Abstract
We studied the reproducibility of ultrasonographic screening examination of the hip when read by diagnostic radiographers. In order to determine interobserver variability, 200 ultrasonograms were classified according to Graf’s method by five observers (four radiographers and one radiologist). The kappa values for interobserver variability indicated moderate agreement (kappa 0.47) for the exact Graf classification and substantial agreement (kappa 0.65) for the classification of normal (type I) versus abnormal (type IIa-IV). Agreement was significantly different for normal, immature and abnormal hips. Comparison of the findings in our interobserver study with existing information based on other examinations and treatment revealed that only a small number of infants with mildly dysplastic hips would have been typed as normal by some observers as a result of observer variability. In conclusion, the interobserver agreement on the ultrasound assessment of the hip was good enough for screening purposes. Observer variability did not result in any severe cases being missed.