Abstract
Ultrasound scans were made of the hips of 209 neonates born consecutively over a two-week period. Of the 418 scans, 62 images were selected at random and 25 of these were duplicated to give a total of 87 scans. These static images were then presented to five experienced observers who each made nine different assessments and measurements. Interobserver and intraboserver agreement was calculated and expressed as kappa values. Our results showed poor reliability on both counts.