Advertisement for orthosearch.org.uk
Results 1 - 20 of 804
Results per page:
The Bone & Joint Journal
Vol. 102-B, Issue 8 | Pages 1041 - 1047
1 Aug 2020
Hamoodi Z Singh J Elvey MH Watts AC

Aims. The Wrightington classification system of fracture-dislocations of the elbow divides these injuries into six subtypes depending on the involvement of the coronoid and the radial head. The aim of this study was to assess the reliability and reproducibility of this classification system. Methods. This was a blinded study using radiographs and CT scans of 48 consecutive patients managed according to the Wrightington classification system between 2010 and 2018. Four trauma and orthopaedic consultants, two post CCT fellows, and one speciality registrar based in the UK classified the injuries. The seven observers reviewed preoperative radiographs and CT scans twice, with a minimum four-week interval. Radiographs and CT scans were reviewed separately. Inter- and intraobserver reliability were calculated using Fleiss and Cohen kappa coefficients. The Landis and Koch criteria were used to interpret the strength of the kappa values. Validity was assessed by calculating the percentage agreement against intraoperative findings. Results. Of the 48 patients, three (6%) had type A injury, 11 (23%) type B, 16 (33%) type B+, 16 (33%) Type C, two (4%) type D+, and none had a type D injury. All 48 patients had anteroposterior (AP) and lateral radiographs, 44 had 2D CT scans, and 39 had 3D reconstructions. The interobserver reliability kappa value was 0.52 for radiographs, 0.71 for 2D CT scans, and 0.73 for a combination of 2D and 3D reconstruction CT scans. The median intraobserver reliability was 0.75 (interquartile range (IQR) 0.62 to 0.79) for radiographs, 0.77 (IQR 0.73 to 0.94) for 2D CT scans, and 0.89 (IQR 0.77 to 0.93) for the combination of 2D and 3D reconstruction. Validity analysis showed that accuracy significantly improved when using CT scans (p = 0.018 and p = 0.028 respectively). Conclusion. The Wrightington classification system is a reliable and valid method of classifying fracture-dislocations of the elbow. CT scans are significantly more accurate than radiographs when identifying the pattern of injury, with good intra- and interobserver reproducibility. Cite this article: Bone Joint J 2020;102-B(8):1041–1047


The Journal of Bone & Joint Surgery British Volume
Vol. 94-B, Issue 1 | Pages 32 - 36
1 Jan 2012
Nho J Lee Y Kim HJ Ha Y Suh Y Koo K

A variety of radiological methods of measuring version of the acetabular component after total hip replacement (THR) have been described. The aim of this study was to evaluate the reliability and validity of six methods (those of Lewinnek; Widmer; Hassan et al; Ackland, Bourne and Uhthoff; Liaw et al; and Woo and Morrey) that are currently in use. In 36 consecutive patients who underwent THR, version of the acetabular component was measured by three independent examiners on plain radiographs using these six methods and compared with measurements using CT scans. The intra- and interobserver reliabilities of each measurement were estimated. All measurements on both radiographs and CT scans had excellent intra- and interobserver reliability and the results from each of the six methods correlated well with the CT measurements. However, measurements made using the methods of Widmer and of Ackland, Bourne and Uhthoff were significantly different from the CT measurements (both p < 0.001), whereas measurements made using the remaining four methods were similar to the CT measurements. With regard to reliability and convergent validity, we recommend the use of the methods described by Lewinnek, Hassan et al, Liaw et al and Woo and Morrey for measurement of version of the acetabular component


The Bone & Joint Journal
Vol. 102-B, Issue 4 | Pages 478 - 484
1 Apr 2020
Daniels AM Wyers CE Janzing HMJ Sassen S Loeffen D Kaarsemaker S van Rietbergen B Hannemann PFW Poeze M van den Bergh JP

Aims

Besides conventional radiographs, the use of MRI, CT, and bone scintigraphy is frequent in the diagnosis of a fracture of the scaphoid. However, which techniques give the best results remain unknown. The investigation of a new imaging technique initially requires an analysis of its precision. The primary aim of this study was to investigate the interobserver agreement of high-resolution peripheral quantitative CT (HR-pQCT) in the diagnosis of a scaphoid fracture. A secondary aim was to investigate the interobserver agreement for the presence of other fractures and for the classification of scaphoid fracture.

Methods

Two radiologists and two orthopaedic trauma surgeons evaluated HR-pQCT scans of 31 patients with a clinically-suspected scaphoid fracture. The observers were asked to determine the presence of a scaphoid or other fracture and to classify the scaphoid fracture based on the Herbert classification system. Fleiss kappa statistics were used to calculate the interobserver agreement for the diagnosis of a fracture. Intraclass correlation coefficients (ICCs) were used to assess the agreement for the classification of scaphoid fracture.


The Bone & Joint Journal
Vol. 103-B, Issue 8 | Pages 1339 - 1344
1 Aug 2021
Jain S Mohrir G Townsend O Lamb JN Palan J Aderinto J Pandit H

Aims. This aim of this study was to assess the reliability and validity of the Unified Classification System (UCS) for postoperative periprosthetic femoral fractures (PFFs) around cemented polished taper-slip (PTS) stems. Methods. Radiographs of 71 patients with a PFF admitted consecutively at two centres between 25 February 2012 and 19 May 2020 were collated by an independent investigator. Six observers (three hip consultants and three trainees) were familiarized with the UCS. Each PFF was classified on two separate occasions, with a mean time between assessments of 22.7 days (16 to 29). Interobserver reliability for more than two observers was assessed using percentage agreement and Fleiss’ kappa statistic. Intraobserver reliability between two observers was calculated with Cohen kappa statistic. Validity was tested on surgically managed UCS type B PFFs where stem stability was documented in operation notes (n = 50). Validity was assessed using percentage agreement and Cohen kappa statistic between radiological assessment and intraoperative findings. Kappa statistics were interpreted using Landis and Koch criteria. All six observers were blinded to operation notes and postoperative radiographs. Results. Interobserver reliability percentage agreement was 58.5% and the overall kappa value was 0.442 (moderate agreement). Lowest kappa values were seen for type B fractures (0.095 to 0.360). The mean intraobserver reliability kappa value was 0.672 (0.447 to 0.867), indicating substantial agreement. Validity percentage agreement was 65.7% and the mean kappa value was 0.300 (0.160 to 0.4400) indicating only fair agreement. Conclusion. This study demonstrates that the UCS is unsatisfactory for the classification of PFFs around PTS stems, and that it has considerably lower reliability and validity than previously described for other stem types. Radiological PTS stem loosening in the presence of PFF is poorly defined and formal intraoperative testing of stem stability is recommended. Cite this article: Bone Joint J 2021;103-B(8):1339–1344


The Journal of Bone & Joint Surgery British Volume
Vol. 92-B, Issue 4 | Pages 571 - 575
1 Apr 2010
Clint SA Morris TP Shaw OM Oddy MJ Rudge B Barry M

The databases of the Picture Archiving and Communication Systems of two hospitals were searched and all children who had a lateral radiograph of the ankle during their attendance at the emergency department were identified. In 227 radiographs, Bohler’s and Gissane’s angles were measured on two separate occasions and by two separate authors to allow calculation of inter- and intra-observer variation. Intraclass correlation coefficients were used to assess the reliability of the measurements. For Bohler’s angle the overall inter-observer reliability, the intraclass correlation coefficient was 0.90 and the intra-observer reliability 0.95, giving excellent agreement. This reliability was maintained across the age groups. For Gissane’s angle, inter- and intra-observer reliability was only fair or poor across most age groups. Further analysis of the Bohler’s angle showed a significant variation in the mean angle with age. Contrary to published opinion, the angle is not uniformly lower than that of adults but varies with age, peaking towards the end of the first decade before attaining adult values. The age-related radiologic changes presented here may help in the interpretation of injuries to the hindfoot in children


The Journal of Bone & Joint Surgery British Volume
Vol. 88-B, Issue 8 | Pages 1048 - 1052
1 Aug 2006
Jerosch-Herold C Rosén B Shepstone L

Locognosia, the ability to localise touch, is one aspect of tactile spatial discrimination which relies on the integrity of peripheral end-organs as well as the somatosensory representation of the surface of the body in the brain. The test presented here is a standardised assessment which uses a protocol for testing locognosia in the zones of the hand supplied by the median and/or ulnar nerves. The test-retest reliability and discriminant validity were investigated in 39 patients with injuries to the median or ulnar nerve. Intraclass correlation coefficients were used to calculate the test-retest reliability. Discriminant validity was assessed by comparing the injured with the unaffected hand. Excellent test-retest reliability was demonstrated for the injuries to the median (intraclass correlation coefficient 0.924, 95% confidence interval 0.848 to 1.00) and the ulnar nerves (intraclass correlation coefficient 0.859, 95% confidence interval 0.693 to 1.00). The magnitude of the difference in scores between affected and unaffected hands showed good discriminant validity. For injuries to the median nerve the mean difference was 11.1 points (1 to 33; . sd. 7.4), which was statistically significant (p < 0.0001, paired t-test) and for those of the ulnar nerve it was 4.75 points (1 to 13.5; . sd. 3.16), which was also statistically significant (paired t-test, p < 0.0001). The locognosia test has excellent test-retest reliability, is a valid test of tactile spatial discrimination and should be included in the evaluation of outcome after injury to peripheral nerves


The Journal of Bone & Joint Surgery British Volume
Vol. 91-B, Issue 7 | Pages 903 - 906
1 Jul 2009
Trickett RW Hodgson P Forster MC Robertson A

We aimed to determine the reliability, accuracy and the clinical role of digital templating in the pre-operative work-up for total knee replacement. Initially a sample of ten pre-operative digital radiographs were templated by four independent observers to determine the inter- and intra-observer reliability of the process. Digital templating was then performed on the radiographs of 40 consecutive patients undergoing total knee replacement by a consultant surgeon not involved with the operation, who was blinded to the size of the implant inserted. The Press Fit Condylar Sigma Knee system was used in all the patients. The size of the implant as judged by templating was then compared to that of the size used. Good inter- and intra-observer agreement was demonstrated for both femoral and tibial templating. However, the correct size of the implant was predicted in only 48% of the femoral and 55% of the tibial components. Albeit reproducible, digital templating does not currently predict the correct size of component often enough to be of clinical benefit


The Journal of Bone & Joint Surgery British Volume
Vol. 80-B, Issue 4 | Pages 670 - 672
1 Jul 1998
Flinkkilä T Nikkola-Sihto A Kaarela O Päakkö E Raatikainen T

Interobserver reliability of the AO system of classification of fractures of the distal radius was assessed using plain radiographs and CT. Five observers classified 30 Colles’-type fractures using only plain radiographs; two months later they were reclassified using CT in addition. Interobserver reliability was poor in both series when detailed classification was used. By reducing the categories to five, interobserver reliability was slightly improved, but was still poor. When only two AO types were used, the reliability was moderate using plain radiographs and good to excellent with the addition of CT. The use of CT as well as plain radiographs brings interobserver reliability to a good level in assessment of the presence or absence of articular involvement, but is otherwise of minor value in improving the interobserver reliability of the AO system of classification of fractures of the distal radius


The Bone & Joint Journal
Vol. 96-B, Issue 12 | Pages 1669 - 1673
1 Dec 2014
Van der Merwe JM Haddad FS Duncan CP

The Unified Classification System (UCS) was introduced because of a growing need to have a standardised universal classification system of periprosthetic fractures. It combines and simplifies many existing classification systems, and can be applied to any fracture around any partial or total joint replacement occurring during or after operation. Our goal was to assess the inter- and intra-observer reliability of the UCS in association with knee replacement when classifying fractures affecting one or more of the femur, tibia or patella. We used an international panel of ten orthopaedic surgeons with subspecialty fellowship training and expertise in adult hip and knee reconstruction (‘experts’) and ten residents of orthopaedic surgery in the last two years of training (‘pre-experts’). They each received 15 radiographs for evaluation. After six weeks they evaluated the same radiographs again but in a different order. . The reliability was assessed using the Kappa and weighted Kappa values. The Kappa values for inter-observer reliability for the experts and the pre-experts were 0.741 (95% confidence interval (CI) 0.707 to 0.774) and 0.765 (95% CI 0.733 to 0.797), respectively. The weighted Kappa values for intra-observer reliability for the experts and pre-experts were 0.898 (95% CI 0.846 to 0.950) and 0.878 (95% CI 0.815 to 0.942) respectively. The UCS has substantial inter-observer reliability and ‘near perfect’ intra-observer reliability when used for periprosthetic fractures in association with knee replacement in the hands of experienced and inexperienced users. Cite this article: Bone Joint J 2014;96-B:1669–73


The Bone & Joint Journal
Vol. 98-B, Issue 2 | Pages 166 - 172
1 Feb 2016
Langlois J Hamadouche M

Previous standards for assessing the reliability of a measurement tool have lacked consistency. We reviewed the most current American Society for Testing and Materials and International Organisation for Standardisation (ISO) recommendations, and propose an algorithm for orthopaedic surgeons. When assessing a measurement tool, conditions of the experimental set-up and clear formulae used to compile the results should be strictly reported. According to these recent guidelines, accuracy is a confusing word with an overly broad meaning and should therefore be abandoned. Depending on the experimental conditions, one should be referring to bias (when the study protocol involves accepted reference values), and repeatability (sr, r) or reproducibility (SR, R). In the absence of accepted reference values, only repeatability (sr, r) or reproducibility (SR, R) should be provided. Take home message: Assessing the reliability of a measurement tool involves reporting bias, repeatability and/or reproducibility depending on the defined conditions, instead of precision or accuracy. Cite this article: Bone Joint J 2016;98-B2:166–72


The Bone & Joint Journal
Vol. 97-B, Issue 5 | Pages 611 - 616
1 May 2015
Shin WC Lee SM Lee KW Cho HJ Lee JS Suh KT

There is no single standardised method of measuring the orientation of the acetabular component on plain radiographs after total hip arthroplasty. We assessed the reliability and accuracy of three methods of assessing anteversion of the acetabular component for 551 THAs using the PolyWare software and the methods of Liaw et al, and of Woo and Morrey. All measurements of the three methods had excellent intra- and inter-observer reliability. The values of the PolyWare software, which determines version of the acetabular component by edge detection were regarded as the reference standard. Although the PolyWare software and the method of Liaw et al were similarly precise, the method of Woo and Morrey was significantly less accurate (p < 0.001). The method of Liaw et al seemed to be more accurate than that of Woo and Morrey when compared with the measurements using the PolyWare software. If the qualified lateral radiograph was selected, anteversion measured using the method of Woo and Morrey was considered to be relatively reliable. Cite this article: Bone Joint J 2015; 97-B:611–16


The Journal of Bone & Joint Surgery British Volume
Vol. 88-B, Issue 9 | Pages 1204 - 1206
1 Sep 2006
Malek IA Machani B Mevcha AM Hyder NH

Our aim was to assess the reproducibility and the reliability of the Weber classification system for fractures of the ankle based on anteroposterior and lateral radiographs. Five observers with varying clinical experience reviewed 50 sets of blinded radiographs. The same observers reviewed the same radiographs again after an interval of four weeks. Inter- and intra-observer agreement was assessed based on the proportion of agreement and the values of the kappa coefficient. For inter-observer agreement, the mean kappa value was 0.61 (0.59 to 0.63) and the proportion of agreement was 78% (76% to 79%) and for intra-observer agreement the mean kappa value was 0.74 (0.39 to 0.86) with an 85% (60% to 93%) observed agreement. These results show that the Weber classification of fractures of the ankle based on two radiological views has substantial inter-observer reliability and intra-observer reproducibility


The Journal of Bone & Joint Surgery British Volume
Vol. 91-B, Issue 6 | Pages 766 - 771
1 Jun 2009
Brunner A Honigmann P Treumann T Babst R

We evaluated the impact of stereo-visualisation of three-dimensional volume-rendering CT datasets on the inter- and intraobserver reliability assessed by kappa values on the AO/OTA and Neer classifications in the assessment of proximal humeral fractures. Four independent observers classified 40 fractures according to the AO/OTA and Neer classifications using plain radiographs, two-dimensional CT scans and with stereo-visualised three-dimensional volume-rendering reconstructions. Both classification systems showed moderate interobserver reliability with plain radiographs and two-dimensional CT scans. Three-dimensional volume-rendered CT scans improved the interobserver reliability of both systems to good. Intraobserver reliability was moderate for both classifications when assessed by plain radiographs. Stereo visualisation of three-dimensional volume rendering improved intraobserver reliability to good for the AO/OTA method and to excellent for the Neer classification. These data support our opinion that stereo visualisation of three-dimensional volume-rendering datasets is of value when analysing and classifying complex fractures of the proximal humerus


The Journal of Bone & Joint Surgery British Volume
Vol. 84-B, Issue 1 | Pages 42 - 47
1 Jan 2002
Brismar BH Wredmark T Movin T Leandersson J Svensson O

We studied 19 videotaped knee arthroscopies in 19 patients with mild to moderate osteoarthritis (OA) of the knee in order to compare the intraobserver and interobserver reliability and the patterns of disagreement between four orthopaedic surgeons. The classifications of OA of Collins, Outerbridge and the French Society of Arthroscopy were used. Intraobserver and interobserver agreements using kappa measures were 0.42 to 0.66 and 0.43 to 0.49, respectively. Only 6% to 8% of paired intraobserver classifications differed by more than one category. Observer-specific disagreement was evident both within and between observers. A small, but significant, occasional variation was also seen. Although reliability may improve by an analysis of disagreement, it appears that the arthroscopic grading of early osteoarthritic lesions is inexact


The Bone & Joint Journal
Vol. 100-B, Issue 2 | Pages 242 - 246
1 Feb 2018
Ghoshal A Enninghorst N Sisak K Balogh ZJ

Aims. To evaluate interobserver reliability of the Orthopaedic Trauma Association’s open fracture classification system (OTA-OFC). Patients and Methods. Patients of any age with a first presentation of an open long bone fracture were included. Standard radiographs, wound photographs, and a short clinical description were given to eight orthopaedic surgeons, who independently evaluated the injury using both the Gustilo and Anderson (GA) and OTA-OFC classifications. The responses were compared for variability using Cohen’s kappa. Results. The overall interobserver agreement was ĸ = 0.44 for the GA classification and ĸ = 0.49 for OTA-OFC, which reflects moderate agreement (0.41 to 0.60) for both classifications. The agreement in the five categories of OTA-OFC was: for skin, ĸ = 0.55 (moderate); for muscle, ĸ = 0.44 (moderate); for arterial injury, ĸ = 0.74 (substantial); for contamination, ĸ = 0.35 (fair); and for bone loss, ĸ = 0.41 (moderate). Conclusion. Although the OTA-OFC, with similar interobserver agreement to GA, offers a more detailed description of open fractures, further development may be needed to make it a reliable and robust tool. Cite this article: Bone Joint J 2018;100-B:242–6


The Bone & Joint Journal
Vol. 96-B, Issue 5 | Pages 597 - 603
1 May 2014
Nomura T Naito M Nakamura Y Ida T Kuroda D Kobayashi T Sakamoto T Seo H

Several radiological methods of measuring anteversion of the acetabular component after total hip replacement (THR) have been described. These studies used different definitions and reference planes to compare methods, allowing for misinterpretation of the results. We compared the reliability and accuracy of five current methods using plain radiographs (those of Lewinnek, Widmer, Liaw, Pradhan, and Woo and Morrey) with CT measurements, using the same definition and reference plane. We retrospectively studied the plain radiographs and CT scans in 84 hips of 84 patients who underwent primary THR. Intra- and inter-observer reliability were high for the measurement of inclination and anteversion with all methods on plain radiographs and CT scans. The measurements of inclination on plain radiographs were similar to the measurements using CT (p = 0.043). The mean difference between CT measurements was 0.6° (-5.9° to 6.8°). Measurements using Widmer’s method were the most similar to those using CT (p = 0.088), with a mean difference between CT measurements of -0.9° (-10.4° to 9.1°), whereas the other four methods differed significantly from those using CT (p < 0.001). This study has shown that Widmer’s method is the best for evaluating the anteversion of the acetabular component on plain radiographs. Cite this article: Bone Joint J 2014; 96-B:597–603


The Journal of Bone & Joint Surgery British Volume
Vol. 75-B, Issue 3 | Pages 479 - 482
1 May 1993
Dias J Thomas I Lamont A Mody B Thompson

Ultrasound scans were made of the hips of 209 neonates born consecutively over a two-week period. Of the 418 scans, 62 images were selected at random and 25 of these were duplicated to give a total of 87 scans. These static images were then presented to five experienced observers who each made nine different assessments and measurements. Interobserver and intraboserver agreement was calculated and expressed as kappa values. Our results showed poor reliability on both counts


The Journal of Bone & Joint Surgery British Volume
Vol. 79-B, Issue 4 | Pages 570 - 575
1 Jul 1997
Boniforti FG Fujii G Angliss RD Benson MKD

We have evaluated the reliability of the measurement of radiological indicators in developmental dysplasia of the hip. Three observers each independently assessed 60 pelvic radiographs from infants aged from 3 to 36 months. Errors from the true value of a single measurement made by a single observer (E1), of the average of two measurements by a single observer (E2), and of the average of two single measurements by two different observers (E3) were established for the acetabular index of Hilgenreiner, for the assessment of superior and lateral femoral displacement and for indicators of pelvic alignment. The errors for the assessment of the acetabular index were E1 ± 5°, E2 ± 5°, and E3 ± 3.5°. There was a significant correlation between the presence of an acetabular notch on the radiograph and an increased error in measurement (p = 0.01). Yamamuro’s measurement of lateral femoral displacement was more reliable than the Hilgenreiner distance. The errors of indicators of pelvic alignment showed a correlation with the age of the infant; the quotient of pelvic rotation was more reliable after seven months of age (p < 0.0001). The errors of the measurement of the symphysis os-ischium angle tended to increase with age and those of the measurement of the index of pelvic tilt decreased with skeletal maturation (p = 0.002)


The Journal of Bone & Joint Surgery British Volume
Vol. 68-B, Issue 4 | Pages 614 - 615
1 Aug 1986
Christensen F Soballe K Ejsted R Luxhoj T

The reliability of the Catterall grouping of Perthes' disease was examined by determining the agreement between pairs of observers using weighted kappa statistics. Anteroposterior and lateral radiographs of 100 hip joints were grouped independently by four experienced observers. There was a low, and in our opinion, unacceptable degree of inter-observer agreement even when Groups 2 and 3 were combined


The Journal of Bone & Joint Surgery British Volume
Vol. 74-B, Issue 2 | Pages 287 - 291
1 Mar 1992
Wright J Feinstein A