Advertisement for orthosearch.org.uk
Results 21 - 40 of 1159
Results per page:
The Journal of Bone & Joint Surgery British Volume
Vol. 88-B, Issue 9 | Pages 1204 - 1206
1 Sep 2006
Malek IA Machani B Mevcha AM Hyder NH

Our aim was to assess the reproducibility and the reliability of the Weber classification system for fractures of the ankle based on anteroposterior and lateral radiographs. Five observers with varying clinical experience reviewed 50 sets of blinded radiographs. The same observers reviewed the same radiographs again after an interval of four weeks. Inter- and intra-observer agreement was assessed based on the proportion of agreement and the values of the kappa coefficient. For inter-observer agreement, the mean kappa value was 0.61 (0.59 to 0.63) and the proportion of agreement was 78% (76% to 79%) and for intra-observer agreement the mean kappa value was 0.74 (0.39 to 0.86) with an 85% (60% to 93%) observed agreement. These results show that the Weber classification of fractures of the ankle based on two radiological views has substantial inter-observer reliability and intra-observer reproducibility


The Journal of Bone & Joint Surgery British Volume
Vol. 91-B, Issue 6 | Pages 766 - 771
1 Jun 2009
Brunner A Honigmann P Treumann T Babst R

We evaluated the impact of stereo-visualisation of three-dimensional volume-rendering CT datasets on the inter- and intraobserver reliability assessed by kappa values on the AO/OTA and Neer classifications in the assessment of proximal humeral fractures. Four independent observers classified 40 fractures according to the AO/OTA and Neer classifications using plain radiographs, two-dimensional CT scans and with stereo-visualised three-dimensional volume-rendering reconstructions. Both classification systems showed moderate interobserver reliability with plain radiographs and two-dimensional CT scans. Three-dimensional volume-rendered CT scans improved the interobserver reliability of both systems to good. Intraobserver reliability was moderate for both classifications when assessed by plain radiographs. Stereo visualisation of three-dimensional volume rendering improved intraobserver reliability to good for the AO/OTA method and to excellent for the Neer classification. These data support our opinion that stereo visualisation of three-dimensional volume-rendering datasets is of value when analysing and classifying complex fractures of the proximal humerus


Bone & Joint Research
Vol. 7, Issue 1 | Pages 36 - 45
1 Jan 2018
Kleinlugtenbelt YV Krol RG Bhandari M Goslings JC Poolman RW Scholtes VAB

Objectives. The patient-rated wrist evaluation (PRWE) and the Disabilities of the Arm, Shoulder and Hand (DASH) questionnaire are patient-reported outcome measures (PROMs) used for clinical and research purposes. Methodological high-quality clinimetric studies that determine the measurement properties of these PROMs when used in patients with a distal radial fracture are lacking. This study aimed to validate the PRWE and DASH in Dutch patients with a displaced distal radial fracture (DRF). Methods. The intraclass correlation coefficient (ICC) was used for test-retest reliability, between PROMs completed twice with a two-week interval at six to eight months after DRF. Internal consistency was determined using Cronbach’s α for the dimensions found in the factor analysis. The measurement error was expressed by the smallest detectable change (SDC). A semi-structured interview was conducted between eight and 12 weeks after DRF to assess the content validity. Results. A total of 119 patients (mean age 58 years (. sd. 15)), 74% female, completed PROMs at a mean time of six months (. sd. 1) post-fracture. One overall meaningful dimension was found for the PRWE and the DASH. Internal consistency was excellent for both PROMs (Cronbach’s α 0.96 (PRWE) and 0.97 (DASH)). Test-retest reliability was good for the PRWE (ICC 0.87) and excellent for the DASH (ICC 0.91). The SDC was 20 for the PRWE and 14 for the DASH. No floor or ceiling effects were found. The content validity was good for both questionnaires. Conclusion. The PRWE and DASH are valid and reliable PROMs in assessing function and disability in Dutch patients with a displaced DRF. However, due to the high SDC, the PRWE and DASH are less useful for individual patients with a distal radial fracture in clinical practice. Cite this article: Y. V. Kleinlugtenbelt, R. G. Krol, M. Bhandari, J. C. Goslings, R. W. Poolman, V. A. B. Scholtes. Are the patient-rated wrist evaluation (PRWE) and the disabilities of the arm, shoulder and hand (DASH) questionnaire used in distal radial fractures truly valid and reliable? Bone Joint Res 2018;7:36–45. DOI: 10.1302/2046-3758.71.BJR-2017-0081.R1


The Journal of Bone & Joint Surgery British Volume
Vol. 84-B, Issue 1 | Pages 42 - 47
1 Jan 2002
Brismar BH Wredmark T Movin T Leandersson J Svensson O

We studied 19 videotaped knee arthroscopies in 19 patients with mild to moderate osteoarthritis (OA) of the knee in order to compare the intraobserver and interobserver reliability and the patterns of disagreement between four orthopaedic surgeons. The classifications of OA of Collins, Outerbridge and the French Society of Arthroscopy were used. Intraobserver and interobserver agreements using kappa measures were 0.42 to 0.66 and 0.43 to 0.49, respectively. Only 6% to 8% of paired intraobserver classifications differed by more than one category. Observer-specific disagreement was evident both within and between observers. A small, but significant, occasional variation was also seen. Although reliability may improve by an analysis of disagreement, it appears that the arthroscopic grading of early osteoarthritic lesions is inexact


Bone & Joint Research
Vol. 2, Issue 1 | Pages 1 - 8
1 Jan 2013
Costa AJ Lustig S Scholes CJ Balestro J Fatima M Parker DA

Objectives. There remains a lack of data on the reliability of methods to estimate tibial coverage achieved during total knee replacement. In order to address this gap, the intra- and interobserver reliability of a three-dimensional (3D) digital templating method was assessed with one symmetric and one asymmetric prosthesis design. Methods. A total of 120 template procedures were performed according to specific rotational and over-hang criteria by three observers at time zero and again two weeks later. Total and sub-region coverage were calculated and the reliability of the templating and measurement method was evaluated. Results. Excellent intra- and interobserver reliability was observed for total coverage, when minimal component overhang (intraclass correlation coefficient (ICC) = 0.87) or no component overhang (ICC = 0.92) was permitted, regardless of rotational restrictions. Conclusions. Measurement of tibial coverage can be reliable using the templating method described even if the rotational axis selected still has a minor influence


The Bone & Joint Journal
Vol. 100-B, Issue 2 | Pages 242 - 246
1 Feb 2018
Ghoshal A Enninghorst N Sisak K Balogh ZJ

Aims. To evaluate interobserver reliability of the Orthopaedic Trauma Association’s open fracture classification system (OTA-OFC). Patients and Methods. Patients of any age with a first presentation of an open long bone fracture were included. Standard radiographs, wound photographs, and a short clinical description were given to eight orthopaedic surgeons, who independently evaluated the injury using both the Gustilo and Anderson (GA) and OTA-OFC classifications. The responses were compared for variability using Cohen’s kappa. Results. The overall interobserver agreement was ĸ = 0.44 for the GA classification and ĸ = 0.49 for OTA-OFC, which reflects moderate agreement (0.41 to 0.60) for both classifications. The agreement in the five categories of OTA-OFC was: for skin, ĸ = 0.55 (moderate); for muscle, ĸ = 0.44 (moderate); for arterial injury, ĸ = 0.74 (substantial); for contamination, ĸ = 0.35 (fair); and for bone loss, ĸ = 0.41 (moderate). Conclusion. Although the OTA-OFC, with similar interobserver agreement to GA, offers a more detailed description of open fractures, further development may be needed to make it a reliable and robust tool. Cite this article: Bone Joint J 2018;100-B:242–6


The Bone & Joint Journal
Vol. 96-B, Issue 5 | Pages 597 - 603
1 May 2014
Nomura T Naito M Nakamura Y Ida T Kuroda D Kobayashi T Sakamoto T Seo H

Several radiological methods of measuring anteversion of the acetabular component after total hip replacement (THR) have been described. These studies used different definitions and reference planes to compare methods, allowing for misinterpretation of the results. We compared the reliability and accuracy of five current methods using plain radiographs (those of Lewinnek, Widmer, Liaw, Pradhan, and Woo and Morrey) with CT measurements, using the same definition and reference plane. We retrospectively studied the plain radiographs and CT scans in 84 hips of 84 patients who underwent primary THR. Intra- and inter-observer reliability were high for the measurement of inclination and anteversion with all methods on plain radiographs and CT scans. The measurements of inclination on plain radiographs were similar to the measurements using CT (p = 0.043). The mean difference between CT measurements was 0.6° (-5.9° to 6.8°). Measurements using Widmer’s method were the most similar to those using CT (p = 0.088), with a mean difference between CT measurements of -0.9° (-10.4° to 9.1°), whereas the other four methods differed significantly from those using CT (p < 0.001). This study has shown that Widmer’s method is the best for evaluating the anteversion of the acetabular component on plain radiographs. Cite this article: Bone Joint J 2014; 96-B:597–603


The Journal of Bone & Joint Surgery British Volume
Vol. 75-B, Issue 3 | Pages 479 - 482
1 May 1993
Dias J Thomas I Lamont A Mody B Thompson

Ultrasound scans were made of the hips of 209 neonates born consecutively over a two-week period. Of the 418 scans, 62 images were selected at random and 25 of these were duplicated to give a total of 87 scans. These static images were then presented to five experienced observers who each made nine different assessments and measurements. Interobserver and intraboserver agreement was calculated and expressed as kappa values. Our results showed poor reliability on both counts


The Journal of Bone & Joint Surgery British Volume
Vol. 79-B, Issue 4 | Pages 570 - 575
1 Jul 1997
Boniforti FG Fujii G Angliss RD Benson MKD

We have evaluated the reliability of the measurement of radiological indicators in developmental dysplasia of the hip. Three observers each independently assessed 60 pelvic radiographs from infants aged from 3 to 36 months. Errors from the true value of a single measurement made by a single observer (E1), of the average of two measurements by a single observer (E2), and of the average of two single measurements by two different observers (E3) were established for the acetabular index of Hilgenreiner, for the assessment of superior and lateral femoral displacement and for indicators of pelvic alignment. The errors for the assessment of the acetabular index were E1 ± 5°, E2 ± 5°, and E3 ± 3.5°. There was a significant correlation between the presence of an acetabular notch on the radiograph and an increased error in measurement (p = 0.01). Yamamuro’s measurement of lateral femoral displacement was more reliable than the Hilgenreiner distance. The errors of indicators of pelvic alignment showed a correlation with the age of the infant; the quotient of pelvic rotation was more reliable after seven months of age (p < 0.0001). The errors of the measurement of the symphysis os-ischium angle tended to increase with age and those of the measurement of the index of pelvic tilt decreased with skeletal maturation (p = 0.002)


The Journal of Bone & Joint Surgery British Volume
Vol. 68-B, Issue 4 | Pages 614 - 615
1 Aug 1986
Christensen F Soballe K Ejsted R Luxhoj T

The reliability of the Catterall grouping of Perthes' disease was examined by determining the agreement between pairs of observers using weighted kappa statistics. Anteroposterior and lateral radiographs of 100 hip joints were grouped independently by four experienced observers. There was a low, and in our opinion, unacceptable degree of inter-observer agreement even when Groups 2 and 3 were combined


The Journal of Bone & Joint Surgery British Volume
Vol. 74-B, Issue 2 | Pages 287 - 291
1 Mar 1992
Wright J Feinstein A


The Journal of Bone & Joint Surgery British Volume
Vol. 72-B, Issue 5 | Pages 924 - 924
1 Sep 1990
Asirvatham R Watts H Ware B Rooney R


The Journal of Bone & Joint Surgery British Volume
Vol. 83-B, Issue 5 | Pages 775 - 777
1 Jul 2001
Rushton N


The Journal of Bone & Joint Surgery British Volume
Vol. 85-B, Issue 3 | Pages 463 - 464
1 Apr 2003
MENCHE DS


The Journal of Bone & Joint Surgery British Volume
Vol. 71-B, Issue 1 | Pages 6 - 8
1 Jan 1989
Broughton N Brougham D Cole W Menelaus M

We investigated the reproducibility of the various radiological methods of assessment of hip dysplasia by making 474 assessments of hips and quantifying the inter-observer and intra-observer variation. There was a wide range of variability between the readings made by different observers and by one observer on two occasions. A measurement of acetabular index has to be given a range of +/- 6 degrees in order to be 95% confident of including the true measurement. We found the most helpful measurements to be the acetabular index, up to the age of eight years; the centre-edge angle, over the age of five years; and Smith's c/b ratio and neck-shaft angle. We feel, however, that the change in value over a series of radiographs in the same child is much more valuable. Single readings of all the radiological measurements investigated in this study were unreliable.


The Bone & Joint Journal
Vol. 97-B, Issue 8 | Pages 1139 - 1143
1 Aug 2015
Hutt JRB Ortega-Briones A Daurka JS Bircher MD Rickman MS

The most widely used classification system for acetabular fractures was developed by Judet, Judet and Letournel over 50 years ago primarily to aid surgical planning. As population demographics and injury mechanisms have altered over time, the fracture patterns also appear to be changing. We conducted a retrospective review of the imaging of 100 patients with a mean age of 54.9 years (19 to 94) and a male to female ratio of 69:31 seen between 2010 and 2013 with acetabular fractures in order to determine whether the current spectrum of injury patterns can be reliably classified using the original system.

Three consultant pelvic and acetabular surgeons and one senior fellow analysed anonymous imaging. Inter-observer agreement for the classification of fractures that fitted into defined categories was substantial, (κ = 0.65, 95% confidence interval (CI) 0.51 to 0.76) with improvement to near perfect on inclusion of CT imaging (κ = 0.80, 95% CI 0.69 to 0.91). However, a high proportion of injuries (46%) were felt to be unclassifiable by more than one surgeon; there was moderate agreement on which these were (κ = 0.42 95% CI 0.31 to 0.54).

Further review of the unclassifiable fractures in this cohort of 100 patients showed that they tended to occur in an older population (mean age 59.1 years; 22 to 94 vs 47.2 years; 19 to 94; p = 0.003) and within this group, there was a recurring pattern of anterior column and quadrilateral plate involvement, with or without an incomplete posterior element injury.

Cite this article: Bone Joint J 2015;97-B:1139–43.


The Journal of Bone & Joint Surgery British Volume
Vol. 94-B, Issue 11 | Pages 1522 - 1528
1 Nov 2012
Wallander H Saebö M Jonsson K Bjönness T Hansson G

We investigated 60 patients (89 feet) with a mean age of 64 years (61 to 67) treated for congenital clubfoot deformity, using standardised weight-bearing radiographs of both feet and ankles together with a functional evaluation. Talocalcaneal and talonavicular relationships were measured and the degree of osteo-arthritic change in the ankle and talonavicular joints was assessed. The functional results were evaluated using a modified Laaveg-Ponseti score. The talocalcaneal (TC) angles in the clubfeet were significantly lower in both anteroposterior (AP) and lateral projections than in the unaffected feet (p < 0.001 for both views). There was significant medial subluxation of the navicular in the clubfeet compared with the unaffected feet (p < 0.001). Severe osteoarthritis in the ankle joint was seen in seven feet (8%) and in the talonavicular joint in 11 feet (12%). The functional result was excellent or good (≥ 80 points) in 29 patients (48%), and fair or poor (< 80 points) in 31 patients (52%). Patients who had undergone few (0 to 1) surgical procedures had better functional outcomes than those who had undergone two or more procedures (p < 0.001). There was a significant correlation between the functional result and the degree of medial subluxation of the navicular (p < 0.001, r2 = 0.164), the talocalcaneal angle on AP projection (p < 0.02, r2 = 0.025) and extent of osteoarthritis in the ankle joint (p < 0.001).

We conclude that poor functional outcome in patients with congenital clubfoot occurs more frequently in those with medial displacement of the navicular, osteoarthritis of the talonavicular and ankle joints, and a low talocalcaneal angle on the AP projection, and in patients who have undergone two or more surgical procedures. However, the ankle joint in these patients appeared relatively resistant to the development of osteoarthritis.


Aims. The purpose of this study was to assess the reliability and responsiveness to hip surgery of a four-point modified Care and Comfort Hypertonicity Questionnaire (mCCHQ) scoring tool in children with cerebral palsy (CP) in Gross Motor Function Classification System (GMFCS) levels IV and V. Methods. This was a population-based cohort study in children with CP from a national surveillance programme. Reliability was assessed from 20 caregivers who completed the mCCHQ questionnaire on two occasions three weeks apart. Test-retest reliability of the mCCHQ was calculated, and responsiveness before and after surgery for a displaced hip was evaluated in a cohort of children. Results. Test-retest reliability for the overall mCCHQ score was good (intraclass correlation coefficient 0.78), and no dimension demonstrated poor reliability. The surgical intervention cohort comprised ten children who had preoperative and postoperative mCCHQ scores at a minimum of six months postoperatively. The mCCHQ tool demonstrated a significant improvement in overall score from preoperative assessment to six-month postoperative follow-up assessment (p < 0.001). Conclusion. The mCCHQ demonstrated responsiveness to intervention and good test-retest reliability. The mCCHQ is proposed as an outcome tool for use within a national surveillance programme for children with CP. Cite this article: Bone Jt Open 2023;4(8):580–583


Bone & Joint Research
Vol. 13, Issue 1 | Pages 19 - 27
5 Jan 2024
Baertl S Rupp M Kerschbaum M Morgenstern M Baumann F Pfeifer C Worlicek M Popp D Amanatullah DF Alt V

Aims. This study aimed to evaluate the clinical application of the PJI-TNM classification for periprosthetic joint infection (PJI) by determining intraobserver and interobserver reliability. To facilitate its use in clinical practice, an educational app was subsequently developed and evaluated. Methods. A total of ten orthopaedic surgeons classified 20 cases of PJI based on the PJI-TNM classification. Subsequently, the classification was re-evaluated using the PJI-TNM app. Classification accuracy was calculated separately for each subcategory (reinfection, tissue and implant condition, non-human cells, and morbidity of the patient). Fleiss’ kappa and Cohen’s kappa were calculated for interobserver and intraobserver reliability, respectively. Results. Overall, interobserver and intraobserver agreements were substantial across the 20 classified cases. Analyses for the variable ‘reinfection’ revealed an almost perfect interobserver and intraobserver agreement with a classification accuracy of 94.8%. The category 'tissue and implant conditions' showed moderate interobserver and substantial intraobserver reliability, while the classification accuracy was 70.8%. For 'non-human cells,' accuracy was 81.0% and interobserver agreement was moderate with an almost perfect intraobserver reliability. The classification accuracy of the variable 'morbidity of the patient' reached 73.5% with a moderate interobserver agreement, whereas the intraobserver agreement was substantial. The application of the app yielded comparable results across all subgroups. Conclusion. The PJI-TNM classification system captures the heterogeneity of PJI and can be applied with substantial inter- and intraobserver reliability. The PJI-TNM educational app aims to facilitate application in clinical practice. A major limitation was the correct assessment of the implant situation. To eliminate this, a re-evaluation according to intraoperative findings is strongly recommended. Cite this article: Bone Joint Res 2024;13(1):19–27


The Bone & Joint Journal
Vol. 105-B, Issue 1 | Pages 56 - 63
1 Jan 2023
de Klerk HH Oosterhoff JHF Schoolmeesters B Nieboer P Eygendaal D Jaarsma RL IJpma FFA van den Bekerom MPJ Doornberg JN

Aims. This study aimed to answer the following questions: do 3D-printed models lead to a more accurate recognition of the pattern of complex fractures of the elbow?; do 3D-printed models lead to a more reliable recognition of the pattern of these injuries?; and do junior surgeons benefit more from 3D-printed models than senior surgeons?. Methods. A total of 15 orthopaedic trauma surgeons (seven juniors, eight seniors) evaluated 20 complex elbow fractures for their overall pattern (i.e. varus posterior medial rotational injury, terrible triad injury, radial head fracture with posterolateral dislocation, anterior (trans-)olecranon fracture-dislocation, posterior (trans-)olecranon fracture-dislocation) and their specific characteristics. First, fractures were assessed based on radiographs and 2D and 3D CT scans; and in a subsequent round, one month later, with additional 3D-printed models. Diagnostic accuracy (acc) and inter-surgeon reliability (κ) were determined for each assessment. Results. Accuracy significantly improved with 3D-printed models for the whole group on pattern recognition (acc. 2D/3D. = 0.62 vs acc. 3Dprint. = 0.69; Δacc = 0.07 (95% confidence interval (CI) 0.00 to 0.14); p = 0.025). A significant improvement was also seen in reliability for pattern recognition with the additional 3D-printed models (κ. 2D/3D. = 0.41 (moderate) vs κ. 3Dprint. = 0.59 (moderate); Δκ = 0.18 (95% CI 0.14 to 0.22); p ≤ 0.001). Accuracy was comparable between junior and senior surgeons with the 3D-printed model (acc. junior. = 0.70 vs acc. senior. = 0.68; Δacc = -0.02 (95% CI -0.17 to 0.13); p = 0.904). Reliability was also comparable between junior and senior surgeons without the 3D-printed model (κ. junior. = 0.39 (fair) vs κ. senior. = 0.43 (moderate); Δκ = 0.03 (95% CI -0.03 to 0.10); p = 0.318). However, junior surgeons showed greater improvement regarding reliability than seniors with 3D-printed models (κ. junior. = 0.65 (substantial) vs κ. senior. = 0.54 (moderate); Δκ = 0.11 (95% CI 0.04 to 0.18); p = 0.002). Conclusion. The use of 3D-printed models significantly improved the accuracy and reliability of recognizing the pattern of complex fractures of the elbow. However, the current long printing time and non-reusable materials could limit the usefulness of 3D-printed models in clinical practice. They could be suitable as a reusable tool for teaching residents. Cite this article: Bone Joint J 2023;105-B(1):56–63