Advertisement for orthosearch.org.uk
Results 1 - 50 of 266
Results per page:
The Journal of Bone & Joint Surgery British Volume
Vol. 80-B, Issue 4 | Pages 670 - 672
1 Jul 1998
Flinkkilä T Nikkola-Sihto A Kaarela O Päakkö E Raatikainen T

Interobserver reliability of the AO system of classification of fractures of the distal radius was assessed using plain radiographs and CT. Five observers classified 30 Colles’-type fractures using only plain radiographs; two months later they were reclassified using CT in addition. Interobserver reliability was poor in both series when detailed classification was used. By reducing the categories to five, interobserver reliability was slightly improved, but was still poor. When only two AO types were used, the reliability was moderate using plain radiographs and good to excellent with the addition of CT. The use of CT as well as plain radiographs brings interobserver reliability to a good level in assessment of the presence or absence of articular involvement, but is otherwise of minor value in improving the interobserver reliability of the AO system of classification of fractures of the distal radius


Aims. Classifying trochlear dysplasia (TD) is useful to determine the treatment options for patients suffering from patellofemoral instability (PFI). There is no consensus on which classification system is more reliable and reproducible for the purpose of guiding clinicians’ management of PFI. There are also concerns about the validity of the Dejour Classification (DJC), which is the most widely used classification for TD, having only a fair reliability score. The Oswestry-Bristol Classification (OBC) is a recently proposed system of classification of TD, and the authors report a fair-to-good interobserver agreement and good-to-excellent intraobserver agreement in the assessment of TD. The aim of this study was to compare the reliability and reproducibility of these two classifications. Methods. In all, six assessors (four consultants and two registrars) independently evaluated 100 axial MRIs of the patellofemoral joint (PFJ) for TD and classified them according to OBC and DJC. These assessments were again repeated by all raters after four weeks. The inter- and intraobserver reliability scores were calculated using Cohen’s kappa and Cronbach’s α. Results. Both classifications showed good to excellent interobserver reliability with high α scores. The OBC classification showed a substantial intraobserver agreement (mean kappa 0.628; p < 0.005) whereas the DJC showed a moderate agreement (mean kappa 0.572; p < 0.005). There was no significant difference in the kappa values when comparing the assessments by consultants with those by registrars, in either classification system. Conclusion. This large study from a non-founding institute shows both classification systems to be reliable for classifying TD based on axial MRIs of the PFJ, with the simple-to-use OBC having a higher intraobserver reliability score than that of the DJC. Cite this article: Bone Jt Open 2023;4(7):532–538


The Bone & Joint Journal
Vol. 100-B, Issue 2 | Pages 242 - 246
1 Feb 2018
Ghoshal A Enninghorst N Sisak K Balogh ZJ

Aims. To evaluate interobserver reliability of the Orthopaedic Trauma Association’s open fracture classification system (OTA-OFC). Patients and Methods. Patients of any age with a first presentation of an open long bone fracture were included. Standard radiographs, wound photographs, and a short clinical description were given to eight orthopaedic surgeons, who independently evaluated the injury using both the Gustilo and Anderson (GA) and OTA-OFC classifications. The responses were compared for variability using Cohen’s kappa. Results. The overall interobserver agreement was ĸ = 0.44 for the GA classification and ĸ = 0.49 for OTA-OFC, which reflects moderate agreement (0.41 to 0.60) for both classifications. The agreement in the five categories of OTA-OFC was: for skin, ĸ = 0.55 (moderate); for muscle, ĸ = 0.44 (moderate); for arterial injury, ĸ = 0.74 (substantial); for contamination, ĸ = 0.35 (fair); and for bone loss, ĸ = 0.41 (moderate). Conclusion. Although the OTA-OFC, with similar interobserver agreement to GA, offers a more detailed description of open fractures, further development may be needed to make it a reliable and robust tool. Cite this article: Bone Joint J 2018;100-B:242–6


The Bone & Joint Journal
Vol. 102-B, Issue 4 | Pages 478 - 484
1 Apr 2020
Daniels AM Wyers CE Janzing HMJ Sassen S Loeffen D Kaarsemaker S van Rietbergen B Hannemann PFW Poeze M van den Bergh JP

Aims

Besides conventional radiographs, the use of MRI, CT, and bone scintigraphy is frequent in the diagnosis of a fracture of the scaphoid. However, which techniques give the best results remain unknown. The investigation of a new imaging technique initially requires an analysis of its precision. The primary aim of this study was to investigate the interobserver agreement of high-resolution peripheral quantitative CT (HR-pQCT) in the diagnosis of a scaphoid fracture. A secondary aim was to investigate the interobserver agreement for the presence of other fractures and for the classification of scaphoid fracture.

Methods

Two radiologists and two orthopaedic trauma surgeons evaluated HR-pQCT scans of 31 patients with a clinically-suspected scaphoid fracture. The observers were asked to determine the presence of a scaphoid or other fracture and to classify the scaphoid fracture based on the Herbert classification system. Fleiss kappa statistics were used to calculate the interobserver agreement for the diagnosis of a fracture. Intraclass correlation coefficients (ICCs) were used to assess the agreement for the classification of scaphoid fracture.


Bone & Joint Research
Vol. 5, Issue 4 | Pages 116 - 121
1 Apr 2016
Leow JM Clement ND Tawonsawatruk T Simpson CJ Simpson AHRW

Objectives

The radiographic union score for tibial (RUST) fractures was developed by Whelan et al to assess the healing of tibial fractures following intramedullary nailing. In the current study, the repeatability and reliability of the RUST score was evaluated in an independent centre (a) using the original description, (b) after further interpretation of the description of the score, and (c) with the immediate post-operative radiograph available for comparison.

Methods

A total of 15 radiographs of tibial shaft fractures treated by intramedullary nailing (IM) were scored by three observers using the RUST system. Following discussion on how the criteria of the RUST system should be implemented, 45 sets (i.e. AP and lateral) of radiographs of IM nailed tibial fractures were scored by five observers. Finally, these 45 sets of radiographs were rescored with the baseline post-operative radiograph available for comparison.


Bone & Joint Open
Vol. 5, Issue 11 | Pages 962 - 970
4 Nov 2024
Suter C Mattila H Ibounig T Sumrein BO Launonen A Järvinen TLN Lähdeoja T Rämö L

Aims. Though most humeral shaft fractures heal nonoperatively, up to one-third may lead to nonunion with inferior outcomes. The Radiographic Union Score for HUmeral Fractures (RUSHU) was created to identify high-risk patients for nonunion. Our study evaluated the RUSHU’s prognostic performance at six and 12 weeks in discriminating nonunion within a significantly larger cohort than before. Methods. Our study included 226 nonoperatively treated humeral shaft fractures. We evaluated the interobserver reliability and intraobserver reproducibility of RUSHU scoring using intraclass correlation coefficients (ICCs). Additionally, we determined the optimal cut-off thresholds for predicting nonunion using the receiver operating characteristic (ROC) method. Results. The RUSHU demonstrated good interobserver reliability with an ICC of 0.78 (95% CI 0.72 to 0.83) at six weeks and 0.77 (95% CI 0.71 to 0.82) at 12 weeks. Intraobserver reproducibility was good or excellent for all analyses. Area under the curve in the ROC analysis was 0.83 (95% CI 0.77 to 0.88) at six weeks and 0.89 (95% CI 0.84 to 0.93) at 12 weeks, indicating excellent discrimination. The optimal cut-off values for predicting nonunion were ≤ eight points at six weeks and ≤ nine points at 12 weeks, providing the best specificity-sensitivity trade-off. Conclusion. The RUSHU proves to be a reliable and reproducible radiological scoring system that aids in identifying patients at risk of nonunion at both six and 12 weeks post-injury during non-surgical treatment of humeral shaft fractures. The statistically optimal cut-off values for predicting nonunion are ≤ eight at six weeks and ≤ nine points at 12 weeks post-injury


The Bone & Joint Journal
Vol. 103-B, Issue 8 | Pages 1345 - 1350
1 Aug 2021
Czubak-Wrzosek M Nitek Z Sztwiertnia P Czubak J Grzelecki D Kowalczewski J Tyrakowski M

Aims. The aim of the study was to compare two methods of calculating pelvic incidence (PI) and pelvic tilt (PT), either by using the femoral heads or acetabular domes to determine the bicoxofemoral axis, in patients with unilateral or bilateral primary hip osteoarthritis (OA). Methods. PI and PT were measured on standing lateral radiographs of the spine in two groups: 50 patients with unilateral (Group I) and 50 patients with bilateral hip OA (Group II), using the femoral heads or acetabular domes to define the bicoxofemoral axis. Agreement between the methods was determined by intraclass correlation coefficient (ICC) and the standard error of measurement (SEm). The intraobserver reproducibility and interobserver reliability of the two methods were analyzed on 31 radiographs in both groups to calculate ICC and SEm. Results. In both groups, excellent agreement between the two methods was obtained, with ICC of 0.99 and SEm 0.3° for Group I, and ICC 0.99 and SEm 0.4° for Group II. The intraobserver reproducibility was excellent for both methods in both groups, with an ICC of at least 0.97 and SEm not exceeding 0.8°. The study also revealed excellent interobserver reliability for both methods in both groups, with ICC 0.99 and SEm 0.5° or less. Conclusion. Either the femoral heads or acetabular domes can be used to define the bicoxofemoral axis on the lateral standing radiographs of the spine for measuring PI and PT in patients with idiopathic unilateral or bilateral hip OA. Cite this article: Bone Joint J 2021;103-B(8):1345–1350


The Bone & Joint Journal
Vol. 106-B, Issue 5 | Pages 468 - 474
1 May 2024
d'Amato M Flevas DA Salari P Bornes TD Brenneis M Boettner F Sculco PK Baldini A

Aims. Obtaining solid implant fixation is crucial in revision total knee arthroplasty (rTKA) to avoid aseptic loosening, a major reason for re-revision. This study aims to validate a novel grading system that quantifies implant fixation across three anatomical zones (epiphysis, metaphysis, diaphysis). Methods. Based on pre-, intra-, and postoperative assessments, the novel grading system allocates a quantitative score (0, 0.5, or 1 point) for the quality of fixation achieved in each anatomical zone. The criteria used by the algorithm to assign the score include the bone quality, the size of the bone defect, and the type of fixation used. A consecutive cohort of 245 patients undergoing rTKA from 2012 to 2018 were evaluated using the current novel scoring system and followed prospectively. In addition, 100 first-time revision cases were assessed radiologically from the original cohort and graded by three observers to evaluate the intra- and inter-rater reliability of the novel radiological grading system. Results. At a mean follow-up of 90 months (64 to 130), only two out of 245 cases failed due to aseptic loosening. Intraoperative grading yielded mean scores of 1.87 (95% confidence interval (CI) 1.82 to 1.92) for the femur and 1.96 (95% CI 1.92 to 2.0) for the tibia. Only 3.7% of femoral and 1.7% of tibial reconstructions fell below the 1.5-point threshold, which included the two cases of aseptic loosening. Interobserver reliability for postoperative radiological grading was 0.97 for the femur and 0.85 for the tibia. Conclusion. A minimum score of 1.5 points for each skeletal segment appears to be a reasonable cut-off to define sufficient fixation in rTKA. There were no revisions for aseptic loosening at mid-term follow-up when this fixation threshold was achieved or exceeded. When assessing first-time revisions, this novel grading system has shown excellent intra- and interobserver reliability. Cite this article: Bone Joint J 2024;106-B(5):468–474


The Bone & Joint Journal
Vol. 103-B, Issue 8 | Pages 1339 - 1344
1 Aug 2021
Jain S Mohrir G Townsend O Lamb JN Palan J Aderinto J Pandit H

Aims. This aim of this study was to assess the reliability and validity of the Unified Classification System (UCS) for postoperative periprosthetic femoral fractures (PFFs) around cemented polished taper-slip (PTS) stems. Methods. Radiographs of 71 patients with a PFF admitted consecutively at two centres between 25 February 2012 and 19 May 2020 were collated by an independent investigator. Six observers (three hip consultants and three trainees) were familiarized with the UCS. Each PFF was classified on two separate occasions, with a mean time between assessments of 22.7 days (16 to 29). Interobserver reliability for more than two observers was assessed using percentage agreement and Fleiss’ kappa statistic. Intraobserver reliability between two observers was calculated with Cohen kappa statistic. Validity was tested on surgically managed UCS type B PFFs where stem stability was documented in operation notes (n = 50). Validity was assessed using percentage agreement and Cohen kappa statistic between radiological assessment and intraoperative findings. Kappa statistics were interpreted using Landis and Koch criteria. All six observers were blinded to operation notes and postoperative radiographs. Results. Interobserver reliability percentage agreement was 58.5% and the overall kappa value was 0.442 (moderate agreement). Lowest kappa values were seen for type B fractures (0.095 to 0.360). The mean intraobserver reliability kappa value was 0.672 (0.447 to 0.867), indicating substantial agreement. Validity percentage agreement was 65.7% and the mean kappa value was 0.300 (0.160 to 0.4400) indicating only fair agreement. Conclusion. This study demonstrates that the UCS is unsatisfactory for the classification of PFFs around PTS stems, and that it has considerably lower reliability and validity than previously described for other stem types. Radiological PTS stem loosening in the presence of PFF is poorly defined and formal intraoperative testing of stem stability is recommended. Cite this article: Bone Joint J 2021;103-B(8):1339–1344


The Bone & Joint Journal
Vol. 103-B, Issue 8 | Pages 1380 - 1385
2 Aug 2021
Kim Y Ryu J Kim JK Al-Dhafer BAA Shin YH

Aims. The aim of this study was to assess arthritis of the basal joint of the thumb quantitatively using bone single-photon emission CT/CT (SPECT/CT) and evaluate its relationship with patients’ pain and function. Methods. We retrospectively reviewed 30 patients (53 hands) with symptomatic basal joint arthritis of the thumb between April 2019 and March 2020. Visual analogue scale (VAS) scores for pain, grip strength, and pinch power of both hands and Patient-Rated Wrist/Hand Evaluation (PRWHE) scores were recorded for all patients. Basal joint arthritis was classified according to the modified Eaton-Glickel stage using routine radiographs and the CT scans of SPECT/CT, respectively. The maximum standardized uptake value (SUVmax) from SPECT/CT was measured in the four peritrapezial joints and the highest uptake was used for analysis. Results. According to Eaton-Glickel classification, 11, 17, 17, and eight hands were stage 0 to I, II, III, and IV, respectively. The interobserver reliability for determining the stage of arthritis was moderate for radiographs (k = 0.41) and substantial for CT scans (k = 0.67). In a binary categorical analysis using SUVmax, pain (p < 0.001) and PRWHE scores (p = 0.004) were significantly higher in hands with higher SUVmax. Using multivariate linear regression to estimate the pain VAS, only SUVmax (B 0.172 (95% confidence interval (CI) 0.065 to 0.279; p = 0.002) showed a significant association. Estimating the variation of PRWHE scores using the same model, only SUVmax (B 1.378 (95% CI, 0.082 to 2.674); p = 0.038) showed a significant association. Conclusion. The CT scans of SPECT/CT provided better interobserver reliability than routine radiographs for evaluating the severity of arthritis. A higher SUVmax in SPECT/CT was associated with more pain and functional disabilities of basal joint arthritis of the thumb. This approach could be used to complement radiographs for the evaluation of patients with this condition. Cite this article: Bone Joint J 2021;103-B(8):1380–1385


Bone & Joint Research
Vol. 13, Issue 1 | Pages 19 - 27
5 Jan 2024
Baertl S Rupp M Kerschbaum M Morgenstern M Baumann F Pfeifer C Worlicek M Popp D Amanatullah DF Alt V

Aims. This study aimed to evaluate the clinical application of the PJI-TNM classification for periprosthetic joint infection (PJI) by determining intraobserver and interobserver reliability. To facilitate its use in clinical practice, an educational app was subsequently developed and evaluated. Methods. A total of ten orthopaedic surgeons classified 20 cases of PJI based on the PJI-TNM classification. Subsequently, the classification was re-evaluated using the PJI-TNM app. Classification accuracy was calculated separately for each subcategory (reinfection, tissue and implant condition, non-human cells, and morbidity of the patient). Fleiss’ kappa and Cohen’s kappa were calculated for interobserver and intraobserver reliability, respectively. Results. Overall, interobserver and intraobserver agreements were substantial across the 20 classified cases. Analyses for the variable ‘reinfection’ revealed an almost perfect interobserver and intraobserver agreement with a classification accuracy of 94.8%. The category 'tissue and implant conditions' showed moderate interobserver and substantial intraobserver reliability, while the classification accuracy was 70.8%. For 'non-human cells,' accuracy was 81.0% and interobserver agreement was moderate with an almost perfect intraobserver reliability. The classification accuracy of the variable 'morbidity of the patient' reached 73.5% with a moderate interobserver agreement, whereas the intraobserver agreement was substantial. The application of the app yielded comparable results across all subgroups. Conclusion. The PJI-TNM classification system captures the heterogeneity of PJI and can be applied with substantial inter- and intraobserver reliability. The PJI-TNM educational app aims to facilitate application in clinical practice. A major limitation was the correct assessment of the implant situation. To eliminate this, a re-evaluation according to intraoperative findings is strongly recommended. Cite this article: Bone Joint Res 2024;13(1):19–27


The Bone & Joint Journal
Vol. 105-B, Issue 1 | Pages 21 - 28
1 Jan 2023
Ndlovu S Naqshband M Masunda S Ndlovu K Chettiar K Anugraha A

Aims. Clinical management of open fractures is challenging and frequently requires complex reconstruction procedures. The Gustilo-Anderson classification lacks uniform interpretation, has poor interobserver reliability, and fails to account for injuries to musculotendinous units and bone. The Ganga Hospital Open Injury Severity Score (GHOISS) was designed to address these concerns. The major aim of this review was to ascertain the evidence available on accuracy of the GHOISS in predicting successful limb salvage in patients with mangled limbs. Methods. We searched electronic data bases including PubMed, CENTRAL, EMBASE, CINAHL, Scopus, and Web of Science to identify studies that employed the GHOISS risk tool in managing complex limb injuries published from April 2006, when the score was introduced, until April 2021. Primary outcome was the measured sensitivity and specificity of the GHOISS risk tool for predicting amputation at a specified threshold score. Secondary outcomes included length of stay, need for plastic surgery, deep infection rate, time to fracture union, and functional outcome measures. Diagnostic test accuracy meta-analysis was performed using a random effects bivariate binomial model. Results. We identified 1,304 records, of which six prospective cohort studies and two retrospective cohort studies evaluating a total of 788 patients were deemed eligible for inclusion. A diagnostic test meta-analysis conducted on five cohort studies, with 474 participants, showed that GHOISS at a threshold score of 14 has a pooled sensitivity of 93.4% (95% confidence interval (CI) 78.4 to 98.2) and a specificity of 95% (95% CI 88.7 to 97.9) for predicting primary or secondary amputations in people with complex lower limb injuries. Conclusion. GHOISS is highly accurate in predicting success of limb salvage, and can inform management and predict secondary outcomes. However, there is a need for high-quality multicentre trials to confirm these findings and investigate the effectiveness of the score in children, and in predicting secondary amputations. Cite this article: Bone Joint J 2023;105-B(1):21–28


The Bone & Joint Journal
Vol. 106-B, Issue 3 | Pages 227 - 231
1 Mar 2024
Todd NV Casey A Birch NC

The diagnostic sub-categorization of cauda equina syndrome (CES) is used to aid communication between doctors and other healthcare professionals. It is also used to determine the need for, and urgency of, MRI and surgery in these patients. A recent paper by Hoeritzauer et al (2023) in this journal examined the interobserver reliability of the widely accepted subcategories in 100 patients with cauda equina syndrome. They found that there is no useful interobserver agreement for the subcategories, even for experienced spinal surgeons. This observation is supported by the largest prospective study of the treatment of cauda equina syndrome in the UK by Woodfield et al (2023). If the accepted subcategories are unreliable, they cannot be used in the way that they are currently, and they should be revised or abandoned. This paper presents a reassessment of the diagnostic and prognostic subcategories of cauda equina syndrome in the light of this evidence, with a suggested cure based on a more inclusive synthesis of symptoms, signs, bladder ultrasound scan results, and pre-intervention urinary catheterization. Cite this article: Bone Joint J 2024;106-B(3):227–231


Bone & Joint Open
Vol. 3, Issue 5 | Pages 423 - 431
1 May 2022
Leong JWY Singhal R Whitehouse MR Howell JR Hamer A Khanduja V Board TN

Aims. The aim of this modified Delphi process was to create a structured Revision Hip Complexity Classification (RHCC) which can be used as a tool to help direct multidisciplinary team (MDT) discussions of complex cases in local or regional revision networks. Methods. The RHCC was developed with the help of a steering group and an invitation through the British Hip Society (BHS) to members to apply, forming an expert panel of 35. We ran a mixed-method modified Delphi process (three rounds of questionnaires and one virtual meeting). Round 1 consisted of identifying the factors that govern the decision-making and complexities, with weighting given to factors considered most important by experts. Participants were asked to identify classification systems where relevant. Rounds 2 and 3 focused on grouping each factor into H1, H2, or H3, creating a hierarchy of complexity. This was followed by a virtual meeting in an attempt to achieve consensus on the factors which had not achieved consensus in preceding rounds. Results. The expert group achieved strong consensus in 32 out of 36 factors following the Delphi process. The RHCC used the existing Paprosky (acetabulum and femur), Unified Classification System, and American Society of Anesthesiologists (ASA) classification systems. Patients with ASA grade III/IV are recognized with a qualifier of an asterisk added to the final classification. The classification has good intraobserver and interobserver reliability with Kappa values of 0.88 to 0.92 and 0.77 to 0.85, respectively. Conclusion. The RHCC has been developed through a modified Delphi technique. RHCC will provide a framework to allow discussion of complex cases as part of a local or regional hip revision MDT. We believe that adoption of the RHCC will provide a comprehensive and reproducible method to describe each patient’s case with regard to surgical complexity, in addition to medical comorbidities that may influence their management. Cite this article: Bone Jt Open 2022;3(5):423–431


The Journal of Bone & Joint Surgery British Volume
Vol. 91-B, Issue 6 | Pages 766 - 771
1 Jun 2009
Brunner A Honigmann P Treumann T Babst R

We evaluated the impact of stereo-visualisation of three-dimensional volume-rendering CT datasets on the inter- and intraobserver reliability assessed by kappa values on the AO/OTA and Neer classifications in the assessment of proximal humeral fractures. Four independent observers classified 40 fractures according to the AO/OTA and Neer classifications using plain radiographs, two-dimensional CT scans and with stereo-visualised three-dimensional volume-rendering reconstructions. Both classification systems showed moderate interobserver reliability with plain radiographs and two-dimensional CT scans. Three-dimensional volume-rendered CT scans improved the interobserver reliability of both systems to good. Intraobserver reliability was moderate for both classifications when assessed by plain radiographs. Stereo visualisation of three-dimensional volume rendering improved intraobserver reliability to good for the AO/OTA method and to excellent for the Neer classification. These data support our opinion that stereo visualisation of three-dimensional volume-rendering datasets is of value when analysing and classifying complex fractures of the proximal humerus


Bone & Joint Open
Vol. 2, Issue 10 | Pages 858 - 864
18 Oct 2021
Guntin J Plummer D Della Valle C DeBenedetti A Nam D

Aims. Prior studies have identified that malseating of a modular dual mobility liner can occur, with previous reported incidences between 5.8% and 16.4%. The aim of this study was to determine the incidence of malseating in dual mobility implants at our institution, assess for risk factors for liner malseating, and investigate whether liner malseating has any impact on clinical outcomes after surgery. Methods. We retrospectively reviewed the radiographs of 239 primary and revision total hip arthroplasties with a modular dual mobility liner. Two independent reviewers assessed radiographs for each patient twice for evidence of malseating, with a third observer acting as a tiebreaker. Univariate analysis was conducted to determine risk factors for malseating with Youden’s index used to identify cut-off points. Cohen’s kappa test was used to measure interobserver and intraobserver reliability. Results. In all, 12 liners (5.0%), including eight Stryker (6.8%) and four Zimmer Biomet (3.3%), had radiological evidence of malseating. Interobserver reliability was found to be 0.453 (95% confidence interval (CI) 0.26 to 0.64), suggesting weak inter-rater agreement, with strong agreement being greater than 0.8. We found component size of 50 mm or less to be associated with liner malseating on univariate analysis (p = 0.031). Patients with malseated liners appeared to have no associated clinical consequences, and none required revision surgery at a mean of 14 months (1.4 to 99.2) postoperatively. Conclusion. The incidence of liner malseating was 5.0%, which is similar to other reports. Component size of 50 mm or smaller was identified as a risk factor for malseating. Surgeons should be aware that malseating can occur and implant design changes or changes in instrumentation should be considered to lower the risk of malseating. Although further follow-up is needed, it remains to be seen if malseating is associated with any clinical consequences. Cite this article: Bone Jt Open 2021;2(10):858–864


The Journal of Bone & Joint Surgery British Volume
Vol. 93-B, Issue 6 | Pages 777 - 781
1 Jun 2011
Kalra S Smith TO Berko B Walton NP

The Oxford unicompartmental knee replacement gives good results in patients with symptomatic osteoarthritis of the medial compartment. Previous studies have suggested that the presence of radiolucent lines (RLLs) does not reflect a poor outcome in such patients. However, the reliability and validity of this assessment have not been determined. Our aim was to assess the intra- and interobserver reliability and the sensitivity and specificity of the assessment of RLLs around both tibial and femoral components using standard radiographs. Two reviewers assessed the radiographs of 45 patients who had loosening of the tibial or femoral component confirmed at revision surgery and compared them with those of a series of 45 asymptomatic patients matched for age and gender. The results suggested that, using standard radiographs, tibial RLLs were 63.6% sensitive and 94.4% specific and femoral RLLs 63.9% sensitive and 72.7% specific for loosening. Overall intra- and interobserver reliability was highly variable, but zonal analysis showed that lucency at the tip of the femoral peg was significantly associated with loosening of the femoral component. Fluoroscopically guided radiographs may improve the zonal reliability of the assessment of RLLs, but further independent and comparative studies are required. In the meantime, the innocence of the physiological RLLs detected by standard radiographs should be viewed with caution


The Journal of Bone & Joint Surgery British Volume
Vol. 87-B, Issue 9 | Pages 1267 - 1271
1 Sep 2005
Allami MK Jamil W Fourie B Ashton V Gregg PJ

The Department of Health and the Public Health Laboratory Service established the Nosocomial Infection National Surveillance Scheme in order to standardise the collection of information about infections acquired in hospital in the United Kingdom and provide national data with which hospitals could measure their own performance. The definition of superficial incisional infection (skin and subcutaneous tissue), set by the Center for Disease Control (CDC), should meet at least one of the defined criteria which would confirm the diagnosis and determine the need for specific treatment. We have assessed the interobserver reliability of the criteria for superficial incisional infection set by the CDC in our current practice. The incisional site of 50 patients who had an elective primary arthroplasty of the hip or knee was evaluated independently by two orthopaedic clinical research fellows and two orthopaedic ward sisters for the presence or absence of surgical-site infection. Interobserver reliability was assessed by comparison of the criteria for wound infection used by the four observers using kappa reliability coefficients. Our study demonstrated that some of the components of the current CDC criteria were unreliable and we recommend their revision


The Bone & Joint Journal
Vol. 95-B, Issue 10 | Pages 1396 - 1401
1 Oct 2013
Gabbe BJ Esser M Bucknill A Russ MK Hofstee D Cameron PA Handley C deSteiger RN

We describe the routine imaging practices of Level 1 trauma centres for patients with severe pelvic ring fractures, and the interobserver reliability of the classification systems of these fractures using plain radiographs and three-dimensional (3D) CT reconstructions. Clinical and imaging data for 187 adult patients (139 men and 48 women, mean age 43 years (15 to 101)) with a severe pelvic ring fracture managed at two Level 1 trauma centres between July 2007 and June 2010 were extracted. Three experienced orthopaedic surgeons classified the plain radiographs and 3D CT reconstruction images of 100 patients using the Tile/AO and Young–Burgess systems. Reliability was compared using kappa statistics. A total of 115 patients (62%) had plain radiographs as well as two-dimensional (2D) CT and 3D CT reconstructions, 52 patients (28%) had plain films only, 12 (6.4%) had 2D and 3D CT reconstructions images only, and eight patients (4.3%) had no available images. The plain radiograph was limited to an anteroposterior pelvic view. Patients without imaging, or only plain films, were more severely injured. A total of 72 patients (39%) were imaged with a pelvic binder in situ. Interobserver reliability for the Tile/AO (Kappa 0.10 to 0.17) and Young–Burgess (Kappa 0.09 to 0.21) was low, and insufficient for clinical and research purposes. Severe pelvic ring fractures are difficult to classify due to their complexity, the increasing use of early treatment such as with pelvic binders, and the absence of imaging altogether in important patient sub-groups, such as those who die early of their injuries. Cite this article: Bone Joint J 2013;95-B:1396–1401


The Bone & Joint Journal
Vol. 102-B, Issue 8 | Pages 1041 - 1047
1 Aug 2020
Hamoodi Z Singh J Elvey MH Watts AC

Aims. The Wrightington classification system of fracture-dislocations of the elbow divides these injuries into six subtypes depending on the involvement of the coronoid and the radial head. The aim of this study was to assess the reliability and reproducibility of this classification system. Methods. This was a blinded study using radiographs and CT scans of 48 consecutive patients managed according to the Wrightington classification system between 2010 and 2018. Four trauma and orthopaedic consultants, two post CCT fellows, and one speciality registrar based in the UK classified the injuries. The seven observers reviewed preoperative radiographs and CT scans twice, with a minimum four-week interval. Radiographs and CT scans were reviewed separately. Inter- and intraobserver reliability were calculated using Fleiss and Cohen kappa coefficients. The Landis and Koch criteria were used to interpret the strength of the kappa values. Validity was assessed by calculating the percentage agreement against intraoperative findings. Results. Of the 48 patients, three (6%) had type A injury, 11 (23%) type B, 16 (33%) type B+, 16 (33%) Type C, two (4%) type D+, and none had a type D injury. All 48 patients had anteroposterior (AP) and lateral radiographs, 44 had 2D CT scans, and 39 had 3D reconstructions. The interobserver reliability kappa value was 0.52 for radiographs, 0.71 for 2D CT scans, and 0.73 for a combination of 2D and 3D reconstruction CT scans. The median intraobserver reliability was 0.75 (interquartile range (IQR) 0.62 to 0.79) for radiographs, 0.77 (IQR 0.73 to 0.94) for 2D CT scans, and 0.89 (IQR 0.77 to 0.93) for the combination of 2D and 3D reconstruction. Validity analysis showed that accuracy significantly improved when using CT scans (p = 0.018 and p = 0.028 respectively). Conclusion. The Wrightington classification system is a reliable and valid method of classifying fracture-dislocations of the elbow. CT scans are significantly more accurate than radiographs when identifying the pattern of injury, with good intra- and interobserver reproducibility. Cite this article: Bone Joint J 2020;102-B(8):1041–1047


The Bone & Joint Journal
Vol. 100-B, Issue 5 | Pages 596 - 602
1 May 2018
Bock P Pittermann M Chraim M Rois S

Aims. Various radiological parameters are used to evaluate a flatfoot deformity and their measurements may differ. The aims of this study were to answer the following questions: 1) Which of the 11 parameters have the best inter- and intraobserver reliability in a standardized radiological setting? 2) Are pre- and postoperative assessments equally reliable? 3) What are the identifiable sources of variation?. Patients and Methods. Measurements of the 11 parameters were recorded on anteroposterior and lateral weight-bearing radiographs of 38 feet before and after surgery for flatfoot, by three observers with different experience in foot surgery (A, ten years; B, three years; C, third-year orthopaedic resident). The inter- and intraobserver reliability was calculated. Results. Preoperative interobserver reliability was high for four, moderate for five, and low for two parameters. Postoperative interobserver reliability was high for four, moderate for five, and low for two parameters. Intraobserver reliability was excellent for all parameters preoperatively as recorded by observer A (PB) and B (MP), and for eight parameters as recorded by observer C (SR). Intraobserver reliability was excellent for ten parameters postoperatively as recorded by observer A and B, and for eight parameters as recorded by observer C. Conclusion. The following parameters can be recommended. For preoperative and postoperative evaluation of flatfoot: anteroposterior, talonavicular coverage angle; lateral, talometatarsal I angle, calcaneal pitch angle, and cuneiform-medial height (high interobserver reliability); and anteroposterior, talometatarsal II angle; lateral, talocalcaneal angle,tibiocalcaneal angle (moderate interobserver reliability). For more experienced observers, we also recommend the anteroposterior talometatarsal I angle (moderate reliability). The inter- and intraobserver reliability for most parameters were similar pre- and postoperatively. The experience of the observer and the definition and ability to measure the parameters themselves were sources of variation. Cite this article: Bone Joint J 2018;100-B:596–602


The Bone & Joint Journal
Vol. 102-B, Issue 3 | Pages 301 - 309
1 Mar 2020
Keenan OJF Holland G Maempel JF Keating JF Scott CEH

Aims. Although knee osteoarthritis (OA) is diagnosed and monitored radiologically, actual full-thickness cartilage loss (FTCL) has rarely been correlated with radiological classification. This study aims to analyze which classification system correlates best with FTCL and to assess their reliability. Methods. A prospective study of 300 consecutive patients undergoing unilateral total knee arthroplasty (TKA) for OA (mean age 69 years (44 to 91; standard deviation (SD) 9.5), 178 (59%) female). Two blinded examiners independently graded preoperative radiographs using five common systems: Kellgren-Lawrence (KL); International Knee Documentation Committee (IKDC); Fairbank; Brandt; and Ahlbäck. Interobserver agreement was assessed using the intraclass correlation coefficient (ICC). Intraoperatively, anterior cruciate ligament (ACL) status and the presence of FTCL in 16 regions of interest were recorded. Radiological classification and FTCL were correlated using the Spearman correlation coefficient. Results. Knees had a mean of 6.8 regions of FTCL (SD 3.1), most common medially. The commonest patterns of FTCL were medial ± patellofemoral (143/300, 48%) and tricompartmental (89/300, 30%). ACL status was associated with pattern of FTCL (p = 0.023). All radiological classification systems demonstrated moderate ICC, but this was highest for the IKDC: whole knee 0.68 (95% confidence interval (CI) 0.60 to 0.74); medial compartment 0.84 (95% CI 0.80 to 0.87); and lateral compartment 0.79 (95% CI 0.73 to 0.83). Correlation with actual FTCL was strongest for Ahlbäck (Spearman rho 0.27 to 0.39) and KL (0.30 to 0.33) systems, although all systems demonstrated medium correlation. The Ahlbäck score was the most discriminating in severe knee OA. Osteophyte presence in the medial compartment had high positive predictive value (PPV) for FTCL, but not in the lateral compartment. Conclusion. The Ahlbäck and KL systems had the highest correlation with confirmed cartilage loss at TKA. However, the IKDC system displayed the best interobserver reliability, with favourable correlation with FTCL in medial and lateral compartments, although it was less discriminating in more severe disease. Cite this article: Bone Joint J 2020;102-B(3):301–309


Bone & Joint Research
Vol. 7, Issue 7 | Pages 468 - 475
1 Jul 2018
He Q Sun H Shu L Zhu Y Xie X Zhan Y Luo C

Objectives. Researchers continue to seek easier ways to evaluate the quality of bone and screen for osteoporosis and osteopenia. Until recently, radiographic images of various parts of the body, except the distal femur, have been reappraised in the light of dual-energy X-ray absorptiometry (DXA) findings. The incidence of osteoporotic fractures around the knee joint in the elderly continues to increase. The aim of this study was to propose two new radiographic parameters of the distal femur for the assessment of bone quality. Methods. Anteroposterior radiographs of the knee and bone mineral density (BMD) and T-scores from DXA scans of 361 healthy patients were prospectively analyzed. The mean cortical bone thickness (CBTavg) and the distal femoral cortex index (DFCI) were the two parameters that were proposed and measured. Intra- and interobserver reliabilities were assessed. Correlations between the BMD and T-score and these parameters were investigated and their value in the diagnosis of osteoporosis and osteopenia was evaluated. Results. The DFCI, as a ratio, had higher reliability than the CBTavg. Both showed significant correlation with BMD and T-score. When compared with DFCI, CBTavg showed better correlation and was better for predicting osteoporosis and osteopenia. Conclusion. The CBTavg and DFCI are simple and reliable screening tools for the prediction of osteoporosis and osteopenia. The CBTavg is more accurate but the DFCI is easier to use in clinical practice. Cite this article: Q-F. He, H. Sun, L-Y. Shu, Y. Zhu, X-T. Xie, Y. Zhan, C-F. Luo. Radiographic predictors for bone mineral loss: Cortical thickness and index of the distal femur. Bone Joint Res 2018;7:468–475. DOI: 10.1302/2046-3758.77.BJR-2017-0332.R1


The Bone & Joint Journal
Vol. 101-B, Issue 12 | Pages 1578 - 1584
1 Dec 2019
Batailler C Weidner J Wyatt M Pfluger D Beck M

Aims. A borderline dysplastic hip can behave as either stable or unstable and this makes surgical decision making challenging. While an unstable hip may be best treated by acetabular reorientation, stable hips can be treated arthroscopically. Several imaging parameters can help to identify the appropriate treatment, including the Femoro-Epiphyseal Acetabular Roof (FEAR) index, measured on plain radiographs. The aim of this study was to assess the reliability and the sensitivity of FEAR index on MRI compared with its radiological measurement. Patients and Methods. The technique of measuring the FEAR index on MRI was defined and its reliability validated. A retrospective study assessed three groups of 20 patients: an unstable group of ‘borderline dysplastic hips’ with lateral centre edge angle (LCEA) less than 25° treated successfully by periacetabular osteotomy; a stable group of ‘borderline dysplastic hips’ with LCEA less than 25° treated successfully by impingement surgery; and an asymptomatic control group with LCEA between 25° and 35°. The following measurements were performed on both standardized radiographs and on MRI: LCEA, acetabular index, femoral anteversion, and FEAR index. Results. The FEAR index showed excellent intraobserver and interobserver reliability on both MRI and radiographs. The FEAR index was more reliable on radiographs than on MRI. The FEAR index on MRI was lower in the stable borderline group (mean -4.2° (. sd. 9.1°)) compared with the unstable borderline group (mean 7.9° (. sd. 6.8°)). With a FEAR index cut-off value of 2°, 90% of patients were correctly identified as stable or unstable using the radiological FEAR index, compared with 82.5% using the FEAR index on MRI. The FEAR index was a better predictor of instability on plain radiographs than on MRI. Conclusion. The FEAR index measured on MRI is less reliable and less sensitive than the FEAR index measured on radiographs. The cut-off value of 2° for radiological FEAR index predicted hip stability with 90% probability. Cite this article: Bone Joint J 2019;101-B:1578–1584


The Bone & Joint Journal
Vol. 100-B, Issue 2 | Pages 176 - 182
1 Feb 2018
Petrie MJ Blakey CM Chadwick C Davies HG Blundell CM Davies MB

Aims. Fractures of the navicular can occur in isolation but, owing to the intimate anatomical and biomechanical relationships, are often associated with other injuries to the neighbouring bones and joints in the foot. As a result, they can lead to long-term morbidity and poor function. Our aim in this study was to identify patterns of injury in a new classification system of traumatic fractures of the navicular, with consideration being given to the commonly associated injuries to the midfoot. Patients and Methods. We undertook a retrospective review of 285 consecutive patients presenting over an eight- year period with a fracture of the navicular. Five common patterns of injury were identified and classified according to the radiological features. Type 1 fractures are dorsal avulsion injuries related to the capsule of the talonavicular joint. Type 2 fractures are isolated avulsion injuries to the tuberosity of the navicular. Type 3 fractures are a variant of tarsometatarsal fracture/dislocations creating instability of the medial ray. Type 4 fractures involve the body of the navicular with no associated injury to the lateral column and type 5 fractures occur in conjunction with disruption of the midtarsal joint with crushing of the medial or lateral, or both, columns of the foot. Results. In order to test the reliability and reproducibility of this new classification, a cohort of 30 patients with a fracture of the navicular were classified by six independent assessors at two separate times, six months apart. Interobserver reliability and intraobserver reproducibility both had substantial agreement, with kappa values of 0.80 and 0.72, respectively. Conclusion. We propose a logical, all-inclusive, and mutually exclusive classification system for fractures of the navicular that gives associated injuries involving the lateral column due consideration. We have shown that this system is reliable and reproducible and have described the rationale for the subsequent treatment of each type. Cite this article: Bone Joint J 2018;100-B:176–82


The Journal of Bone & Joint Surgery British Volume
Vol. 86-B, Issue 3 | Pages 413 - 425
1 Apr 2004
Edelson G Kelly I Vigder F Reis ND

Existing classifications of fractures of the head of the humerus are inadequate in terms of interobserver reliability and the predictability of the clinical outcome. From a combined study of 73 fracture specimens in museums and 84 CT-three-dimensional reconstructions in patients, we have devised a classification which appears to be more useful clinically. Common patterns of fracture and a plausible mechanism of injury were observed. In 3-D most proximal humeral fractures can be organised into five basic types. These correspond in some degree to the Codman/Neer classification, but differ significantly in regard to the more complex patterns of fracture. We observed a logical progression from simple to complex fractures. An interobserver reliability study was carried out which indicated the improved usefulness of this new 3-D concept in providing a common language among clinicians for classifying these injuries. When surgery is indicated, the 3-D concept is also invaluable in guiding the restitution of anatomy through either open or percutaneous means


The Journal of Bone & Joint Surgery British Volume
Vol. 91-B, Issue 9 | Pages 1191 - 1196
1 Sep 2009
Pagenstert GI Barg A Leumann AG Rasch H Müller-Brand J Hintermann B Valderrabano V

The precise localisation of osteoarthritic changes is crucial for selective surgical treatment. Single photon-emission CT-CT (SPECT-CT) combines both morphological and biological information. We hypothesised that SPECT-CT increased the intra- and interobserver reliability to localise increased uptake compared with traditional evaluation of CT and bone scanning together. We evaluated 20 consecutive patients with pain of uncertain origin in the foot and ankle by radiography and SPECT-CT, available as fused SPECT-CT, and by separate bone scanning and CT. Five observers assessed the presence or absence of arthritis. The images were blinded and randomly ordered. They were evaluated twice at an interval of six weeks. Kappa and multirater kappa values were calculated. The mean intraobserver reliability for SPECT-CT was excellent (κ = 0.86; 95% CI 0.81 to 0.88) and significantly higher than that for CT and bone scanning together. SPECT-CT had significantly higher interobserver agreement, especially when evaluating the naviculocuneiform and tarsometatarsal joints. SPECT-CT is useful in localising active arthritis especially in areas where the number and configuration of joints are complex


Bone & Joint Research
Vol. 2, Issue 1 | Pages 1 - 8
1 Jan 2013
Costa AJ Lustig S Scholes CJ Balestro J Fatima M Parker DA

Objectives. There remains a lack of data on the reliability of methods to estimate tibial coverage achieved during total knee replacement. In order to address this gap, the intra- and interobserver reliability of a three-dimensional (3D) digital templating method was assessed with one symmetric and one asymmetric prosthesis design. Methods. A total of 120 template procedures were performed according to specific rotational and over-hang criteria by three observers at time zero and again two weeks later. Total and sub-region coverage were calculated and the reliability of the templating and measurement method was evaluated. Results. Excellent intra- and interobserver reliability was observed for total coverage, when minimal component overhang (intraclass correlation coefficient (ICC) = 0.87) or no component overhang (ICC = 0.92) was permitted, regardless of rotational restrictions. Conclusions. Measurement of tibial coverage can be reliable using the templating method described even if the rotational axis selected still has a minor influence


The Journal of Bone & Joint Surgery British Volume
Vol. 93-B, Issue 5 | Pages 629 - 633
1 May 2011
Hirschmann MT Konala P Amsler F Iranpour F Friederich NF Cobb JP

We studied the intra- and interobserver reliability of measurements of the position of the components after total knee replacement (TKR) using a combination of radiographs and axial two-dimensional (2D) and three-dimensional (3D) reconstructed CT images to identify which method is best for this purpose. A total of 30 knees after primary TKR were assessed by two independent observers (an orthopaedic surgeon and a radiologist) using radiographs and CT scans. Plain radiographs were highly reliable at measuring the tibial slope, but showed wide variability for all other measurements; 2D-CT also showed wide variability. 3D-CT was highly reliable, even when measuring rotation of the femoral components, and significantly better than 2D-CT. Interobserver variability in the measurements on radiographs were good (intraclass correlation coefficient (ICC) 0.65 to 0.82), but rotational measurements on 2D-CT were poor (ICC 0.29). On 3D-CT they were near perfect (ICC 0.89 to 0.99), and significantly more reliable than 2D-CT (p < 0.001). 3D-reconstructed images are sufficiently reliable to enable reporting of the position and orientation of the components. Rotational measurements in particular should be performed on 3D-reconstructed CT images. When faced with a poorly functioning TKR with concerns over component positioning, we recommend 3D-CT as the investigation of choice


The Bone & Joint Journal
Vol. 100-B, Issue 7 | Pages 862 - 866
1 Jul 2018
Darrith B Bell JA Culvern C Della Valle CJ

Aims. Accurate placement of the acetabular component is essential in total hip arthroplasty (THA). The purpose of this study was to determine if the ability to achieve inclination of the acetabular component within the ‘safe-zone’ of 30° to 50° could be improved with the use of an inclinometer. Patients and Methods. We reviewed 167 primary THAs performed by a single surgeon over a period of 14 months. Procedures were performed at two institutions: an inpatient hospital, where an inclinometer was used (inclinometer group); and an ambulatory centre, where an inclinometer was not used as it could not be adequately sterilized (control group). We excluded 47 patients with a body mass index (BMI) of > 40 kg/m. 2. , age of > 68 years, or a surgical indication other than osteoarthritis whose treatment could not be undertaken in the ambulatory centre. There were thus 120 patients in the study, 68 in the inclinometer group and 52 in the control group. The inclination angles of the acetabular component were measured from de-identified plain radiographs by two blinded investigators who were not involved in the surgery. The effect of the use of the inclinometer on the inclination angle was determined using multivariate regression analysis. Results. The mean inclination angle for the THAs in the inclinometer group was 42.9° (95% confidence interval (CI) 41.7° to 44.0°; range 29.0° to 63.8°) and 46.5° (95% CI 45.2° to 47.7°; range 32.8° to 63.2°) in the control group (p < 0.001). Regression analysis identified a 9.1% difference in inclination due to the use of an inclinometer (p < 0.001), and THAs performed without the inclinometer were three times more likely to result in inclination angles of > 50° (odds ratio (OR) 2.8, p = 0.036). The correlation coefficient for the interobserver reliability of the measurement of the two investigators was 0.95 (95% CI 0.93 to 0.97). Conclusion. The use of a simple inclinometer resulted in a significant reduction in the number of outliers compared with a freehand technique. Cite this article: Bone Joint J 2018;100-B:862–6


The Journal of Bone & Joint Surgery British Volume
Vol. 91-B, Issue 8 | Pages 1049 - 1053
1 Aug 2009
Braunstein V Kirchhoff C Ockert B Sprecher CM Korner M Mutschler W Wiedemann E Biberthaler P

In 100 patients the fulcrum axis which is the line connecting the anterior tip of the coracoid and the posterolateral angle of the acromion, was used to position true anteroposterior radiographs of the shoulder. This method was then compared with the conventional radiological technique in a further 100 patients. Three orthopaedic surgeons counted the number of images without overlap between the humeral head and glenoid and calculated the amount of the glenoid surface visible in each radiograph. The analysis was repeated for intraobserver reliability. The learning curves of both techniques were studied. The amount of free visible glenoid space was significantly higher using the fulcrum-axis method (64 vs 31) and the comparable glenoid size increased significantly (8.56 vs 6.47). Thus the accuracy of the anteroposterior radiographs of the shoulder is impaired by using this technique. The intra and interobserver reliability showed a high consistency. No learning curve was observed for either technique


Bone & Joint Research
Vol. 6, Issue 9 | Pages 530 - 534
1 Sep 2017
Krakow L Klockow A Roehner E Brodt S Eijer H Bossert J Matziolis G

Objectives. The determination of the volumetric polyethylene wear on explanted material requires complicated equipment, which is not available in many research institutions. Our aim in this study was to present and validate a method that only requires a set of polyetheretherketone balls and a laboratory balance to determine wear. Methods. The insert to be measured was placed on a balance, and a ball of the appropriate diameter was inserted. The cavity remaining between the ball and insert caused by wear was filled with contrast medium and the weight of the contrast medium was recorded. The volume was calculated from the known density of the liquid. The precision, inter- and intraobserver reliability, were determined by four investigators on four days using nine inserts with specified wear (0.094 ml to 1.626 ml), and the intra-class correlation coefficient was calculated. The feasibility of using this method in routine clinical practice and the time required for measurement were tested on 84 explanted inserts by one investigator. Results. In order to get the mean for all investigators and determinations, the deviation between the measured and specified wear was -0.08 ml . (sd. 0.12; -0.21 to 0.11). The interobserver reliability was 0.989 ml (95% confidence interval (CI) 0.964 to 0.997) and the intraobserver reliability was 0.941 for observer 1 (95% CI 0.846 to 0.985), 0.983 for observer 2 (95% CI 0.956 to 0.995), 0.939 for observer 3 (95% CI 0.855 to 0.984), and 0.934 for observer 4 (95% CI 0.790 to 0.984). The mean time required to examine the samples was two minutes . (sd. 2; 1 to 5). Conclusion. The method presented here was shown to be sufficiently precise for many settings and is a cost-effective and quick method of determining the volumetric wear of explanted acetabular components. However, the measurement of wear for scientific purposes will probably continue to involve more accurate and dedicated laboratory equipment. Cite this article: Bone Joint Res 2017;6:530–534


The Journal of Bone & Joint Surgery British Volume
Vol. 88-B, Issue 8 | Pages 1105 - 1109
1 Aug 2006
Kandemir U Allaire RB Jolly JT Debski RE McMahon PJ

Our aim was to determine the most repeatable three-dimensional measurement of glenoid orientation and to compare it between shoulders with intact and torn rotator cuffs. Our null hypothesis was that glenoid orientation in the scapulae of shoulders with a full-thickness tear of the rotator cuff was the same as that in shoulders with an intact rotator cuff. We studied 24 shoulders in cadavers, 12 with an intact rotator cuff and 12 with a full-thickness tear. Two different observers used a three-dimensional digitising system to measure glenoid orientation in the scapular plane (ie glenoid inclination) using six different techniques. Glenoid version was also measured. The overall precision of the measurements revealed an error of less than 0.6°. Intraobserver reliability (correlation coefficients of 0.990 and 0.984 for each observer) and interobserver reliability (correlation coefficient of 0.985) were highest for measurement of glenoid inclination based on the angle obtained from a line connecting the superior and inferior points of the glenoid and that connecting the most superior point of the glenoid and the most superior point on the body of the scapula. There were no differences in glenoid inclination (p = 0.34) or glenoid version (p = 0.12) in scapulae from shoulders with an intact rotator cuff and those with a full-thickness tear. Abnormal glenoid orientation was not present in shoulders with a torn rotator cuff


The Journal of Bone & Joint Surgery British Volume
Vol. 92-B, Issue 1 | Pages 47 - 50
1 Jan 2010
Konan S Rayan F Haddad FS

The radiological evaluation of the anterolateral femoral head is an essential tool for the assessment of the cam type of femoroacetabular impingement. CT, MRI and frog lateral plain radiographs have all been suggested as imaging options for this type of lesion. The alpha angle is accepted as a reliable indicator of the cam type of impingement and may also be used as an assessment for the successful operative correction of the cam lesion. We studied the alpha angles of 32 consecutive patients with femoroacetabular impingement. The angle measured on frog lateral radiographs using templating tools was compared with that measured on CT scans in order to assess the reliability of the frog lateral view in analysing the alpha angle in cam impingement. A high interobserver reliability was noted for the assessment of the alpha angle on the frog lateral view with an intraclass correlation coefficient of 0.83. The mean alpha angle measured on the frog lateral view was 58.71° (32° to 83.3°) and that by CT was 65.11° (30° to 102°). A poor intraclass correlation coefficient (0.08) was noted between the measurements using the two systems. The frog lateral plain radiograph is not reliable for measuring the alpha angle. Various factors may be responsible for this such as the projection of the radiograph, the positioning of the patient and the quality of the image. CT may be necessary for accurate measurement of the alpha angle


The Journal of Bone & Joint Surgery British Volume
Vol. 93-B, Issue 3 | Pages 332 - 336
1 Mar 2011
Konan S Rayan F Meermans G Witt J Haddad FS

There have been considerable recent advances in the understanding and management of femoroacetabular impingement and associated labral and chondral pathology. We have developed a classification system for acetabular chondral lesions. In our system, we use the six acetabular zones previously described by Ilizaliturri et al. The cartilage is then graded on a scale of 0 to 4 as follows: grade 0, normal articular cartilage lesions; grade 1, softening or wave sign; grade 2, cleavage lesion; grade 3, delamination; and grade 4, exposed bone. The site of the lesion is further classed as A, B or C based on whether the lesion is less than one-third of the distance from the acetabular rim to the cotyloid fossa, one-third to two-thirds of the same distance and greater than two-thirds of the distance, respectively. In order to validate the classification system, six surgeons graded ten video recordings of hip arthroscopy. Our findings showed a high intra-observer reliability of the classification system with an intraclass correlation coefficient of 0.81 and a high interobserver reliability with an intraclass correlation coefficient of 0.88. We have developed a simple reproducible classification system for lesions of the acetabular cartilage, which it is hoped will allow standardised documentation to be made of damage to the articular cartilage, particularly that associated with femoroacetabular impingement


The Bone & Joint Journal
Vol. 95-B, Issue 11 | Pages 1500 - 1507
1 Nov 2013
Zaidi R Cro S Gurusamy K Sivanadarajah N Macgregor A Henricson A Goldberg A

We performed a systematic review and meta-analysis of modern total ankle replacements (TARs) to determine the survivorship, outcome, complications, radiological findings and range of movement, in patients with end-stage osteoarthritis (OA) of the ankle who undergo this procedure. We used the methodology of the Cochrane Collaboration, which uses risk of bias profiling to assess the quality of papers in favour of a domain-based approach. Continuous outcome scores were pooled across studies using the generic inverse variance method and the random-effects model was used to incorporate clinical and methodological heterogeneity. We included 58 papers (7942 TARs) with an interobserver reliability (Kappa) for selection, performance, attrition, detection and reporting bias of between 0.83 and 0.98. The overall survivorship was 89% at ten years with an annual failure rate of 1.2% (95% confidence interval (CI) 0.7 to 1.6). The mean American Orthopaedic Foot and Ankle Society score changed from 40 (95% CI 36 to 43) pre-operatively to 80 (95% CI 76 to 84) at a mean follow-up of 8.2 years (7 to 10) (p < 0.01). Radiolucencies were identified in up to 23% of TARs after a mean of 4.4 years (2.3 to 9.6). The mean total range of movement improved from 23° (95% CI 19 to 26) to 34° (95% CI 26 to 41) (p = 0.01). Our study demonstrates that TAR has a positive impact on patients’ lives, with benefits lasting ten years, as judged by improvement in pain and function, as well as improved gait and increased range of movement. However, the quality of evidence is weak and fraught with biases and high quality randomised controlled trials are required to compare TAR with other forms of treatment such as fusion. Cite this article: Bone Joint J 2013;95-B:1500–7


The Journal of Bone & Joint Surgery British Volume
Vol. 90-B, Issue 12 | Pages 1576 - 1579
1 Dec 2008
Rayan F Dodd M Haddad FS

The Vancouver classification has been shown by its developers to be a valid and reliable method for categorising the configuration of periprosthetic proximal femoral fractures and for planning their management. We have re-validated this classification system independently using the radiographs of 30 patients with periprosthetic fractures. These were reviewed by six experienced consultant orthopaedic surgeons, six trainee surgeons and six medical students in order to assess intra- and interobserver reliability and reproducibility. Each observer read the radiographs on two separate occasions. The results were subjected to weighted kappa statistical analysis. The respective kappa values for interobserver agreement were 0.72 and 0.74 for consultants, 0.68 and 0.70 for trainees on the first and second readings of the radiographs and 0.61 for medical students. The intra-observer agreement for the consultants was 0.64 and 0.67, for the trainees 0.61 and 0.64, and for the medical students 0.59 and 0.60 for the first and second readings, respectively. The validity of the classification was studied by comparing the pre-operative radiological findings within B subgroups with the operative findings. This revealed agreement for 77% of these type-B fractures, with a kappa value of 0.67. Our data confirm the reliability and reproducibility of this classification system in a European setting and for inexperienced staff. This is a reliable system which can be used by non-experts, between centres and across continents


The Journal of Bone & Joint Surgery British Volume
Vol. 83-B, Issue 4 | Pages 565 - 568
1 May 2001
Katayose M Magee DJ

We have established a reference standard for the cross-sectional area (CSA) of supraspinatus as measured by diagnostic ultrasound. The influence of hand dominance and of ageing on the CSA was also assessed. We examined 72 subjects aged from 20 to 79 years. Standard values of the CSA were determined with a high measure of interobserver reliability. Although the CSA on the dominant side was significantly larger (p < 0.001) by 0.16 cm. 2. (95% CI 0.072 to 0.249) than that on the non-dominant side, this difference had no clinical significance. The CSA of supraspinatus decreased significantly with ageing


The Journal of Bone & Joint Surgery British Volume
Vol. 80-B, Issue 4 | Pages 679 - 683
1 Jul 1998
Blundell CM Parker MJ Pryor GA Hopkinson-Woolley J Bhonsle SS

There are a number of classification systems for intracapsular fractures of the proximal femur, but none has been shown to be practical with satisfactory reproducibility and accurate predictive value. We have investigated the AO classification and evaluated intra-and interobserver accuracy and its value in predicting treatment and outcome. We found it to have very poor intra- and interobserver reliability and to be of limited predictive use for the outcome of treatment. A simplified system in which the subdivisions were allocated to one of three groups of undisplaced, displaced and basal fractures was found to be of value. We conclude that this is the only division which is appropriate for these fractures and that the AO system for intracapsular fractures is too complicated and should not be used


The Journal of Bone & Joint Surgery British Volume
Vol. 84-B, Issue 1 | Pages 42 - 47
1 Jan 2002
Brismar BH Wredmark T Movin T Leandersson J Svensson O

We studied 19 videotaped knee arthroscopies in 19 patients with mild to moderate osteoarthritis (OA) of the knee in order to compare the intraobserver and interobserver reliability and the patterns of disagreement between four orthopaedic surgeons. The classifications of OA of Collins, Outerbridge and the French Society of Arthroscopy were used. Intraobserver and interobserver agreements using kappa measures were 0.42 to 0.66 and 0.43 to 0.49, respectively. Only 6% to 8% of paired intraobserver classifications differed by more than one category. Observer-specific disagreement was evident both within and between observers. A small, but significant, occasional variation was also seen. Although reliability may improve by an analysis of disagreement, it appears that the arthroscopic grading of early osteoarthritic lesions is inexact


The Journal of Bone & Joint Surgery British Volume
Vol. 90-B, Issue 5 | Pages 579 - 583
1 May 2008
Yiannakopoulos CK Chougle A Eskelinen A Hodgkinson JP Hartofilakidis G

Our study evaluated the reliability of the Crowe and Hartofilakidis classification systems for developmental dysplasia of the hip in adults. The anteroposterior radiographs of the pelvis of 145 patients with 209 osteoarthritic hips were examined twice by three experienced hip surgeons from three European countries and the abnormal hips were rated using both classifications. The inter- and intra-observer agreement was calculated. Interobserver reliability was evaluated using weighted and unweighted kappa coefficients and for the Crowe classification, among the three pairs there was a minimum kappa coefficient with linear weighting of 0.90 for observers A and C and a maximum kappa coefficient of 0.92 for observers B and C. For the Hartofilakidis classification, the minimum kappa value was 0.85 for observers A and B, and the maximum value was 0.93 for observers B and C. With regard to intra-observer reliability, the kappa coefficients with linear weighting between the two evaluations of the same observer ranged between 0.86 and 0.95 for the Crowe classification and between 0.80 and 0.93 for the Hartofilakidis classification. The reliability of both systems was substantial to almost perfect both for serial measurements by individual readers and between different readers, although the information offered was dissimilar


The Journal of Bone & Joint Surgery British Volume
Vol. 81-B, Issue 2 | Pages 266 - 272
1 Mar 1999
Biedermann R Krismer M Stöckl B Mayrhofer P Ornstein E Franzén H

Several methods of measuring the migration of the femoral component after total hip replacement have been described, but they use different reference lines, and have differing accuracies, some unproven. Statistical comparison of different studies is rarely possible. We report a study of the EBRA-FCA method (femoral component analysis using Einzel-Bild-Röntgen-Analyse) to determine its accuracy using three independent assessments, including a direct comparison with the results of roentgen stereophotogrammetric analysis (RSA). The accuracy of EBRA-FCA was better than ±1.5 mm (95% percentile) with a Cronbach’s coefficient alpha for interobserver reliability of 0.84; a very good result. The method had a specificity of 100% and a sensitivity of 78% compared with RSA for the detection of migration of over 1 mm. This is accurate enough to assess the stability of a prosthesis within a relatively limited period. The best reference line for downward migration is between the greater trochanter and the shoulder of the stem, as confirmed by two experimental analyses and a computer-assisted design


The Bone & Joint Journal
Vol. 105-B, Issue 10 | Pages 1123 - 1130
1 Oct 2023
Donnan M Anderson N Hoq M Donnan L

Aims

The aim of this study was to investigate the agreement in interpretation of the quality of the paediatric hip ultrasound examination, the reliability of geometric and morphological assessment, and the relationship between these measurements.

Methods

Four investigators evaluated 60 hip ultrasounds and assessed their quality based the standard plane of Graf et al. They measured geometric parameters, described the morphology of the hip, and assigned the Graf grade of dysplasia. They analyzed one self-selected image and one randomly selected image from the ultrasound series, and repeated the process four weeks later. The intra- and interobserver agreement, and correlations between various parameters were analyzed.


Bone & Joint Research
Vol. 12, Issue 3 | Pages 155 - 164
1 Mar 2023
McCarty CP Nazif MA Sangiorgio SN Ebramzadeh E Park S

Aims

Taper corrosion has been widely reported to be problematic for modular total hip arthroplasty implants. A simple and systematic method to evaluate taper damage with sufficient resolution is needed. We introduce a semiquantitative grading system for modular femoral tapers to characterize taper corrosion damage.

Methods

After examining a unique collection of retrieved cobalt-chromium (CoCr) taper sleeves (n = 465) using the widely-used Goldberg system, we developed an expanded six-point visual grading system intended to characterize the severity, visible material loss, and absence of direct component contact due to corrosion. Female taper sleeve damage was evaluated by three blinded observers using the Goldberg scoring system and the expanded system. A subset (n = 85) was then re-evaluated following destructive cleaning, using both scoring systems. Material loss for this subset was quantified using metrology and correlated with both scoring systems.


The Bone & Joint Journal
Vol. 106-B, Issue 9 | Pages 964 - 969
1 Sep 2024
Wang YC Song JJ Li TT Yang D Lv ZB Wang ZY Zhang ZM Luo Y

Aims

To propose a new method for evaluating paediatric radial neck fractures and improve the accuracy of fracture angulation measurement, particularly in younger children, and thereby facilitate planning treatment in this population.

Methods

Clinical data of 117 children with radial neck fractures in our hospital from August 2014 to March 2023 were collected. A total of 50 children (26 males, 24 females, mean age 7.6 years (2 to 13)) met the inclusion criteria and were analyzed. Cases were excluded for the following reasons: Judet grade I and Judet grade IVb (> 85° angulation) classification; poor radiograph image quality; incomplete clinical information; sagittal plane angulation; severe displacement of the ulna fracture; and Monteggia fractures. For each patient, standard elbow anteroposterior (AP) view radiographs and corresponding CT images were acquired. On radiographs, Angle P (complementary to the angle between the long axis of the radial head and the line perpendicular to the physis), Angle S (complementary to the angle between the long axis of the radial head and the midline through the proximal radial shaft), and Angle U (between the long axis of the radial head and the straight line from the distal tip of the capitellum to the coronoid process) were identified as candidates approximating the true coronal plane angulation of radial neck fractures. On the coronal plane of the CT scan, the angulation of radial neck fractures (CTa) was measured and served as the reference standard for measurement. Inter- and intraobserver reliabilities were assessed by Kappa statistics and intraclass correlation coefficient (ICC).


Bone & Joint Open
Vol. 5, Issue 6 | Pages 524 - 531
24 Jun 2024
Woldeyesus TA Gjertsen J Dalen I Meling T Behzadi M Harboe K Djuv A

Aims

To investigate if preoperative CT improves detection of unstable trochanteric hip fractures.

Methods

A single-centre prospective study was conducted. Patients aged 65 years or older with trochanteric hip fractures admitted to Stavanger University Hospital (Stavanger, Norway) were consecutively included from September 2020 to January 2022. Radiographs and CT images of the fractures were obtained, and surgeons made individual assessments of the fractures based on these. The assessment was conducted according to a systematic protocol including three classification systems (AO/Orthopaedic Trauma Association (OTA), Evans Jensen (EVJ), and Nakano) and questions addressing specific fracture patterns. An expert group provided a gold-standard assessment based on the CT images. Sensitivities and specificities of surgeons’ assessments were estimated and compared in regression models with correlations for the same patients. Intra- and inter-rater reliability were presented as Cohen’s kappa and Gwet’s agreement coefficient (AC1).


Bone & Joint Open
Vol. 3, Issue 10 | Pages 759 - 766
5 Oct 2022
Schmaranzer F Meier MK Lerch TD Hecker A Steppacher SD Novais EN Kiapour AM

Aims

To evaluate how abnormal proximal femoral anatomy affects different femoral version measurements in young patients with hip pain.

Methods

First, femoral version was measured in 50 hips of symptomatic consecutively selected patients with hip pain (mean age 20 years (SD 6), 60% (n = 25) females) on preoperative CT scans using different measurement methods: Lee et al, Reikerås et al, Tomczak et al, and Murphy et al. Neck-shaft angle (NSA) and α angle were measured on coronal and radial CT images. Second, CT scans from three patients with femoral retroversion, normal femoral version, and anteversion were used to create 3D femur models, which were manipulated to generate models with different NSAs and different cam lesions, resulting in eight models per patient. Femoral version measurements were repeated on manipulated femora.


Bone & Joint 360
Vol. 12, Issue 3 | Pages 32 - 35
1 Jun 2023

The June 2023 Trauma Roundup360 looks at: Aspirin or low-molecular-weight heparin for thromboprophylaxis?; Lateral plating or retrograde nailing for distal femur fractures?; Sciatic nerve palsy after acetabular fixation: what about patient position?; How reliable is the new OTA/AO classification for trochanteric hip fractures?; Young hip fractures: is a medial buttress the answer?; When is the best time to ‘flap’ an open fracture?; The mortality burden of nonoperatively managed hip fractures.


The Bone & Joint Journal
Vol. 105-B, Issue 1 | Pages 29 - 34
1 Jan 2023
Fransen BL Bengoa FJ Neufeld ME Sheridan GA Garbuz DS Howard LC

Aims

Several short- and mid-term studies have shown minimal liner wear of highly cross-linked polyethylene (HXLPE) in total hip arthroplasty (THA), but the safety of using thinner HXLPE liners to maximize femoral head size remains uncertain. The objective of this study was to analyze clinical survival and radiological wear rates of patients with HXLPE liners, a 36 mm femoral head, and a small acetabular component with a minimum of ten years’ follow-up.

Methods

We retrospectively identified 55 patients who underwent primary THA performed at a single centre, using HXLPE liners with 36 mm cobalt-chrome heads in acetabular components with an outer diameter of 52 mm or smaller. Patient demographic details, implant details, death, and all-cause revisions were recorded. Cox regression and Kaplan-Meier survival was used to determine all-cause and liner-specific revision. Of these 55 patients, 22 had a minimum radiological follow-up of seven years and were assessed radiologically for linear and volumetric wear.


The Bone & Joint Journal
Vol. 105-B, Issue 6 | Pages 696 - 701
1 Jun 2023
Kurisunkal V Morris G Kaneuchi Y Bleibleh S James S Botchu R Jeys L Parry MC

Aims

Intra-articular (IA) tumours around the knee are treated with extra-articular (EA) resection, which is associated with poor functional outcomes. We aim to evaluate the accuracy of MRI in predicting IA involvement around the knee.

Methods

We identified 63 cases of high-grade sarcomas in or around the distal femur that underwent an EA resection from a prospectively maintained database (January 1996 to April 2020). Suspicion of IA disease was noted in 52 cases, six had IA pathological fracture, two had an effusion, two had prior surgical intervention (curettage/IA intervention), and one had an osseous metastasis in the proximal tibia. To ascertain validity, two musculoskeletal radiologists (R1, R2) reviewed the preoperative imaging (MRI) of 63 consecutive cases on two occasions six weeks apart. The radiological criteria for IA disease comprised evidence of tumour extension within the suprapatellar pouch, intercondylar notch, extension along medial/lateral retinaculum, and presence of IA fracture. The radiological predictions were then confirmed with the final histopathology of the resected specimens.