Advertisement for orthosearch.org.uk
Results 1 - 50 of 502
Results per page:

Aims. Classifying trochlear dysplasia (TD) is useful to determine the treatment options for patients suffering from patellofemoral instability (PFI). There is no consensus on which classification system is more reliable and reproducible for the purpose of guiding clinicians’ management of PFI. There are also concerns about the validity of the Dejour Classification (DJC), which is the most widely used classification for TD, having only a fair reliability score. The Oswestry-Bristol Classification (OBC) is a recently proposed system of classification of TD, and the authors report a fair-to-good interobserver agreement and good-to-excellent intraobserver agreement in the assessment of TD. The aim of this study was to compare the reliability and reproducibility of these two classifications. Methods. In all, six assessors (four consultants and two registrars) independently evaluated 100 axial MRIs of the patellofemoral joint (PFJ) for TD and classified them according to OBC and DJC. These assessments were again repeated by all raters after four weeks. The inter- and intraobserver reliability scores were calculated using Cohen’s kappa and Cronbach’s α. Results. Both classifications showed good to excellent interobserver reliability with high α scores. The OBC classification showed a substantial intraobserver agreement (mean kappa 0.628; p < 0.005) whereas the DJC showed a moderate agreement (mean kappa 0.572; p < 0.005). There was no significant difference in the kappa values when comparing the assessments by consultants with those by registrars, in either classification system. Conclusion. This large study from a non-founding institute shows both classification systems to be reliable for classifying TD based on axial MRIs of the PFJ, with the simple-to-use OBC having a higher intraobserver reliability score than that of the DJC. Cite this article: Bone Jt Open 2023;4(7):532–538


The Bone & Joint Journal
Vol. 102-B, Issue 4 | Pages 478 - 484
1 Apr 2020
Daniels AM Wyers CE Janzing HMJ Sassen S Loeffen D Kaarsemaker S van Rietbergen B Hannemann PFW Poeze M van den Bergh JP

Aims. Besides conventional radiographs, the use of MRI, CT, and bone scintigraphy is frequent in the diagnosis of a fracture of the scaphoid. However, which techniques give the best results remain unknown. The investigation of a new imaging technique initially requires an analysis of its precision. The primary aim of this study was to investigate the interobserver agreement of high-resolution peripheral quantitative CT (HR-pQCT) in the diagnosis of a scaphoid fracture. A secondary aim was to investigate the interobserver agreement for the presence of other fractures and for the classification of scaphoid fracture. Methods. Two radiologists and two orthopaedic trauma surgeons evaluated HR-pQCT scans of 31 patients with a clinically-suspected scaphoid fracture. The observers were asked to determine the presence of a scaphoid or other fracture and to classify the scaphoid fracture based on the Herbert classification system. Fleiss kappa statistics were used to calculate the interobserver agreement for the diagnosis of a fracture. Intraclass correlation coefficients (ICCs) were used to assess the agreement for the classification of scaphoid fracture. Results. A total of nine (29%) scaphoid fractures and 12 (39%) other fractures were diagnosed in 20 patients (65%) using HR-pQCT across the four observers. The interobserver agreement was 91% for the identification of a scaphoid fracture (95% confidence interval (CI) 0.76 to 1.00) and 80% for other fractures (95% CI 0.72 to 0.87). The mean ICC for the classification of a scaphoid fracture in the seven patients diagnosed with scaphoid fracture by all four observers was 73% (95% CI 0.42 to 0.94). Conclusion. We conclude that the diagnosis of scaphoid and other fractures is reliable when using HR-pQCT in patients with a clinically-suspected fracture. Cite this article: Bone Joint J 2020;102-B(4):478–484


The Journal of Bone & Joint Surgery British Volume
Vol. 88-B, Issue 4 | Pages 484 - 488
1 Apr 2006
Rogers BA Thornton-Bott P Cannon SR Briggs TWR

We assessed the reproducibility and accuracy of four ratios used to measure patellar height, namely the Blackburne-Peel, Caton-Deschamps, Insall-Salvati and modified Insall-Salvati, before and after total knee arthroplasty. The patellar height was measured, by means of the four ratios, on the pre- and post-operative lateral radiographs of 44 patients (45 knees) who had undergone total knee arthroplasty. Two independent observers measured the films sequentially, in identical conditions, totalling 720 measurements per observer. Statistical analysis, comparing both observers and ratios, was carried out using the intraclass correlation coefficient. Before operation there was greater interobserver variation using either the Insall-Salvati or modified Insall-Salvati ratios than when using the Caton-Deschamps or Blackburne-Peel methods. This was because of difficulty in identifying the insertion of the patellar tendon. Before operation, there was a minimal difference in reliability between these methods. After operation the interobserver difference was greatly reduced using both the Caton-Deschamps and Blackburne-Peel methods, which use the prosthetic joint line, compared with the Insall-Salvati and modified Insall-Salvati, which reference from the insertion of the patellar tendon. The theoretical advantage of using the Insall-Salvati and modified Insall-Salvati ratios in measuring true patellar height after total knee arthroplasty needs to be balanced against their significant interobserver variability and inferior reliability when compared with other ratios


The Bone & Joint Journal
Vol. 100-B, Issue 2 | Pages 242 - 246
1 Feb 2018
Ghoshal A Enninghorst N Sisak K Balogh ZJ

Aims. To evaluate interobserver reliability of the Orthopaedic Trauma Association’s open fracture classification system (OTA-OFC). Patients and Methods. Patients of any age with a first presentation of an open long bone fracture were included. Standard radiographs, wound photographs, and a short clinical description were given to eight orthopaedic surgeons, who independently evaluated the injury using both the Gustilo and Anderson (GA) and OTA-OFC classifications. The responses were compared for variability using Cohen’s kappa. Results. The overall interobserver agreement was ĸ = 0.44 for the GA classification and ĸ = 0.49 for OTA-OFC, which reflects moderate agreement (0.41 to 0.60) for both classifications. The agreement in the five categories of OTA-OFC was: for skin, ĸ = 0.55 (moderate); for muscle, ĸ = 0.44 (moderate); for arterial injury, ĸ = 0.74 (substantial); for contamination, ĸ = 0.35 (fair); and for bone loss, ĸ = 0.41 (moderate). Conclusion. Although the OTA-OFC, with similar interobserver agreement to GA, offers a more detailed description of open fractures, further development may be needed to make it a reliable and robust tool. Cite this article: Bone Joint J 2018;100-B:242–6


The Journal of Bone & Joint Surgery British Volume
Vol. 80-B, Issue 4 | Pages 670 - 672
1 Jul 1998
Flinkkilä T Nikkola-Sihto A Kaarela O Päakkö E Raatikainen T

Interobserver reliability of the AO system of classification of fractures of the distal radius was assessed using plain radiographs and CT. Five observers classified 30 Colles’-type fractures using only plain radiographs; two months later they were reclassified using CT in addition. Interobserver reliability was poor in both series when detailed classification was used. By reducing the categories to five, interobserver reliability was slightly improved, but was still poor. When only two AO types were used, the reliability was moderate using plain radiographs and good to excellent with the addition of CT. The use of CT as well as plain radiographs brings interobserver reliability to a good level in assessment of the presence or absence of articular involvement, but is otherwise of minor value in improving the interobserver reliability of the AO system of classification of fractures of the distal radius


The Journal of Bone & Joint Surgery British Volume
Vol. 72-B, Issue 2 | Pages 202 - 204
1 Mar 1990
Simmons E Graham H Szalai J

Fifteen independent observers of three levels of experience (consultant staff, fellows, residents) assessed 40 radiographs of children presenting with Perthes' disease using the Catterall and the Salter-Thompson grading systems. Each observer was supplied with descriptions and illustrations of the classifications and each hip was grouped by both systems by each observer. The results were statistically analysed using 'kappa' statistics. The level of interobserver agreement was higher for the Salter-Thompson system and correlated with the level of experience of the observer. Both systems can give acceptable levels of interobserver agreement, but the Salter-Thompson grouping is simpler and easier to apply in the earlier stages of the disease when treatment must be decided, and has a higher degree of reproducibility amongst more experienced observers


The Journal of Bone & Joint Surgery British Volume
Vol. 84-B, Issue 7 | Pages 950 - 954
1 Sep 2002
Brorson S Bagger J Sylvest A Høbjartsson A

We investigated whether training doctors to classify proximal fractures of the humerus according to the Neer system could improve interobserver agreement. Fourteen doctors were randomised to two training sessions, or to no training, and asked to categorise 42 unselected pairs of plain radiographs of fractures of the proximal humerus according to the Neer system. The mean kappa difference between the training and control groups was 0.30 (95% CI 0.10 to 0.50, p = 0.006). In the training group the mean kappa value for interobserver variation improved from 0.27 (95% CI 0.24 to 0.31) to 0.62 (95% CI 0.57 to 0.67). The improvement was particularly notable for specialists in whom kappa increased from 0.30 (95% CI 0.23 to 0.37) to 0.79 (95% CI 0.70 to 0.88). These results suggest that formal training in the Neer system is a prerequisite for its use in clinical practice and research


The Journal of Bone & Joint Surgery British Volume
Vol. 84-B, Issue 1 | Pages 48 - 49
1 Jan 2002
Javed A Siddique M Vaghela M Hui ACW

We carried out a prospective study in order to establish to what extent the intra-articular evaluation undertaken during arthroscopy of the knee differed between surgeons. Two senior specialist registrars and a consultant orthopaedic surgeon with a special interest in knee surgery were involved. A total of 78 knee arthroscopies (78 patients) was studied. Arthroscopy was first carried out by the trainee and then by the senior author (ACWH). The intra-articular evaluation during the arthroscopy was recorded independently by a third person in the operating theatre. Data were collected to record variations in examination under anaesthesia, the morphology and pathology of the menisci and anterior cruciate ligament and the state of the articular surfaces. The overall interobserver variation was 20% in all categories. We question the published results of intra-articular evaluation during knee arthroscopy when surgeons of different levels of experience are involved in a single study


The Journal of Bone & Joint Surgery British Volume
Vol. 82-B, Issue 5 | Pages 636 - 642
1 Jul 2000
Wainwright AM Williams JR Carr AJ

We assessed the inter- and intraobserver variation in classification systems for fractures of the distal humerus. Three orthopaedic trauma consultants, three trauma registrars and three consultant musculoskeletal radiologists independently classified 33 sets of radiographs of such fractures on two occasions, each using three separate systems. For interobserver variation, the Riseborough and Radin system produced ‘moderate’ agreement (kappa = 0.513), but half of the fractures were not classifiable by this system. For the complete AO system, agreement was ‘fair’ (kappa = 0.343), but if only AO type and group or AO type alone was used, agreement improved to ‘moderate’ and ‘substantial’, respectively (kappa = 0.52 and 0.66). Agreement for the system of Jupiter and Mehne was ‘fair’ (kappa = 0.295). Similar levels of intraobserver variation were found. Systems of classification are useful in decision-making and evaluation of outcome only if there is agreement and consistency among observers. Our study casts doubt on these aspects of the systems currently available for fractures of the distal humerus


Bone & Joint Research
Vol. 9, Issue 5 | Pages 242 - 249
1 May 2020
Bali K Smit K Ibrahim M Poitras S Wilkin G Galmiche R Belzile E Beaulé PE

Aims

The aim of the current study was to assess the reliability of the Ottawa classification for symptomatic acetabular dysplasia.

Methods

In all, 134 consecutive hips that underwent periacetabular osteotomy were categorized using a validated software (Hip2Norm) into four categories of normal, lateral/global, anterior, or posterior. A total of 74 cases were selected for reliability analysis, and these included 44 dysplastic and 30 normal hips. A group of six blinded fellowship-trained raters, provided with the classification system, looked at these radiographs at two separate timepoints to classify the hips using standard radiological measurements. Thereafter, a consensus meeting was held where a modified flow diagram was devised, before a third reading by four raters using a separate set of 74 radiographs took place.


The Journal of Bone & Joint Surgery British Volume
Vol. 84-B, Issue 1 | Pages 15 - 18
1 Jan 2002
Whelan DB Bhandari M McKee MD Guyatt GH Kreder HJ Stephen D Schemitsch EH

The reliability of the radiological assessment of the healing of tibial fractures remains undetermined. We examined the inter- and intraobserver agreement of the healing of such fractures among four orthopaedic trauma surgeons who, on two separate occasions eight weeks apart, independently assessed the radiographs of 30 patients with fractures of the tibial shaft which had been treated by intramedullary fixation. The radiographs were selected from a database to represent fractures at various stages of healing. For each radiograph, the surgeon scored the degree of union, quantified the number of cortices bridged by callus or with a visible fracture line, described the extent and quality of the callus, and provided an overall rating of healing. The interobserver chance-corrected agreement using a quadratically weighted kappa (κ) statistic in which values of 0.61 to 0.80 represented substantial agreement were as follows: radiological union scale (κ = 0.60); number of cortices bridged by callus (κ = 0.75); number of cortices with a visible fracture line (κ = 0.70); the extent of the callus (κ = 0.57); and general impression of fracture healing (κ = 0.67). The intraobserver agreement of the overall impression of healing (κ = 0.89) and the number of cortices bridged by callus (κ = 0.82) or with a visible fracture line (κ = 0.83) was almost perfect. There are no validated scales which allow surgeons to grade fracture healing radiologically. Among those examined, the number of cortices bridged by bone appears to be a reliable, and easily measured radiological variable to assess the healing of fractures after intramedullary fixation


Bone & Joint Research
Vol. 13, Issue 1 | Pages 19 - 27
5 Jan 2024
Baertl S Rupp M Kerschbaum M Morgenstern M Baumann F Pfeifer C Worlicek M Popp D Amanatullah DF Alt V

Aims. This study aimed to evaluate the clinical application of the PJI-TNM classification for periprosthetic joint infection (PJI) by determining intraobserver and interobserver reliability. To facilitate its use in clinical practice, an educational app was subsequently developed and evaluated. Methods. A total of ten orthopaedic surgeons classified 20 cases of PJI based on the PJI-TNM classification. Subsequently, the classification was re-evaluated using the PJI-TNM app. Classification accuracy was calculated separately for each subcategory (reinfection, tissue and implant condition, non-human cells, and morbidity of the patient). Fleiss’ kappa and Cohen’s kappa were calculated for interobserver and intraobserver reliability, respectively. Results. Overall, interobserver and intraobserver agreements were substantial across the 20 classified cases. Analyses for the variable ‘reinfection’ revealed an almost perfect interobserver and intraobserver agreement with a classification accuracy of 94.8%. The category 'tissue and implant conditions' showed moderate interobserver and substantial intraobserver reliability, while the classification accuracy was 70.8%. For 'non-human cells,' accuracy was 81.0% and interobserver agreement was moderate with an almost perfect intraobserver reliability. The classification accuracy of the variable 'morbidity of the patient' reached 73.5% with a moderate interobserver agreement, whereas the intraobserver agreement was substantial. The application of the app yielded comparable results across all subgroups. Conclusion. The PJI-TNM classification system captures the heterogeneity of PJI and can be applied with substantial inter- and intraobserver reliability. The PJI-TNM educational app aims to facilitate application in clinical practice. A major limitation was the correct assessment of the implant situation. To eliminate this, a re-evaluation according to intraoperative findings is strongly recommended. Cite this article: Bone Joint Res 2024;13(1):19–27


The Bone & Joint Journal
Vol. 105-B, Issue 10 | Pages 1123 - 1130
1 Oct 2023
Donnan M Anderson N Hoq M Donnan L

Aims. The aim of this study was to investigate the agreement in interpretation of the quality of the paediatric hip ultrasound examination, the reliability of geometric and morphological assessment, and the relationship between these measurements. Methods. Four investigators evaluated 60 hip ultrasounds and assessed their quality based the standard plane of Graf et al. They measured geometric parameters, described the morphology of the hip, and assigned the Graf grade of dysplasia. They analyzed one self-selected image and one randomly selected image from the ultrasound series, and repeated the process four weeks later. The intra- and interobserver agreement, and correlations between various parameters were analyzed. Results. In the assessment of quality, there a was moderate to substantial intraobserver agreement for each element investigated, but interobserver agreement was poor. Morphological features showed weak to moderate agreement across all parameters but improved to significant when responses were reduced. The geometric measurements showed nearly perfect agreement, and the relationship between them and the morphological features showed a dose response across all parameters with moderate to substantial correlations. There were strong correlations between geometric measurements. The Graf classification showed a fair to moderate interobserver agreement, and moderate to substantial intraobserver agreement. Conclusion. This investigation into the reliability of the interpretation of hip ultrasound scans identified the difficulties in defining what is a high-quality ultrasound. We confirmed that geometric measurements are reliably interpreted and may be useful as a further measurement of quality. Morphological features are generally poorly interpreted, but a simpler binary classification considerably improves agreement. As there is a clear dose response relationship between geometric and morphological measurements, the importance of morphology in the diagnosis of hip dysplasia should be questioned. Cite this article: Bone Joint J 2023;105-B(10):1123–1130


The Bone & Joint Journal
Vol. 106-B, Issue 3 | Pages 227 - 231
1 Mar 2024
Todd NV Casey A Birch NC

The diagnostic sub-categorization of cauda equina syndrome (CES) is used to aid communication between doctors and other healthcare professionals. It is also used to determine the need for, and urgency of, MRI and surgery in these patients. A recent paper by Hoeritzauer et al (2023) in this journal examined the interobserver reliability of the widely accepted subcategories in 100 patients with cauda equina syndrome. They found that there is no useful interobserver agreement for the subcategories, even for experienced spinal surgeons. This observation is supported by the largest prospective study of the treatment of cauda equina syndrome in the UK by Woodfield et al (2023). If the accepted subcategories are unreliable, they cannot be used in the way that they are currently, and they should be revised or abandoned. This paper presents a reassessment of the diagnostic and prognostic subcategories of cauda equina syndrome in the light of this evidence, with a suggested cure based on a more inclusive synthesis of symptoms, signs, bladder ultrasound scan results, and pre-intervention urinary catheterization. Cite this article: Bone Joint J 2024;106-B(3):227–231


Bone & Joint Open
Vol. 5, Issue 11 | Pages 962 - 970
4 Nov 2024
Suter C Mattila H Ibounig T Sumrein BO Launonen A Järvinen TLN Lähdeoja T Rämö L

Aims. Though most humeral shaft fractures heal nonoperatively, up to one-third may lead to nonunion with inferior outcomes. The Radiographic Union Score for HUmeral Fractures (RUSHU) was created to identify high-risk patients for nonunion. Our study evaluated the RUSHU’s prognostic performance at six and 12 weeks in discriminating nonunion within a significantly larger cohort than before. Methods. Our study included 226 nonoperatively treated humeral shaft fractures. We evaluated the interobserver reliability and intraobserver reproducibility of RUSHU scoring using intraclass correlation coefficients (ICCs). Additionally, we determined the optimal cut-off thresholds for predicting nonunion using the receiver operating characteristic (ROC) method. Results. The RUSHU demonstrated good interobserver reliability with an ICC of 0.78 (95% CI 0.72 to 0.83) at six weeks and 0.77 (95% CI 0.71 to 0.82) at 12 weeks. Intraobserver reproducibility was good or excellent for all analyses. Area under the curve in the ROC analysis was 0.83 (95% CI 0.77 to 0.88) at six weeks and 0.89 (95% CI 0.84 to 0.93) at 12 weeks, indicating excellent discrimination. The optimal cut-off values for predicting nonunion were ≤ eight points at six weeks and ≤ nine points at 12 weeks, providing the best specificity-sensitivity trade-off. Conclusion. The RUSHU proves to be a reliable and reproducible radiological scoring system that aids in identifying patients at risk of nonunion at both six and 12 weeks post-injury during non-surgical treatment of humeral shaft fractures. The statistically optimal cut-off values for predicting nonunion are ≤ eight at six weeks and ≤ nine points at 12 weeks post-injury


The Bone & Joint Journal
Vol. 106-B, Issue 9 | Pages 898 - 906
1 Sep 2024
Kayani B Wazir MUK Mancino F Plastow R Haddad FS

Aims. The primary objective of this study was to develop a validated classification system for assessing iatrogenic bone trauma and soft-tissue injury during total hip arthroplasty (THA). The secondary objective was to compare macroscopic bone trauma and soft-tissues injury in conventional THA (CO THA) versus robotic arm-assisted THA (RO THA) using this classification system. Methods. This study included 30 CO THAs versus 30 RO THAs performed by a single surgeon. Intraoperative photographs of the osseous acetabulum and periacetabular soft-tissues were obtained prior to implantation of the acetabular component, which were used to develop the proposed classification system. Interobserver and intraobserver variabilities of the proposed classification system were assessed. Results. The BOne trauma and Soft-Tissue Injury classification system in total Hip arthroplasty (BOSTI Hip) grades osseous acetabular trauma and periarticular muscle damage during THA. The classification system has an interclass correlation coefficient of 0.90 (95% CI 0.86 to 0.93) for interobserver agreement and 0.89 (95% CI 0.84 to 0.93) for intraobserver agreement. RO THA was associated with improved BOSTI Hip scores (p = 0.002) and more pristine osseous surfaces in the anterior superior (p = 0.001) and posterior superior (p < 0.001) acetabular quadrants compared with CO THA. There were no differences between the groups in relation to injury to the gluteus medius (p = 0.084), obturator internus (p = 0.241), piriformis (p = 0.081), superior gamellus (p = 0.116), inferior gamellus (p = 0.132), quadratus femoris (p = 0.208), and vastus lateralis (p = 0.135), but overall combined muscle injury was reduced in RO THA compared with CO THA (p = 0.023). Discussion. The proposed BOSTI Hip classification provides a reproducible grading system for stratifying iatrogenic bone trauma and soft-tissue injury during THA. RO THA was associated with improved BOSTI Hip scores, more pristine osseous acetabular surfaces, and reduced combined periarticular muscle injury compared with CO THA. Further research is required to understand if these intraoperative findings translate to differences in clinical outcomes between the treatment groups. Cite this article: Bone Joint J 2024;106-B(9):898–906


The Bone & Joint Journal
Vol. 106-B, Issue 5 | Pages 468 - 474
1 May 2024
d'Amato M Flevas DA Salari P Bornes TD Brenneis M Boettner F Sculco PK Baldini A

Aims. Obtaining solid implant fixation is crucial in revision total knee arthroplasty (rTKA) to avoid aseptic loosening, a major reason for re-revision. This study aims to validate a novel grading system that quantifies implant fixation across three anatomical zones (epiphysis, metaphysis, diaphysis). Methods. Based on pre-, intra-, and postoperative assessments, the novel grading system allocates a quantitative score (0, 0.5, or 1 point) for the quality of fixation achieved in each anatomical zone. The criteria used by the algorithm to assign the score include the bone quality, the size of the bone defect, and the type of fixation used. A consecutive cohort of 245 patients undergoing rTKA from 2012 to 2018 were evaluated using the current novel scoring system and followed prospectively. In addition, 100 first-time revision cases were assessed radiologically from the original cohort and graded by three observers to evaluate the intra- and inter-rater reliability of the novel radiological grading system. Results. At a mean follow-up of 90 months (64 to 130), only two out of 245 cases failed due to aseptic loosening. Intraoperative grading yielded mean scores of 1.87 (95% confidence interval (CI) 1.82 to 1.92) for the femur and 1.96 (95% CI 1.92 to 2.0) for the tibia. Only 3.7% of femoral and 1.7% of tibial reconstructions fell below the 1.5-point threshold, which included the two cases of aseptic loosening. Interobserver reliability for postoperative radiological grading was 0.97 for the femur and 0.85 for the tibia. Conclusion. A minimum score of 1.5 points for each skeletal segment appears to be a reasonable cut-off to define sufficient fixation in rTKA. There were no revisions for aseptic loosening at mid-term follow-up when this fixation threshold was achieved or exceeded. When assessing first-time revisions, this novel grading system has shown excellent intra- and interobserver reliability. Cite this article: Bone Joint J 2024;106-B(5):468–474


The Bone & Joint Journal
Vol. 103-B, Issue 8 | Pages 1345 - 1350
1 Aug 2021
Czubak-Wrzosek M Nitek Z Sztwiertnia P Czubak J Grzelecki D Kowalczewski J Tyrakowski M

Aims. The aim of the study was to compare two methods of calculating pelvic incidence (PI) and pelvic tilt (PT), either by using the femoral heads or acetabular domes to determine the bicoxofemoral axis, in patients with unilateral or bilateral primary hip osteoarthritis (OA). Methods. PI and PT were measured on standing lateral radiographs of the spine in two groups: 50 patients with unilateral (Group I) and 50 patients with bilateral hip OA (Group II), using the femoral heads or acetabular domes to define the bicoxofemoral axis. Agreement between the methods was determined by intraclass correlation coefficient (ICC) and the standard error of measurement (SEm). The intraobserver reproducibility and interobserver reliability of the two methods were analyzed on 31 radiographs in both groups to calculate ICC and SEm. Results. In both groups, excellent agreement between the two methods was obtained, with ICC of 0.99 and SEm 0.3° for Group I, and ICC 0.99 and SEm 0.4° for Group II. The intraobserver reproducibility was excellent for both methods in both groups, with an ICC of at least 0.97 and SEm not exceeding 0.8°. The study also revealed excellent interobserver reliability for both methods in both groups, with ICC 0.99 and SEm 0.5° or less. Conclusion. Either the femoral heads or acetabular domes can be used to define the bicoxofemoral axis on the lateral standing radiographs of the spine for measuring PI and PT in patients with idiopathic unilateral or bilateral hip OA. Cite this article: Bone Joint J 2021;103-B(8):1345–1350


Bone & Joint Open
Vol. 2, Issue 10 | Pages 858 - 864
18 Oct 2021
Guntin J Plummer D Della Valle C DeBenedetti A Nam D

Aims. Prior studies have identified that malseating of a modular dual mobility liner can occur, with previous reported incidences between 5.8% and 16.4%. The aim of this study was to determine the incidence of malseating in dual mobility implants at our institution, assess for risk factors for liner malseating, and investigate whether liner malseating has any impact on clinical outcomes after surgery. Methods. We retrospectively reviewed the radiographs of 239 primary and revision total hip arthroplasties with a modular dual mobility liner. Two independent reviewers assessed radiographs for each patient twice for evidence of malseating, with a third observer acting as a tiebreaker. Univariate analysis was conducted to determine risk factors for malseating with Youden’s index used to identify cut-off points. Cohen’s kappa test was used to measure interobserver and intraobserver reliability. Results. In all, 12 liners (5.0%), including eight Stryker (6.8%) and four Zimmer Biomet (3.3%), had radiological evidence of malseating. Interobserver reliability was found to be 0.453 (95% confidence interval (CI) 0.26 to 0.64), suggesting weak inter-rater agreement, with strong agreement being greater than 0.8. We found component size of 50 mm or less to be associated with liner malseating on univariate analysis (p = 0.031). Patients with malseated liners appeared to have no associated clinical consequences, and none required revision surgery at a mean of 14 months (1.4 to 99.2) postoperatively. Conclusion. The incidence of liner malseating was 5.0%, which is similar to other reports. Component size of 50 mm or smaller was identified as a risk factor for malseating. Surgeons should be aware that malseating can occur and implant design changes or changes in instrumentation should be considered to lower the risk of malseating. Although further follow-up is needed, it remains to be seen if malseating is associated with any clinical consequences. Cite this article: Bone Jt Open 2021;2(10):858–864


The Bone & Joint Journal
Vol. 103-B, Issue 8 | Pages 1339 - 1344
1 Aug 2021
Jain S Mohrir G Townsend O Lamb JN Palan J Aderinto J Pandit H

Aims. This aim of this study was to assess the reliability and validity of the Unified Classification System (UCS) for postoperative periprosthetic femoral fractures (PFFs) around cemented polished taper-slip (PTS) stems. Methods. Radiographs of 71 patients with a PFF admitted consecutively at two centres between 25 February 2012 and 19 May 2020 were collated by an independent investigator. Six observers (three hip consultants and three trainees) were familiarized with the UCS. Each PFF was classified on two separate occasions, with a mean time between assessments of 22.7 days (16 to 29). Interobserver reliability for more than two observers was assessed using percentage agreement and Fleiss’ kappa statistic. Intraobserver reliability between two observers was calculated with Cohen kappa statistic. Validity was tested on surgically managed UCS type B PFFs where stem stability was documented in operation notes (n = 50). Validity was assessed using percentage agreement and Cohen kappa statistic between radiological assessment and intraoperative findings. Kappa statistics were interpreted using Landis and Koch criteria. All six observers were blinded to operation notes and postoperative radiographs. Results. Interobserver reliability percentage agreement was 58.5% and the overall kappa value was 0.442 (moderate agreement). Lowest kappa values were seen for type B fractures (0.095 to 0.360). The mean intraobserver reliability kappa value was 0.672 (0.447 to 0.867), indicating substantial agreement. Validity percentage agreement was 65.7% and the mean kappa value was 0.300 (0.160 to 0.4400) indicating only fair agreement. Conclusion. This study demonstrates that the UCS is unsatisfactory for the classification of PFFs around PTS stems, and that it has considerably lower reliability and validity than previously described for other stem types. Radiological PTS stem loosening in the presence of PFF is poorly defined and formal intraoperative testing of stem stability is recommended. Cite this article: Bone Joint J 2021;103-B(8):1339–1344


The Bone & Joint Journal
Vol. 103-B, Issue 8 | Pages 1380 - 1385
2 Aug 2021
Kim Y Ryu J Kim JK Al-Dhafer BAA Shin YH

Aims. The aim of this study was to assess arthritis of the basal joint of the thumb quantitatively using bone single-photon emission CT/CT (SPECT/CT) and evaluate its relationship with patients’ pain and function. Methods. We retrospectively reviewed 30 patients (53 hands) with symptomatic basal joint arthritis of the thumb between April 2019 and March 2020. Visual analogue scale (VAS) scores for pain, grip strength, and pinch power of both hands and Patient-Rated Wrist/Hand Evaluation (PRWHE) scores were recorded for all patients. Basal joint arthritis was classified according to the modified Eaton-Glickel stage using routine radiographs and the CT scans of SPECT/CT, respectively. The maximum standardized uptake value (SUVmax) from SPECT/CT was measured in the four peritrapezial joints and the highest uptake was used for analysis. Results. According to Eaton-Glickel classification, 11, 17, 17, and eight hands were stage 0 to I, II, III, and IV, respectively. The interobserver reliability for determining the stage of arthritis was moderate for radiographs (k = 0.41) and substantial for CT scans (k = 0.67). In a binary categorical analysis using SUVmax, pain (p < 0.001) and PRWHE scores (p = 0.004) were significantly higher in hands with higher SUVmax. Using multivariate linear regression to estimate the pain VAS, only SUVmax (B 0.172 (95% confidence interval (CI) 0.065 to 0.279; p = 0.002) showed a significant association. Estimating the variation of PRWHE scores using the same model, only SUVmax (B 1.378 (95% CI, 0.082 to 2.674); p = 0.038) showed a significant association. Conclusion. The CT scans of SPECT/CT provided better interobserver reliability than routine radiographs for evaluating the severity of arthritis. A higher SUVmax in SPECT/CT was associated with more pain and functional disabilities of basal joint arthritis of the thumb. This approach could be used to complement radiographs for the evaluation of patients with this condition. Cite this article: Bone Joint J 2021;103-B(8):1380–1385


The Bone & Joint Journal
Vol. 105-B, Issue 1 | Pages 21 - 28
1 Jan 2023
Ndlovu S Naqshband M Masunda S Ndlovu K Chettiar K Anugraha A

Aims. Clinical management of open fractures is challenging and frequently requires complex reconstruction procedures. The Gustilo-Anderson classification lacks uniform interpretation, has poor interobserver reliability, and fails to account for injuries to musculotendinous units and bone. The Ganga Hospital Open Injury Severity Score (GHOISS) was designed to address these concerns. The major aim of this review was to ascertain the evidence available on accuracy of the GHOISS in predicting successful limb salvage in patients with mangled limbs. Methods. We searched electronic data bases including PubMed, CENTRAL, EMBASE, CINAHL, Scopus, and Web of Science to identify studies that employed the GHOISS risk tool in managing complex limb injuries published from April 2006, when the score was introduced, until April 2021. Primary outcome was the measured sensitivity and specificity of the GHOISS risk tool for predicting amputation at a specified threshold score. Secondary outcomes included length of stay, need for plastic surgery, deep infection rate, time to fracture union, and functional outcome measures. Diagnostic test accuracy meta-analysis was performed using a random effects bivariate binomial model. Results. We identified 1,304 records, of which six prospective cohort studies and two retrospective cohort studies evaluating a total of 788 patients were deemed eligible for inclusion. A diagnostic test meta-analysis conducted on five cohort studies, with 474 participants, showed that GHOISS at a threshold score of 14 has a pooled sensitivity of 93.4% (95% confidence interval (CI) 78.4 to 98.2) and a specificity of 95% (95% CI 88.7 to 97.9) for predicting primary or secondary amputations in people with complex lower limb injuries. Conclusion. GHOISS is highly accurate in predicting success of limb salvage, and can inform management and predict secondary outcomes. However, there is a need for high-quality multicentre trials to confirm these findings and investigate the effectiveness of the score in children, and in predicting secondary amputations. Cite this article: Bone Joint J 2023;105-B(1):21–28


The Bone & Joint Journal
Vol. 105-B, Issue 6 | Pages 696 - 701
1 Jun 2023
Kurisunkal V Morris G Kaneuchi Y Bleibleh S James S Botchu R Jeys L Parry MC

Aims. Intra-articular (IA) tumours around the knee are treated with extra-articular (EA) resection, which is associated with poor functional outcomes. We aim to evaluate the accuracy of MRI in predicting IA involvement around the knee. Methods. We identified 63 cases of high-grade sarcomas in or around the distal femur that underwent an EA resection from a prospectively maintained database (January 1996 to April 2020). Suspicion of IA disease was noted in 52 cases, six had IA pathological fracture, two had an effusion, two had prior surgical intervention (curettage/IA intervention), and one had an osseous metastasis in the proximal tibia. To ascertain validity, two musculoskeletal radiologists (R1, R2) reviewed the preoperative imaging (MRI) of 63 consecutive cases on two occasions six weeks apart. The radiological criteria for IA disease comprised evidence of tumour extension within the suprapatellar pouch, intercondylar notch, extension along medial/lateral retinaculum, and presence of IA fracture. The radiological predictions were then confirmed with the final histopathology of the resected specimens. Results. The resection histology revealed 23 cases (36.5%) showing IA disease involvement compared with 40 cases without (62%). The intraobserver variability of R1 was 0.85 (p < 0.001) compared to R2 with κ = 0.21 (p = 0.007). The interobserver variability was κ = 0.264 (p = 0.003). Knee effusion was found to be the most sensitive indicator of IA involvement, with a sensitivity of 91.3% but specificity of only 35%. However, when combined with a pathological fracture, this rose to 97.5% and 100% when disease was visible in Hoffa’s fat pad. Conclusion. MRI imaging can sometimes overestimate IA joint involvement and needs to be correlated with clinical signs. In the light of our findings, we would recommend EA resections when imaging shows effusion combined with either disease in Hoffa’s fat pad or retinaculum, or pathological fractures. Cite this article: Bone Joint J 2023;105-B(6):696–701


Bone & Joint Open
Vol. 3, Issue 10 | Pages 826 - 831
28 Oct 2022
Jukes C Dirckx M Bellringer S Chaundy W Phadnis J

Aims. The conventionally described mechanism of distal biceps tendon rupture (DBTR) is of a ‘considerable extension force suddenly applied to a resisting, actively flexed forearm’. This has been commonly paraphrased as an ‘eccentric contracture to a flexed elbow’. Both definitions have been frequently used in the literature with little objective analysis or citation. The aim of the present study was to use video footage of real time distal biceps ruptures to revisit and objectively define the mechanism of injury. Methods. An online search identified 61 videos reporting a DBTR. Videos were independently reviewed by three surgeons to assess forearm rotation, elbow flexion, shoulder position, and type of muscle contraction being exerted at the time of rupture. Prospective data on mechanism of injury and arm position was also collected concurrently for 22 consecutive patients diagnosed with an acute DBTR in order to corroborate the video analysis. Results. Four videos were excluded, leaving 57 for final analysis. Mechanisms of injury included deadlift, bicep curls, calisthenics, arm wrestling, heavy lifting, and boxing. In all, 98% of ruptures occurred with the arm in supination and 89% occurred at 0° to 10° of elbow flexion. Regarding muscle activity, 88% occurred during isometric contraction, 7% during eccentric contraction, and 5% during concentric contraction. Interobserver correlation scores were calculated as 0.66 to 0.89 using the free-marginal Fleiss Kappa tool. The prospectively collected patient data was consistent with the video analysis, with 82% of injuries occurring in supination and 95% in relative elbow extension. Conclusion. Contrary to the classically described injury mechanism, in this study the usual arm position during DBTR was forearm supination and elbow extension, and the muscle contraction was typically isometric. This was demonstrated for both video analysis and ‘real’ patients across a range of activities leading to rupture. Cite this article: Bone Jt Open 2022;3(10):826–831


Bone & Joint Open
Vol. 4, Issue 4 | Pages 262 - 272
11 Apr 2023
Batailler C Naaim A Daxhelet J Lustig S Ollivier M Parratte S

Aims. The impact of a diaphyseal femoral deformity on knee alignment varies according to its severity and localization. The aims of this study were to determine a method of assessing the impact of diaphyseal femoral deformities on knee alignment for the varus knee, and to evaluate the reliability and the reproducibility of this method in a large cohort of osteoarthritic patients. Methods. All patients who underwent a knee arthroplasty from 2019 to 2021 were included. Exclusion criteria were genu valgus, flexion contracture (> 5°), previous femoral osteotomy or fracture, total hip arthroplasty, and femoral rotational disorder. A total of 205 patients met the inclusion criteria. The mean age was 62.2 years (SD 8.4). The mean BMI was 33.1 kg/m. 2. (SD 5.5). The radiological measurements were performed twice by two independent reviewers, and included hip knee ankle (HKA) angle, mechanical medial distal femoral angle (mMDFA), anatomical medial distal femoral angle (aMDFA), femoral neck shaft angle (NSA), femoral bowing angle (FBow), the distance between the knee centre and the top of the FBow (DK), and the angle representing the FBow impact on the knee (C’KS angle). Results. The FBow impact on the mMDFA can be measured by the C’KS angle. The C’KS angle took the localization (length DK) and the importance (FBow angle) of the FBow into consideration. The mean FBow angle was 4.4° (SD 2.4; 0 to 12.5). The mean C’KS angle was 1.8° (SD 1.1; 0 to 5.8). Overall, 84 knees (41%) had a severe FBow (> 5°). The radiological measurements showed very good to excellent intraobserver and interobserver agreements. The C’KS increased significantly when the length DK decreased and the FBow angle increased (p < 0.001). Conclusion. The impact of the diaphyseal femoral deformity on the mechanical femoral axis is measured by the C’KS angle, a reliable and reproducible measurement. Cite this article: Bone Jt Open 2023;4(4):262–272


The Bone & Joint Journal
Vol. 102-B, Issue 8 | Pages 1041 - 1047
1 Aug 2020
Hamoodi Z Singh J Elvey MH Watts AC

Aims. The Wrightington classification system of fracture-dislocations of the elbow divides these injuries into six subtypes depending on the involvement of the coronoid and the radial head. The aim of this study was to assess the reliability and reproducibility of this classification system. Methods. This was a blinded study using radiographs and CT scans of 48 consecutive patients managed according to the Wrightington classification system between 2010 and 2018. Four trauma and orthopaedic consultants, two post CCT fellows, and one speciality registrar based in the UK classified the injuries. The seven observers reviewed preoperative radiographs and CT scans twice, with a minimum four-week interval. Radiographs and CT scans were reviewed separately. Inter- and intraobserver reliability were calculated using Fleiss and Cohen kappa coefficients. The Landis and Koch criteria were used to interpret the strength of the kappa values. Validity was assessed by calculating the percentage agreement against intraoperative findings. Results. Of the 48 patients, three (6%) had type A injury, 11 (23%) type B, 16 (33%) type B+, 16 (33%) Type C, two (4%) type D+, and none had a type D injury. All 48 patients had anteroposterior (AP) and lateral radiographs, 44 had 2D CT scans, and 39 had 3D reconstructions. The interobserver reliability kappa value was 0.52 for radiographs, 0.71 for 2D CT scans, and 0.73 for a combination of 2D and 3D reconstruction CT scans. The median intraobserver reliability was 0.75 (interquartile range (IQR) 0.62 to 0.79) for radiographs, 0.77 (IQR 0.73 to 0.94) for 2D CT scans, and 0.89 (IQR 0.77 to 0.93) for the combination of 2D and 3D reconstruction. Validity analysis showed that accuracy significantly improved when using CT scans (p = 0.018 and p = 0.028 respectively). Conclusion. The Wrightington classification system is a reliable and valid method of classifying fracture-dislocations of the elbow. CT scans are significantly more accurate than radiographs when identifying the pattern of injury, with good intra- and interobserver reproducibility. Cite this article: Bone Joint J 2020;102-B(8):1041–1047


Bone & Joint Research
Vol. 9, Issue 7 | Pages 360 - 367
1 Jul 2020
Kawahara S Hara T Sato T Kitade K Shimoto T Nakamura T Mawatari T Higaki H Nakashima Y

Aims. Appropriate acetabular component placement has been proposed for prevention of postoperative dislocation in total hip arthroplasty (THA). Manual placements often cause outliers in spite of attempts to insert the component within the intended safe zone; therefore, some surgeons routinely evaluate intraoperative pelvic radiographs to exclude excessive acetabular component malposition. However, their evaluation is often ambiguous in case of the tilted or rotated pelvic position. The purpose of this study was to develop the computational analysis to digitalize the acetabular component orientation regardless of the pelvic tilt or rotation. Methods. Intraoperative pelvic radiographs of 50 patients who underwent THA were collected retrospectively. The 3D pelvic bone model and the acetabular component were image-matched to the intraoperative pelvic radiograph. The radiological anteversion (RA) and radiological inclination (RI) of the acetabular component were calculated and those measurement errors from the postoperative CT data were compared relative to those of the 2D measurements. In addition, the intra- and interobserver differences of the image-matching analysis were evaluated. Results. Mean measurement errors of the image-matching analyses were significantly small (2.5° (SD 1.4°) and 0.1° (SD 0.9°) in the RA and RI, respectively) relative to those of the 2D measurements. Intra- and interobserver differences were similarly small from the clinical perspective. Conclusion. We have developed a computational analysis of acetabular component orientation using an image-matching technique with small measurement errors compared to visual evaluations regardless of the pelvic tilt or rotation. Cite this article: Bone Joint Res 2020;9(7):360–367


The Bone & Joint Journal
Vol. 102-B, Issue 1 | Pages 102 - 107
1 Jan 2020
Sharma N Brown A Bouras T Kuiper JH Eldridge J Barnett A

Aims. Trochlear dysplasia is a significant risk factor for patellofemoral instability. The Dejour classification is currently considered the standard for classifying trochlear dysplasia, but numerous studies have reported poor reliability on both plain radiography and MRI. The severity of trochlear dysplasia is important to establish in order to guide surgical management. We have developed an MRI-specific classification system to assess the severity of trochlear dysplasia, the Oswestry-Bristol Classification (OBC). This is a four-part classification system comprising normal, mild, moderate, and severe to represent a normal, shallow, flat, and convex trochlear, respectively. The purpose of this study was to assess the inter- and intraobserver reliability of the OBC and compare it with that of the Dejour classification. Methods. Four observers (two senior and two junior orthopaedic surgeons) independently assessed 32 CT and axial MRI scans for trochlear dysplasia and classified each according to the OBC and the Dejour classification systems. Assessments were repeated following a four-week interval. The inter- and intraobserver agreement was determined by using Fleiss’ generalization of Cohen’s kappa statistic and S-statistic nominal and linear weights. Results. The OBC showed fair-to-good interobserver agreement and good-to-excellent intraobserver agreement (mean kappa 0.68). The Dejour classification showed poor interobserver agreement and fair-to-good intraobserver agreement (mean kappa 0.52). Conclusion. The OBC can be used to assess the severity of trochlear dysplasia. It can be applied in clinical practice to simplify and standardize surgical decision-making in patients with recurrent patella instability. Cite this article: Bone Joint J 2020;102-B(1):102–107


The Bone & Joint Journal
Vol. 100-B, Issue 5 | Pages 596 - 602
1 May 2018
Bock P Pittermann M Chraim M Rois S

Aims. Various radiological parameters are used to evaluate a flatfoot deformity and their measurements may differ. The aims of this study were to answer the following questions: 1) Which of the 11 parameters have the best inter- and intraobserver reliability in a standardized radiological setting? 2) Are pre- and postoperative assessments equally reliable? 3) What are the identifiable sources of variation?. Patients and Methods. Measurements of the 11 parameters were recorded on anteroposterior and lateral weight-bearing radiographs of 38 feet before and after surgery for flatfoot, by three observers with different experience in foot surgery (A, ten years; B, three years; C, third-year orthopaedic resident). The inter- and intraobserver reliability was calculated. Results. Preoperative interobserver reliability was high for four, moderate for five, and low for two parameters. Postoperative interobserver reliability was high for four, moderate for five, and low for two parameters. Intraobserver reliability was excellent for all parameters preoperatively as recorded by observer A (PB) and B (MP), and for eight parameters as recorded by observer C (SR). Intraobserver reliability was excellent for ten parameters postoperatively as recorded by observer A and B, and for eight parameters as recorded by observer C. Conclusion. The following parameters can be recommended. For preoperative and postoperative evaluation of flatfoot: anteroposterior, talonavicular coverage angle; lateral, talometatarsal I angle, calcaneal pitch angle, and cuneiform-medial height (high interobserver reliability); and anteroposterior, talometatarsal II angle; lateral, talocalcaneal angle,tibiocalcaneal angle (moderate interobserver reliability). For more experienced observers, we also recommend the anteroposterior talometatarsal I angle (moderate reliability). The inter- and intraobserver reliability for most parameters were similar pre- and postoperatively. The experience of the observer and the definition and ability to measure the parameters themselves were sources of variation. Cite this article: Bone Joint J 2018;100-B:596–602


Bone & Joint Open
Vol. 3, Issue 5 | Pages 423 - 431
1 May 2022
Leong JWY Singhal R Whitehouse MR Howell JR Hamer A Khanduja V Board TN

Aims. The aim of this modified Delphi process was to create a structured Revision Hip Complexity Classification (RHCC) which can be used as a tool to help direct multidisciplinary team (MDT) discussions of complex cases in local or regional revision networks. Methods. The RHCC was developed with the help of a steering group and an invitation through the British Hip Society (BHS) to members to apply, forming an expert panel of 35. We ran a mixed-method modified Delphi process (three rounds of questionnaires and one virtual meeting). Round 1 consisted of identifying the factors that govern the decision-making and complexities, with weighting given to factors considered most important by experts. Participants were asked to identify classification systems where relevant. Rounds 2 and 3 focused on grouping each factor into H1, H2, or H3, creating a hierarchy of complexity. This was followed by a virtual meeting in an attempt to achieve consensus on the factors which had not achieved consensus in preceding rounds. Results. The expert group achieved strong consensus in 32 out of 36 factors following the Delphi process. The RHCC used the existing Paprosky (acetabulum and femur), Unified Classification System, and American Society of Anesthesiologists (ASA) classification systems. Patients with ASA grade III/IV are recognized with a qualifier of an asterisk added to the final classification. The classification has good intraobserver and interobserver reliability with Kappa values of 0.88 to 0.92 and 0.77 to 0.85, respectively. Conclusion. The RHCC has been developed through a modified Delphi technique. RHCC will provide a framework to allow discussion of complex cases as part of a local or regional hip revision MDT. We believe that adoption of the RHCC will provide a comprehensive and reproducible method to describe each patient’s case with regard to surgical complexity, in addition to medical comorbidities that may influence their management. Cite this article: Bone Jt Open 2022;3(5):423–431


The Bone & Joint Journal
Vol. 103-B, Issue 11 | Pages 1662 - 1668
1 Nov 2021
Bhanushali A Chimutengwende-Gordon M Beck M Callary SA Costi K Howie DW Solomon LB

Aims. The aims of this study were to compare clinically relevant measurements of hip dysplasia on radiographs taken in the supine and standing position, and to compare Hip2Norm software and Picture Archiving and Communication System (PACS)-derived digital radiological measurements. Methods. Preoperative supine and standing radiographs of 36 consecutive patients (43 hips) who underwent periacetabular osteotomy surgery were retrospectively analyzed from a single-centre, two-surgeon cohort. Anterior coverage (AC), posterior coverage (PC), lateral centre-edge angle (LCEA), acetabular inclination (AI), sharp angle (SA), pelvic tilt (PT), retroversion index (RI), femoroepiphyseal acetabular roof (FEAR) index, femoroepiphyseal horizontal angle (FEHA), leg length discrepancy (LLD), and pelvic obliquity (PO) were analyzed using both Hip2Norm software and PACS-derived measurements where applicable. Results. Analysis of supine and standing radiographs resulted in significant variation for measurements of PT (p < 0.001) and AC (p = 0.005). The variation in PT correlated with the variation in AC in a limited number of patients (R. 2. = 0.378; p = 0.012). Conclusion. The significant variation in PT and AC between supine and standing radiographs suggests that it may benefit surgeons to have both radiographs when planning surgical correction of hip dysplasia. We also recommend using PACS-derived measurements of AI and SA due to the poor interobserver error on Hip2Norm. Cite this article: Bone Joint J 2021;103-B(11):1662–1668


The Bone & Joint Journal
Vol. 102-B, Issue 3 | Pages 301 - 309
1 Mar 2020
Keenan OJF Holland G Maempel JF Keating JF Scott CEH

Aims. Although knee osteoarthritis (OA) is diagnosed and monitored radiologically, actual full-thickness cartilage loss (FTCL) has rarely been correlated with radiological classification. This study aims to analyze which classification system correlates best with FTCL and to assess their reliability. Methods. A prospective study of 300 consecutive patients undergoing unilateral total knee arthroplasty (TKA) for OA (mean age 69 years (44 to 91; standard deviation (SD) 9.5), 178 (59%) female). Two blinded examiners independently graded preoperative radiographs using five common systems: Kellgren-Lawrence (KL); International Knee Documentation Committee (IKDC); Fairbank; Brandt; and Ahlbäck. Interobserver agreement was assessed using the intraclass correlation coefficient (ICC). Intraoperatively, anterior cruciate ligament (ACL) status and the presence of FTCL in 16 regions of interest were recorded. Radiological classification and FTCL were correlated using the Spearman correlation coefficient. Results. Knees had a mean of 6.8 regions of FTCL (SD 3.1), most common medially. The commonest patterns of FTCL were medial ± patellofemoral (143/300, 48%) and tricompartmental (89/300, 30%). ACL status was associated with pattern of FTCL (p = 0.023). All radiological classification systems demonstrated moderate ICC, but this was highest for the IKDC: whole knee 0.68 (95% confidence interval (CI) 0.60 to 0.74); medial compartment 0.84 (95% CI 0.80 to 0.87); and lateral compartment 0.79 (95% CI 0.73 to 0.83). Correlation with actual FTCL was strongest for Ahlbäck (Spearman rho 0.27 to 0.39) and KL (0.30 to 0.33) systems, although all systems demonstrated medium correlation. The Ahlbäck score was the most discriminating in severe knee OA. Osteophyte presence in the medial compartment had high positive predictive value (PPV) for FTCL, but not in the lateral compartment. Conclusion. The Ahlbäck and KL systems had the highest correlation with confirmed cartilage loss at TKA. However, the IKDC system displayed the best interobserver reliability, with favourable correlation with FTCL in medial and lateral compartments, although it was less discriminating in more severe disease. Cite this article: Bone Joint J 2020;102-B(3):301–309


The Bone & Joint Journal
Vol. 103-B, Issue 5 | Pages 872 - 880
1 May 2021
Young PS Macarico DT Silverwood RK Farhan-Alanie OM Mohammed A Periasamy K Nicol A Meek RMD

Aims. Uncemented metal acetabular components show good osseointegration, but material stiffness causes stress shielding and retroacetabular bone loss. Cemented monoblock polyethylene components load more physiologically; however, the cement bone interface can suffer fibrous encapsulation and loosening. It was hypothesized that an uncemented titanium-sintered monoblock polyethylene component may offer the optimum combination of osseointegration and anatomical loading. Methods. A total of 38 patients were prospectively enrolled and received an uncemented monoblock polyethylene acetabular (pressfit) component. This single cohort was then retrospectively compared with previously reported randomized cohorts of cemented monoblock (cemented) and trabecular metal (trabecular) acetabular implants. The primary outcome measure was periprosthetic bone density using dual-energy x-ray absorptiometry over two years. Secondary outcomes included radiological and clinical analysis. Results. Although there were differences in the number of males and females in each group, no significant sex bias was noted (p = 0.080). Furthermore, there was no significant difference in age (p = 0.910) or baseline lumbar bone mineral density (BMD) (p = 0.998) found between any of the groups (pressfit, cemented, or trabecular). The pressfit implant initially behaved like the trabecular component with an immediate fall in BMD in the inferior and medial regions, with preserved BMD laterally, suggesting lateral rim loading. However, the pressfit component subsequently showed a reversal in BMD medially with recovery back towards baseline, and a continued rise in lateral BMD. This would suggest that the pressfit component begins to reload the medial bone over time, more akin to the cemented component. Analysis of postoperative radiographs revealed no pressfit component subsidence or movement up to two years postoperatively (100% interobserver reliability). Medial defects seen immediately postoperatively in five cases had completely resolved by two years in four patients. Conclusion. Initially, the uncemented monoblock component behaved similarly to the rigid trabecular metal component with lateral rim loading; however, over two years this changed to more closely resemble the loading pattern of a cemented polyethylene component with increasing medial pelvic loading. This indicates that the uncemented monoblock acetabular component may result in optimized fixation and preservation of retroacetabular bone stock. Cite this article: Bone Joint J 2021;103-B(5):872–880


The Bone & Joint Journal
Vol. 101-B, Issue 10 | Pages 1300 - 1306
1 Oct 2019
Oliver WM Smith TJ Nicholson JA Molyneux SG White TO Clement ND Duckworth AD

Aims. The primary aim of this study was to develop a reliable, effective radiological score to assess the healing of humeral shaft fractures, the Radiographic Union Score for HUmeral fractures (RUSHU). The secondary aim was to assess whether the six-week RUSHU was predictive of nonunion at six months after the injury. Patients and Methods. Initially, 20 patients with radiographs six weeks following a humeral shaft fracture were selected at random from a trauma database and scored by three observers, based on the Radiographic Union Scale for Tibial fractures system. After refinement of the RUSHU criteria, a second group of 60 patients with radiographs six weeks after injury, 40 with fractures that united and 20 with fractures that developed nonunion, were scored by two blinded observers. Results. After refinement, the interobserver intraclass correlation coefficient (ICC) was 0.79 (95% confidence interval (CI) 0.67 to 0.87), indicating substantial agreement. At six weeks after injury, patients whose fractures united had a significantly higher median score than those who developed nonunion (10 vs 7; p < 0.001). A receiver operating characteristic curve determined that a RUSHU cut-off of < 8 was predictive of nonunion (area under the curve = 0.84, 95% CI 0.74 to 0.94). The sensitivity was 75% and specificity 80% with a positive predictive value (PPV) of 65% and a negative predictive value of 86%. Patients with a RUSHU < 8 (n = 23) were more likely to develop nonunion than those with a RUSHU ≥ 8 (n = 37, odds ratio 12.0, 95% CI 3.4 to 42.9). Based on a PPV of 65%, if all patients with a RUSHU < 8 underwent fixation, the number of procedures needed to avoid one nonunion would be 1.5. Conclusion. The RUSHU is reliable and effective in identifying patients at risk of nonunion of a humeral shaft fracture at six weeks after injury. This tool requires external validation but could potentially reduce the morbidity associated with delayed treatment of an established nonunion. Cite this article: Bone Joint J 2019;101-B:1300–1306


The Bone & Joint Journal
Vol. 102-B, Issue 5 | Pages 593 - 599
1 May 2020
Amanatullah DF Cheng RZ Huddleston III JI Maloney WJ Finlay AK Kappagoda S Suh GA Goodman SB

Aims. To establish the utility of adding the laboratory-based synovial alpha-defensin immunoassay to the traditional diagnostic work-up of a prosthetic joint infection (PJI). Methods. A group of four physicians evaluated 158 consecutive patients who were worked up for PJI, of which 94 underwent revision arthroplasty. Each physician reviewed the diagnostic data and decided on the presence of PJI according to the 2014 Musculoskeletal Infection Society (MSIS) criteria (yes, no, or undetermined). Their initial randomized review of the available data before or after surgery was blinded to each alpha-defensin result and a subsequent randomized review was conducted with each result. Multilevel logistic regression analysis assessed the effect of having the alpha-defensin result on the ability to diagnose PJI. Alpha-defensin was correlated to the number of synovial white blood cells (WBCs) and percentage of polymorphonuclear cells (%PMN). Results. Intraobserver reliability and interobserver agreement did not change when the alpha-defensin result was available. Positive alpha-defensin results had greater synovial WBCs (mean 31,854 cells/μL, SD 32,594) and %PMN (mean 93.0%, SD 5.5%) than negative alpha-defensin results (mean 974 cells/μL, SD 3,988; p < 0.001 and mean 39.4% SD 28.6%; p < 0.001). Adding the alpha-defensin result did not alter the diagnosis of a PJI using preoperative (odds ratio (OR) 0.52, 95% confidence interval (CI) 0.14 to 1.88; p = 0.315) or operative (OR 0.52, CI 0.18 to 1.55; p = 0.242) data when clinicians already decided that PJI was present or absent with traditionally available testing. However, when undetermined with traditional preoperative testing, alpha-defensin helped diagnose (OR 0.44, CI 0.30 to 0.64; p < 0.001) or rule out (OR 0.41, CI 0.17 to 0.98; p = 0.044) PJI. Of the 27 undecided cases with traditional testing, 24 (89%) benefited from the addition of alpha-defensin testing. Conclusion. The laboratory-based synovial alpha-defensin immunoassay did not help diagnose or rule out a PJI when added to routine serologies and synovial fluid analyses except in cases where the diagnosis of PJI was unclear. We recommend against the routine use of alpha-defensin and suggest using it only when traditional testing is indeterminate. Cite this article: Bone Joint J 2020;102-B(5):593–599


The Bone & Joint Journal
Vol. 101-B, Issue 12 | Pages 1578 - 1584
1 Dec 2019
Batailler C Weidner J Wyatt M Pfluger D Beck M

Aims. A borderline dysplastic hip can behave as either stable or unstable and this makes surgical decision making challenging. While an unstable hip may be best treated by acetabular reorientation, stable hips can be treated arthroscopically. Several imaging parameters can help to identify the appropriate treatment, including the Femoro-Epiphyseal Acetabular Roof (FEAR) index, measured on plain radiographs. The aim of this study was to assess the reliability and the sensitivity of FEAR index on MRI compared with its radiological measurement. Patients and Methods. The technique of measuring the FEAR index on MRI was defined and its reliability validated. A retrospective study assessed three groups of 20 patients: an unstable group of ‘borderline dysplastic hips’ with lateral centre edge angle (LCEA) less than 25° treated successfully by periacetabular osteotomy; a stable group of ‘borderline dysplastic hips’ with LCEA less than 25° treated successfully by impingement surgery; and an asymptomatic control group with LCEA between 25° and 35°. The following measurements were performed on both standardized radiographs and on MRI: LCEA, acetabular index, femoral anteversion, and FEAR index. Results. The FEAR index showed excellent intraobserver and interobserver reliability on both MRI and radiographs. The FEAR index was more reliable on radiographs than on MRI. The FEAR index on MRI was lower in the stable borderline group (mean -4.2° (. sd. 9.1°)) compared with the unstable borderline group (mean 7.9° (. sd. 6.8°)). With a FEAR index cut-off value of 2°, 90% of patients were correctly identified as stable or unstable using the radiological FEAR index, compared with 82.5% using the FEAR index on MRI. The FEAR index was a better predictor of instability on plain radiographs than on MRI. Conclusion. The FEAR index measured on MRI is less reliable and less sensitive than the FEAR index measured on radiographs. The cut-off value of 2° for radiological FEAR index predicted hip stability with 90% probability. Cite this article: Bone Joint J 2019;101-B:1578–1584


Bone & Joint Research
Vol. 2, Issue 1 | Pages 1 - 8
1 Jan 2013
Costa AJ Lustig S Scholes CJ Balestro J Fatima M Parker DA

Objectives. There remains a lack of data on the reliability of methods to estimate tibial coverage achieved during total knee replacement. In order to address this gap, the intra- and interobserver reliability of a three-dimensional (3D) digital templating method was assessed with one symmetric and one asymmetric prosthesis design. Methods. A total of 120 template procedures were performed according to specific rotational and over-hang criteria by three observers at time zero and again two weeks later. Total and sub-region coverage were calculated and the reliability of the templating and measurement method was evaluated. Results. Excellent intra- and interobserver reliability was observed for total coverage, when minimal component overhang (intraclass correlation coefficient (ICC) = 0.87) or no component overhang (ICC = 0.92) was permitted, regardless of rotational restrictions. Conclusions. Measurement of tibial coverage can be reliable using the templating method described even if the rotational axis selected still has a minor influence


The Journal of Bone & Joint Surgery British Volume
Vol. 91-B, Issue 9 | Pages 1191 - 1196
1 Sep 2009
Pagenstert GI Barg A Leumann AG Rasch H Müller-Brand J Hintermann B Valderrabano V

The precise localisation of osteoarthritic changes is crucial for selective surgical treatment. Single photon-emission CT-CT (SPECT-CT) combines both morphological and biological information. We hypothesised that SPECT-CT increased the intra- and interobserver reliability to localise increased uptake compared with traditional evaluation of CT and bone scanning together. We evaluated 20 consecutive patients with pain of uncertain origin in the foot and ankle by radiography and SPECT-CT, available as fused SPECT-CT, and by separate bone scanning and CT. Five observers assessed the presence or absence of arthritis. The images were blinded and randomly ordered. They were evaluated twice at an interval of six weeks. Kappa and multirater kappa values were calculated. The mean intraobserver reliability for SPECT-CT was excellent (κ = 0.86; 95% CI 0.81 to 0.88) and significantly higher than that for CT and bone scanning together. SPECT-CT had significantly higher interobserver agreement, especially when evaluating the naviculocuneiform and tarsometatarsal joints. SPECT-CT is useful in localising active arthritis especially in areas where the number and configuration of joints are complex


The Journal of Bone & Joint Surgery British Volume
Vol. 91-B, Issue 6 | Pages 766 - 771
1 Jun 2009
Brunner A Honigmann P Treumann T Babst R

We evaluated the impact of stereo-visualisation of three-dimensional volume-rendering CT datasets on the inter- and intraobserver reliability assessed by kappa values on the AO/OTA and Neer classifications in the assessment of proximal humeral fractures. Four independent observers classified 40 fractures according to the AO/OTA and Neer classifications using plain radiographs, two-dimensional CT scans and with stereo-visualised three-dimensional volume-rendering reconstructions. Both classification systems showed moderate interobserver reliability with plain radiographs and two-dimensional CT scans. Three-dimensional volume-rendered CT scans improved the interobserver reliability of both systems to good. Intraobserver reliability was moderate for both classifications when assessed by plain radiographs. Stereo visualisation of three-dimensional volume rendering improved intraobserver reliability to good for the AO/OTA method and to excellent for the Neer classification. These data support our opinion that stereo visualisation of three-dimensional volume-rendering datasets is of value when analysing and classifying complex fractures of the proximal humerus


The Journal of Bone & Joint Surgery British Volume
Vol. 93-B, Issue 5 | Pages 629 - 633
1 May 2011
Hirschmann MT Konala P Amsler F Iranpour F Friederich NF Cobb JP

We studied the intra- and interobserver reliability of measurements of the position of the components after total knee replacement (TKR) using a combination of radiographs and axial two-dimensional (2D) and three-dimensional (3D) reconstructed CT images to identify which method is best for this purpose. A total of 30 knees after primary TKR were assessed by two independent observers (an orthopaedic surgeon and a radiologist) using radiographs and CT scans. Plain radiographs were highly reliable at measuring the tibial slope, but showed wide variability for all other measurements; 2D-CT also showed wide variability. 3D-CT was highly reliable, even when measuring rotation of the femoral components, and significantly better than 2D-CT. Interobserver variability in the measurements on radiographs were good (intraclass correlation coefficient (ICC) 0.65 to 0.82), but rotational measurements on 2D-CT were poor (ICC 0.29). On 3D-CT they were near perfect (ICC 0.89 to 0.99), and significantly more reliable than 2D-CT (p < 0.001). 3D-reconstructed images are sufficiently reliable to enable reporting of the position and orientation of the components. Rotational measurements in particular should be performed on 3D-reconstructed CT images. When faced with a poorly functioning TKR with concerns over component positioning, we recommend 3D-CT as the investigation of choice


The Journal of Bone & Joint Surgery British Volume
Vol. 80-B, Issue 2 | Pages 321 - 324
1 Mar 1998
Bar-On E Meyer S Harati G Porat S

Ultrasonography of the hip was performed sequentially by two different examiners in 75 infants. The ultrasound strips were reviewed twice by three paediatric orthopaedic surgeons and classified by the Graf method. The intraobserver and interobserver agreement between the interpretations was analysed using simple and weighted kappa coefficients calculated for agreement on the Graf classification and for grouping as normal (types 1A to 2A), and abnormal requiring treatment (types 2B to 4). When examining the same ultrasound strip, intraobserver agreement for the Graf classification was substantial (mean kappa 0.61), but interobserver agreement was only moderate (kappa 0.50). For the grouping into normal and abnormal, the mean kappa value for intraobserver agreement was 0.67 and for interobserver agreement 0.57. Because of the significant differences in agreement between normal and abnormal hips, we analysed a subgroup of those with at least one abnormal interpretation. Intraobserver agreement within this subgroup showed moderate reliability (kappa 0.41), but interobserver agreement was only fair (kappa 0.28). Interpretations of two different strips performed sequentially showed significantly lower agreement with an intraobserver kappa value of 0.29 and an interobserver value of 0.28. In the subgroup with at least one abnormal reading, the intraobserver kappa was 0.09 and the interobserver 0.1. Our findings suggest that both the technique of performing ultrasonography and the interpretation of the image may influence the result


The Journal of Bone & Joint Surgery British Volume
Vol. 93-B, Issue 8 | Pages 1021 - 1026
1 Aug 2011
Kalteis T Sendtner E Beverland D Archbold PA Hube R Schuster T Renkawitz T Grifka J

Orientation of the native acetabular plane as defined by the transverse acetabular ligament (TAL) and the posterior labrum was measured intra-operatively using computer-assisted navigation in 39 hips. In order to assess the influence of alignment on impingement, the range of movement was calculated for that defined by the TAL and the posterior labrum and compared with a standard acetabular component position (abduction 45°/anteversion 15°). With respect to the registration of the plane defined by the TAL and the posterior labrum, there was moderate interobserver agreement (r = 0.64, p < 0.001) and intra-observer reproducibility (r = 0.73, p < 0.001). The mean acetabular component orientation achieved was abduction of 41° (32° to 51°) and anteversion of 18° (−1° to 36°). With respect to the Lewinnek safe zone (abduction 40° ±10°, anteversion 15° ±10°), 35 of the 39 acetabular components were within this zone. However, there was no improvement in the range of movement (p = 0.94) and no significant difference in impingement (p = 0.085). Alignment of the acetabular component with the TAL and the posterior labrum might reduce the variability of acetabular component placement in total hip replacement. However, there is only a moderate interobserver agreement and intra-observer reliability in the alignment of the acetabular component using the TAL and the posterior labrum. No reduction in impingement was found when the acetabular component was aligned with the TAL and the posterior labrum, compared with a standard acetabular component position


The Bone & Joint Journal
Vol. 101-B, Issue 7 | Pages 848 - 851
1 Jul 2019
Sautet P Parratte S Mékidèche T Abdel MP Flécher X Argenson J Ollivier M

Aims. The aims of this study were to compare the mean duration of antibiotic release and the mean zone of inhibition between vancomycin-loaded porous tantalum cylinders and antibiotic-loaded bone cement at intervals, and to evaluate potential intrinsic antimicrobial properties of tantalum in an in vitro medium environment against methicillin-sensitive Staphylococcus aureus (MSSA). Materials and Methods. Ten porous tantalum cylinders and ten cylinders of cement were used. The tantalum cylinders were impregnated with vancomycin, which was also added during preparation of the cylinders of cement. The cylinders were then placed on agar plates inoculated with MSSA. The diameter of the inhibition zone was measured each day, and the cylinders were transferred to a new inoculated plate. Inhibition zones were measured with a Vernier caliper and using an automated computed evaluation, and the intra- and interobserver reproducibility were measured. The mean inhibition zones between the two groups were compared with Wilcoxon’s test. Results. MSSA was inhibited for 12 days by the tantalum cylinders and for nine days by the cement cylinders. At day one, the mean zone of inhibition was 28.6 mm for the tantalum and 19.8 mm for the cement group (p < 0.001). At day ten, the mean zone of inhibition was 3.8 mm for the tantalum and 0 mm for the cement group (p < 0.001). The porous tantalum cylinders soaked only with phosphate buffered solution showed no zone of inhibition. Conclusion. Compared with cement, tantalum could release antibiotics for longer. Further studies should assess the advantages of using antibiotic-loaded porous tantalum implants at revision arthroplasty. Cite this article: Bone Joint J 2019;101-B:848–851


The Bone & Joint Journal
Vol. 95-B, Issue 10 | Pages 1396 - 1401
1 Oct 2013
Gabbe BJ Esser M Bucknill A Russ MK Hofstee D Cameron PA Handley C deSteiger RN

We describe the routine imaging practices of Level 1 trauma centres for patients with severe pelvic ring fractures, and the interobserver reliability of the classification systems of these fractures using plain radiographs and three-dimensional (3D) CT reconstructions. Clinical and imaging data for 187 adult patients (139 men and 48 women, mean age 43 years (15 to 101)) with a severe pelvic ring fracture managed at two Level 1 trauma centres between July 2007 and June 2010 were extracted. Three experienced orthopaedic surgeons classified the plain radiographs and 3D CT reconstruction images of 100 patients using the Tile/AO and Young–Burgess systems. Reliability was compared using kappa statistics. A total of 115 patients (62%) had plain radiographs as well as two-dimensional (2D) CT and 3D CT reconstructions, 52 patients (28%) had plain films only, 12 (6.4%) had 2D and 3D CT reconstructions images only, and eight patients (4.3%) had no available images. The plain radiograph was limited to an anteroposterior pelvic view. Patients without imaging, or only plain films, were more severely injured. A total of 72 patients (39%) were imaged with a pelvic binder in situ. Interobserver reliability for the Tile/AO (Kappa 0.10 to 0.17) and Young–Burgess (Kappa 0.09 to 0.21) was low, and insufficient for clinical and research purposes. Severe pelvic ring fractures are difficult to classify due to their complexity, the increasing use of early treatment such as with pelvic binders, and the absence of imaging altogether in important patient sub-groups, such as those who die early of their injuries. Cite this article: Bone Joint J 2013;95-B:1396–1401


The Journal of Bone & Joint Surgery British Volume
Vol. 93-B, Issue 6 | Pages 777 - 781
1 Jun 2011
Kalra S Smith TO Berko B Walton NP

The Oxford unicompartmental knee replacement gives good results in patients with symptomatic osteoarthritis of the medial compartment. Previous studies have suggested that the presence of radiolucent lines (RLLs) does not reflect a poor outcome in such patients. However, the reliability and validity of this assessment have not been determined. Our aim was to assess the intra- and interobserver reliability and the sensitivity and specificity of the assessment of RLLs around both tibial and femoral components using standard radiographs. Two reviewers assessed the radiographs of 45 patients who had loosening of the tibial or femoral component confirmed at revision surgery and compared them with those of a series of 45 asymptomatic patients matched for age and gender. The results suggested that, using standard radiographs, tibial RLLs were 63.6% sensitive and 94.4% specific and femoral RLLs 63.9% sensitive and 72.7% specific for loosening. Overall intra- and interobserver reliability was highly variable, but zonal analysis showed that lucency at the tip of the femoral peg was significantly associated with loosening of the femoral component. Fluoroscopically guided radiographs may improve the zonal reliability of the assessment of RLLs, but further independent and comparative studies are required. In the meantime, the innocence of the physiological RLLs detected by standard radiographs should be viewed with caution


The Bone & Joint Journal
Vol. 101-B, Issue 10 | Pages 1285 - 1291
1 Oct 2019
MacKenzie SA Ng RT Snowden G Powell-Bowns MFR Duckworth AD Scott CEH

Aims. Currently, periprosthetic fractures are excluded from the American Society for Bone and Mineral Research (ASBMR) definition of atypical femoral fracture (AFFs). This study aims to report on a series of periprosthetic femoral fractures (PFFs) that otherwise meet the criteria for AFFs. Secondary aims were to identify predictors of periprosthetic atypical femoral fractures (PAFFs) and quantify the complications of treatment. Patients and Methods. This was a retrospective case control study of consecutive patients with periprosthetic femoral fractures between 2007 and 2017. Two observers identified 16 PAFF cases (mean age 73.9 years (44 to 88), 14 female patients) and 17 typical periprosthetic fractures in patients on bisphosphonate therapy as controls (mean age 80.7 years (60 to 86, 13 female patients). Univariate and multivariate analysis was performed to identify predictors of PAFF. Management and complications were recorded. Results. Interobserver agreement for the PAFF classification was excellent (kappa = 0.944; p < 0.001). On univariate analysis compared with controls, patients with PAFFs had higher mean body mass indices (28.6 kg/m. 2. (. sd. 8.9) vs 21.5 kg/m. 2. (. sd. 3.3); p = 0.009), longer durations of bisphosphonate therapy (median 5.5 years (IQR 3.2 to 10.6) vs 2.4 years (IQR 1.0 to 6.4); p = 0.04), and were less likely to be on alendronate (50% vs 94%; p = 0.02) with an indication of secondary osteoporosis (19% vs 0%; p = 0.049). Duration of bisphosphonate therapy was an independent predictor of PAFF on multivariate analysis (R. 2. = 0.733; p = 0.05). Following primary fracture management, complication rates were higher in PAFFs (9/16, 56%) than controls (5/17, 29%; p = 0.178) with a relative risk of any complication following PAFF of 1.71 (95% confidence interval (CI) 0.77 to 3.8) and of reoperation 2.56 (95% CI 1.3 to 5.2). Conclusion. AFFs do occur in association with prostheses. Longer duration of bisphosphonate therapy is an independent predictor of PAFF. Complication rates are higher following PAFFs compared with typical PFFs, particularly of reoperation and infection. Cite this article: Bone Joint J 2019;101-B:1285–1291


The Journal of Bone & Joint Surgery British Volume
Vol. 87-B, Issue 9 | Pages 1267 - 1271
1 Sep 2005
Allami MK Jamil W Fourie B Ashton V Gregg PJ

The Department of Health and the Public Health Laboratory Service established the Nosocomial Infection National Surveillance Scheme in order to standardise the collection of information about infections acquired in hospital in the United Kingdom and provide national data with which hospitals could measure their own performance. The definition of superficial incisional infection (skin and subcutaneous tissue), set by the Center for Disease Control (CDC), should meet at least one of the defined criteria which would confirm the diagnosis and determine the need for specific treatment. We have assessed the interobserver reliability of the criteria for superficial incisional infection set by the CDC in our current practice. The incisional site of 50 patients who had an elective primary arthroplasty of the hip or knee was evaluated independently by two orthopaedic clinical research fellows and two orthopaedic ward sisters for the presence or absence of surgical-site infection. Interobserver reliability was assessed by comparison of the criteria for wound infection used by the four observers using kappa reliability coefficients. Our study demonstrated that some of the components of the current CDC criteria were unreliable and we recommend their revision


Bone & Joint Research
Vol. 7, Issue 7 | Pages 468 - 475
1 Jul 2018
He Q Sun H Shu L Zhu Y Xie X Zhan Y Luo C

Objectives. Researchers continue to seek easier ways to evaluate the quality of bone and screen for osteoporosis and osteopenia. Until recently, radiographic images of various parts of the body, except the distal femur, have been reappraised in the light of dual-energy X-ray absorptiometry (DXA) findings. The incidence of osteoporotic fractures around the knee joint in the elderly continues to increase. The aim of this study was to propose two new radiographic parameters of the distal femur for the assessment of bone quality. Methods. Anteroposterior radiographs of the knee and bone mineral density (BMD) and T-scores from DXA scans of 361 healthy patients were prospectively analyzed. The mean cortical bone thickness (CBTavg) and the distal femoral cortex index (DFCI) were the two parameters that were proposed and measured. Intra- and interobserver reliabilities were assessed. Correlations between the BMD and T-score and these parameters were investigated and their value in the diagnosis of osteoporosis and osteopenia was evaluated. Results. The DFCI, as a ratio, had higher reliability than the CBTavg. Both showed significant correlation with BMD and T-score. When compared with DFCI, CBTavg showed better correlation and was better for predicting osteoporosis and osteopenia. Conclusion. The CBTavg and DFCI are simple and reliable screening tools for the prediction of osteoporosis and osteopenia. The CBTavg is more accurate but the DFCI is easier to use in clinical practice. Cite this article: Q-F. He, H. Sun, L-Y. Shu, Y. Zhu, X-T. Xie, Y. Zhan, C-F. Luo. Radiographic predictors for bone mineral loss: Cortical thickness and index of the distal femur. Bone Joint Res 2018;7:468–475. DOI: 10.1302/2046-3758.77.BJR-2017-0332.R1


The Bone & Joint Journal
Vol. 100-B, Issue 8 | Pages 1100 - 1105
1 Aug 2018
Howard EL Shepherd KL Cribb G Cool P

Aims. The aim of this study was to validate the Mirels score in predicting pathological fractures in metastatic disease of the lower limb. Patients and Methods. A total of 62 patients with confirmed metastatic disease met the inclusion criteria. Of the 62 patients, 32 were female and 30 were male. The mean age of patients was 65 years (35 to 89). The primary malignancy originated from the breast in 27 (44%) patients, prostate in 15 (24%) patients, kidney in seven (11%), and lung in four (6%) of patients. One patient (2%) had metastatic carcinoma from the lacrimal gland, two patients (3%) had multiple myeloma, one patient (2%) had lymphoma of bone, and five patients (8%) had metastatic carcinoma of unknown primary. Plain radiographs at the time of initial presentation were scored using Mirels system by the four authors. The radiographic components of the score (anatomical site, size, and radiographic appearance) were scored two weeks apart. Inter- and intraobserver reliability were calculated with Fleiss’ kappa test. Bland-Altman plots were created to compare the variances of the individual components of the score and the total Mirels score. Results. Kappa values for the interobserver variability of the components of the Mirels score were k = 0.554 (95% CI 0.483 to 0.626) for site, k = 0.342 (95% CI 0.285 to 0.400) for size, k = 0.443 (95% CI 0.387 to 0.499) for radiographic appearance, and k = 0.294 (95% CI 0.258 to 0.331)for the total score. Kappa values for the intra-observer reliability were k = 0.608 (95% CI 0.506 to 0.710) for site, k = 0.579 (95% CI 0.487 to 0.670) for size, k = 0.614 (95% CI 0.522 to 0.703) for radiographic appearance, and k = 0.323 (95% CI 0.266 to 0.379) for total score. Conclusion. Our study showed fair to moderate agreement between authors when using the Mirels score, and moderate to substantial agreement when authors rescored radiographs. The Mirels score is subjective and lacks reproducibility in predicting the risk of pathological fracture. Cite this article: Bone Joint J 2018;100-B:1100–5


The Journal of Bone & Joint Surgery British Volume
Vol. 87-B, Issue 9 | Pages 1227 - 1232
1 Sep 2005
Brouwer RW Bierma-Zeinstra SMA van Koeveringe AJ Verhaar JAN

Our aim was to compare the degree of patellar descent and alteration in angle of the inclination of the tibial plateau in lateral closing-wedge and medial opening-wedge high tibial osteotomy (HTO) in 51 consecutive patients with osteoarthritis of the medial compartment and varus malalignment. Patellar height was measured by the Insall-Salvati (IS) and the Blackburne-Peel (BP) ratios. The tibial inclination was determined by the Moore-Harvey (MH) method. Multivariate linear regression analysis was used to determine the influence of the type of HTO (closing vs opening wedge) on the post-operative patellar height or tibial inclination. The intra- and interobserver variability of these methods was determined before operation and at follow-up at one year. After an opening-wedge HTO the patellar height was significantly more decreased (mean post-operative difference: IS = 0.15; 95% confidence interval (CI) 0.06 to 0.23; BP = 0.11; 95% CI 0.05 to 0.18) compared with a closing-wedge HTO. The angle of tibial inclination differed significantly (mean post-operative difference MH = −6.40°; 95% CI −8.74 to −4.02) between the two HTO techniques, increasing after opening-wedge HTO and decreasing after closing-wedge HTO. There was no clinically-relevant difference in the intra- and interobserver variability of measurements of patellar height either before or after HTO