Advertisement for orthosearch.org.uk
Results 1 - 50 of 194
Results per page:

Aims. Classifying trochlear dysplasia (TD) is useful to determine the treatment options for patients suffering from patellofemoral instability (PFI). There is no consensus on which classification system is more reliable and reproducible for the purpose of guiding clinicians’ management of PFI. There are also concerns about the validity of the Dejour Classification (DJC), which is the most widely used classification for TD, having only a fair reliability score. The Oswestry-Bristol Classification (OBC) is a recently proposed system of classification of TD, and the authors report a fair-to-good interobserver agreement and good-to-excellent intraobserver agreement in the assessment of TD. The aim of this study was to compare the reliability and reproducibility of these two classifications. Methods. In all, six assessors (four consultants and two registrars) independently evaluated 100 axial MRIs of the patellofemoral joint (PFJ) for TD and classified them according to OBC and DJC. These assessments were again repeated by all raters after four weeks. The inter- and intraobserver reliability scores were calculated using Cohen’s kappa and Cronbach’s α. Results. Both classifications showed good to excellent interobserver reliability with high α scores. The OBC classification showed a substantial intraobserver agreement (mean kappa 0.628; p < 0.005) whereas the DJC showed a moderate agreement (mean kappa 0.572; p < 0.005). There was no significant difference in the kappa values when comparing the assessments by consultants with those by registrars, in either classification system. Conclusion. This large study from a non-founding institute shows both classification systems to be reliable for classifying TD based on axial MRIs of the PFJ, with the simple-to-use OBC having a higher intraobserver reliability score than that of the DJC. Cite this article: Bone Jt Open 2023;4(7):532–538


The Bone & Joint Journal
Vol. 100-B, Issue 5 | Pages 596 - 602
1 May 2018
Bock P Pittermann M Chraim M Rois S

Aims. Various radiological parameters are used to evaluate a flatfoot deformity and their measurements may differ. The aims of this study were to answer the following questions: 1) Which of the 11 parameters have the best inter- and intraobserver reliability in a standardized radiological setting? 2) Are pre- and postoperative assessments equally reliable? 3) What are the identifiable sources of variation?. Patients and Methods. Measurements of the 11 parameters were recorded on anteroposterior and lateral weight-bearing radiographs of 38 feet before and after surgery for flatfoot, by three observers with different experience in foot surgery (A, ten years; B, three years; C, third-year orthopaedic resident). The inter- and intraobserver reliability was calculated. Results. Preoperative interobserver reliability was high for four, moderate for five, and low for two parameters. Postoperative interobserver reliability was high for four, moderate for five, and low for two parameters. Intraobserver reliability was excellent for all parameters preoperatively as recorded by observer A (PB) and B (MP), and for eight parameters as recorded by observer C (SR). Intraobserver reliability was excellent for ten parameters postoperatively as recorded by observer A and B, and for eight parameters as recorded by observer C. Conclusion. The following parameters can be recommended. For preoperative and postoperative evaluation of flatfoot: anteroposterior, talonavicular coverage angle; lateral, talometatarsal I angle, calcaneal pitch angle, and cuneiform-medial height (high interobserver reliability); and anteroposterior, talometatarsal II angle; lateral, talocalcaneal angle,tibiocalcaneal angle (moderate interobserver reliability). For more experienced observers, we also recommend the anteroposterior talometatarsal I angle (moderate reliability). The inter- and intraobserver reliability for most parameters were similar pre- and postoperatively. The experience of the observer and the definition and ability to measure the parameters themselves were sources of variation. Cite this article: Bone Joint J 2018;100-B:596–602


The Journal of Bone & Joint Surgery British Volume
Vol. 91-B, Issue 6 | Pages 766 - 771
1 Jun 2009
Brunner A Honigmann P Treumann T Babst R

We evaluated the impact of stereo-visualisation of three-dimensional volume-rendering CT datasets on the inter- and intraobserver reliability assessed by kappa values on the AO/OTA and Neer classifications in the assessment of proximal humeral fractures. Four independent observers classified 40 fractures according to the AO/OTA and Neer classifications using plain radiographs, two-dimensional CT scans and with stereo-visualised three-dimensional volume-rendering reconstructions. Both classification systems showed moderate interobserver reliability with plain radiographs and two-dimensional CT scans. Three-dimensional volume-rendered CT scans improved the interobserver reliability of both systems to good. Intraobserver reliability was moderate for both classifications when assessed by plain radiographs. Stereo visualisation of three-dimensional volume rendering improved intraobserver reliability to good for the AO/OTA method and to excellent for the Neer classification. These data support our opinion that stereo visualisation of three-dimensional volume-rendering datasets is of value when analysing and classifying complex fractures of the proximal humerus


The Journal of Bone & Joint Surgery British Volume
Vol. 82-B, Issue 5 | Pages 636 - 642
1 Jul 2000
Wainwright AM Williams JR Carr AJ

We assessed the inter- and intraobserver variation in classification systems for fractures of the distal humerus. Three orthopaedic trauma consultants, three trauma registrars and three consultant musculoskeletal radiologists independently classified 33 sets of radiographs of such fractures on two occasions, each using three separate systems. For interobserver variation, the Riseborough and Radin system produced ‘moderate’ agreement (kappa = 0.513), but half of the fractures were not classifiable by this system. For the complete AO system, agreement was ‘fair’ (kappa = 0.343), but if only AO type and group or AO type alone was used, agreement improved to ‘moderate’ and ‘substantial’, respectively (kappa = 0.52 and 0.66). Agreement for the system of Jupiter and Mehne was ‘fair’ (kappa = 0.295). Similar levels of intraobserver variation were found. Systems of classification are useful in decision-making and evaluation of outcome only if there is agreement and consistency among observers. Our study casts doubt on these aspects of the systems currently available for fractures of the distal humerus


The Journal of Bone & Joint Surgery British Volume
Vol. 84-B, Issue 1 | Pages 15 - 18
1 Jan 2002
Whelan DB Bhandari M McKee MD Guyatt GH Kreder HJ Stephen D Schemitsch EH

The reliability of the radiological assessment of the healing of tibial fractures remains undetermined. We examined the inter- and intraobserver agreement of the healing of such fractures among four orthopaedic trauma surgeons who, on two separate occasions eight weeks apart, independently assessed the radiographs of 30 patients with fractures of the tibial shaft which had been treated by intramedullary fixation. The radiographs were selected from a database to represent fractures at various stages of healing. For each radiograph, the surgeon scored the degree of union, quantified the number of cortices bridged by callus or with a visible fracture line, described the extent and quality of the callus, and provided an overall rating of healing. The interobserver chance-corrected agreement using a quadratically weighted kappa (κ) statistic in which values of 0.61 to 0.80 represented substantial agreement were as follows: radiological union scale (κ = 0.60); number of cortices bridged by callus (κ = 0.75); number of cortices with a visible fracture line (κ = 0.70); the extent of the callus (κ = 0.57); and general impression of fracture healing (κ = 0.67). The intraobserver agreement of the overall impression of healing (κ = 0.89) and the number of cortices bridged by callus (κ = 0.82) or with a visible fracture line (κ = 0.83) was almost perfect. There are no validated scales which allow surgeons to grade fracture healing radiologically. Among those examined, the number of cortices bridged by bone appears to be a reliable, and easily measured radiological variable to assess the healing of fractures after intramedullary fixation


Bone & Joint Research
Vol. 9, Issue 5 | Pages 242 - 249
1 May 2020
Bali K Smit K Ibrahim M Poitras S Wilkin G Galmiche R Belzile E Beaulé PE

Aims

The aim of the current study was to assess the reliability of the Ottawa classification for symptomatic acetabular dysplasia.

Methods

In all, 134 consecutive hips that underwent periacetabular osteotomy were categorized using a validated software (Hip2Norm) into four categories of normal, lateral/global, anterior, or posterior. A total of 74 cases were selected for reliability analysis, and these included 44 dysplastic and 30 normal hips. A group of six blinded fellowship-trained raters, provided with the classification system, looked at these radiographs at two separate timepoints to classify the hips using standard radiological measurements. Thereafter, a consensus meeting was held where a modified flow diagram was devised, before a third reading by four raters using a separate set of 74 radiographs took place.


Bone & Joint Research
Vol. 4, Issue 12 | Pages 190 - 194
1 Dec 2015
Kleinlugtenbelt YV Hoekstra M Ham SJ Kloen P Haverlag R Simons MP Bhandari M Goslings JC Poolman RW Scholtes VAB

Objectives

Current studies on the additional benefit of using computed tomography (CT) in order to evaluate the surgeons’ agreement on treatment plans for fracture are inconsistent. This inconsistency can be explained by a methodological phenomenon called ‘spectrum bias’, defined as the bias inherent when investigators choose a population lacking therapeutic uncertainty for evaluation. The aim of the study is to determine the influence of spectrum bias on the intra-observer agreement of treatment plans for fractures of the distal radius.

Methods

Four surgeons evaluated 51 patients with displaced fractures of the distal radius at four time points: T1 and T2: conventional radiographs; T3 and T4: radiographs and additional CT scan (radiograph and CT). Choice of treatment plan (operative or non-operative) and therapeutic certainty (five-point scale: very uncertain to very certain) were rated. To determine the influence of spectrum bias, the intra-observer agreement was analysed, using Kappa statistics, for each degree of therapeutic certainty.


Bone & Joint Research
Vol. 13, Issue 1 | Pages 19 - 27
5 Jan 2024
Baertl S Rupp M Kerschbaum M Morgenstern M Baumann F Pfeifer C Worlicek M Popp D Amanatullah DF Alt V

Aims. This study aimed to evaluate the clinical application of the PJI-TNM classification for periprosthetic joint infection (PJI) by determining intraobserver and interobserver reliability. To facilitate its use in clinical practice, an educational app was subsequently developed and evaluated. Methods. A total of ten orthopaedic surgeons classified 20 cases of PJI based on the PJI-TNM classification. Subsequently, the classification was re-evaluated using the PJI-TNM app. Classification accuracy was calculated separately for each subcategory (reinfection, tissue and implant condition, non-human cells, and morbidity of the patient). Fleiss’ kappa and Cohen’s kappa were calculated for interobserver and intraobserver reliability, respectively. Results. Overall, interobserver and intraobserver agreements were substantial across the 20 classified cases. Analyses for the variable ‘reinfection’ revealed an almost perfect interobserver and intraobserver agreement with a classification accuracy of 94.8%. The category 'tissue and implant conditions' showed moderate interobserver and substantial intraobserver reliability, while the classification accuracy was 70.8%. For 'non-human cells,' accuracy was 81.0% and interobserver agreement was moderate with an almost perfect intraobserver reliability. The classification accuracy of the variable 'morbidity of the patient' reached 73.5% with a moderate interobserver agreement, whereas the intraobserver agreement was substantial. The application of the app yielded comparable results across all subgroups. Conclusion. The PJI-TNM classification system captures the heterogeneity of PJI and can be applied with substantial inter- and intraobserver reliability. The PJI-TNM educational app aims to facilitate application in clinical practice. A major limitation was the correct assessment of the implant situation. To eliminate this, a re-evaluation according to intraoperative findings is strongly recommended. Cite this article: Bone Joint Res 2024;13(1):19–27


The Bone & Joint Journal
Vol. 105-B, Issue 10 | Pages 1123 - 1130
1 Oct 2023
Donnan M Anderson N Hoq M Donnan L

Aims. The aim of this study was to investigate the agreement in interpretation of the quality of the paediatric hip ultrasound examination, the reliability of geometric and morphological assessment, and the relationship between these measurements. Methods. Four investigators evaluated 60 hip ultrasounds and assessed their quality based the standard plane of Graf et al. They measured geometric parameters, described the morphology of the hip, and assigned the Graf grade of dysplasia. They analyzed one self-selected image and one randomly selected image from the ultrasound series, and repeated the process four weeks later. The intra- and interobserver agreement, and correlations between various parameters were analyzed. Results. In the assessment of quality, there a was moderate to substantial intraobserver agreement for each element investigated, but interobserver agreement was poor. Morphological features showed weak to moderate agreement across all parameters but improved to significant when responses were reduced. The geometric measurements showed nearly perfect agreement, and the relationship between them and the morphological features showed a dose response across all parameters with moderate to substantial correlations. There were strong correlations between geometric measurements. The Graf classification showed a fair to moderate interobserver agreement, and moderate to substantial intraobserver agreement. Conclusion. This investigation into the reliability of the interpretation of hip ultrasound scans identified the difficulties in defining what is a high-quality ultrasound. We confirmed that geometric measurements are reliably interpreted and may be useful as a further measurement of quality. Morphological features are generally poorly interpreted, but a simpler binary classification considerably improves agreement. As there is a clear dose response relationship between geometric and morphological measurements, the importance of morphology in the diagnosis of hip dysplasia should be questioned. Cite this article: Bone Joint J 2023;105-B(10):1123–1130


Bone & Joint Open
Vol. 1, Issue 7 | Pages 355 - 358
7 Jul 2020
Konrads C Gonser C Ahmad SS

Aims. The Oswestry-Bristol Classification (OBC) was recently described as an MRI-based classification tool for the femoral trochlear. The authors demonstrated better inter- and intraobserver agreement compared to the Dejour classification. As the OBC could potentially provide a very useful MRI-based grading system for trochlear dysplasia, it was the aim to determine the inter- and intraobserver reliability of the classification system from the perspective of the non-founder. Methods. Two orthopaedic surgeons independently assessed 50 MRI scans for trochlear dysplasia and classified each according to the OBC. Both observers repeated the assessments after six weeks. The inter- and intraobserver agreement was determined using Cohen’s kappa statistic and S-statistic nominal and linear weights. Results. The OBC with grading into four different trochlear forms showed excellent inter- and intraobserver agreement with a mean kappa of 0.78. Conclusion. The OBC is a simple MRI-based classification system with high inter- and intraobserver reliability. It could present a useful tool for grading the severity of trochlear dysplasia in daily practice. Cite this article: Bone Joint Open 2020;1-7:355–358


The Bone & Joint Journal
Vol. 106-B, Issue 9 | Pages 964 - 969
1 Sep 2024
Wang YC Song JJ Li TT Yang D Lv ZB Wang ZY Zhang ZM Luo Y

Aims. To propose a new method for evaluating paediatric radial neck fractures and improve the accuracy of fracture angulation measurement, particularly in younger children, and thereby facilitate planning treatment in this population. Methods. Clinical data of 117 children with radial neck fractures in our hospital from August 2014 to March 2023 were collected. A total of 50 children (26 males, 24 females, mean age 7.6 years (2 to 13)) met the inclusion criteria and were analyzed. Cases were excluded for the following reasons: Judet grade I and Judet grade IVb (> 85° angulation) classification; poor radiograph image quality; incomplete clinical information; sagittal plane angulation; severe displacement of the ulna fracture; and Monteggia fractures. For each patient, standard elbow anteroposterior (AP) view radiographs and corresponding CT images were acquired. On radiographs, Angle P (complementary to the angle between the long axis of the radial head and the line perpendicular to the physis), Angle S (complementary to the angle between the long axis of the radial head and the midline through the proximal radial shaft), and Angle U (between the long axis of the radial head and the straight line from the distal tip of the capitellum to the coronoid process) were identified as candidates approximating the true coronal plane angulation of radial neck fractures. On the coronal plane of the CT scan, the angulation of radial neck fractures (CTa) was measured and served as the reference standard for measurement. Inter- and intraobserver reliabilities were assessed by Kappa statistics and intraclass correlation coefficient (ICC). Results. Angle U showed the strongest correlation with CTa (p < 0.001). In the analysis of inter- and intraobserver reliability, Kappa values were significantly higher for Angles S and U compared with Angle P. ICC values were excellent among the three groups. Conclusion. Angle U on AP view was the best substitute for CTa when evaluating radial neck fractures in children. Further studies are required to validate this method. Cite this article: Bone Joint J 2024;106-B(9):964–969


The Bone & Joint Journal
Vol. 106-B, Issue 9 | Pages 898 - 906
1 Sep 2024
Kayani B Wazir MUK Mancino F Plastow R Haddad FS

Aims. The primary objective of this study was to develop a validated classification system for assessing iatrogenic bone trauma and soft-tissue injury during total hip arthroplasty (THA). The secondary objective was to compare macroscopic bone trauma and soft-tissues injury in conventional THA (CO THA) versus robotic arm-assisted THA (RO THA) using this classification system. Methods. This study included 30 CO THAs versus 30 RO THAs performed by a single surgeon. Intraoperative photographs of the osseous acetabulum and periacetabular soft-tissues were obtained prior to implantation of the acetabular component, which were used to develop the proposed classification system. Interobserver and intraobserver variabilities of the proposed classification system were assessed. Results. The BOne trauma and Soft-Tissue Injury classification system in total Hip arthroplasty (BOSTI Hip) grades osseous acetabular trauma and periarticular muscle damage during THA. The classification system has an interclass correlation coefficient of 0.90 (95% CI 0.86 to 0.93) for interobserver agreement and 0.89 (95% CI 0.84 to 0.93) for intraobserver agreement. RO THA was associated with improved BOSTI Hip scores (p = 0.002) and more pristine osseous surfaces in the anterior superior (p = 0.001) and posterior superior (p < 0.001) acetabular quadrants compared with CO THA. There were no differences between the groups in relation to injury to the gluteus medius (p = 0.084), obturator internus (p = 0.241), piriformis (p = 0.081), superior gamellus (p = 0.116), inferior gamellus (p = 0.132), quadratus femoris (p = 0.208), and vastus lateralis (p = 0.135), but overall combined muscle injury was reduced in RO THA compared with CO THA (p = 0.023). Discussion. The proposed BOSTI Hip classification provides a reproducible grading system for stratifying iatrogenic bone trauma and soft-tissue injury during THA. RO THA was associated with improved BOSTI Hip scores, more pristine osseous acetabular surfaces, and reduced combined periarticular muscle injury compared with CO THA. Further research is required to understand if these intraoperative findings translate to differences in clinical outcomes between the treatment groups. Cite this article: Bone Joint J 2024;106-B(9):898–906


The Bone & Joint Journal
Vol. 104-B, Issue 6 | Pages 715 - 720
1 Jun 2022
Dunsmuir RA Nisar S Cruickshank JA Loughenbury PR

Aims. The aim of the study was to determine if there was a direct correlation between the pain and disability experienced by patients and size of their disc prolapse, measured by the disc’s cross-sectional area on T2 axial MRI scans. Methods. Patients were asked to prospectively complete visual analogue scale (VAS) and Oswestry Disability Index (ODI) scores on the day of their MRI scan. All patients with primary disc herniation were included. Exclusion criteria included recurrent disc herniation, cauda equina syndrome, or any other associated spinal pathology. T2 weighted MRI scans were reviewed on picture archiving and communications software. The T2 axial image showing the disc protrusion with the largest cross sectional area was used for measurements. The area of the disc and canal were measured at this level. The size of the disc was measured as a percentage of the cross-sectional area of the spinal canal on the chosen image. The VAS leg pain and ODI scores were each correlated with the size of the disc using the Pearson correlation coefficient (PCC). Intraobserver reliability for MRI measurement was assessed using the interclass correlation coefficient (ICC). We assessed if the position of the disc prolapse (central, lateral recess, or foraminal) altered the symptoms described by the patient. The VAS and ODI scores from central and lateral recess disc prolapses were compared. Results. A total of 56 patients (mean age 41.1 years (22.8 to 70.3)) were included. A high degree of intraobserver reliability was observed for MRI measurement: single measure ICC was 0.99 (95% confidence interval (CI) from 0.97 to 0.99 (p < 0.001)). The PCC comparing VAS leg scores with canal occupancy for herniated disc was 0.056. The PCC comparing ODI for herniated disc was 0.070. We found 13 disc prolapses centrally and 43 lateral recess prolapses. There were no foraminal prolapses in this group. The position of the prolapse was not found to be related to the mean VAS score or ODI experienced by the patients (VAS, p = 0.251; ODI, p = 0.093). Conclusion. The results of the statistical analysis show that there is no direct correlation between the size or position of the disc prolapse and a patient’s symptoms. The symptoms experienced by patients should be the primary concern in deciding to perform discectomy. Cite this article: Bone Joint J 2022;104-B(6):715–720


The Bone & Joint Journal
Vol. 102-B, Issue 1 | Pages 102 - 107
1 Jan 2020
Sharma N Brown A Bouras T Kuiper JH Eldridge J Barnett A

Aims. Trochlear dysplasia is a significant risk factor for patellofemoral instability. The Dejour classification is currently considered the standard for classifying trochlear dysplasia, but numerous studies have reported poor reliability on both plain radiography and MRI. The severity of trochlear dysplasia is important to establish in order to guide surgical management. We have developed an MRI-specific classification system to assess the severity of trochlear dysplasia, the Oswestry-Bristol Classification (OBC). This is a four-part classification system comprising normal, mild, moderate, and severe to represent a normal, shallow, flat, and convex trochlear, respectively. The purpose of this study was to assess the inter- and intraobserver reliability of the OBC and compare it with that of the Dejour classification. Methods. Four observers (two senior and two junior orthopaedic surgeons) independently assessed 32 CT and axial MRI scans for trochlear dysplasia and classified each according to the OBC and the Dejour classification systems. Assessments were repeated following a four-week interval. The inter- and intraobserver agreement was determined by using Fleiss’ generalization of Cohen’s kappa statistic and S-statistic nominal and linear weights. Results. The OBC showed fair-to-good interobserver agreement and good-to-excellent intraobserver agreement (mean kappa 0.68). The Dejour classification showed poor interobserver agreement and fair-to-good intraobserver agreement (mean kappa 0.52). Conclusion. The OBC can be used to assess the severity of trochlear dysplasia. It can be applied in clinical practice to simplify and standardize surgical decision-making in patients with recurrent patella instability. Cite this article: Bone Joint J 2020;102-B(1):102–107


Bone & Joint Research
Vol. 8, Issue 8 | Pages 357 - 366
1 Aug 2019
Zhang B Sun H Zhan Y He Q Zhu Y Wang Y Luo C

Objectives. CT-based three-column classification (TCC) has been widely used in the treatment of tibial plateau fractures (TPFs). In its updated version (updated three-column concept, uTCC), a fracture morphology-based injury mechanism was proposed for effective treatment guidance. In this study, the injury mechanism of TPFs is further explained, and its inter- and intraobserver reliability is evaluated to perfect the uTCC. Methods. The radiological images of 90 consecutive TPF patients were collected. A total of 47 men (52.2%) and 43 women (47.8%) with a mean age of 49.8 years (. sd. 12.4; 17 to 77) were enrolled in our study. Among them, 57 fractures were on the left side (63.3%) and 33 were on the right side (36.7%); no bilateral fracture existed. Four observers were chosen to classify or estimate independently these randomized cases according to the Schatzker classification, TCC, and injury mechanism. With two rounds of evaluation, the kappa values were calculated to estimate the inter- and intrareliability. Results. The overall inter- and intraobserver agreements of the injury mechanism were substantial (κ. inter. = 0.699, κ. intra. = 0.749, respectively). The initial position and the force direction, which are two components of the injury mechanism, had substantial agreement for both inter-reliability or intrareliability. The inter- and intraobserver agreements were lower in high-energy fractures (Schatzker types IV to VI; κ. inter. = 0.605, κ. intra. = 0.721) compared with low-energy fractures (Schatzker types I to III; κ. inter. = 0.81, κ. intra. = 0.832). The inter- and intraobserver agreements were relatively higher in one-column fractures (κ. inter. = 0.759, κ. intra. = 0.801) compared with two-column and three-column fractures. Conclusion. The complete theory of injury mechanism of TPFs was first put forward to make the TCC consummate. It demonstrates substantial inter- and intraobserver agreement generally. Furthermore, the injury mechanism can be promoted clinically. Cite this article: B-B. Zhang, H. Sun, Y. Zhan, Q-F. He, Y. Zhu, Y-K. Wang, C-F. Luo. Reliability and repeatability of tibial plateau fracture assessment with an injury mechanism-based concept. Bone Joint Res 2019;8:357–366. DOI: 10.1302/2046-3758.88.BJR-2018-0331.R1


The Bone & Joint Journal
Vol. 103-B, Issue 8 | Pages 1345 - 1350
1 Aug 2021
Czubak-Wrzosek M Nitek Z Sztwiertnia P Czubak J Grzelecki D Kowalczewski J Tyrakowski M

Aims. The aim of the study was to compare two methods of calculating pelvic incidence (PI) and pelvic tilt (PT), either by using the femoral heads or acetabular domes to determine the bicoxofemoral axis, in patients with unilateral or bilateral primary hip osteoarthritis (OA). Methods. PI and PT were measured on standing lateral radiographs of the spine in two groups: 50 patients with unilateral (Group I) and 50 patients with bilateral hip OA (Group II), using the femoral heads or acetabular domes to define the bicoxofemoral axis. Agreement between the methods was determined by intraclass correlation coefficient (ICC) and the standard error of measurement (SEm). The intraobserver reproducibility and interobserver reliability of the two methods were analyzed on 31 radiographs in both groups to calculate ICC and SEm. Results. In both groups, excellent agreement between the two methods was obtained, with ICC of 0.99 and SEm 0.3° for Group I, and ICC 0.99 and SEm 0.4° for Group II. The intraobserver reproducibility was excellent for both methods in both groups, with an ICC of at least 0.97 and SEm not exceeding 0.8°. The study also revealed excellent interobserver reliability for both methods in both groups, with ICC 0.99 and SEm 0.5° or less. Conclusion. Either the femoral heads or acetabular domes can be used to define the bicoxofemoral axis on the lateral standing radiographs of the spine for measuring PI and PT in patients with idiopathic unilateral or bilateral hip OA. Cite this article: Bone Joint J 2021;103-B(8):1345–1350


The Bone & Joint Journal
Vol. 103-B, Issue 8 | Pages 1339 - 1344
1 Aug 2021
Jain S Mohrir G Townsend O Lamb JN Palan J Aderinto J Pandit H

Aims. This aim of this study was to assess the reliability and validity of the Unified Classification System (UCS) for postoperative periprosthetic femoral fractures (PFFs) around cemented polished taper-slip (PTS) stems. Methods. Radiographs of 71 patients with a PFF admitted consecutively at two centres between 25 February 2012 and 19 May 2020 were collated by an independent investigator. Six observers (three hip consultants and three trainees) were familiarized with the UCS. Each PFF was classified on two separate occasions, with a mean time between assessments of 22.7 days (16 to 29). Interobserver reliability for more than two observers was assessed using percentage agreement and Fleiss’ kappa statistic. Intraobserver reliability between two observers was calculated with Cohen kappa statistic. Validity was tested on surgically managed UCS type B PFFs where stem stability was documented in operation notes (n = 50). Validity was assessed using percentage agreement and Cohen kappa statistic between radiological assessment and intraoperative findings. Kappa statistics were interpreted using Landis and Koch criteria. All six observers were blinded to operation notes and postoperative radiographs. Results. Interobserver reliability percentage agreement was 58.5% and the overall kappa value was 0.442 (moderate agreement). Lowest kappa values were seen for type B fractures (0.095 to 0.360). The mean intraobserver reliability kappa value was 0.672 (0.447 to 0.867), indicating substantial agreement. Validity percentage agreement was 65.7% and the mean kappa value was 0.300 (0.160 to 0.4400) indicating only fair agreement. Conclusion. This study demonstrates that the UCS is unsatisfactory for the classification of PFFs around PTS stems, and that it has considerably lower reliability and validity than previously described for other stem types. Radiological PTS stem loosening in the presence of PFF is poorly defined and formal intraoperative testing of stem stability is recommended. Cite this article: Bone Joint J 2021;103-B(8):1339–1344


The Bone & Joint Journal
Vol. 106-B, Issue 1 | Pages 19 - 27
1 Jan 2024
Tang H Guo S Ma Z Wang S Zhou Y

Aims. The aim of this study was to evaluate the reliability and validity of a patient-specific algorithm which we developed for predicting changes in sagittal pelvic tilt after total hip arthroplasty (THA). Methods. This retrospective study included 143 patients who underwent 171 THAs between April 2019 and October 2020 and had full-body lateral radiographs preoperatively and at one year postoperatively. We measured the pelvic incidence (PI), the sagittal vertical axis (SVA), pelvic tilt, sacral slope (SS), lumbar lordosis (LL), and thoracic kyphosis to classify patients into types A, B1, B2, B3, and C. The change of pelvic tilt was predicted according to the normal range of SVA (0 mm to 50 mm) for types A, B1, B2, and B3, and based on the absolute value of one-third of the PI-LL mismatch for type C patients. The reliability of the classification of the patients and the prediction of the change of pelvic tilt were assessed using kappa values and intraclass correlation coefficients (ICCs), respectively. Validity was assessed using the overall mean error and mean absolute error (MAE) for the prediction of the change of pelvic tilt. Results. The kappa values were 0.927 (95% confidence interval (CI) 0.861 to 0.992) and 0.945 (95% CI 0.903 to 0.988) for the inter- and intraobserver reliabilities, respectively, and the ICCs ranged from 0.919 to 0.997. The overall mean error and MAE for the prediction of the change of pelvic tilt were -0.3° (SD 3.6°) and 2.8° (SD 2.4°), respectively. The overall absolute change of pelvic tilt was 5.0° (SD 4.1°). Pre- and postoperative values and changes in pelvic tilt, SVA, SS, and LL varied significantly among the five types of patient. Conclusion. We found that the proposed algorithm was reliable and valid for predicting the standing pelvic tilt after THA. Cite this article: Bone Joint J 2024;106-B(1):19–27


The Bone & Joint Journal
Vol. 105-B, Issue 6 | Pages 696 - 701
1 Jun 2023
Kurisunkal V Morris G Kaneuchi Y Bleibleh S James S Botchu R Jeys L Parry MC

Aims. Intra-articular (IA) tumours around the knee are treated with extra-articular (EA) resection, which is associated with poor functional outcomes. We aim to evaluate the accuracy of MRI in predicting IA involvement around the knee. Methods. We identified 63 cases of high-grade sarcomas in or around the distal femur that underwent an EA resection from a prospectively maintained database (January 1996 to April 2020). Suspicion of IA disease was noted in 52 cases, six had IA pathological fracture, two had an effusion, two had prior surgical intervention (curettage/IA intervention), and one had an osseous metastasis in the proximal tibia. To ascertain validity, two musculoskeletal radiologists (R1, R2) reviewed the preoperative imaging (MRI) of 63 consecutive cases on two occasions six weeks apart. The radiological criteria for IA disease comprised evidence of tumour extension within the suprapatellar pouch, intercondylar notch, extension along medial/lateral retinaculum, and presence of IA fracture. The radiological predictions were then confirmed with the final histopathology of the resected specimens. Results. The resection histology revealed 23 cases (36.5%) showing IA disease involvement compared with 40 cases without (62%). The intraobserver variability of R1 was 0.85 (p < 0.001) compared to R2 with κ = 0.21 (p = 0.007). The interobserver variability was κ = 0.264 (p = 0.003). Knee effusion was found to be the most sensitive indicator of IA involvement, with a sensitivity of 91.3% but specificity of only 35%. However, when combined with a pathological fracture, this rose to 97.5% and 100% when disease was visible in Hoffa’s fat pad. Conclusion. MRI imaging can sometimes overestimate IA joint involvement and needs to be correlated with clinical signs. In the light of our findings, we would recommend EA resections when imaging shows effusion combined with either disease in Hoffa’s fat pad or retinaculum, or pathological fractures. Cite this article: Bone Joint J 2023;105-B(6):696–701


Bone & Joint Open
Vol. 4, Issue 4 | Pages 262 - 272
11 Apr 2023
Batailler C Naaim A Daxhelet J Lustig S Ollivier M Parratte S

Aims. The impact of a diaphyseal femoral deformity on knee alignment varies according to its severity and localization. The aims of this study were to determine a method of assessing the impact of diaphyseal femoral deformities on knee alignment for the varus knee, and to evaluate the reliability and the reproducibility of this method in a large cohort of osteoarthritic patients. Methods. All patients who underwent a knee arthroplasty from 2019 to 2021 were included. Exclusion criteria were genu valgus, flexion contracture (> 5°), previous femoral osteotomy or fracture, total hip arthroplasty, and femoral rotational disorder. A total of 205 patients met the inclusion criteria. The mean age was 62.2 years (SD 8.4). The mean BMI was 33.1 kg/m. 2. (SD 5.5). The radiological measurements were performed twice by two independent reviewers, and included hip knee ankle (HKA) angle, mechanical medial distal femoral angle (mMDFA), anatomical medial distal femoral angle (aMDFA), femoral neck shaft angle (NSA), femoral bowing angle (FBow), the distance between the knee centre and the top of the FBow (DK), and the angle representing the FBow impact on the knee (C’KS angle). Results. The FBow impact on the mMDFA can be measured by the C’KS angle. The C’KS angle took the localization (length DK) and the importance (FBow angle) of the FBow into consideration. The mean FBow angle was 4.4° (SD 2.4; 0 to 12.5). The mean C’KS angle was 1.8° (SD 1.1; 0 to 5.8). Overall, 84 knees (41%) had a severe FBow (> 5°). The radiological measurements showed very good to excellent intraobserver and interobserver agreements. The C’KS increased significantly when the length DK decreased and the FBow angle increased (p < 0.001). Conclusion. The impact of the diaphyseal femoral deformity on the mechanical femoral axis is measured by the C’KS angle, a reliable and reproducible measurement. Cite this article: Bone Jt Open 2023;4(4):262–272


The Bone & Joint Journal
Vol. 102-B, Issue 8 | Pages 1041 - 1047
1 Aug 2020
Hamoodi Z Singh J Elvey MH Watts AC

Aims. The Wrightington classification system of fracture-dislocations of the elbow divides these injuries into six subtypes depending on the involvement of the coronoid and the radial head. The aim of this study was to assess the reliability and reproducibility of this classification system. Methods. This was a blinded study using radiographs and CT scans of 48 consecutive patients managed according to the Wrightington classification system between 2010 and 2018. Four trauma and orthopaedic consultants, two post CCT fellows, and one speciality registrar based in the UK classified the injuries. The seven observers reviewed preoperative radiographs and CT scans twice, with a minimum four-week interval. Radiographs and CT scans were reviewed separately. Inter- and intraobserver reliability were calculated using Fleiss and Cohen kappa coefficients. The Landis and Koch criteria were used to interpret the strength of the kappa values. Validity was assessed by calculating the percentage agreement against intraoperative findings. Results. Of the 48 patients, three (6%) had type A injury, 11 (23%) type B, 16 (33%) type B+, 16 (33%) Type C, two (4%) type D+, and none had a type D injury. All 48 patients had anteroposterior (AP) and lateral radiographs, 44 had 2D CT scans, and 39 had 3D reconstructions. The interobserver reliability kappa value was 0.52 for radiographs, 0.71 for 2D CT scans, and 0.73 for a combination of 2D and 3D reconstruction CT scans. The median intraobserver reliability was 0.75 (interquartile range (IQR) 0.62 to 0.79) for radiographs, 0.77 (IQR 0.73 to 0.94) for 2D CT scans, and 0.89 (IQR 0.77 to 0.93) for the combination of 2D and 3D reconstruction. Validity analysis showed that accuracy significantly improved when using CT scans (p = 0.018 and p = 0.028 respectively). Conclusion. The Wrightington classification system is a reliable and valid method of classifying fracture-dislocations of the elbow. CT scans are significantly more accurate than radiographs when identifying the pattern of injury, with good intra- and interobserver reproducibility. Cite this article: Bone Joint J 2020;102-B(8):1041–1047


Bone & Joint Open
Vol. 3, Issue 5 | Pages 423 - 431
1 May 2022
Leong JWY Singhal R Whitehouse MR Howell JR Hamer A Khanduja V Board TN

Aims. The aim of this modified Delphi process was to create a structured Revision Hip Complexity Classification (RHCC) which can be used as a tool to help direct multidisciplinary team (MDT) discussions of complex cases in local or regional revision networks. Methods. The RHCC was developed with the help of a steering group and an invitation through the British Hip Society (BHS) to members to apply, forming an expert panel of 35. We ran a mixed-method modified Delphi process (three rounds of questionnaires and one virtual meeting). Round 1 consisted of identifying the factors that govern the decision-making and complexities, with weighting given to factors considered most important by experts. Participants were asked to identify classification systems where relevant. Rounds 2 and 3 focused on grouping each factor into H1, H2, or H3, creating a hierarchy of complexity. This was followed by a virtual meeting in an attempt to achieve consensus on the factors which had not achieved consensus in preceding rounds. Results. The expert group achieved strong consensus in 32 out of 36 factors following the Delphi process. The RHCC used the existing Paprosky (acetabulum and femur), Unified Classification System, and American Society of Anesthesiologists (ASA) classification systems. Patients with ASA grade III/IV are recognized with a qualifier of an asterisk added to the final classification. The classification has good intraobserver and interobserver reliability with Kappa values of 0.88 to 0.92 and 0.77 to 0.85, respectively. Conclusion. The RHCC has been developed through a modified Delphi technique. RHCC will provide a framework to allow discussion of complex cases as part of a local or regional hip revision MDT. We believe that adoption of the RHCC will provide a comprehensive and reproducible method to describe each patient’s case with regard to surgical complexity, in addition to medical comorbidities that may influence their management. Cite this article: Bone Jt Open 2022;3(5):423–431


Bone & Joint Open
Vol. 2, Issue 10 | Pages 858 - 864
18 Oct 2021
Guntin J Plummer D Della Valle C DeBenedetti A Nam D

Aims. Prior studies have identified that malseating of a modular dual mobility liner can occur, with previous reported incidences between 5.8% and 16.4%. The aim of this study was to determine the incidence of malseating in dual mobility implants at our institution, assess for risk factors for liner malseating, and investigate whether liner malseating has any impact on clinical outcomes after surgery. Methods. We retrospectively reviewed the radiographs of 239 primary and revision total hip arthroplasties with a modular dual mobility liner. Two independent reviewers assessed radiographs for each patient twice for evidence of malseating, with a third observer acting as a tiebreaker. Univariate analysis was conducted to determine risk factors for malseating with Youden’s index used to identify cut-off points. Cohen’s kappa test was used to measure interobserver and intraobserver reliability. Results. In all, 12 liners (5.0%), including eight Stryker (6.8%) and four Zimmer Biomet (3.3%), had radiological evidence of malseating. Interobserver reliability was found to be 0.453 (95% confidence interval (CI) 0.26 to 0.64), suggesting weak inter-rater agreement, with strong agreement being greater than 0.8. We found component size of 50 mm or less to be associated with liner malseating on univariate analysis (p = 0.031). Patients with malseated liners appeared to have no associated clinical consequences, and none required revision surgery at a mean of 14 months (1.4 to 99.2) postoperatively. Conclusion. The incidence of liner malseating was 5.0%, which is similar to other reports. Component size of 50 mm or smaller was identified as a risk factor for malseating. Surgeons should be aware that malseating can occur and implant design changes or changes in instrumentation should be considered to lower the risk of malseating. Although further follow-up is needed, it remains to be seen if malseating is associated with any clinical consequences. Cite this article: Bone Jt Open 2021;2(10):858–864


Bone & Joint Research
Vol. 6, Issue 9 | Pages 530 - 534
1 Sep 2017
Krakow L Klockow A Roehner E Brodt S Eijer H Bossert J Matziolis G

Objectives. The determination of the volumetric polyethylene wear on explanted material requires complicated equipment, which is not available in many research institutions. Our aim in this study was to present and validate a method that only requires a set of polyetheretherketone balls and a laboratory balance to determine wear. Methods. The insert to be measured was placed on a balance, and a ball of the appropriate diameter was inserted. The cavity remaining between the ball and insert caused by wear was filled with contrast medium and the weight of the contrast medium was recorded. The volume was calculated from the known density of the liquid. The precision, inter- and intraobserver reliability, were determined by four investigators on four days using nine inserts with specified wear (0.094 ml to 1.626 ml), and the intra-class correlation coefficient was calculated. The feasibility of using this method in routine clinical practice and the time required for measurement were tested on 84 explanted inserts by one investigator. Results. In order to get the mean for all investigators and determinations, the deviation between the measured and specified wear was -0.08 ml . (sd. 0.12; -0.21 to 0.11). The interobserver reliability was 0.989 ml (95% confidence interval (CI) 0.964 to 0.997) and the intraobserver reliability was 0.941 for observer 1 (95% CI 0.846 to 0.985), 0.983 for observer 2 (95% CI 0.956 to 0.995), 0.939 for observer 3 (95% CI 0.855 to 0.984), and 0.934 for observer 4 (95% CI 0.790 to 0.984). The mean time required to examine the samples was two minutes . (sd. 2; 1 to 5). Conclusion. The method presented here was shown to be sufficiently precise for many settings and is a cost-effective and quick method of determining the volumetric wear of explanted acetabular components. However, the measurement of wear for scientific purposes will probably continue to involve more accurate and dedicated laboratory equipment. Cite this article: Bone Joint Res 2017;6:530–534


The Bone & Joint Journal
Vol. 101-B, Issue 9 | Pages 1042 - 1049
1 Sep 2019
Murphy MP Killen CJ Ralles SJ Brown NM Hopkinson WJ Wu K

Aims. Several radiological methods of measuring anteversion of the acetabular component after total hip arthroplasty (THA) have been described. These are limited by low reproducibility, are less accurate than CT 3D reconstruction, and are cumbersome to use. These methods also partly rely on the identification of obscured radiological borders of the component. We propose two novel methods, the Area and Orthogonal methods, which have been designed to maximize use of readily identifiable points while maintaining the same trigonometric principles. Patients and Methods. A retrospective study of plain radiographs was conducted on 160 hips of 141 patients who had undergone primary THA. We compared the reliability and accuracy of the Area and Orthogonal methods with two of the current leading methods: those of Widmer and Lewinnek, respectively. Results. The 160 anteroposterior pelvis films revealed that the proposed Area method was statistically different from those described by Widmer and Lewinnek (p < 0.001 and p = 0.004, respectively). They gave the highest inter- and intraobserver reliability (0.992 and 0.998, respectively), and took less time (27.50 seconds (. sd. 3.19); p < 0.001) to complete. In addition, 21 available CT 3D reconstructions revealed the Area method achieved the highest Pearson’s correlation coefficient (r = 0.956; p < 0.001) and least statistical difference (p = 0.704) from CT with a mean within 1° of CT-3D reconstruction between ranges of 1° to 30° of measured radiological anteversion. Conclusion. Our results support the proposed Area method to be the most reliable, accurate, and speedy. They did not support any statistical superiority of the proposed Orthogonal method to that of the Widmer or Lewinnek method. Cite this article: Bone Joint J 2019;101-B:1042–1049


The Bone & Joint Journal
Vol. 101-B, Issue 12 | Pages 1578 - 1584
1 Dec 2019
Batailler C Weidner J Wyatt M Pfluger D Beck M

Aims. A borderline dysplastic hip can behave as either stable or unstable and this makes surgical decision making challenging. While an unstable hip may be best treated by acetabular reorientation, stable hips can be treated arthroscopically. Several imaging parameters can help to identify the appropriate treatment, including the Femoro-Epiphyseal Acetabular Roof (FEAR) index, measured on plain radiographs. The aim of this study was to assess the reliability and the sensitivity of FEAR index on MRI compared with its radiological measurement. Patients and Methods. The technique of measuring the FEAR index on MRI was defined and its reliability validated. A retrospective study assessed three groups of 20 patients: an unstable group of ‘borderline dysplastic hips’ with lateral centre edge angle (LCEA) less than 25° treated successfully by periacetabular osteotomy; a stable group of ‘borderline dysplastic hips’ with LCEA less than 25° treated successfully by impingement surgery; and an asymptomatic control group with LCEA between 25° and 35°. The following measurements were performed on both standardized radiographs and on MRI: LCEA, acetabular index, femoral anteversion, and FEAR index. Results. The FEAR index showed excellent intraobserver and interobserver reliability on both MRI and radiographs. The FEAR index was more reliable on radiographs than on MRI. The FEAR index on MRI was lower in the stable borderline group (mean -4.2° (. sd. 9.1°)) compared with the unstable borderline group (mean 7.9° (. sd. 6.8°)). With a FEAR index cut-off value of 2°, 90% of patients were correctly identified as stable or unstable using the radiological FEAR index, compared with 82.5% using the FEAR index on MRI. The FEAR index was a better predictor of instability on plain radiographs than on MRI. Conclusion. The FEAR index measured on MRI is less reliable and less sensitive than the FEAR index measured on radiographs. The cut-off value of 2° for radiological FEAR index predicted hip stability with 90% probability. Cite this article: Bone Joint J 2019;101-B:1578–1584


The Bone & Joint Journal
Vol. 102-B, Issue 5 | Pages 593 - 599
1 May 2020
Amanatullah DF Cheng RZ Huddleston III JI Maloney WJ Finlay AK Kappagoda S Suh GA Goodman SB

Aims. To establish the utility of adding the laboratory-based synovial alpha-defensin immunoassay to the traditional diagnostic work-up of a prosthetic joint infection (PJI). Methods. A group of four physicians evaluated 158 consecutive patients who were worked up for PJI, of which 94 underwent revision arthroplasty. Each physician reviewed the diagnostic data and decided on the presence of PJI according to the 2014 Musculoskeletal Infection Society (MSIS) criteria (yes, no, or undetermined). Their initial randomized review of the available data before or after surgery was blinded to each alpha-defensin result and a subsequent randomized review was conducted with each result. Multilevel logistic regression analysis assessed the effect of having the alpha-defensin result on the ability to diagnose PJI. Alpha-defensin was correlated to the number of synovial white blood cells (WBCs) and percentage of polymorphonuclear cells (%PMN). Results. Intraobserver reliability and interobserver agreement did not change when the alpha-defensin result was available. Positive alpha-defensin results had greater synovial WBCs (mean 31,854 cells/μL, SD 32,594) and %PMN (mean 93.0%, SD 5.5%) than negative alpha-defensin results (mean 974 cells/μL, SD 3,988; p < 0.001 and mean 39.4% SD 28.6%; p < 0.001). Adding the alpha-defensin result did not alter the diagnosis of a PJI using preoperative (odds ratio (OR) 0.52, 95% confidence interval (CI) 0.14 to 1.88; p = 0.315) or operative (OR 0.52, CI 0.18 to 1.55; p = 0.242) data when clinicians already decided that PJI was present or absent with traditionally available testing. However, when undetermined with traditional preoperative testing, alpha-defensin helped diagnose (OR 0.44, CI 0.30 to 0.64; p < 0.001) or rule out (OR 0.41, CI 0.17 to 0.98; p = 0.044) PJI. Of the 27 undecided cases with traditional testing, 24 (89%) benefited from the addition of alpha-defensin testing. Conclusion. The laboratory-based synovial alpha-defensin immunoassay did not help diagnose or rule out a PJI when added to routine serologies and synovial fluid analyses except in cases where the diagnosis of PJI was unclear. We recommend against the routine use of alpha-defensin and suggest using it only when traditional testing is indeterminate. Cite this article: Bone Joint J 2020;102-B(5):593–599


The Journal of Bone & Joint Surgery British Volume
Vol. 80-B, Issue 2 | Pages 321 - 324
1 Mar 1998
Bar-On E Meyer S Harati G Porat S

Ultrasonography of the hip was performed sequentially by two different examiners in 75 infants. The ultrasound strips were reviewed twice by three paediatric orthopaedic surgeons and classified by the Graf method. The intraobserver and interobserver agreement between the interpretations was analysed using simple and weighted kappa coefficients calculated for agreement on the Graf classification and for grouping as normal (types 1A to 2A), and abnormal requiring treatment (types 2B to 4). When examining the same ultrasound strip, intraobserver agreement for the Graf classification was substantial (mean kappa 0.61), but interobserver agreement was only moderate (kappa 0.50). For the grouping into normal and abnormal, the mean kappa value for intraobserver agreement was 0.67 and for interobserver agreement 0.57. Because of the significant differences in agreement between normal and abnormal hips, we analysed a subgroup of those with at least one abnormal interpretation. Intraobserver agreement within this subgroup showed moderate reliability (kappa 0.41), but interobserver agreement was only fair (kappa 0.28). Interpretations of two different strips performed sequentially showed significantly lower agreement with an intraobserver kappa value of 0.29 and an interobserver value of 0.28. In the subgroup with at least one abnormal reading, the intraobserver kappa was 0.09 and the interobserver 0.1. Our findings suggest that both the technique of performing ultrasonography and the interpretation of the image may influence the result


The Bone & Joint Journal
Vol. 100-B, Issue 8 | Pages 1100 - 1105
1 Aug 2018
Howard EL Shepherd KL Cribb G Cool P

Aims. The aim of this study was to validate the Mirels score in predicting pathological fractures in metastatic disease of the lower limb. Patients and Methods. A total of 62 patients with confirmed metastatic disease met the inclusion criteria. Of the 62 patients, 32 were female and 30 were male. The mean age of patients was 65 years (35 to 89). The primary malignancy originated from the breast in 27 (44%) patients, prostate in 15 (24%) patients, kidney in seven (11%), and lung in four (6%) of patients. One patient (2%) had metastatic carcinoma from the lacrimal gland, two patients (3%) had multiple myeloma, one patient (2%) had lymphoma of bone, and five patients (8%) had metastatic carcinoma of unknown primary. Plain radiographs at the time of initial presentation were scored using Mirels system by the four authors. The radiographic components of the score (anatomical site, size, and radiographic appearance) were scored two weeks apart. Inter- and intraobserver reliability were calculated with Fleiss’ kappa test. Bland-Altman plots were created to compare the variances of the individual components of the score and the total Mirels score. Results. Kappa values for the interobserver variability of the components of the Mirels score were k = 0.554 (95% CI 0.483 to 0.626) for site, k = 0.342 (95% CI 0.285 to 0.400) for size, k = 0.443 (95% CI 0.387 to 0.499) for radiographic appearance, and k = 0.294 (95% CI 0.258 to 0.331)for the total score. Kappa values for the intra-observer reliability were k = 0.608 (95% CI 0.506 to 0.710) for site, k = 0.579 (95% CI 0.487 to 0.670) for size, k = 0.614 (95% CI 0.522 to 0.703) for radiographic appearance, and k = 0.323 (95% CI 0.266 to 0.379) for total score. Conclusion. Our study showed fair to moderate agreement between authors when using the Mirels score, and moderate to substantial agreement when authors rescored radiographs. The Mirels score is subjective and lacks reproducibility in predicting the risk of pathological fracture. Cite this article: Bone Joint J 2018;100-B:1100–5


The Bone & Joint Journal
Vol. 101-B, Issue 1_Supple_A | Pages 11 - 18
1 Jan 2019
Kayani B Konan S Thakrar RR Huq SS Haddad FS

Objectives. The primary objective of this study was to compare accuracy in restoring the native centre of hip rotation in patients undergoing conventional manual total hip arthroplasty (THA) versus robotic-arm assisted THA. Secondary objectives were to determine differences between these treatment techniques for THA in achieving the planned combined offset, component inclination, component version, and leg-length correction. Materials and Methods. This prospective cohort study included 50 patients undergoing conventional manual THA and 25 patients receiving robotic-arm assisted THA. Patients undergoing conventional manual THA and robotic-arm assisted THA were well matched for age (mean age, 69.4 years (. sd. 5.2) vs 67.5 years (. sd. 5.8) (p = 0.25); body mass index (27.4 kg/m. 2. (. sd. 2.1) vs 26.9 kg/m. 2. (. sd. 2.2); p = 0.39); and laterality of surgery (right = 28, left = 22 vs right = 12, left = 13; p = 0.78). All operative procedures were undertaken by a single surgeon using the posterior approach. Two independent blinded observers recorded all radiological outcomes of interest using plain radiographs. Results. The correlation coefficient was 0.92 (95% confidence interval (CI) 0.88 to 0.95) for intraobserver agreement and 0.88 (95% CI 0.82 to 0.94) for interobserver agreement in all study outcomes. Robotic THA was associated with improved accuracy in restoring the native horizontal (p < 0.001) and vertical (p < 0.001) centres of rotation, and improved preservation of the patient’s native combined offset (p < 0.001) compared with conventional THA. Robotic THA improved accuracy in positioning of the acetabular component within the combined safe zones of inclination and anteversion described by Lewinnek et al (p = 0.02) and Callanan et al (p = 0.01) compared with conventional THA. There was no difference between the two treatment groups in achieving the planned leg-length correction (p = 0.10). Conclusion. Robotic-arm assisted THA was associated with improved accuracy in restoring the native centre of rotation, better preservation of the combined offset, and more precise acetabular component positioning within the safe zones of inclination and anteversion compared with conventional manual THA


The Journal of Bone & Joint Surgery British Volume
Vol. 84-B, Issue 1 | Pages 42 - 47
1 Jan 2002
Brismar BH Wredmark T Movin T Leandersson J Svensson O

We studied 19 videotaped knee arthroscopies in 19 patients with mild to moderate osteoarthritis (OA) of the knee in order to compare the intraobserver and interobserver reliability and the patterns of disagreement between four orthopaedic surgeons. The classifications of OA of Collins, Outerbridge and the French Society of Arthroscopy were used. Intraobserver and interobserver agreements using kappa measures were 0.42 to 0.66 and 0.43 to 0.49, respectively. Only 6% to 8% of paired intraobserver classifications differed by more than one category. Observer-specific disagreement was evident both within and between observers. A small, but significant, occasional variation was also seen. Although reliability may improve by an analysis of disagreement, it appears that the arthroscopic grading of early osteoarthritic lesions is inexact


The Journal of Bone & Joint Surgery British Volume
Vol. 91-B, Issue 9 | Pages 1191 - 1196
1 Sep 2009
Pagenstert GI Barg A Leumann AG Rasch H Müller-Brand J Hintermann B Valderrabano V

The precise localisation of osteoarthritic changes is crucial for selective surgical treatment. Single photon-emission CT-CT (SPECT-CT) combines both morphological and biological information. We hypothesised that SPECT-CT increased the intra- and interobserver reliability to localise increased uptake compared with traditional evaluation of CT and bone scanning together. We evaluated 20 consecutive patients with pain of uncertain origin in the foot and ankle by radiography and SPECT-CT, available as fused SPECT-CT, and by separate bone scanning and CT. Five observers assessed the presence or absence of arthritis. The images were blinded and randomly ordered. They were evaluated twice at an interval of six weeks. Kappa and multirater kappa values were calculated. The mean intraobserver reliability for SPECT-CT was excellent (κ = 0.86; 95% CI 0.81 to 0.88) and significantly higher than that for CT and bone scanning together. SPECT-CT had significantly higher interobserver agreement, especially when evaluating the naviculocuneiform and tarsometatarsal joints. SPECT-CT is useful in localising active arthritis especially in areas where the number and configuration of joints are complex


The Bone & Joint Journal
Vol. 106-B, Issue 1 | Pages 99 - 106
1 Jan 2024
Khal AA Aiba H Righi A Gambarotti M Atherley O'Meally AO Manfrini M Donati DM Errani C

Aims

Low-grade central osteosarcoma (LGCOS), a rare type of osteosarcoma, often has misleading radiological and pathological features that overlap with those of other bone tumours, thereby complicating diagnosis and treatment. We aimed to analyze the clinical, radiological, and pathological features of patients with LGCOS, with a focus on diagnosis, treatment, and outcomes.

Methods

We retrospectively analyzed the medical records of 49 patients with LGCOS (Broder’s grade 1 to 2) treated between January 1985 and December 2017 in a single institute. We examined the presence of malignant features on imaging (periosteal reaction, cortical destruction, soft-tissue invasion), the diagnostic accuracy of biopsy, surgical treatment, and oncological outcome.


Bone & Joint Open
Vol. 5, Issue 6 | Pages 524 - 531
24 Jun 2024
Woldeyesus TA Gjertsen J Dalen I Meling T Behzadi M Harboe K Djuv A

Aims

To investigate if preoperative CT improves detection of unstable trochanteric hip fractures.

Methods

A single-centre prospective study was conducted. Patients aged 65 years or older with trochanteric hip fractures admitted to Stavanger University Hospital (Stavanger, Norway) were consecutively included from September 2020 to January 2022. Radiographs and CT images of the fractures were obtained, and surgeons made individual assessments of the fractures based on these. The assessment was conducted according to a systematic protocol including three classification systems (AO/Orthopaedic Trauma Association (OTA), Evans Jensen (EVJ), and Nakano) and questions addressing specific fracture patterns. An expert group provided a gold-standard assessment based on the CT images. Sensitivities and specificities of surgeons’ assessments were estimated and compared in regression models with correlations for the same patients. Intra- and inter-rater reliability were presented as Cohen’s kappa and Gwet’s agreement coefficient (AC1).


The Bone & Joint Journal
Vol. 105-B, Issue 7 | Pages 775 - 782
1 Jul 2023
Koper MC Spek RWA Reijman M van Es EM Baart SJ Verhaar JAN Bos PK

Aims

The aims of this study were to determine if an increasing serum cobalt (Co) and/or chromium (Cr) concentration is correlated with a decreasing Harris Hip Score (HHS) and Hip disability and Osteoarthritis Outcome Score (HOOS) in patients who received the Articular Surface Replacement (ASR) hip resurfacing arthroplasty (HRA), and to evaluate the ten-year revision rate and show if sex, inclination angle, and Co level influenced the revision rate.

Methods

A total of 62 patients with an ASR-HRA were included and monitored yearly postoperatively. At follow-up, serum Co and Cr levels were measured and the HHS and the HOOS were scored. In addition, preoperative patient and implant variables and the need for revision surgery were recorded. We used a linear mixed model to relate the serum Co and Cr levels to different patient-reported outcome measures (PROMs). For the survival analyses we used the Kaplan-Meier and Cox regression model.


Bone & Joint 360
Vol. 12, Issue 3 | Pages 32 - 35
1 Jun 2023

The June 2023 Trauma Roundup360 looks at: Aspirin or low-molecular-weight heparin for thromboprophylaxis?; Lateral plating or retrograde nailing for distal femur fractures?; Sciatic nerve palsy after acetabular fixation: what about patient position?; How reliable is the new OTA/AO classification for trochanteric hip fractures?; Young hip fractures: is a medial buttress the answer?; When is the best time to ‘flap’ an open fracture?; The mortality burden of nonoperatively managed hip fractures.


Bone & Joint 360
Vol. 12, Issue 6 | Pages 36 - 39
1 Dec 2023

The December 2023 Trauma Roundup360 looks at: Distal femoral arthroplasty: medical risks under the spotlight; Quads repair: tunnels or anchors?; Complex trade-offs in treating severe tibial fractures: limb salvage versus primary amputation; Middle-sized posterior malleolus fractures – to fix?; Bone transport through induced membrane: a randomized controlled trial; Displaced geriatric femoral neck fractures; Risk factors for reoperation to promote union in 1,111 distal femur fractures; New versus old – reliability of the OTA/AO classification for trochanteric hip fractures; Risk factors for fracture-related infection after ankle fracture surgery.


Bone & Joint Open
Vol. 4, Issue 9 | Pages 659 - 667
1 Sep 2023
Nasser AAHH Osman K Chauhan GS Prakash R Handford C Nandra RS Mahmood A

Aims

Periprosthetic fractures (PPFs) following hip arthroplasty are complex injuries. This study evaluates patient demographic characteristics, management, outcomes, and risk factors associated with PPF subtypes over a decade.

Methods

Using a multicentre collaborative study design, independent of registry data, we identified adults from 29 centres with PPFs around the hip between January 2010 and December 2019. Radiographs were assessed for the Unified Classification System (UCS) grade. Patient and injury characteristics, management, and outcomes were compared between UCS grades. A multinomial logistic regression was performed to estimate relative risk ratios (RRR) of variables on UCS grade.


The Bone & Joint Journal
Vol. 104-B, Issue 11 | Pages 1196 - 1201
1 Nov 2022
Anderson CG Brilliant ZR Jang SJ Sokrab R Mayman DJ Vigdorchik JM Sculco PK Jerabek SA

Aims

Although CT is considered the benchmark to measure femoral version, 3D biplanar radiography (hipEOS) has recently emerged as a possible alternative with reduced exposure to ionizing radiation and shorter examination time. The aim of our study was to evaluate femoral stem version in postoperative total hip arthroplasty (THA) patients and compare the accuracy of hipEOS to CT. We hypothesize that there will be no significant difference in calculated femoral stem version measurements between the two imaging methods.

Methods

In this study, 45 patients who underwent THA between February 2016 and February 2020 and had both a postoperative CT and EOS scan were included for evaluation. A fellowship-trained musculoskeletal radiologist and radiological technician measured femoral version for CT and 3D EOS, respectively. Comparison of values for each imaging modality were assessed for statistical significance.


Bone & Joint Open
Vol. 3, Issue 10 | Pages 759 - 766
5 Oct 2022
Schmaranzer F Meier MK Lerch TD Hecker A Steppacher SD Novais EN Kiapour AM

Aims

To evaluate how abnormal proximal femoral anatomy affects different femoral version measurements in young patients with hip pain.

Methods

First, femoral version was measured in 50 hips of symptomatic consecutively selected patients with hip pain (mean age 20 years (SD 6), 60% (n = 25) females) on preoperative CT scans using different measurement methods: Lee et al, Reikerås et al, Tomczak et al, and Murphy et al. Neck-shaft angle (NSA) and α angle were measured on coronal and radial CT images. Second, CT scans from three patients with femoral retroversion, normal femoral version, and anteversion were used to create 3D femur models, which were manipulated to generate models with different NSAs and different cam lesions, resulting in eight models per patient. Femoral version measurements were repeated on manipulated femora.


The Bone & Joint Journal
Vol. 105-B, Issue 8 | Pages 905 - 911
1 Aug 2023
Giannicola G Amura A Sessa P Prigent S Cinotti G

Aims

The aim of this study was to analyze how proximal radial neck resorption (PRNR) starts and progresses radiologically in two types of press-fit radial head arthroplasties (RHAs), and to investigate its clinical relevance.

Methods

A total of 97 patients with RHA were analyzed: 56 received a bipolar RHA (Group 1) while 41 received an anatomical implant (Group 2). Radiographs were performed postoperatively and after three, six, nine, and 12 weeks, six, nine, 12, 18, and 24 months, and annually thereafter. PRNR was measured in all radiographs in the four radial neck quadrants. The Mayo Elbow Performance Score (MEPS), the abbreviated version of the Disabilities of the Arm, Shoulder, and Hand questionnaire (QuickDASH), and the patient-assessed American Shoulder and Elbow Surgeons score - Elbow (pASES-E) were used for the clinical assessment. Radiological signs of implant loosening were investigated.


The Bone & Joint Journal
Vol. 106-B, Issue 3 | Pages 240 - 248
1 Mar 2024
Kim SE Kwak J Ro DH Lee MC Han H

Aims

The aim of this study was to evaluate whether achieving medial joint opening, as measured by the change in the joint line convergence angle (∆JLCA), is a better predictor of clinical outcomes after high tibial osteotomy (HTO) compared with the mechanical axis deviation, and to find individualized targets for the redistribution of load that reflect bony alignment, joint laxity, and surgical technique.

Methods

This retrospective study analyzed 121 knees in 101 patients. Patient-reported outcome measures (PROMs) were collected preoperatively and one year postoperatively, and were analyzed according to the surgical technique (opening or closing wedge), postoperative mechanical axis deviation (deviations above and below 10% from the target), and achievement of medial joint opening (∆JLCA > 1°). Radiological parameters, including JLCA, mechanical axis deviation, and the difference in JLCA between preoperative standing and supine radiographs (JLCAPD), an indicator of medial soft-tissue laxity, were measured. Cut-off points for parameters related to achieving medial joint opening were calculated from receiver operating characteristic (ROC) curves.


The Bone & Joint Journal
Vol. 106-B, Issue 7 | Pages 696 - 704
1 Jul 2024
Barvelink B Reijman M Smidt S Miranda Afonso P Verhaar JAN Colaris JW

Aims

It is not clear which type of casting provides the best initial treatment in adults with a distal radial fracture. Given that between 32% and 64% of adequately reduced fractures redisplace during immobilization in a cast, preventing redisplacement and a disabling malunion or secondary surgery is an aim of treatment. In this study, we investigated whether circumferential casting leads to fewer fracture redisplacements and better one-year outcomes compared to plaster splinting.

Methods

In a pragmatic, open-label, multicentre, two-period cluster-randomized superiority trial, we compared these two types of casting. Recruitment took place in ten hospitals. Eligible patients aged ≥ 18 years with a displaced distal radial fracture, which was acceptably aligned after closed reduction, were included. The primary outcome measure was the rate of redisplacement within five weeks of immobilization. Secondary outcomes were the rate of complaints relating to the cast, clinical outcomes at three months, patient-reported outcome measures (PROMs) (using the numerical rating scale (NRS), the abbreviated version of the Disabilities of the Arm, Shoulder and Hand (QuickDASH), and Patient-Rated Wrist/Hand Evaluation (PRWHE) scores), and adverse events such as the development of compartment syndrome during one year of follow-up. We used multivariable mixed-effects logistic regression for the analysis of the primary outcome measure.


Bone & Joint Open
Vol. 4, Issue 12 | Pages 932 - 941
6 Dec 2023
Oe K Iida H Otsuki Y Kobayashi F Sogawa S Nakamura T Saito T

Aims

Although there are various pelvic osteotomies for acetabular dysplasia of the hip, shelf operations offer effective and minimally invasive osteotomy. Our study aimed to assess outcomes following modified Spitzy shelf acetabuloplasty.

Methods

Between November 2000 and December 2016, we retrospectively evaluated 144 consecutive hip procedures in 122 patients a minimum of five years after undergoing modified Spitzy shelf acetabuloplasty for acetabular dysplasia including osteoarthritis (OA). Our follow-up rate was 92%. The mean age at time of surgery was 37 years (13 to 58), with a mean follow-up of 11 years (5 to 21). Advanced OA (Tönnis grade ≥ 2) was present preoperatively in 16 hips (11%). The preoperative lateral centre-edge angle ranged from -28° to 25°. Survival was determined by Kaplan-Meier analysis, using conversions to total hip arthroplasty as the endpoint. Risk factors for joint space narrowing less than 2 mm were analyzed using a Cox proportional hazards model.


Bone & Joint Open
Vol. 3, Issue 10 | Pages 795 - 803
12 Oct 2022
Liechti EF Attinger MC Hecker A Kuonen K Michel A Klenke FM

Aims

Traditionally, total hip arthroplasty (THA) templating has been performed on anteroposterior (AP) pelvis radiographs. Recently, additional AP hip radiographs have been recommended for accurate measurement of the femoral offset (FO). To verify this claim, this study aimed to establish quantitative data of the measurement error of the FO in relation to leg position and X-ray source position using a newly developed geometric model and clinical data.

Methods

We analyzed the FOs measured on AP hip and pelvis radiographs in a prospective consecutive series of 55 patients undergoing unilateral primary THA for hip osteoarthritis. To determine sample size, a power analysis was performed. Patients’ position and X-ray beam setting followed a standardized protocol to achieve reproducible projections. All images were calibrated with the KingMark calibration system. In addition, a geometric model was created to evaluate both the effects of leg position (rotation and abduction/adduction) and the effects of X-ray source position on FO measurement.


Aims

Total knee arthroplasty (TKA) may provoke ankle symptoms. The aim of this study was to validate the impact of the preoperative mechanical tibiofemoral angle (mTFA), the talar tilt (TT) on ankle symptoms after TKA, and assess changes in the range of motion (ROM) of the subtalar joint, foot posture, and ankle laxity.

Methods

Patients who underwent TKA from September 2020 to September 2021 were prospectively included. Inclusion criteria were primary end-stage osteoarthritis (Kellgren-Lawrence stage IV) of the knee. Exclusion criteria were missed follow-up visit, post-traumatic pathologies of the foot, and neurological disorders. Radiological angles measured included the mTFA, hindfoot alignment view angle, and TT. The Foot Function Index (FFI) score was assessed. Gait analyses were conducted to measure mediolateral changes of the gait line and ankle laxity was tested using an ankle arthrometer. All parameters were acquired one week pre- and three months postoperatively.


The Journal of Bone & Joint Surgery British Volume
Vol. 91-B, Issue 8 | Pages 1049 - 1053
1 Aug 2009
Braunstein V Kirchhoff C Ockert B Sprecher CM Korner M Mutschler W Wiedemann E Biberthaler P

In 100 patients the fulcrum axis which is the line connecting the anterior tip of the coracoid and the posterolateral angle of the acromion, was used to position true anteroposterior radiographs of the shoulder. This method was then compared with the conventional radiological technique in a further 100 patients. Three orthopaedic surgeons counted the number of images without overlap between the humeral head and glenoid and calculated the amount of the glenoid surface visible in each radiograph. The analysis was repeated for intraobserver reliability. The learning curves of both techniques were studied. The amount of free visible glenoid space was significantly higher using the fulcrum-axis method (64 vs 31) and the comparable glenoid size increased significantly (8.56 vs 6.47). Thus the accuracy of the anteroposterior radiographs of the shoulder is impaired by using this technique. The intra and interobserver reliability showed a high consistency. No learning curve was observed for either technique


Bone & Joint Research
Vol. 12, Issue 1 | Pages 58 - 71
17 Jan 2023
Dagneaux L Limberg AK Owen AR Bettencourt JW Dudakovic A Bayram B Gades NM Sanchez-Sotelo J Berry DJ van Wijnen A Morrey ME Abdel MP

Aims

As has been shown in larger animal models, knee immobilization can lead to arthrofibrotic phenotypes. Our study included 168 C57BL/6J female mice, with 24 serving as controls, and 144 undergoing a knee procedure to induce a contracture without osteoarthritis (OA).

Methods

Experimental knees were immobilized for either four weeks (72 mice) or eight weeks (72 mice), followed by a remobilization period of zero weeks (24 mice), two weeks (24 mice), or four weeks (24 mice) after suture removal. Half of the experimental knees also received an intra-articular injury. Biomechanical data were collected to measure passive extension angle (PEA). Histological data measuring area and thickness of posterior and anterior knee capsules were collected from knee sections.


The Journal of Bone & Joint Surgery British Volume
Vol. 78-B, Issue 2 | Pages 191 - 194
1 Mar 1996
McCaskie AW Brown AR Thompson JR Gregg PJ

Three radiological methods are commonly used to assess the outcome of total hip replacement (THR). They aim to record the appearance of lucent areas and migration of the prosthesis in a reproducible manner. Two of them were designed to monitor the implant through time and one to grade the quality of cementing. We have measured the level of inter- and intraobserver agreement in all three systems. We randomised 30 patients to receive either finger packing or retrograde gun cementing during Charnley hip replacements. The postoperative departmental radiographs were evaluated in a blinded study by two orthopaedic trainees, two consultants and two experts in THR. The trainees and consultants repeated the exercise at least two weeks later. We used the unweighted kappa statistic to establish the levels of agreement. In general, intraobserver agreement was moderate but interobserver agreement was poor, with levels similar to or less than those expected by chance. Our results indicate that such systems cannot provide reliable data from centres in different parts of the world, with various levels of surgeon evaluating radiographs at differing time intervals. We discuss the problem and suggest some methods of improvement


Bone & Joint Open
Vol. 3, Issue 2 | Pages 114 - 122
1 Feb 2022
Green GL Arnander M Pearse E Tennent D

Aims

Recurrent dislocation is both a cause and consequence of glenoid bone loss, and the extent of the bony defect is an indicator guiding operative intervention. Literature suggests that loss greater than 25% requires glenoid reconstruction. Measuring bone loss is controversial; studies use different methods to determine this, with no clear evidence of reproducibility. A systematic review was performed to identify existing CT-based methods of quantifying glenoid bone loss and establish their reliability and reproducibility

Methods

A Preferred Reporting Items for Systematic reviews and Meta-Analyses-compliant systematic review of conventional and grey literature was performed.