Advertisement for orthosearch.org.uk
Results 1 - 20 of 136
Results per page:
Bone & Joint Research
Vol. 4, Issue 12 | Pages 190 - 194
1 Dec 2015
Kleinlugtenbelt YV Hoekstra M Ham SJ Kloen P Haverlag R Simons MP Bhandari M Goslings JC Poolman RW Scholtes VAB

Objectives. Current studies on the additional benefit of using computed tomography (CT) in order to evaluate the surgeons’ agreement on treatment plans for fracture are inconsistent. This inconsistency can be explained by a methodological phenomenon called ‘spectrum bias’, defined as the bias inherent when investigators choose a population lacking therapeutic uncertainty for evaluation. The aim of the study is to determine the influence of spectrum bias on the intra-observer agreement of treatment plans for fractures of the distal radius. Methods. Four surgeons evaluated 51 patients with displaced fractures of the distal radius at four time points: T1 and T2: conventional radiographs; T3 and T4: radiographs and additional CT scan (radiograph and CT). Choice of treatment plan (operative or non-operative) and therapeutic certainty (five-point scale: very uncertain to very certain) were rated. To determine the influence of spectrum bias, the intra-observer agreement was analysed, using Kappa statistics, for each degree of therapeutic certainty. . Results. In cases with high therapeutic certainty, intra-observer agreement based on radiograph was almost perfect (0.86 to 0.90), but decreased to moderate based on a radiograph and CT (0.47 to 0.60). In cases with high therapeutic uncertainty, intra-observer agreement was slight at best (-0.12 to 0.19), but increased to moderate based on the radiograph and CT (0.56 to 0.57). Conclusion. Spectrum bias influenced the outcome of this agreement study on treatment plans. An additional CT scan improves the intra-observer agreement on treatment plans for a fracture of the distal radius only when there is therapeutic uncertainty. Reporting and analysing intra-observer agreement based on the surgeon’s level of certainty is an appropriate method to minimise spectrum bias. Cite this article: Bone Joint Res 2015;4:190–194


Orthopaedic Proceedings
Vol. 94-B, Issue SUPP_XXXVII | Pages 122 - 122
1 Sep 2012
Jensen C Overgaard S Aagaard P
Full Access

Introduction. Total leg muscle function in hip OA patients is not well studied. We used a test-retest protocol to evaluate the reproducibility of single- and multi-joint peak muscle torque and rapid torque development in a group of 40–65 yr old hip patients. Both peak torque and torque development are outcome measures associated with functional performance during activities of daily living. Material and Methods. Patients: Twenty patients (age 55.5±3.3, BMI 27.6±4.8) who underwent total hip arthroplasty participated in this study. Reliability: We used the intra-class correlation (ICC) and within subject coefficients of variation (CVws) to evaluate reliability. Agreement: Relative Bland-Altman 95% limits of agreements (LOA) and smallest detectable difference (SDD) were calculated and used for evaluation of measurement accuracy. Parameters: Maximal muscle strength (peak torque, Nm) and rate of torque development (Nm•sec-1) for affected (AF) and non-affected (NA) side were measured during unilateral knee extension-flexion (seated), hip extension-flexion, and hip adduction-abduction (standing), respectively. Contractile RTD100, 200, peak was derived as the average slope of the torque-time curve (torque/time) at 0–100, 0–200 and 0 peak relative to onset of contraction. Protocol: After 5 min level walking at self-selected and maximum speeds each muscle group was tested using 1–2 sub-maximal contraction efforts followed by 3 maximal contractions 4s duration. Statistics: The variance components were estimated using STATA12, with muscle function and occasion as independent variable and patients as random factor, using the restricted maximum likelihood method (=0.05). Results. For all exercises and sides, the ICC's for peak torque were good (0.81–0.96) with CVws ranging from 5.0–10.8%. Similar good ICC's were observed for RTD200 on the non-affected side (0.83–0.93), whereas most exercises (4/6) on the affected side showed moderate to good ICC (0.72–0.82). We found moderate CVws for RTD200 with 12.8–18.7% and 10.3–18.9%, affected and non-affected, respectively. With few exceptions the ICC's and CVws for RTD100 were moderate to poor on the affected side but good to moderate on the non-affected side. The SDD's for peak torque ranged from 14.9 Nm to 39.0 Nm, equal to relative LOA of 13.9–23.8%. For RTD200, the SDD's were 77–257 Nm•sec-1 and 29.2–86.2%, absolute and relative, respectively. With few exceptions interventions measuring RTD100 and RTDpeak would have to find changes exceeding 60% for them to be statistical significant. Conclusions. Our novel set-up for lower limb isometric muscle testing showed overall good reproducibility for peak torque, moderate for RTD200, while poor for RTD100 and RTDpeak. The results for peak torque and RTD200 are promising for defining relevant changes in muscle function in future longitudinal clinical trials in this patient group


The Bone & Joint Journal
Vol. 103-B, Issue 12 | Pages 1802 - 1808
1 Dec 2021
Bruce J Knight R Parsons N Betteridge R Verdon A Brown J Campolier M Achten J Costa ML

Aims. Deep surgical site infection (SSI) is common after lower limb fracture. We compared the diagnosis of deep SSI using alternative methods of data collection and examined the agreement of clinical photography and in-person clinical assessment by the Centers for Disease Control and Prevention (CDC) criteria after lower limb fracture surgery. Methods. Data from two large, UK-based multicentre randomized controlled major trauma trials investigating SSI and wound healing after surgical repair of open lower limb fractures that could not be primarily closed (UK WOLLF), and surgical incisions for fractures that were primarily closed (UK WHiST), were examined. Trial interventions were standard wound care management and negative pressure wound therapy after initial surgical debridement. Wound outcomes were collected from 30 days to six weeks. We compared the level of agreement between wound photography and clinical assessment of CDC-defined SSI. We are also assessed the level of agreement between blinded independent assessors of the photographs. Results. Rates of CDC-defined deep SSI were 7.6% (35/460) after open fracture and 6.3% (95/1519) after closed incisional repair. Photographs were obtained for 77% and 73% of WOLLF and WHiST cohorts respectively (all participants n = 1,478). Agreement between photographic-SSI and CDC-SSI was fair for open fracture wounds (83%; k = 0.27 (95% confidence interval (CI) 0.14 to 0.42)) and for closed incisional wounds (88%; k = 0.29 (95% CI 0.20 to 0.37)) although the rate of photographically detected deep SSIs was twice as high as CDC-SSI (12% vs 6%). Agreement between different assessors for photographic-SSI (WOLLF 88%, k = 0.63 (95% CI 0.52 to 0.72); WHiST 89%; k = 0.61 (95% CI 0.54 to 0.69)); and wound healing was good (WOLLF 90%; k = 0.80 (95% CI 0.73 to 0.86); WHiST 87%; k = 0.57 (95% CI 0.50 to 0.64)). Conclusion. Although wound photography was feasible within the research context and inter-rater assessor agreement substantial, digital photographs used in isolation overestimated deep SSI rates, when compared to CDC criteria. Wound photography should not replace clinical assessment in pragmatic trials but may be useful for screening purposes where surgical infection outcomes are paramount. Cite this article: Bone Joint J 2021;103-B(12):1802–1808


The Bone & Joint Journal
Vol. 101-B, Issue 10 | Pages 1292 - 1299
1 Oct 2019
Masters J Metcalfe D Parsons NR Achten J Griffin XL Costa ML

Aims. This study explores data quality in operation type and fracture classification recorded as part of a large research study and a national audit with an independent review. Patients and Methods. At 17 centres, an expert surgeon reviewed a randomly selected subset of cases from their centre with regard to fracture classification using the AO system and type of operation performed. Agreement for these variables was then compared with the data collected during conduct of the World Hip Trauma Evaluation (WHiTE) cohort study. Both types of surgery and fracture classification were collapsed to identify the level of detail of reporting that achieved meaningful agreement. In the National Hip Fracture Database (NHFD), the types of operation and fracture classification were explored to identify the proportion of “highly improbable” combinations. Results. The records were reviewed for 903 cases. Agreement for the subtypes of extracapsular fracture was poor; most centres achieved no better than “fair” agreement. When the classification was collapsed to a single option for “extracapsular” fracture, only four centres failed to have at least “moderate” agreement. There was only “moderate” agreement for the subtypes of intracapsular fracture, which improved to “substantial” when collapsed to “intracapsular”. Subtrochanteric fracture types were well reported with “substantial” agreement. There was near “perfect” agreement for internal fixation procedures. “Perfect” or “substantial” agreement was achieved when the type of arthroplasty surgery was reported at the level of “hemiarthroplasty” and “total hip replacement”. When reviewing data submitted to the NHFD, a minimum of 5.2% of cases contained “highly improbable” procedures for the stated fracture classification. Conclusion. The complexity of collecting fracture classification data at a national scale compromises the accuracy with which detailed classification systems can be reported. Data around type of surgery performed show similar tendencies. Data capture, reporting, and interpretation in future studies must take this into account. Cite this article: Bone Joint J 2019;101-B:1292–1299


Orthopaedic Proceedings
Vol. 105-B, Issue SUPP_6 | Pages 2 - 2
20 Mar 2023
Brennan C Slevin Z Savaridas T
Full Access

The suprascapular nerve is an ideal target for nerve blockade to alleviate shoulder pain given its widespread innervation to the shoulder girdle. Many techniques have been described. To widen the availability of this treatment we investigate whether an anatomical landmark technique can be easily learned by novice injectors to provide efficacious blockade. Five injectors were recruited with varying experience; from the novice medical student to an orthopaedic consultant. Five torsos (10 shoulders) were used. A single page of written instruction and illustration of the Dangoisse landmark technique was provided prior to injection of a Thiel embalmed cadaver bilaterally. A pre-mixed injectate with blue dye was used. Cadavers were dissected and the presence or absence of dye staining reported by 3 observers and a consensus agreement reached. Dissection demonstrated diffuse staining in the suprascapular fossa. 90% of shoulders were found to have adequate staining of the suprascapular nerve directly, or its distal branches, in a manner which would provide adequate anaesthesia. The inter-observer agreement was good (k = 0.73) for staining at the supraspinous fossa and excellent (k=0.87) for staining distally. The technique was easily performed by novice injectors with a 100% success rate. We demonstrate that this technique is reproducible by a range of clinicians to effectively provide anaesthesia of the SScN. The main risks are ineffective block (10% in this series) and of intravascular injection. Within a resource strained healthcare environment greater uptake of this technique is likely to be of benefit to a wider array of patients


Orthopaedic Proceedings
Vol. 105-B, Issue SUPP_14 | Pages 3 - 3
10 Oct 2023
Verma S Malaviya S Barker S
Full Access

Technological advancements in orthopaedic surgery have mainly focused on increasing precision during the operation however, there have been few developments in post-operative physiotherapy. We have developed a computer vision program using machine learning that can virtually measure the range of movement of a joint to track progress after surgery. This data can be used by physiotherapists to change patients’ exercise regimes with more objectively and help patients visualise the progress that they have made. In this study, we tested our program's reliability and validity to find a benchmark for future use on patients. We compared 150 shoulder joint angles, measured using a goniometer, and those calculated by our program called ArmTracking in a group of 10 participants (5 males and 5 females). Reliability was tested using adjusted R squared and validity was tested using 95% limits of agreement. Our clinically acceptable limit of agreement was ± 10° for ArmTracking to be used interchangeably with goniometry. ArmTracking showed excellent overall reliability of 97.1% when all shoulder movements were combined but there were lower scores for some movements like shoulder extension at 75.8%. There was moderate validity shown when all shoulder movements were combined at 9.6° overestimation and 18.3° underestimation. Computer vision programs have a great potential to be used in telerehabilitation to collect useful information as patients carry out prescribed exercises at home. However, they need to be trained well for precise joint detections to reduce the range of errors in readings


Bone & Joint Open
Vol. 5, Issue 6 | Pages 524 - 531
24 Jun 2024
Woldeyesus TA Gjertsen J Dalen I Meling T Behzadi M Harboe K Djuv A

Aims. To investigate if preoperative CT improves detection of unstable trochanteric hip fractures. Methods. A single-centre prospective study was conducted. Patients aged 65 years or older with trochanteric hip fractures admitted to Stavanger University Hospital (Stavanger, Norway) were consecutively included from September 2020 to January 2022. Radiographs and CT images of the fractures were obtained, and surgeons made individual assessments of the fractures based on these. The assessment was conducted according to a systematic protocol including three classification systems (AO/Orthopaedic Trauma Association (OTA), Evans Jensen (EVJ), and Nakano) and questions addressing specific fracture patterns. An expert group provided a gold-standard assessment based on the CT images. Sensitivities and specificities of surgeons’ assessments were estimated and compared in regression models with correlations for the same patients. Intra- and inter-rater reliability were presented as Cohen’s kappa and Gwet’s agreement coefficient (AC1). Results. We included 120 fractures in 119 patients. Compared to radiographs, CT increased the sensitivity of detecting unstable trochanteric fractures from 63% to 70% (p = 0.028) and from 70% to 76% (p = 0.004) using AO/OTA and EVJ, respectively. Compared to radiographs alone, CT increased the sensitivity of detecting a large posterolateral trochanter major fragment or a comminuted trochanter major fragment from 63% to 76% (p = 0.002) and from 38% to 55% (p < 0.001), respectively. CT improved intra-rater reliability for stability assessment using EVJ (AC1 0.68 to 0.78; p = 0.049) and for detecting a large posterolateral trochanter major fragment (AC1 0.42 to 0.57; p = 0.031). Conclusion. A preoperative CT of trochanteric fractures increased detection of unstable fractures using the AO/OTA and EVJ classification systems. Compared to radiographs, CT improved intra-rater reliability when assessing fracture stability and detecting large posterolateral trochanter major fragments. Cite this article: Bone Jt Open 2024;5(6):524–531


Bone & Joint Open
Vol. 5, Issue 3 | Pages 236 - 242
22 Mar 2024
Guryel E McEwan J Qureshi AA Robertson A Ahluwalia R

Aims. Ankle fractures are common injuries and the third most common fragility fracture. In all, 40% of ankle fractures in the frail are open and represent a complex clinical scenario, with morbidity and mortality rates similar to hip fracture patients. They have a higher risk of complications, such as wound infections, malunion, hospital-acquired infections, pressure sores, veno-thromboembolic events, and significant sarcopaenia from prolonged bed rest. Methods. A modified Delphi method was used and a group of experts with a vested interest in best practice were invited from the British Foot and Ankle Society (BOFAS), British Orthopaedic Association (BOA), Orthopaedic Trauma Society (OTS), British Association of Plastic & Reconstructive Surgeons (BAPRAS), British Geriatric Society (BGS), and the British Limb Reconstruction Society (BLRS). Results. In the first stage, there were 36 respondents to the survey, with over 70% stating their unit treats more than 20 such cases per year. There was a 50:50 split regarding if the timing of surgery should be within 36 hours, as per the hip fracture guidelines, or 72 hours, as per the open fracture guidelines. Overall, 75% would attempt primary wound closure and 25% would utilize a local flap. There was no orthopaedic agreement on fixation, and 75% would permit weightbearing immediately. In the second stage, performed at the BLRS meeting, experts discussed the survey results and agreed upon a consensus for the management of open elderly ankle fractures. Conclusion. A mutually agreed consensus from the expert panel was reached to enable the best practice for the management of patients with frailty with an open ankle fracture: 1) all units managing lower limb fragility fractures should do so through a cohorted multidisciplinary pathway. This pathway should follow the standards laid down in the "care of the older or frail orthopaedic trauma patient" British Orthopaedic Association Standards for Trauma and Orthopaedics (BOAST) guideline. These patients have low bone density, and we should recommend full falls and bone health assessment; 2) all open lower limb fragility fractures should be treated in a single stage within 24 hours of injury if possible; 3) all patients with fragility fractures of the lower limb should be considered for mobilisation on the day following surgery; 4) all patients with lower limb open fragility fractures should be considered for tissue sparing, with judicious debridement as a default; 5) all patients with open lower limb fragility fractures should be managed by a consultant plastic surgeon with primary closure wherever possible; and 6) the method of fixation must allow for immediate unrestricted weightbearing. Cite this article: Bone Jt Open 2024;5(3):236–242


The Bone & Joint Journal
Vol. 106-B, Issue 4 | Pages 412 - 418
1 Apr 2024
Alqarni AG Nightingale J Norrish A Gladman JRF Ollivere B

Aims. Frailty greatly increases the risk of adverse outcome of trauma in older people. Frailty detection tools appear to be unsuitable for use in traumatically injured older patients. We therefore aimed to develop a method for detecting frailty in older people sustaining trauma using routinely collected clinical data. Methods. We analyzed prospectively collected registry data from 2,108 patients aged ≥ 65 years who were admitted to a single major trauma centre over five years (1 October 2015 to 31 July 2020). We divided the sample equally into two, creating derivation and validation samples. In the derivation sample, we performed univariate analyses followed by multivariate regression, starting with 27 clinical variables in the registry to predict Clinical Frailty Scale (CFS; range 1 to 9) scores. Bland-Altman analyses were performed in the validation cohort to evaluate any biases between the Nottingham Trauma Frailty Index (NTFI) and the CFS. Results. In the derivation cohort, five of the 27 variables were strongly predictive of the CFS (regression coefficient B = 6.383 (95% confidence interval 5.03 to 7.74), p < 0.001): age, Abbreviated Mental Test score, admission haemoglobin concentration (g/l), pre-admission mobility (needs assistance or not), and mechanism of injury (falls from standing height). In the validation cohort, there was strong agreement between the NTFI and the CFS (mean difference 0.02) with no apparent systematic bias. Conclusion. We have developed a clinically applicable tool using easily and routinely measured physiological and functional parameters, which clinicians and researchers can use to guide patient care and to stratify the analysis of quality improvement and research projects. Cite this article: Bone Joint J 2024;106-B(4):412–418


The Bone & Joint Journal
Vol. 103-B, Issue 4 | Pages 775 - 781
1 Apr 2021
Mellema JJ Janssen S Schouten T Haverkamp D van den Bekerom MPJ Ring D Doornberg JN

Aims. This study evaluated variation in the surgical treatment of stable (A1) and unstable (A2) trochanteric hip fractures among an international group of orthopaedic surgeons, and determined the influence of patient, fracture, and surgeon characteristics on choice of implant (intramedullary nailing (IMN) versus sliding hip screw (SHS)). Methods. A total of 128 orthopaedic surgeons in the Science of Variation Group evaluated radiographs of 30 patients with Type A1 and A2 trochanteric hip fractures and indicated their preferred treatment: IMN or SHS. The management of Type A3 (reverse obliquity) trochanteric fractures was not evaluated. Agreement between surgeons was calculated using multirater kappa. Multivariate logistic regression models were used to assess whether patient, fracture, and surgeon characteristics were independently associated with choice of implant. Results. The overall agreement between surgeons on implant choice was fair (kappa = 0.27 (95% confidence interval (CI) 0.25 to 0.28)). Factors associated with preference for IMN included USA compared to Europe or the UK (Europe odds ratio (OR) 0.56 (95% CI 0.47 to 0.67); UK OR 0.16 (95% CI 0.12 to 0.22); p < 0.001); exposure to IMN only during training compared to surgeons that were exposed to both (only IMN during training OR 2.6 (95% CI 2.0 to 3.4); p < 0.001); and A2 compared to A1 fractures (Type A2 OR 10 (95% CI 8.4 to 12); p < 0.001). Conclusion. In an international cohort of orthopaedic surgeons, there was a large variation in implant preference for patients with A1 and A2 trochanteric fractures. This is due to surgeon bias (country of practice and aspects of training). The observation that surgeons favoured the more expensive implant (IMN) in the absence of convincing evidence of its superiority suggests that surgeon de-biasing strategies may be a useful focus for optimizing patient outcomes and promoting value-based healthcare. Cite this article: Bone Joint J 2021;103-B(4):775–781


The Bone & Joint Journal
Vol. 104-B, Issue 8 | Pages 963 - 971
1 Aug 2022
Sun Z Liu W Liu H Li J Hu Y Tu B Wang W Fan C

Aims. Heterotopic ossification (HO) is a common complication after elbow trauma and can cause severe upper limb disability. Although multiple prognostic factors have been reported to be associated with the development of post-traumatic HO, no model has yet been able to combine these predictors more succinctly to convey prognostic information and medical measures to patients. Therefore, this study aimed to identify prognostic factors leading to the formation of HO after surgery for elbow trauma, and to establish and validate a nomogram to predict the probability of HO formation in such particular injuries. Methods. This multicentre case-control study comprised 200 patients with post-traumatic elbow HO and 229 patients who had elbow trauma but without HO formation between July 2019 and December 2020. Features possibly associated with HO formation were obtained. The least absolute shrinkage and selection operator regression model was used to optimize feature selection. Multivariable logistic regression analysis was applied to build the new nomogram: the Shanghai post-Traumatic Elbow Heterotopic Ossification Prediction model (STEHOP). STEHOP was validated by concordance index (C-index) and calibration plot. Internal validation was conducted using bootstrapping validation. Results. Male sex, obesity, open wound, dislocations, late definitive surgical treatment, and lack of use of non-steroidal anti-inflammatory drugs were identified as adverse predictors and incorporated to construct the STEHOP model. It displayed good discrimination with a C-index of 0.80 (95% confidence interval 0.75 to 0.84). A high C-index value of 0.77 could still be reached in the internal validation. The calibration plot showed good agreement between nomogram prediction and observed outcomes. Conclusion. The newly developed STEHOP model is a valid and convenient instrument to predict HO formation after surgery for elbow trauma. It could assist clinicians in counselling patients regarding treatment expectations and therapeutic choices. Cite this article: Bone Joint J 2022;104-B(8):963–971


Orthopaedic Proceedings
Vol. 105-B, Issue SUPP_18 | Pages 18 - 18
1 Dec 2023
Fawdry A O'Dowd D
Full Access

Introduction. Activity scales are used throughout orthopaedics as a component of PROMs. Tegner Activity Scale is commonly used and is validated in various knee injuries in adults. It has a reading age of 18 years presenting an understanding problem for children. An alternative is HSS-PediFABS, but this looks at specific skills like running, cutting, pivoting rather than sporting level. Our aim was to determine if children understood TAS and whether their answers compared to how their parents scored them and determine if our suggested sporting levels were more appropriate for them. Method. We created a study form to compare levels given by children and their parent. We added our own suggested levels, with a reading age of 9, created by a discussion group of paediatric orthopaedic surgeons. Following ethics approval, a sample size was determined via power calculation. All patients over 7 and their parents presenting to the orthopaedic clinic at SCH over a 4-month period were asked to fill out the TAS, baseline questions and rank the new suggested sporting levels. Results. 51 patients and their parents were recruited, with a mean age of 13 (±0.31, 8–17). 35% female. The mean TAS score for children rating themselves was 7.04 (±0.32, 2–10) vs 6.43 (±0.37, 0–10) for parents rating the child (p=0.31). The average weekly activity time rated by children was 6.72 hours (±0.84, 0–30) vs 7.48 (±1.02, 0–36) rated by the parent (p=0.68). Our suggested levels for paediatric patients were ordered correctly by both groups (mode score). The mean new activity level for children was 4.9 (±0.24, 2–9) vs 4.81 (±0.26, 1–8) by their parent(p=0.79). The mean score difference for TAS was 1.42 vs 1.2 in the new score (p=0.38). Conclusion. Paediatric patients had difficulty understanding the TAS and there was poor agreement of activity levels between patients and parents


The Bone & Joint Journal
Vol. 100-B, Issue 2 | Pages 242 - 246
1 Feb 2018
Ghoshal A Enninghorst N Sisak K Balogh ZJ

Aims. To evaluate interobserver reliability of the Orthopaedic Trauma Association’s open fracture classification system (OTA-OFC). Patients and Methods. Patients of any age with a first presentation of an open long bone fracture were included. Standard radiographs, wound photographs, and a short clinical description were given to eight orthopaedic surgeons, who independently evaluated the injury using both the Gustilo and Anderson (GA) and OTA-OFC classifications. The responses were compared for variability using Cohen’s kappa. Results. The overall interobserver agreement was ĸ = 0.44 for the GA classification and ĸ = 0.49 for OTA-OFC, which reflects moderate agreement (0.41 to 0.60) for both classifications. The agreement in the five categories of OTA-OFC was: for skin, ĸ = 0.55 (moderate); for muscle, ĸ = 0.44 (moderate); for arterial injury, ĸ = 0.74 (substantial); for contamination, ĸ = 0.35 (fair); and for bone loss, ĸ = 0.41 (moderate). Conclusion. Although the OTA-OFC, with similar interobserver agreement to GA, offers a more detailed description of open fractures, further development may be needed to make it a reliable and robust tool. Cite this article: Bone Joint J 2018;100-B:242–6


The Bone & Joint Journal
Vol. 102-B, Issue 1 | Pages 33 - 41
1 Jan 2020
Norman JG Brealey S Keding A Torgerson D Rangan A

Aims. The aim of this study was to explore whether time to surgery affects functional outcome in displaced proximal humeral fractures. Methods. A total of 250 patients presenting within three weeks of sustaining a displaced proximal humeral fracture involving the surgical neck were recruited at 32 acute NHS hospitals in the United Kingdom between September 2008 and April 2011. Of the 125 participants, 109 received surgery (fracture fixation or humeral head replacement) as per randomization. Data were included for 101 and 67 participants at six-month and five-year follow-up, respectively. Oxford Shoulder Scores (OSS) collected at six, 12, and 24 months and at three, four, and five years following randomization was plotted against time to surgery. Long-term recovery was explored by plotting six-month scores against five-year scores and agreement was illustrated with a Bland-Altman plot. Results. The mean time from initial trauma to surgery was 10.5 days (1 to 33). Earlier surgical intervention did not improve OSS throughout follow-up, nor when stratified by participant age (< 65 years vs ≥ 65 years) and fracture severity (one- and two-part vs three- and four-part fractures). Participants managed later than reported international averages (three days in the United States and Germany, eight days in the United Kingdom) did not have worse outcomes. At five-year follow-up, 50 participants (76%) had the same or improved OSS compared with six months (six-month mean OSS 35.8 (SD 10.0); five-year mean OSS 40.1 (SD 9.1); r = 0.613). A Bland-Altman plot demonstrated a positive mean difference (3.3 OSS points (SD 7.92)) with wide 95% limits of agreement (-12.2 and 18.8 points). Conclusion. Timing of surgery did not affect OSS at any stage of follow-up, irrespective of age or fracture type. Most participants had maximum functional outcome at six months that was maintained at five years. These findings may help guide providers of trauma services on surgical prioritization. Cite this article: Bone Joint J 2020;102-B(1):33–41


Orthopaedic Proceedings
Vol. 104-B, Issue SUPP_6 | Pages 4 - 4
1 Jun 2022
Hoban K Downie S Adamson D MacLean J Cool P Jariwala AC
Full Access

Mirels’ score predicts the likelihood of sustaining pathological fractures using pain, lesion site, size and morphology. The aim is to investigate its reproducibility, reliability and accuracy in upper limb bony metastases and validate its use in pathological fracture prediction. A retrospective cohort study of patients with upper limb metastases, referred to an Orthopaedic Trauma Centre (2013–18). Mirels’ was calculated in 32 patients; plain radiographs at presentation scored by 6 raters. Radiological aspects were scored twice by each rater, 2-weeks apart. Inter- and intra-observer reliability were calculated (Fleiss’ kappa test). Bland-Altman plots compared variances of individual score components &total Mirels’ score. Mirels’ score of ≥9 did not accurately predict lesions that would fracture (11% 5/46 vs 65.2% Mirels’ score ≤8, p<0.0001). Sensitivity was 14.3% &specificity was 72.7%. When Mirels’ cut-off was lowered to ≥7, patients were more likely to fracture (48% 22/46 versus 28% 13/46, p=0.045). Sensitivity rose to 62.9%, specificity fell to 54.6%. Kappa values for interobserver variability were 0.358 (fair, 0.288–0.429) for lesion size, 0.107 (poor, 0.02–0.193) for radiological appearance and 0.274 (fair, 0.229–0.318) for total Mirels’ score. Values for intraobserver variability were 0.716 (good, 95% CI 0.432–0.999) for lesion size, 0.427 (moderate, 95% CI 0.195–0.768) for radiological appearance and 0.580 (moderate, 0.395–0.765) for total Mirels’ score. We showed moderate to substantial agreement between &within raters using Mirels’ score on upper limb radiographs. Mirels’ has poor sensitivity &specificity predicting upper limb fractures - we recommend the cut-off score for prophylactic surgery should be lower than for lower limb lesions


The Journal of Bone & Joint Surgery British Volume
Vol. 88-B, Issue 9 | Pages 1204 - 1206
1 Sep 2006
Malek IA Machani B Mevcha AM Hyder NH

Our aim was to assess the reproducibility and the reliability of the Weber classification system for fractures of the ankle based on anteroposterior and lateral radiographs. Five observers with varying clinical experience reviewed 50 sets of blinded radiographs. The same observers reviewed the same radiographs again after an interval of four weeks. Inter- and intra-observer agreement was assessed based on the proportion of agreement and the values of the kappa coefficient. For inter-observer agreement, the mean kappa value was 0.61 (0.59 to 0.63) and the proportion of agreement was 78% (76% to 79%) and for intra-observer agreement the mean kappa value was 0.74 (0.39 to 0.86) with an 85% (60% to 93%) observed agreement. These results show that the Weber classification of fractures of the ankle based on two radiological views has substantial inter-observer reliability and intra-observer reproducibility


Bone & Joint Research
Vol. 5, Issue 10 | Pages 481 - 489
1 Oct 2016
Handoll HHG Brealey SD Jefferson L Keding A Brooksbank AJ Johnstone AJ Candal-Couto JJ Rangan A

Objectives. Accurate characterisation of fractures is essential in fracture management trials. However, this is often hampered by poor inter-observer agreement. This article describes the practicalities of defining the fracture population, based on the Neer classification, within a pragmatic multicentre randomised controlled trial in which surgical treatment was compared with non-surgical treatment in adults with displaced fractures of the proximal humerus involving the surgical neck. Methods. The trial manual illustrated the Neer classification of proximal humeral fractures. However, in addition to surgical neck displacement, surgeons assessing patient eligibility reported on whether either or both of the tuberosities were involved. Anonymised electronic versions of baseline radiographs were sought for all 250 trial participants. A protocol, data collection tool and training presentation were developed and tested in a pilot study. These were then used in a formal assessment and classification of the trial fractures by two independent senior orthopaedic shoulder trauma surgeons. Results. Two or more baseline radiographic views were obtained for each participant. The independent raters confirmed that all fractures would have been considered for surgery in contemporaneous practice. A full description of the fracture population based on the Neer classification was obtained. The agreement between the categorisation at baseline (tuberosity involvement) and Neer classification as assessed by the two raters was only fair (kappa 0.29). However, this disparity did not appear to affect trial findings, specifically in terms of influencing the effect of treatment on the primary outcome of the trial. Conclusions. A key reporting requirement, namely the description of the fracture population, was achieved within the context of a pragmatic multicentre randomised clinical trial. This article provides important guidance for researchers designing similar trials on fracture management. Cite this article: H. H. G. Handoll, S. D. Brealey, L. Jefferson, A. Keding, A. J. Brooksbank, A. J. Johnstone, J. J. Candal-Couto, A. Rangan. Defining the fracture population in a pragmatic multicentre randomised controlled trial: PROFHER and the Neer classification of proximal humeral fractures.Bone Joint Res 2016;5:481–489. DOI: 10.1302/2046-3758.510.BJR-2016-0132.R1


The Bone & Joint Journal
Vol. 101-B, Issue 10 | Pages 1300 - 1306
1 Oct 2019
Oliver WM Smith TJ Nicholson JA Molyneux SG White TO Clement ND Duckworth AD

Aims. The primary aim of this study was to develop a reliable, effective radiological score to assess the healing of humeral shaft fractures, the Radiographic Union Score for HUmeral fractures (RUSHU). The secondary aim was to assess whether the six-week RUSHU was predictive of nonunion at six months after the injury. Patients and Methods. Initially, 20 patients with radiographs six weeks following a humeral shaft fracture were selected at random from a trauma database and scored by three observers, based on the Radiographic Union Scale for Tibial fractures system. After refinement of the RUSHU criteria, a second group of 60 patients with radiographs six weeks after injury, 40 with fractures that united and 20 with fractures that developed nonunion, were scored by two blinded observers. Results. After refinement, the interobserver intraclass correlation coefficient (ICC) was 0.79 (95% confidence interval (CI) 0.67 to 0.87), indicating substantial agreement. At six weeks after injury, patients whose fractures united had a significantly higher median score than those who developed nonunion (10 vs 7; p < 0.001). A receiver operating characteristic curve determined that a RUSHU cut-off of < 8 was predictive of nonunion (area under the curve = 0.84, 95% CI 0.74 to 0.94). The sensitivity was 75% and specificity 80% with a positive predictive value (PPV) of 65% and a negative predictive value of 86%. Patients with a RUSHU < 8 (n = 23) were more likely to develop nonunion than those with a RUSHU ≥ 8 (n = 37, odds ratio 12.0, 95% CI 3.4 to 42.9). Based on a PPV of 65%, if all patients with a RUSHU < 8 underwent fixation, the number of procedures needed to avoid one nonunion would be 1.5. Conclusion. The RUSHU is reliable and effective in identifying patients at risk of nonunion of a humeral shaft fracture at six weeks after injury. This tool requires external validation but could potentially reduce the morbidity associated with delayed treatment of an established nonunion. Cite this article: Bone Joint J 2019;101-B:1300–1306


Orthopaedic Proceedings
Vol. 94-B, Issue SUPP_XXXVII | Pages 29 - 29
1 Sep 2012
Bajada S Harrison P Mofidi A Richardson J
Full Access

Introduction. Regenerative medicine is a rapidly expanding discipline. However due to a lack of validated outcome measures, clinical trials have been far few. This study aims to assess the validity, inter-observer reliability and intra-observer reproducibility of experimental fracture healing assessment on plain radiographies. This technique involves implantation of mesenchymal stem cell (MSC) seeded constructs on only one side of the fracture after randomisation. Methods. We examined inter/intraobserver agreement on the area and “bridging length” of callus formed on opposite sides of the fracture. Among 16 orthopaedic surgeons with trauma commitments (8 consultants, 8 registrars) on two separate occasions (average 52 days apart). They independently assessed the radiographs (AP or lateral) of 28 patients with fractures of the tibial or femoral shaft. The fractures chosen included non-unions treated with MSC/constructs and fresh fractures at 4–9 months. For each radiograph the assessor assigned which side (medial or lateral) is there more callus. Chase-corrected agreement using Fleiss kappa was used to compare opinions. Digital analysis software (Image-J) was used to quantify extent/bridging callus and correlate it with surgeons opinion. Results. Inter-observer variation showed a substantial overall agreement (k = 0.716) on the fracture side containing a larger “area” of callus but moderate agreement (k = 0.489) on side with more “bridging length”. These results were reproducible with a substantial overall intraobserver agreement. MSC/construct treated non-union showed a larger amount of agreement than fresh fractures for area (k = 0.754 vs 0.613) and bridging (0.550 vs 0.406). Utilizing digital analysis, non-unions showed a significant larger quantifiable difference between sides than fresh fractures (p = 0.009) for area but not bridging length (p = 0.269). Digital analysis quantification and surgeons opinion showed an almost perfect agreement for area (k = 0.867) and bridging (k = 0.846). Discussion. In this study we aimed to validate a novel method at studying the efficacy and effect of regenerative techniques on fracture healing. In particular, plain radiographs for comparing a treatment/internal control side. In this study we showed this method assessing area of callus is valid, reliable and reproducible. This is particularly so for MSC/construct treated non-union where the difference in both sides is higher as quantified in digital analysis. This is a novel method of experimental fracture healing using an internal control which decreases the variation between groups and sample size needed. This makes regenerative medicine clinical trials easier


The Journal of Bone & Joint Surgery British Volume
Vol. 89-B, Issue 1 | Pages 72 - 76
1 Jan 2007
Patel V Day A Dinah F Kelly M Bircher M

Specific radiological features identified by Brandser and Marsh were selected for the analysis of acetabular fractures according to the classification of Letournel and Judet. The method employs a binary approach that requires the observer to allocate each radiological feature to one of two groups. The inter- and intra-observer variances were assessed. The presence of articular displacement, marginal impaction, incongruity, intra-articular fragments and osteochondral injuries to the femoral head were analysed by a similar method. These factors were termed ‘modifiers’ and are generally considered when planning operative intervention and, critically, they may influence prognosis. Six observers independently assessed 30 sets of plain radiographs and CT scans on two separate occasions, 12 weeks apart. They were asked to determine the presence or absence of specific radiological features. This simple binary approach to classification yields an inter- and intra-observer agreement which ranges from moderate to near-perfect (κ = 0.49 to 0.88 and κ = 0.57 to 0.88, respectively). A similar approach to the modifiers yields only slight to fair inter-observer agreement (κ = 0.20 to 0.34) and slight to moderate intra-observer agreement (κ = 0 to 0.55)