Advertisement for orthosearch.org.uk
Results 1 - 20 of 1239
Results per page:
The Bone & Joint Journal
Vol. 105-B, Issue 10 | Pages 1123 - 1130
1 Oct 2023
Donnan M Anderson N Hoq M Donnan L

Aims. The aim of this study was to investigate the agreement in interpretation of the quality of the paediatric hip ultrasound examination, the reliability of geometric and morphological assessment, and the relationship between these measurements. Methods. Four investigators evaluated 60 hip ultrasounds and assessed their quality based the standard plane of Graf et al. They measured geometric parameters, described the morphology of the hip, and assigned the Graf grade of dysplasia. They analyzed one self-selected image and one randomly selected image from the ultrasound series, and repeated the process four weeks later. The intra- and interobserver agreement, and correlations between various parameters were analyzed. Results. In the assessment of quality, there a was moderate to substantial intraobserver agreement for each element investigated, but interobserver agreement was poor. Morphological features showed weak to moderate agreement across all parameters but improved to significant when responses were reduced. The geometric measurements showed nearly perfect agreement, and the relationship between them and the morphological features showed a dose response across all parameters with moderate to substantial correlations. There were strong correlations between geometric measurements. The Graf classification showed a fair to moderate interobserver agreement, and moderate to substantial intraobserver agreement. Conclusion. This investigation into the reliability of the interpretation of hip ultrasound scans identified the difficulties in defining what is a high-quality ultrasound. We confirmed that geometric measurements are reliably interpreted and may be useful as a further measurement of quality. Morphological features are generally poorly interpreted, but a simpler binary classification considerably improves agreement. As there is a clear dose response relationship between geometric and morphological measurements, the importance of morphology in the diagnosis of hip dysplasia should be questioned. Cite this article: Bone Joint J 2023;105-B(10):1123–1130


Bone & Joint Research
Vol. 4, Issue 12 | Pages 190 - 194
1 Dec 2015
Kleinlugtenbelt YV Hoekstra M Ham SJ Kloen P Haverlag R Simons MP Bhandari M Goslings JC Poolman RW Scholtes VAB

Objectives. Current studies on the additional benefit of using computed tomography (CT) in order to evaluate the surgeons’ agreement on treatment plans for fracture are inconsistent. This inconsistency can be explained by a methodological phenomenon called ‘spectrum bias’, defined as the bias inherent when investigators choose a population lacking therapeutic uncertainty for evaluation. The aim of the study is to determine the influence of spectrum bias on the intra-observer agreement of treatment plans for fractures of the distal radius. Methods. Four surgeons evaluated 51 patients with displaced fractures of the distal radius at four time points: T1 and T2: conventional radiographs; T3 and T4: radiographs and additional CT scan (radiograph and CT). Choice of treatment plan (operative or non-operative) and therapeutic certainty (five-point scale: very uncertain to very certain) were rated. To determine the influence of spectrum bias, the intra-observer agreement was analysed, using Kappa statistics, for each degree of therapeutic certainty. . Results. In cases with high therapeutic certainty, intra-observer agreement based on radiograph was almost perfect (0.86 to 0.90), but decreased to moderate based on a radiograph and CT (0.47 to 0.60). In cases with high therapeutic uncertainty, intra-observer agreement was slight at best (-0.12 to 0.19), but increased to moderate based on the radiograph and CT (0.56 to 0.57). Conclusion. Spectrum bias influenced the outcome of this agreement study on treatment plans. An additional CT scan improves the intra-observer agreement on treatment plans for a fracture of the distal radius only when there is therapeutic uncertainty. Reporting and analysing intra-observer agreement based on the surgeon’s level of certainty is an appropriate method to minimise spectrum bias. Cite this article: Bone Joint Res 2015;4:190–194


The Journal of Bone & Joint Surgery British Volume
Vol. 70-B, Issue 2 | Pages 299 - 301
1 Mar 1988
Dias J Taylor M Thompson J Brenkel I Gregg P

Inter-observer agreement and reproducibility of opinion were assessed for the radiographic diagnosis of union of scaphoid fractures on films taken 12 weeks after injury. Weighted kappa statistics were used to compare the opinions of eight senior observers reviewing 20 sets of good quality radiographs on two occasions separated by two months. There was poor agreement on whether trabeculae crossed the fracture line, whether there was sclerosis at or near the fracture and on whether the proximal part of the scaphoid was avascular. As a consequence, agreement on union also was poor; it appears that radiographs taken 12 weeks after a scaphoid fracture do not provide reliable and reproducible evidence of healing


Bone & Joint 360
Vol. 10, Issue 6 | Pages 8 - 10
1 Dec 2021
Spacey K Wimhurst J Hasan R Sharma D


The Journal of Bone & Joint Surgery British Volume
Vol. 30-B, Issue 1 | Pages 4 - 6
1 Feb 1948


The Bone & Joint Journal
Vol. 105-B, Issue 9 | Pages 1007 - 1012
1 Sep 2023
Hoeritzauer I Paterson M Jamjoom AAB Srikandarajah N Soleiman H Poon MTC Copley PC Graves C MacKay S Duong C Leung AHC Eames N Statham PFX Darwish S Sell PJ Thorpe P Shekhar H Roy H Woodfield J

Aims. Patients with cauda equina syndrome (CES) require emergency imaging and surgical decompression. The severity and type of symptoms may influence the timing of imaging and surgery, and help predict the patient’s prognosis. Categories of CES attempt to group patients for management and prognostication purposes. We aimed in this study to assess the inter-rater reliability of dividing patients with CES into categories to assess whether they can be reliably applied in clinical practice and in research. Methods. A literature review was undertaken to identify published descriptions of categories of CES. A total of 100 real anonymized clinical vignettes of patients diagnosed with CES from the Understanding Cauda Equina Syndrome (UCES) study were reviewed by consultant spinal surgeons, neurosurgical registrars, and medical students. All were provided with published category definitions and asked to decide whether each patient had ‘suspected CES’; ‘early CES’; ‘incomplete CES’; or ‘CES with urinary retention’. Inter-rater agreement was assessed for all categories, for all raters, and for each group of raters using Fleiss’s kappa. Results. Each of the 100 participants were rated by four medical students, five neurosurgical registrars, and four consultant spinal surgeons. No groups achieved reasonable inter-rater agreement for any of the categories. CES with retention versus all other categories had the highest inter-rater agreement (kappa 0.34 (95% confidence interval 0.27 to 0.31); minimal agreement). There was no improvement in inter-rater agreement with clinical experience. Across all categories, registrars agreed with each other most often (kappa 0.41), followed by medical students (kappa 0.39). Consultant spinal surgeons had the lowest inter-rater agreement (kappa 0.17). Conclusion. Inter-rater agreement for categorizing CES is low among clinicians who regularly manage these patients. CES categories should be used with caution in clinical practice and research studies, as groups may be heterogenous and not comparable. Cite this article: Bone Joint J 2023;105-B(9):1007–1012


Bone & Joint Research
Vol. 13, Issue 1 | Pages 19 - 27
5 Jan 2024
Baertl S Rupp M Kerschbaum M Morgenstern M Baumann F Pfeifer C Worlicek M Popp D Amanatullah DF Alt V

Aims. This study aimed to evaluate the clinical application of the PJI-TNM classification for periprosthetic joint infection (PJI) by determining intraobserver and interobserver reliability. To facilitate its use in clinical practice, an educational app was subsequently developed and evaluated. Methods. A total of ten orthopaedic surgeons classified 20 cases of PJI based on the PJI-TNM classification. Subsequently, the classification was re-evaluated using the PJI-TNM app. Classification accuracy was calculated separately for each subcategory (reinfection, tissue and implant condition, non-human cells, and morbidity of the patient). Fleiss’ kappa and Cohen’s kappa were calculated for interobserver and intraobserver reliability, respectively. Results. Overall, interobserver and intraobserver agreements were substantial across the 20 classified cases. Analyses for the variable ‘reinfection’ revealed an almost perfect interobserver and intraobserver agreement with a classification accuracy of 94.8%. The category 'tissue and implant conditions' showed moderate interobserver and substantial intraobserver reliability, while the classification accuracy was 70.8%. For 'non-human cells,' accuracy was 81.0% and interobserver agreement was moderate with an almost perfect intraobserver reliability. The classification accuracy of the variable 'morbidity of the patient' reached 73.5% with a moderate interobserver agreement, whereas the intraobserver agreement was substantial. The application of the app yielded comparable results across all subgroups. Conclusion. The PJI-TNM classification system captures the heterogeneity of PJI and can be applied with substantial inter- and intraobserver reliability. The PJI-TNM educational app aims to facilitate application in clinical practice. A major limitation was the correct assessment of the implant situation. To eliminate this, a re-evaluation according to intraoperative findings is strongly recommended. Cite this article: Bone Joint Res 2024;13(1):19–27


Aims. Classifying trochlear dysplasia (TD) is useful to determine the treatment options for patients suffering from patellofemoral instability (PFI). There is no consensus on which classification system is more reliable and reproducible for the purpose of guiding clinicians’ management of PFI. There are also concerns about the validity of the Dejour Classification (DJC), which is the most widely used classification for TD, having only a fair reliability score. The Oswestry-Bristol Classification (OBC) is a recently proposed system of classification of TD, and the authors report a fair-to-good interobserver agreement and good-to-excellent intraobserver agreement in the assessment of TD. The aim of this study was to compare the reliability and reproducibility of these two classifications. Methods. In all, six assessors (four consultants and two registrars) independently evaluated 100 axial MRIs of the patellofemoral joint (PFJ) for TD and classified them according to OBC and DJC. These assessments were again repeated by all raters after four weeks. The inter- and intraobserver reliability scores were calculated using Cohen’s kappa and Cronbach’s α. Results. Both classifications showed good to excellent interobserver reliability with high α scores. The OBC classification showed a substantial intraobserver agreement (mean kappa 0.628; p < 0.005) whereas the DJC showed a moderate agreement (mean kappa 0.572; p < 0.005). There was no significant difference in the kappa values when comparing the assessments by consultants with those by registrars, in either classification system. Conclusion. This large study from a non-founding institute shows both classification systems to be reliable for classifying TD based on axial MRIs of the PFJ, with the simple-to-use OBC having a higher intraobserver reliability score than that of the DJC. Cite this article: Bone Jt Open 2023;4(7):532–538


Bone & Joint Research
Vol. 12, Issue 5 | Pages 313 - 320
8 May 2023
Saiki Y Kabata T Ojima T Kajino Y Kubo N Tsuchiya H

Aims. We aimed to assess the reliability and validity of OpenPose, a posture estimation algorithm, for measurement of knee range of motion after total knee arthroplasty (TKA), in comparison to radiography and goniometry. Methods. In this prospective observational study, we analyzed 35 primary TKAs (24 patients) for knee osteoarthritis. We measured the knee angles in flexion and extension using OpenPose, radiography, and goniometry. We assessed the test-retest reliability of each method using intraclass correlation coefficient (1,1). We evaluated the ability to estimate other measurement values from the OpenPose value using linear regression analysis. We used intraclass correlation coefficients (2,1) and Bland–Altman analyses to evaluate the agreement and error between radiography and the other measurements. Results. OpenPose had excellent test-retest reliability (intraclass correlation coefficient (1,1) = 1.000). The R. 2. of all regression models indicated large correlations (0.747 to 0.927). In the flexion position, the intraclass correlation coefficients (2,1) of OpenPose indicated excellent agreement (0.953) with radiography. In the extension position, the intraclass correlation coefficients (2,1) indicated good agreement of OpenPose and radiography (0.815) and moderate agreement of goniometry with radiography (0.593). OpenPose had no systematic error in the flexion position, and a 2.3° fixed error in the extension position, compared to radiography. Conclusion. OpenPose is a reliable and valid tool for measuring flexion and extension positions after TKA. It has better accuracy than goniometry, especially in the extension position. Accurate measurement values can be obtained with low error, high reproducibility, and no contact, independent of the examiner’s skills. Cite this article: Bone Joint Res 2023;12(5):313–320


The Bone & Joint Journal
Vol. 105-B, Issue 12 | Pages 1259 - 1264
1 Dec 2023
Hurley ET Hughes AJ Savage-Elliott I Dejour D Campbell KA Mulcahey MK Wittstein JR Jazrawi LM

Aims. The aim of this study was to establish consensus statements on the diagnosis, nonoperative management, and indications, if any, for medial patellofemoral complex (MPFC) repair in patients with patellar instability, using the modified Delphi approach. Methods. A total of 60 surgeons from 11 countries were invited to develop consensus statements based on their expertise in this area. They were assigned to one of seven working groups defined by subtopics of interest within patellar instability. Consensus was defined as achieving between 80% and 89% agreement, strong consensus was defined as between 90% and 99% agreement, and 100% agreement was considered to be unanimous. Results. Of 27 questions and statements on patellar instability, three achieved unanimous consensus, 14 achieved strong consensus, five achieved consensus, and five did not achieve consensus. Conclusion. The statements that reached unanimous consensus were that an assessment of physeal status is critical for paediatric patients with patellar instability. There was also unanimous consensus on early mobilization and resistance training following nonoperative management once there is no apprehension. The statements that did not achieve consensus were on the importance of immobilization of the knee, the use of orthobiologics in nonoperative management, the indications for MPFC repair, and whether a vastus medialis oblique advancement should be performed. Cite this article: Bone Joint J 2023;105-B(12):1259–1264


The Bone & Joint Journal
Vol. 103-B, Issue 12 | Pages 1802 - 1808
1 Dec 2021
Bruce J Knight R Parsons N Betteridge R Verdon A Brown J Campolier M Achten J Costa ML

Aims. Deep surgical site infection (SSI) is common after lower limb fracture. We compared the diagnosis of deep SSI using alternative methods of data collection and examined the agreement of clinical photography and in-person clinical assessment by the Centers for Disease Control and Prevention (CDC) criteria after lower limb fracture surgery. Methods. Data from two large, UK-based multicentre randomized controlled major trauma trials investigating SSI and wound healing after surgical repair of open lower limb fractures that could not be primarily closed (UK WOLLF), and surgical incisions for fractures that were primarily closed (UK WHiST), were examined. Trial interventions were standard wound care management and negative pressure wound therapy after initial surgical debridement. Wound outcomes were collected from 30 days to six weeks. We compared the level of agreement between wound photography and clinical assessment of CDC-defined SSI. We are also assessed the level of agreement between blinded independent assessors of the photographs. Results. Rates of CDC-defined deep SSI were 7.6% (35/460) after open fracture and 6.3% (95/1519) after closed incisional repair. Photographs were obtained for 77% and 73% of WOLLF and WHiST cohorts respectively (all participants n = 1,478). Agreement between photographic-SSI and CDC-SSI was fair for open fracture wounds (83%; k = 0.27 (95% confidence interval (CI) 0.14 to 0.42)) and for closed incisional wounds (88%; k = 0.29 (95% CI 0.20 to 0.37)) although the rate of photographically detected deep SSIs was twice as high as CDC-SSI (12% vs 6%). Agreement between different assessors for photographic-SSI (WOLLF 88%, k = 0.63 (95% CI 0.52 to 0.72); WHiST 89%; k = 0.61 (95% CI 0.54 to 0.69)); and wound healing was good (WOLLF 90%; k = 0.80 (95% CI 0.73 to 0.86); WHiST 87%; k = 0.57 (95% CI 0.50 to 0.64)). Conclusion. Although wound photography was feasible within the research context and inter-rater assessor agreement substantial, digital photographs used in isolation overestimated deep SSI rates, when compared to CDC criteria. Wound photography should not replace clinical assessment in pragmatic trials but may be useful for screening purposes where surgical infection outcomes are paramount. Cite this article: Bone Joint J 2021;103-B(12):1802–1808


The Bone & Joint Journal
Vol. 105-B, Issue 12 | Pages 1265 - 1270
1 Dec 2023
Hurley ET Sherman SL Chahla J Gursoy S Alaia MJ Tanaka MJ Pace JL Jazrawi LM

Aims. The aim of this study was to establish consensus statements on medial patellofemoral ligament (MPFL) reconstruction, anteromedialization tibial tubercle osteotomy, trochleoplasty, and rehabilitation and return to sporting activity in patients with patellar instability, using the modified Delphi process. Methods. This was the second part of a study dealing with these aspects of management in these patients. As in part I, a total of 60 surgeons from 11 countries contributed to the development of consensus statements based on their expertise in this area. They were assigned to one of seven working groups defined by subtopics of interest. Consensus was defined as achieving between 80% and 89% agreement, strong consensus was defined as between 90% and 99% agreement, and 100% agreement was considered unanimous. Results. Of 41 questions and statements on patellar instability, none achieved unanimous consensus, 19 achieved strong consensus, 15 achieved consensus, and seven did not achieve consensus. Conclusion. Most statements reached some degree of consensus, without any achieving unanimous consensus. There was no consensus on the use of anchors in MPFL reconstruction, and the order of fixation of the graft (patella first versus femur first). There was also no consensus on the indications for trochleoplasty or its effect on the viability of the cartilage after elevation of the osteochondral flap. There was also no consensus on postoperative immobilization or weightbearing, or whether paediatric patients should avoid an early return to sport. Cite this article: Bone Joint J 2023;105-B(12):1265–1270


The Bone & Joint Journal
Vol. 103-B, Issue 8 | Pages 1339 - 1344
1 Aug 2021
Jain S Mohrir G Townsend O Lamb JN Palan J Aderinto J Pandit H

Aims. This aim of this study was to assess the reliability and validity of the Unified Classification System (UCS) for postoperative periprosthetic femoral fractures (PFFs) around cemented polished taper-slip (PTS) stems. Methods. Radiographs of 71 patients with a PFF admitted consecutively at two centres between 25 February 2012 and 19 May 2020 were collated by an independent investigator. Six observers (three hip consultants and three trainees) were familiarized with the UCS. Each PFF was classified on two separate occasions, with a mean time between assessments of 22.7 days (16 to 29). Interobserver reliability for more than two observers was assessed using percentage agreement and Fleiss’ kappa statistic. Intraobserver reliability between two observers was calculated with Cohen kappa statistic. Validity was tested on surgically managed UCS type B PFFs where stem stability was documented in operation notes (n = 50). Validity was assessed using percentage agreement and Cohen kappa statistic between radiological assessment and intraoperative findings. Kappa statistics were interpreted using Landis and Koch criteria. All six observers were blinded to operation notes and postoperative radiographs. Results. Interobserver reliability percentage agreement was 58.5% and the overall kappa value was 0.442 (moderate agreement). Lowest kappa values were seen for type B fractures (0.095 to 0.360). The mean intraobserver reliability kappa value was 0.672 (0.447 to 0.867), indicating substantial agreement. Validity percentage agreement was 65.7% and the mean kappa value was 0.300 (0.160 to 0.4400) indicating only fair agreement. Conclusion. This study demonstrates that the UCS is unsatisfactory for the classification of PFFs around PTS stems, and that it has considerably lower reliability and validity than previously described for other stem types. Radiological PTS stem loosening in the presence of PFF is poorly defined and formal intraoperative testing of stem stability is recommended. Cite this article: Bone Joint J 2021;103-B(8):1339–1344


The Bone & Joint Journal
Vol. 106-B, Issue 9 | Pages 898 - 906
1 Sep 2024
Kayani B Wazir MUK Mancino F Plastow R Haddad FS

Aims. The primary objective of this study was to develop a validated classification system for assessing iatrogenic bone trauma and soft-tissue injury during total hip arthroplasty (THA). The secondary objective was to compare macroscopic bone trauma and soft-tissues injury in conventional THA (CO THA) versus robotic arm-assisted THA (RO THA) using this classification system. Methods. This study included 30 CO THAs versus 30 RO THAs performed by a single surgeon. Intraoperative photographs of the osseous acetabulum and periacetabular soft-tissues were obtained prior to implantation of the acetabular component, which were used to develop the proposed classification system. Interobserver and intraobserver variabilities of the proposed classification system were assessed. Results. The BOne trauma and Soft-Tissue Injury classification system in total Hip arthroplasty (BOSTI Hip) grades osseous acetabular trauma and periarticular muscle damage during THA. The classification system has an interclass correlation coefficient of 0.90 (95% CI 0.86 to 0.93) for interobserver agreement and 0.89 (95% CI 0.84 to 0.93) for intraobserver agreement. RO THA was associated with improved BOSTI Hip scores (p = 0.002) and more pristine osseous surfaces in the anterior superior (p = 0.001) and posterior superior (p < 0.001) acetabular quadrants compared with CO THA. There were no differences between the groups in relation to injury to the gluteus medius (p = 0.084), obturator internus (p = 0.241), piriformis (p = 0.081), superior gamellus (p = 0.116), inferior gamellus (p = 0.132), quadratus femoris (p = 0.208), and vastus lateralis (p = 0.135), but overall combined muscle injury was reduced in RO THA compared with CO THA (p = 0.023). Discussion. The proposed BOSTI Hip classification provides a reproducible grading system for stratifying iatrogenic bone trauma and soft-tissue injury during THA. RO THA was associated with improved BOSTI Hip scores, more pristine osseous acetabular surfaces, and reduced combined periarticular muscle injury compared with CO THA. Further research is required to understand if these intraoperative findings translate to differences in clinical outcomes between the treatment groups. Cite this article: Bone Joint J 2024;106-B(9):898–906


The Bone & Joint Journal
Vol. 101-B, Issue 10 | Pages 1292 - 1299
1 Oct 2019
Masters J Metcalfe D Parsons NR Achten J Griffin XL Costa ML

Aims. This study explores data quality in operation type and fracture classification recorded as part of a large research study and a national audit with an independent review. Patients and Methods. At 17 centres, an expert surgeon reviewed a randomly selected subset of cases from their centre with regard to fracture classification using the AO system and type of operation performed. Agreement for these variables was then compared with the data collected during conduct of the World Hip Trauma Evaluation (WHiTE) cohort study. Both types of surgery and fracture classification were collapsed to identify the level of detail of reporting that achieved meaningful agreement. In the National Hip Fracture Database (NHFD), the types of operation and fracture classification were explored to identify the proportion of “highly improbable” combinations. Results. The records were reviewed for 903 cases. Agreement for the subtypes of extracapsular fracture was poor; most centres achieved no better than “fair” agreement. When the classification was collapsed to a single option for “extracapsular” fracture, only four centres failed to have at least “moderate” agreement. There was only “moderate” agreement for the subtypes of intracapsular fracture, which improved to “substantial” when collapsed to “intracapsular”. Subtrochanteric fracture types were well reported with “substantial” agreement. There was near “perfect” agreement for internal fixation procedures. “Perfect” or “substantial” agreement was achieved when the type of arthroplasty surgery was reported at the level of “hemiarthroplasty” and “total hip replacement”. When reviewing data submitted to the NHFD, a minimum of 5.2% of cases contained “highly improbable” procedures for the stated fracture classification. Conclusion. The complexity of collecting fracture classification data at a national scale compromises the accuracy with which detailed classification systems can be reported. Data around type of surgery performed show similar tendencies. Data capture, reporting, and interpretation in future studies must take this into account. Cite this article: Bone Joint J 2019;101-B:1292–1299


Bone & Joint Open
Vol. 2, Issue 8 | Pages 638 - 645
1 Aug 2021
Garner AJ Edwards TC Liddle AD Jones GG Cobb JP

Aims. Joint registries classify all further arthroplasty procedures to a knee with an existing partial arthroplasty as revision surgery, regardless of the actual procedure performed. Relatively minor procedures, including bearing exchanges, are classified in the same way as major operations requiring augments and stems. A new classification system is proposed to acknowledge and describe the detail of these procedures, which has implications for risk, recovery, and health economics. Methods. Classification categories were proposed by a surgical consensus group, then ranked by patients, according to perceived invasiveness and implications for recovery. In round one, 26 revision cases were classified by the consensus group. Results were tested for inter-rater reliability. In round two, four additional cases were added for clarity. Round three repeated the survey one month later, subject to inter- and intrarater reliability testing. In round four, five additional expert partial knee arthroplasty surgeons were asked to classify the 30 cases according to the proposed revision partial knee classification (RPKC) system. Results. Four classes were proposed: PR1, where no bone-implant interfaces are affected; PR2, where surgery does not include conversion to total knee arthroplasty, for example, a second partial arthroplasty to a native compartment; PR3, when a standard primary total knee prosthesis is used; and PR4 when revision components are necessary. Round one resulted in 92% inter-rater agreement (Kendall’s W 0.97; p < 0.005), rising to 93% in round two (Kendall’s W 0.98; p < 0.001). Round three demonstrated 97% agreement (Kendall’s W 0.98; p < 0.001), with high intra-rater reliability (interclass correlation coefficient (ICC) 0.99; 95% confidence interval 0.98 to 0.99). Round four resulted in 80% agreement (Kendall’s W 0.92; p < 0.001). Conclusion. The RPKC system accounts for all procedures which may be appropriate following partial knee arthroplasty. It has been shown to be reliable, repeatable and pragmatic. The implications for patient care and health economics are discussed. Cite this article: Bone Jt Open 2021;2(8):638–645


The Bone & Joint Journal
Vol. 104-B, Issue 6 | Pages 758 - 764
1 Jun 2022
Gelfer Y Davis N Blanco J Buckingham R Trees A Mavrotas J Tennant S Theologis T

Aims. The aim of this study was to gain an agreement on the management of idiopathic congenital talipes equinovarus (CTEV) up to walking age in order to provide a benchmark for practitioners and guide consistent, high-quality care for children with CTEV. Methods. The consensus process followed an established Delphi approach with a predetermined degree of agreement. The process included the following steps: establishing a steering group; steering group meetings, generating statements, and checking them against the literature; a two-round Delphi survey; and final consensus meeting. The steering group members and Delphi survey participants were all British Society of Children’s Orthopaedic Surgery (BSCOS) members. Descriptive statistics were used for analysis of the Delphi survey results. The Appraisal of Guidelines for Research & Evaluation checklist was followed for reporting of the results. Results. The BSCOS-selected steering group, the steering group meetings, the Delphi survey, and the final consensus meeting all followed the pre-agreed protocol. A total of 153/243 members voted in round 1 Delphi (63%) and 132 voted in round 2 (86%). Out of 61 statements presented to round 1 Delphi, 43 reached ‘consensus in’, no statements reached ‘consensus out’, and 18 reached ‘no consensus’. Four statements were deleted and one new statement added following suggestions from round 1. Out of 15 statements presented to round 2, 12 reached ‘consensus in’, no statements reached ‘consensus out’, and three reached ‘no consensus’ and were discussed and included following the final consensus meeting. Two statements were combined for simplicity. The final consensus document includes 57 statements allocated into six successive stages. Conclusion. We have produced a consensus document for the treatment of idiopathic CTEV up to walking age. This will provide a benchmark for standard of care in the UK and will help to reduce geographical variability in treatment and outcomes. Appropriate dissemination and implementation will be key to its success. Cite this article: Bone Joint J 2022;104-B(6):758–764


Bone & Joint Research
Vol. 10, Issue 12 | Pages 759 - 766
1 Dec 2021
Nicholson JA Oliver WM MacGillivray TJ Robinson CM Simpson AHRW

Aims. The aim of this study was to establish a reliable method for producing 3D reconstruction of sonographic callus. Methods. A cohort of ten closed tibial shaft fractures managed with intramedullary nailing underwent ultrasound scanning at two, six, and 12 weeks post-surgery. Ultrasound capture was performed using infrared tracking technology to map each image to a 3D lattice. Using echo intensity, semi-automated mapping was performed to produce an anatomical 3D representation of the fracture site. Two reviewers independently performed 3D reconstructions and kappa coefficient was used to determine agreement. A further validation study was undertaken with ten reviewers to estimate the clinical application of this imaging technique using the intraclass correlation coefficient (ICC). Results. Nine of the ten patients achieved union at six months. At six weeks, seven patients had bridging callus of ≥ one cortex on the 3D reconstruction and when present all achieved union. Compared to six-week radiographs, no bridging callus was present in any patient. Of the three patients lacking sonographic bridging callus, one went onto a nonunion (77.8% sensitive and 100% specific to predict union). At 12 weeks, nine patients had bridging callus at ≥ one cortex on 3D reconstruction (100%-sensitive and 100%-specific to predict union). Presence of sonographic bridging callus on 3D reconstruction demonstrated excellent reviewer agreement on ICC at 0.87 (95% confidence interval 0.74 to 0.96). Conclusion. 3D fracture reconstruction can be created using multiple ultrasound images in order to evaluate the presence of bridging callus. This imaging modality has the potential to enhance the usability and accuracy of identification of early fracture healing. Cite this article: Bone Joint Res 2021;10(12):759–766


The Bone & Joint Journal
Vol. 102-B, Issue 4 | Pages 478 - 484
1 Apr 2020
Daniels AM Wyers CE Janzing HMJ Sassen S Loeffen D Kaarsemaker S van Rietbergen B Hannemann PFW Poeze M van den Bergh JP

Aims. Besides conventional radiographs, the use of MRI, CT, and bone scintigraphy is frequent in the diagnosis of a fracture of the scaphoid. However, which techniques give the best results remain unknown. The investigation of a new imaging technique initially requires an analysis of its precision. The primary aim of this study was to investigate the interobserver agreement of high-resolution peripheral quantitative CT (HR-pQCT) in the diagnosis of a scaphoid fracture. A secondary aim was to investigate the interobserver agreement for the presence of other fractures and for the classification of scaphoid fracture. Methods. Two radiologists and two orthopaedic trauma surgeons evaluated HR-pQCT scans of 31 patients with a clinically-suspected scaphoid fracture. The observers were asked to determine the presence of a scaphoid or other fracture and to classify the scaphoid fracture based on the Herbert classification system. Fleiss kappa statistics were used to calculate the interobserver agreement for the diagnosis of a fracture. Intraclass correlation coefficients (ICCs) were used to assess the agreement for the classification of scaphoid fracture. Results. A total of nine (29%) scaphoid fractures and 12 (39%) other fractures were diagnosed in 20 patients (65%) using HR-pQCT across the four observers. The interobserver agreement was 91% for the identification of a scaphoid fracture (95% confidence interval (CI) 0.76 to 1.00) and 80% for other fractures (95% CI 0.72 to 0.87). The mean ICC for the classification of a scaphoid fracture in the seven patients diagnosed with scaphoid fracture by all four observers was 73% (95% CI 0.42 to 0.94). Conclusion. We conclude that the diagnosis of scaphoid and other fractures is reliable when using HR-pQCT in patients with a clinically-suspected fracture. Cite this article: Bone Joint J 2020;102-B(4):478–484


The Bone & Joint Journal
Vol. 106-B, Issue 3 | Pages 227 - 231
1 Mar 2024
Todd NV Casey A Birch NC

The diagnostic sub-categorization of cauda equina syndrome (CES) is used to aid communication between doctors and other healthcare professionals. It is also used to determine the need for, and urgency of, MRI and surgery in these patients. A recent paper by Hoeritzauer et al (2023) in this journal examined the interobserver reliability of the widely accepted subcategories in 100 patients with cauda equina syndrome. They found that there is no useful interobserver agreement for the subcategories, even for experienced spinal surgeons. This observation is supported by the largest prospective study of the treatment of cauda equina syndrome in the UK by Woodfield et al (2023). If the accepted subcategories are unreliable, they cannot be used in the way that they are currently, and they should be revised or abandoned. This paper presents a reassessment of the diagnostic and prognostic subcategories of cauda equina syndrome in the light of this evidence, with a suggested cure based on a more inclusive synthesis of symptoms, signs, bladder ultrasound scan results, and pre-intervention urinary catheterization. Cite this article: Bone Joint J 2024;106-B(3):227–231