Aims. Classifying trochlear dysplasia (TD) is useful to determine the treatment options for patients suffering from patellofemoral instability (PFI). There is no consensus on which classification system is more reliable and reproducible for the purpose of guiding clinicians’ management of PFI. There are also concerns about the validity of the Dejour Classification (DJC), which is the most widely used classification for TD, having only a fair
Aims. We aimed to assess the
Aims. The Wrightington classification system of fracture-dislocations of the elbow divides these injuries into six subtypes depending on the involvement of the coronoid and the radial head. The aim of this study was to assess the
Objectives. CT-based three-column classification (TCC) has been widely used in the treatment of tibial plateau fractures (TPFs). In its updated version (updated three-column concept, uTCC), a fracture morphology-based injury mechanism was proposed for effective treatment guidance. In this study, the injury mechanism of TPFs is further explained, and its inter- and intraobserver
A variety of radiological methods of measuring
version of the acetabular component after total hip replacement (THR)
have been described. The aim of this study was to evaluate the reliability
and validity of six methods (those of Lewinnek; Widmer; Hassan et
al; Ackland, Bourne and Uhthoff; Liaw et al; and Woo and Morrey)
that are currently in use. In 36 consecutive patients who underwent
THR, version of the acetabular component was measured by three independent
examiners on plain radiographs using these six methods and compared
with measurements using CT scans. The intra- and interobserver reliabilities
of each measurement were estimated. All measurements on both radiographs
and CT scans had excellent intra- and interobserver reliability
and the results from each of the six methods correlated well with
the CT measurements. However, measurements made using the methods
of Widmer and of Ackland, Bourne and Uhthoff were significantly
different from the CT measurements (both p <
0.001), whereas
measurements made using the remaining four methods were similar
to the CT measurements. With regard to
Objectives. The diagnosis of surgical site infection following endoprosthetic reconstruction for bone tumours is frequently a subjective diagnosis. Large clinical trials use blinded Central Adjudication Committees (CACs) to minimise the variability and bias associated with assessing a clinical outcome. The aim of this study was to determine the level of inter-rater and intra-rater agreement in the diagnosis of surgical site infection in the context of a clinical trial. Materials and Methods. The Prophylactic Antibiotic Regimens in Tumour Surgery (PARITY) trial CAC adjudicated 29 non-PARITY cases of lower extremity endoprosthetic reconstruction. The CAC members classified each case according to the Centers for Disease Control (CDC) criteria for surgical site infection (superficial, deep, or organ space). Combinatorial analysis was used to calculate the smallest CAC panel size required to maximise agreement. A final meeting was held to establish a consensus. Results. Full or near consensus was reached in 20 of the 29 cases. The Fleiss kappa value was calculated as 0.44 (95% confidence interval (CI) 0.35 to 0.53), or moderate agreement. The greatest statistical agreement was observed in the outcome of no infection, 0.61 (95% CI 0.49 to 0.72, substantial agreement). Panelists reached a full consensus in 12 of 29 cases and near consensus in five of 29 cases when CDC criteria were used (superficial, deep or organ space). A stable maximum Fleiss kappa of 0.46 (95% CI 0.50 to 0.35) at CAC sizes greater than three members was obtained. Conclusions. There is substantial agreement among the members of the PARITY CAC regarding the presence or absence of surgical site infection. Agreement on the level of infection, however, is more challenging. Additional clinical information routinely collected by the prospective PARITY trial may improve the discriminatory capacity of the CAC in the parent study for the diagnosis of infection. Cite this article: J. Nuttall, N. Evaniew, P. Thornley, A. Griffin, B. Deheshi, T. O’Shea, J. Wunder, P. Ferguson, R. L. Randall, R. Turcotte, P. Schneider, P. McKay, M. Bhandari, M. Ghert. The inter-rater
Besides conventional radiographs, the use of MRI, CT, and bone scintigraphy is frequent in the diagnosis of a fracture of the scaphoid. However, which techniques give the best results remain unknown. The investigation of a new imaging technique initially requires an analysis of its precision. The primary aim of this study was to investigate the interobserver agreement of high-resolution peripheral quantitative CT (HR-pQCT) in the diagnosis of a scaphoid fracture. A secondary aim was to investigate the interobserver agreement for the presence of other fractures and for the classification of scaphoid fracture. Two radiologists and two orthopaedic trauma surgeons evaluated HR-pQCT scans of 31 patients with a clinically-suspected scaphoid fracture. The observers were asked to determine the presence of a scaphoid or other fracture and to classify the scaphoid fracture based on the Herbert classification system. Fleiss kappa statistics were used to calculate the interobserver agreement for the diagnosis of a fracture. Intraclass correlation coefficients (ICCs) were used to assess the agreement for the classification of scaphoid fracture.Aims
Methods
Aims. The evidence demonstrating the superiority of early MRI has led to increased use of MRI in clinical pathways for acute wrist trauma. The aim of this study was to describe the radiological characteristics and the inter-observer
Aims. Reimers migration percentage (MP) is a key measure to inform decision-making around the management of hip displacement in cerebral palsy (CP). The aim of this study is to assess validity and inter- and intra-rater
Aims. This aim of this study was to assess the
Aims. To investigate if preoperative CT improves detection of unstable trochanteric hip fractures. Methods. A single-centre prospective study was conducted. Patients aged 65 years or older with trochanteric hip fractures admitted to Stavanger University Hospital (Stavanger, Norway) were consecutively included from September 2020 to January 2022. Radiographs and CT images of the fractures were obtained, and surgeons made individual assessments of the fractures based on these. The assessment was conducted according to a systematic protocol including three classification systems (AO/Orthopaedic Trauma Association (OTA), Evans Jensen (EVJ), and Nakano) and questions addressing specific fracture patterns. An expert group provided a gold-standard assessment based on the CT images. Sensitivities and specificities of surgeons’ assessments were estimated and compared in regression models with correlations for the same patients. Intra- and inter-rater
Aims. The aim of the current study was to assess the
Aims. The Oswestry-Bristol Classification (OBC) was recently described as an MRI-based classification tool for the femoral trochlear. The authors demonstrated better inter- and intraobserver agreement compared to the Dejour classification. As the OBC could potentially provide a very useful MRI-based grading system for trochlear dysplasia, it was the aim to determine the inter- and intraobserver
The databases of the Picture Archiving and Communication Systems of two hospitals were searched and all children who had a lateral radiograph of the ankle during their attendance at the emergency department were identified. In 227 radiographs, Bohler’s and Gissane’s angles were measured on two separate occasions and by two separate authors to allow calculation of inter- and intra-observer variation. Intraclass correlation coefficients were used to assess the
Locognosia, the ability to localise touch, is one aspect of tactile spatial discrimination which relies on the integrity of peripheral end-organs as well as the somatosensory representation of the surface of the body in the brain. The test presented here is a standardised assessment which uses a protocol for testing locognosia in the zones of the hand supplied by the median and/or ulnar nerves. The test-retest
We aimed to determine the
The Unified Classification System (UCS) was introduced
because of a growing need to have a standardised universal classification
system of periprosthetic fractures. It combines and simplifies many
existing classification systems, and can be applied to any fracture
around any partial or total joint replacement occurring during or
after operation. Our goal was to assess the inter- and intra-observer
reliability of the UCS in association with knee replacement when
classifying fractures affecting one or more of the femur, tibia
or patella. We used an international panel of ten orthopaedic surgeons with
subspecialty fellowship training and expertise in adult hip and
knee reconstruction (‘experts’) and ten residents of orthopaedic
surgery in the last two years of training (‘pre-experts’). They
each received 15 radiographs for evaluation. After six weeks they
evaluated the same radiographs again but in a different order. . The
Interobserver
Previous standards for assessing the reliability
of a measurement tool have lacked consistency. We reviewed the most
current American Society for Testing and Materials and International
Organisation for Standardisation (ISO) recommendations, and propose
an algorithm for orthopaedic surgeons. When assessing a measurement
tool, conditions of the experimental set-up and clear formulae used
to compile the results should be strictly reported. According to
these recent guidelines, accuracy is a confusing word with an overly
broad meaning and should therefore be abandoned. Depending on the
experimental conditions, one should be referring to bias (when the study
protocol involves accepted reference values), and repeatability
(sr, r) or reproducibility (SR, R). In the absence of accepted reference
values, only repeatability (sr, r) or reproducibility (SR, R) should
be provided. Take home message: Assessing the
There is no single standardised method of measuring
the orientation of the acetabular component on plain radiographs
after total hip arthroplasty. We assessed the
Our aim was to assess the reproducibility and the
We evaluated the impact of stereo-visualisation of three-dimensional volume-rendering CT datasets on the inter- and intraobserver
Objectives. The patient-rated wrist evaluation (PRWE) and the Disabilities of the Arm, Shoulder and Hand (DASH) questionnaire are patient-reported outcome measures (PROMs) used for clinical and research purposes. Methodological high-quality clinimetric studies that determine the measurement properties of these PROMs when used in patients with a distal radial fracture are lacking. This study aimed to validate the PRWE and DASH in Dutch patients with a displaced distal radial fracture (DRF). Methods. The intraclass correlation coefficient (ICC) was used for test-retest
Objectives. There remains a lack of data on the
We studied 19 videotaped knee arthroscopies in 19 patients with mild to moderate osteoarthritis (OA) of the knee in order to compare the intraobserver and interobserver
Aims. To evaluate interobserver
Several radiological methods of measuring anteversion
of the acetabular component after total hip replacement (THR) have
been described. These studies used different definitions and reference
planes to compare methods, allowing for misinterpretation of the
results. We compared the
Ultrasound scans were made of the hips of 209 neonates born consecutively over a two-week period. Of the 418 scans, 62 images were selected at random and 25 of these were duplicated to give a total of 87 scans. These static images were then presented to five experienced observers who each made nine different assessments and measurements. Interobserver and intraboserver agreement was calculated and expressed as kappa values. Our results showed poor
We have evaluated the
The
We investigated the reproducibility of the various radiological methods of assessment of hip dysplasia by making 474 assessments of hips and quantifying the inter-observer and intra-observer variation. There was a wide range of variability between the readings made by different observers and by one observer on two occasions. A measurement of acetabular index has to be given a range of +/- 6 degrees in order to be 95% confident of including the true measurement. We found the most helpful measurements to be the acetabular index, up to the age of eight years; the centre-edge angle, over the age of five years; and Smith's c/b ratio and neck-shaft angle. We feel, however, that the change in value over a series of radiographs in the same child is much more valuable. Single readings of all the radiological measurements investigated in this study were unreliable.
The most widely used classification system for
acetabular fractures was developed by Judet, Judet and Letournel over
50 years ago primarily to aid surgical planning. As population demographics
and injury mechanisms have altered over time, the fracture patterns
also appear to be changing. We conducted a retrospective review
of the imaging of 100 patients with a mean age of 54.9 years (19
to 94) and a male to female ratio of 69:31 seen between 2010 and
2013 with acetabular fractures in order to determine whether the
current spectrum of injury patterns can be reliably classified using
the original system. Three consultant pelvic and acetabular surgeons and one senior
fellow analysed anonymous imaging. Inter-observer agreement for
the classification of fractures that fitted into defined categories
was substantial, (κ = 0.65, 95% confidence interval (CI) 0.51 to
0.76) with improvement to near perfect on inclusion of CT imaging
(κ = 0.80, 95% CI 0.69 to 0.91). However, a high proportion of injuries
(46%) were felt to be unclassifiable by more than one surgeon; there
was moderate agreement on which these were (κ = 0.42 95% CI 0.31
to 0.54). Further review of the unclassifiable fractures in this cohort
of 100 patients showed that they tended to occur in an older population
(mean age 59.1 years; 22 to 94 Cite this article:
We investigated 60 patients (89 feet) with a
mean age of 64 years (61 to 67) treated for congenital clubfoot deformity,
using standardised weight-bearing radiographs of both feet and ankles
together with a functional evaluation. Talocalcaneal and talonavicular
relationships were measured and the degree of osteo-arthritic change
in the ankle and talonavicular joints was assessed. The functional
results were evaluated using a modified Laaveg-Ponseti score. The
talocalcaneal (TC) angles in the clubfeet were significantly lower
in both anteroposterior (AP) and lateral projections than in the
unaffected feet (p <
0.001 for both views). There was significant
medial subluxation of the navicular in the clubfeet compared with
the unaffected feet (p <
0.001). Severe osteoarthritis in the
ankle joint was seen in seven feet (8%) and in the talonavicular
joint in 11 feet (12%). The functional result was excellent or good
(≥ 80 points) in 29 patients (48%), and fair or poor (<
80 points)
in 31 patients (52%). Patients who had undergone few (0 to 1) surgical
procedures had better functional outcomes than those who had undergone
two or more procedures (p <
0.001). There was a significant correlation
between the functional result and the degree of medial subluxation
of the navicular (p <
0.001, r2 = 0.164), the talocalcaneal
angle on AP projection (p <
0.02, r2 = 0.025) and extent of osteoarthritis
in the ankle joint (p <
0.001). We conclude that poor functional outcome in patients with congenital
clubfoot occurs more frequently in those with medial displacement
of the navicular, osteoarthritis of the talonavicular and ankle
joints, and a low talocalcaneal angle on the AP projection, and
in patients who have undergone two or more surgical procedures. However,
the ankle joint in these patients appeared relatively resistant
to the development of osteoarthritis.
Aims. The purpose of this study was to assess the
Aims. This study aimed to evaluate the clinical application of the PJI-TNM classification for periprosthetic joint infection (PJI) by determining intraobserver and interobserver
Aims. This study aimed to answer the following questions: do 3D-printed models lead to a more accurate recognition of the pattern of complex fractures of the elbow?; do 3D-printed models lead to a more reliable recognition of the pattern of these injuries?; and do junior surgeons benefit more from 3D-printed models than senior surgeons?. Methods. A total of 15 orthopaedic trauma surgeons (seven juniors, eight seniors) evaluated 20 complex elbow fractures for their overall pattern (i.e. varus posterior medial rotational injury, terrible triad injury, radial head fracture with posterolateral dislocation, anterior (trans-)olecranon fracture-dislocation, posterior (trans-)olecranon fracture-dislocation) and their specific characteristics. First, fractures were assessed based on radiographs and 2D and 3D CT scans; and in a subsequent round, one month later, with additional 3D-printed models. Diagnostic accuracy (acc) and inter-surgeon
Aims. The aim of this study was to evaluate the
Aims. Obtaining solid implant fixation is crucial in revision total knee arthroplasty (rTKA) to avoid aseptic loosening, a major reason for re-revision. This study aims to validate a novel grading system that quantifies implant fixation across three anatomical zones (epiphysis, metaphysis, diaphysis). Methods. Based on pre-, intra-, and postoperative assessments, the novel grading system allocates a quantitative score (0, 0.5, or 1 point) for the quality of fixation achieved in each anatomical zone. The criteria used by the algorithm to assign the score include the bone quality, the size of the bone defect, and the type of fixation used. A consecutive cohort of 245 patients undergoing rTKA from 2012 to 2018 were evaluated using the current novel scoring system and followed prospectively. In addition, 100 first-time revision cases were assessed radiologically from the original cohort and graded by three observers to evaluate the intra- and inter-rater
Aims. To identify a core outcome set of postoperative radiographic measurements to assess technical skill in ankle fracture open reduction internal fixation (ORIF), and to validate these against Van der Vleuten’s criteria for effective assessment. Methods. An e-Delphi exercise was undertaken at a major trauma centre (n = 39) to identify relevant parameters. Feasibility was tested by two authors.
Aims. Though most humeral shaft fractures heal nonoperatively, up to one-third may lead to nonunion with inferior outcomes. The Radiographic Union Score for HUmeral Fractures (RUSHU) was created to identify high-risk patients for nonunion. Our study evaluated the RUSHU’s prognostic performance at six and 12 weeks in discriminating nonunion within a significantly larger cohort than before. Methods. Our study included 226 nonoperatively treated humeral shaft fractures. We evaluated the interobserver
Aims. The aim of this study was to investigate the agreement in interpretation of the quality of the paediatric hip ultrasound examination, the
Aims. To determine whether side-bending films in scoliosis are assessed for adequacy in clinical practice; and to introduce a novel method for doing so. Methods. Six surgeons and eight radiographers were invited to participate in four online surveys. The generic survey comprised erect and left and right bending radiographs of eight individuals with scoliosis, with an average age of 14.6 years. Respondents were asked to indicate whether each bending film was optimal (adequate) or suboptimal. In the first survey, they were also asked if they currently assessed the adequacy of bending films. A similar second survey was sent out two weeks later, using the same eight cases but in a different order. In the third survey, a guide for assessing bending film adequacy was attached along with the radiographs to introduce the novel T1-45B method, in which the upper endplate of T1 must tilt ≥ 45° from baseline for the study to be considered optimal. A fourth and final survey was subsequently conducted for confirmation. Results. Overall, 12 (86%) of 14 respondents did not use any criteria to assess the bending film adequacy; the remaining two each described a different invalidated method. In total, 12 (86%) of the respondents felt T1-45B was easy to learn and apply. There was fair to substantial intra-rater
Aims. To propose a new method for evaluating paediatric radial neck fractures and improve the accuracy of fracture angulation measurement, particularly in younger children, and thereby facilitate planning treatment in this population. Methods. Clinical data of 117 children with radial neck fractures in our hospital from August 2014 to March 2023 were collected. A total of 50 children (26 males, 24 females, mean age 7.6 years (2 to 13)) met the inclusion criteria and were analyzed. Cases were excluded for the following reasons: Judet grade I and Judet grade IVb (> 85° angulation) classification; poor radiograph image quality; incomplete clinical information; sagittal plane angulation; severe displacement of the ulna fracture; and Monteggia fractures. For each patient, standard elbow anteroposterior (AP) view radiographs and corresponding CT images were acquired. On radiographs, Angle P (complementary to the angle between the long axis of the radial head and the line perpendicular to the physis), Angle S (complementary to the angle between the long axis of the radial head and the midline through the proximal radial shaft), and Angle U (between the long axis of the radial head and the straight line from the distal tip of the capitellum to the coronoid process) were identified as candidates approximating the true coronal plane angulation of radial neck fractures. On the coronal plane of the CT scan, the angulation of radial neck fractures (CTa) was measured and served as the reference standard for measurement. Inter- and intraobserver
Aims. As an alternative to external fixators, intramedullary lengthening nails (ILNs) can be employed for distraction osteogenesis. While previous studies have demonstrated that typical complications of external devices, such as soft-tissue tethering, and pin site infection can be avoided with ILNs, there is a lack of studies that exclusively investigated tibial distraction osteogenesis with motorized ILNs inserted via an antegrade approach. Methods. A total of 58 patients (median age 17 years (interquartile range (IQR) 15 to 21)) treated by unilateral tibial distraction osteogenesis for a median leg length discrepancy of 41 mm (IQR 34 to 53), and nine patients with disproportionate short stature treated by bilateral simultaneous tibial distraction osteogenesis, with magnetically controlled motorized ILNs inserted via an antegrade approach, were retrospectively analyzed. The median follow-up was 37 months (IQR 30 to 51). Outcome measurements were accuracy, precision,
Aims. The Oxford Shoulder Score (OSS) is a 12-item measure commonly used for the assessment of shoulder surgeries. This study explores whether computerized adaptive testing (CAT) provides a shortened, individually tailored questionnaire while maintaining test accuracy. Methods. A total of 16,238 preoperative OSS were available in the National Joint Registry (NJR) for England, Wales, Northern Ireland, the Isle of Man, and the States of Guernsey dataset (April 2012 to April 2022). Prior to CAT, the foundational item response theory (IRT) assumptions of unidimensionality, monotonicity, and local independence were established. CAT compared sequential item selection with stopping criteria set at standard error (SE) < 0.32 and SE < 0.45 (equivalent to
Aims. Hip dysplasia (HD) leads to premature osteoarthritis. Timely detection and correction of HD has been shown to improve pain, functional status, and hip longevity. Several time-consuming radiological measurements are currently used to confirm HD. An artificial intelligence (AI) software named HIPPO automatically locates anatomical landmarks on anteroposterior pelvis radiographs and performs the needed measurements. The primary aim of this study was to assess the