Advertisement for orthosearch.org.uk
Results 1 - 20 of 127
Results per page:
Bone & Joint Research
Vol. 12, Issue 5 | Pages 313 - 320
8 May 2023
Saiki Y Kabata T Ojima T Kajino Y Kubo N Tsuchiya H

Aims. We aimed to assess the reliability and validity of OpenPose, a posture estimation algorithm, for measurement of knee range of motion after total knee arthroplasty (TKA), in comparison to radiography and goniometry. Methods. In this prospective observational study, we analyzed 35 primary TKAs (24 patients) for knee osteoarthritis. We measured the knee angles in flexion and extension using OpenPose, radiography, and goniometry. We assessed the test-retest reliability of each method using intraclass correlation coefficient (1,1). We evaluated the ability to estimate other measurement values from the OpenPose value using linear regression analysis. We used intraclass correlation coefficients (2,1) and Bland–Altman analyses to evaluate the agreement and error between radiography and the other measurements. Results. OpenPose had excellent test-retest reliability (intraclass correlation coefficient (1,1) = 1.000). The R. 2. of all regression models indicated large correlations (0.747 to 0.927). In the flexion position, the intraclass correlation coefficients (2,1) of OpenPose indicated excellent agreement (0.953) with radiography. In the extension position, the intraclass correlation coefficients (2,1) indicated good agreement of OpenPose and radiography (0.815) and moderate agreement of goniometry with radiography (0.593). OpenPose had no systematic error in the flexion position, and a 2.3° fixed error in the extension position, compared to radiography. Conclusion. OpenPose is a reliable and valid tool for measuring flexion and extension positions after TKA. It has better accuracy than goniometry, especially in the extension position. Accurate measurement values can be obtained with low error, high reproducibility, and no contact, independent of the examiner’s skills. Cite this article: Bone Joint Res 2023;12(5):313–320


Bone & Joint Research
Vol. 8, Issue 8 | Pages 357 - 366
1 Aug 2019
Zhang B Sun H Zhan Y He Q Zhu Y Wang Y Luo C

Objectives. CT-based three-column classification (TCC) has been widely used in the treatment of tibial plateau fractures (TPFs). In its updated version (updated three-column concept, uTCC), a fracture morphology-based injury mechanism was proposed for effective treatment guidance. In this study, the injury mechanism of TPFs is further explained, and its inter- and intraobserver reliability is evaluated to perfect the uTCC. Methods. The radiological images of 90 consecutive TPF patients were collected. A total of 47 men (52.2%) and 43 women (47.8%) with a mean age of 49.8 years (. sd. 12.4; 17 to 77) were enrolled in our study. Among them, 57 fractures were on the left side (63.3%) and 33 were on the right side (36.7%); no bilateral fracture existed. Four observers were chosen to classify or estimate independently these randomized cases according to the Schatzker classification, TCC, and injury mechanism. With two rounds of evaluation, the kappa values were calculated to estimate the inter- and intrareliability. Results. The overall inter- and intraobserver agreements of the injury mechanism were substantial (κ. inter. = 0.699, κ. intra. = 0.749, respectively). The initial position and the force direction, which are two components of the injury mechanism, had substantial agreement for both inter-reliability or intrareliability. The inter- and intraobserver agreements were lower in high-energy fractures (Schatzker types IV to VI; κ. inter. = 0.605, κ. intra. = 0.721) compared with low-energy fractures (Schatzker types I to III; κ. inter. = 0.81, κ. intra. = 0.832). The inter- and intraobserver agreements were relatively higher in one-column fractures (κ. inter. = 0.759, κ. intra. = 0.801) compared with two-column and three-column fractures. Conclusion. The complete theory of injury mechanism of TPFs was first put forward to make the TCC consummate. It demonstrates substantial inter- and intraobserver agreement generally. Furthermore, the injury mechanism can be promoted clinically. Cite this article: B-B. Zhang, H. Sun, Y. Zhan, Q-F. He, Y. Zhu, Y-K. Wang, C-F. Luo. Reliability and repeatability of tibial plateau fracture assessment with an injury mechanism-based concept. Bone Joint Res 2019;8:357–366. DOI: 10.1302/2046-3758.88.BJR-2018-0331.R1


Bone & Joint Research
Vol. 5, Issue 8 | Pages 347 - 352
1 Aug 2016
Nuttall J Evaniew N Thornley P Griffin A Deheshi B O’Shea T Wunder J Ferguson P Randall RL Turcotte R Schneider P McKay P Bhandari M Ghert M

Objectives. The diagnosis of surgical site infection following endoprosthetic reconstruction for bone tumours is frequently a subjective diagnosis. Large clinical trials use blinded Central Adjudication Committees (CACs) to minimise the variability and bias associated with assessing a clinical outcome. The aim of this study was to determine the level of inter-rater and intra-rater agreement in the diagnosis of surgical site infection in the context of a clinical trial. Materials and Methods. The Prophylactic Antibiotic Regimens in Tumour Surgery (PARITY) trial CAC adjudicated 29 non-PARITY cases of lower extremity endoprosthetic reconstruction. The CAC members classified each case according to the Centers for Disease Control (CDC) criteria for surgical site infection (superficial, deep, or organ space). Combinatorial analysis was used to calculate the smallest CAC panel size required to maximise agreement. A final meeting was held to establish a consensus. Results. Full or near consensus was reached in 20 of the 29 cases. The Fleiss kappa value was calculated as 0.44 (95% confidence interval (CI) 0.35 to 0.53), or moderate agreement. The greatest statistical agreement was observed in the outcome of no infection, 0.61 (95% CI 0.49 to 0.72, substantial agreement). Panelists reached a full consensus in 12 of 29 cases and near consensus in five of 29 cases when CDC criteria were used (superficial, deep or organ space). A stable maximum Fleiss kappa of 0.46 (95% CI 0.50 to 0.35) at CAC sizes greater than three members was obtained. Conclusions. There is substantial agreement among the members of the PARITY CAC regarding the presence or absence of surgical site infection. Agreement on the level of infection, however, is more challenging. Additional clinical information routinely collected by the prospective PARITY trial may improve the discriminatory capacity of the CAC in the parent study for the diagnosis of infection. Cite this article: J. Nuttall, N. Evaniew, P. Thornley, A. Griffin, B. Deheshi, T. O’Shea, J. Wunder, P. Ferguson, R. L. Randall, R. Turcotte, P. Schneider, P. McKay, M. Bhandari, M. Ghert. The inter-rater reliability of the diagnosis of surgical site infection in the context of a clinical trial. Bone Joint Res 2016;5:347–352. DOI: 10.1302/2046-3758.58.BJR-2016-0036.R1


Bone & Joint Research
Vol. 9, Issue 5 | Pages 242 - 249
1 May 2020
Bali K Smit K Ibrahim M Poitras S Wilkin G Galmiche R Belzile E Beaulé PE

Aims. The aim of the current study was to assess the reliability of the Ottawa classification for symptomatic acetabular dysplasia. Methods. In all, 134 consecutive hips that underwent periacetabular osteotomy were categorized using a validated software (Hip2Norm) into four categories of normal, lateral/global, anterior, or posterior. A total of 74 cases were selected for reliability analysis, and these included 44 dysplastic and 30 normal hips. A group of six blinded fellowship-trained raters, provided with the classification system, looked at these radiographs at two separate timepoints to classify the hips using standard radiological measurements. Thereafter, a consensus meeting was held where a modified flow diagram was devised, before a third reading by four raters using a separate set of 74 radiographs took place. Results. Intrarater results per surgeon between Time 1 and Time 2 showed substantial to almost perfect agreement among the raters (κappa = 0.416 to 0.873). With respect to inter-rater reliability, at Time 1 and Time 2 there was substantial agreement overall between all surgeons (Time 1 κappa = 0.619; Time 2 κappa = 0.623). Posterior and anterior rating categories had moderate and fair agreement at Time 1 (posterior κappa = 0.557; anterior κappa = 0.438) and Time 2 (posterior κappa = 0.506; anterior κappa = 0.250), respectively. At Time 3, overall reliability (κappa = 0.687) and posterior and anterior reliability (posterior κappa = 0.579; anterior κappa = 0.521) improved from Time 1 and Time 2. Conclusion. The Ottawa classification system provides a reliable way to identify three categories of acetabular dysplasia that are well-aligned with surgical management. The term ‘borderline dysplasia’ should no longer be used. Cite this article: Bone Joint Res. 2020;9(5):242–249


Bone & Joint Research
Vol. 7, Issue 1 | Pages 36 - 45
1 Jan 2018
Kleinlugtenbelt YV Krol RG Bhandari M Goslings JC Poolman RW Scholtes VAB

Objectives. The patient-rated wrist evaluation (PRWE) and the Disabilities of the Arm, Shoulder and Hand (DASH) questionnaire are patient-reported outcome measures (PROMs) used for clinical and research purposes. Methodological high-quality clinimetric studies that determine the measurement properties of these PROMs when used in patients with a distal radial fracture are lacking. This study aimed to validate the PRWE and DASH in Dutch patients with a displaced distal radial fracture (DRF). Methods. The intraclass correlation coefficient (ICC) was used for test-retest reliability, between PROMs completed twice with a two-week interval at six to eight months after DRF. Internal consistency was determined using Cronbach’s α for the dimensions found in the factor analysis. The measurement error was expressed by the smallest detectable change (SDC). A semi-structured interview was conducted between eight and 12 weeks after DRF to assess the content validity. Results. A total of 119 patients (mean age 58 years (. sd. 15)), 74% female, completed PROMs at a mean time of six months (. sd. 1) post-fracture. One overall meaningful dimension was found for the PRWE and the DASH. Internal consistency was excellent for both PROMs (Cronbach’s α 0.96 (PRWE) and 0.97 (DASH)). Test-retest reliability was good for the PRWE (ICC 0.87) and excellent for the DASH (ICC 0.91). The SDC was 20 for the PRWE and 14 for the DASH. No floor or ceiling effects were found. The content validity was good for both questionnaires. Conclusion. The PRWE and DASH are valid and reliable PROMs in assessing function and disability in Dutch patients with a displaced DRF. However, due to the high SDC, the PRWE and DASH are less useful for individual patients with a distal radial fracture in clinical practice. Cite this article: Y. V. Kleinlugtenbelt, R. G. Krol, M. Bhandari, J. C. Goslings, R. W. Poolman, V. A. B. Scholtes. Are the patient-rated wrist evaluation (PRWE) and the disabilities of the arm, shoulder and hand (DASH) questionnaire used in distal radial fractures truly valid and reliable? Bone Joint Res 2018;7:36–45. DOI: 10.1302/2046-3758.71.BJR-2017-0081.R1


Bone & Joint Research
Vol. 2, Issue 1 | Pages 1 - 8
1 Jan 2013
Costa AJ Lustig S Scholes CJ Balestro J Fatima M Parker DA

Objectives. There remains a lack of data on the reliability of methods to estimate tibial coverage achieved during total knee replacement. In order to address this gap, the intra- and interobserver reliability of a three-dimensional (3D) digital templating method was assessed with one symmetric and one asymmetric prosthesis design. Methods. A total of 120 template procedures were performed according to specific rotational and over-hang criteria by three observers at time zero and again two weeks later. Total and sub-region coverage were calculated and the reliability of the templating and measurement method was evaluated. Results. Excellent intra- and interobserver reliability was observed for total coverage, when minimal component overhang (intraclass correlation coefficient (ICC) = 0.87) or no component overhang (ICC = 0.92) was permitted, regardless of rotational restrictions. Conclusions. Measurement of tibial coverage can be reliable using the templating method described even if the rotational axis selected still has a minor influence


Bone & Joint Research
Vol. 13, Issue 1 | Pages 19 - 27
5 Jan 2024
Baertl S Rupp M Kerschbaum M Morgenstern M Baumann F Pfeifer C Worlicek M Popp D Amanatullah DF Alt V

Aims. This study aimed to evaluate the clinical application of the PJI-TNM classification for periprosthetic joint infection (PJI) by determining intraobserver and interobserver reliability. To facilitate its use in clinical practice, an educational app was subsequently developed and evaluated. Methods. A total of ten orthopaedic surgeons classified 20 cases of PJI based on the PJI-TNM classification. Subsequently, the classification was re-evaluated using the PJI-TNM app. Classification accuracy was calculated separately for each subcategory (reinfection, tissue and implant condition, non-human cells, and morbidity of the patient). Fleiss’ kappa and Cohen’s kappa were calculated for interobserver and intraobserver reliability, respectively. Results. Overall, interobserver and intraobserver agreements were substantial across the 20 classified cases. Analyses for the variable ‘reinfection’ revealed an almost perfect interobserver and intraobserver agreement with a classification accuracy of 94.8%. The category 'tissue and implant conditions' showed moderate interobserver and substantial intraobserver reliability, while the classification accuracy was 70.8%. For 'non-human cells,' accuracy was 81.0% and interobserver agreement was moderate with an almost perfect intraobserver reliability. The classification accuracy of the variable 'morbidity of the patient' reached 73.5% with a moderate interobserver agreement, whereas the intraobserver agreement was substantial. The application of the app yielded comparable results across all subgroups. Conclusion. The PJI-TNM classification system captures the heterogeneity of PJI and can be applied with substantial inter- and intraobserver reliability. The PJI-TNM educational app aims to facilitate application in clinical practice. A major limitation was the correct assessment of the implant situation. To eliminate this, a re-evaluation according to intraoperative findings is strongly recommended. Cite this article: Bone Joint Res 2024;13(1):19–27


Bone & Joint Research
Vol. 13, Issue 8 | Pages 392 - 400
5 Aug 2024
Barakat A Evans J Gibbons C Singh HP

Aims. The Oxford Shoulder Score (OSS) is a 12-item measure commonly used for the assessment of shoulder surgeries. This study explores whether computerized adaptive testing (CAT) provides a shortened, individually tailored questionnaire while maintaining test accuracy. Methods. A total of 16,238 preoperative OSS were available in the National Joint Registry (NJR) for England, Wales, Northern Ireland, the Isle of Man, and the States of Guernsey dataset (April 2012 to April 2022). Prior to CAT, the foundational item response theory (IRT) assumptions of unidimensionality, monotonicity, and local independence were established. CAT compared sequential item selection with stopping criteria set at standard error (SE) < 0.32 and SE < 0.45 (equivalent to reliability coefficients of 0.90 and 0.80) to full-length patient-reported outcome measure (PROM) precision. Results. Confirmatory factor analysis (CFA) for unidimensionality exhibited satisfactory fit with root mean square standardized residual (RSMSR) of 0.06 (cut-off ≤ 0.08) but not with comparative fit index (CFI) of 0.85 or Tucker-Lewis index (TLI) of 0.82 (cut-off > 0.90). Monotonicity, measured by H value, yielded 0.482, signifying good monotonic trends. Local independence was generally met, with Yen’s Q3 statistic > 0.2 for most items. The median item count for completing the CAT simulation with a SE of 0.32 was 3 (IQR 3 to 12), while for a SE of 0.45 it was 2 (IQR 2 to 6). This constituted only 25% and 16%, respectively, when compared to the 12-item full-length questionnaire. Conclusion. Calibrating IRT for the OSS has resulted in the development of an efficient and shortened CAT while maintaining accuracy and reliability. Through the reduction of redundant items and implementation of a standardized measurement scale, our study highlights a promising approach to alleviate time burden and potentially enhance compliance with these widely used outcome measures. Cite this article: Bone Joint Res 2024;13(8):392–400


Bone & Joint Research
Vol. 11, Issue 9 | Pages 619 - 628
7 Sep 2022
Yapp LZ Scott CEH Howie CR MacDonald DJ Simpson AHRW Clement ND

Aims. The aim of this study was to report the meaningful values of the EuroQol five-dimension three-level questionnaire (EQ-5D-3L) and EuroQol visual analogue scale (EQ-VAS) in patients undergoing primary knee arthroplasty (KA). Methods. This is a retrospective study of patients undergoing primary KA for osteoarthritis in a university teaching hospital (Royal Infirmary of Edinburgh) (1 January 2013 to 31 December 2019). Pre- and postoperative (one-year) data were prospectively collected for 3,181 patients (median age 69.9 years (interquartile range (IQR) 64.2 to 76.1); females, n = 1,745 (54.9%); median BMI 30.1 kg/m. 2. (IQR 26.6 to 34.2)). The reliability of the EQ-5D-3L was measured using Cronbach’s alpha. Responsiveness was determined by calculating the anchor-based minimal clinically important difference (MCID), the minimal important change (MIC) (cohort and individual), the patient-acceptable symptom state (PASS) predictive of satisfaction, and the minimal detectable change at 90% confidence intervals (MDC-90). Results. The EQ-5D-3L demonstrated good internal consistency with an overall Cronbach alpha of 0.75 (preoperative) and 0.88 (postoperative), respectively. The MCID for the Index score was 0.085 (95% confidence interval (CI) 0.042 to 0.127) and EQ-VAS was 6.41 (95% CI 3.497 to 9.323). The MIC. COHORT. was 0.289 for the EQ-5D and 5.27 for the EQ-VAS. However, the MIC. INDIVIDUAL. for both the EQ-5D-3L Index (0.105) and EQ-VAS (-1) demonstrated poor-to-acceptable reliability. The MDC-90 was 0.023 for the EQ-5D-3L Index and 1.0 for the EQ-VAS. The PASS for the postoperative EQ-5D-3L Index and EQ-VAS scores predictive of patient satisfaction were 0.708 and 77.0, respectively. Conclusion. The meaningful values of the EQ-5D-3L Index and EQ-VAS scores can be used to measure clinically relevant changes in health-related quality of life in patients undergoing primary KA. Cite this article: Bone Joint Res 2022;11(9):619–628


Bone & Joint Research
Vol. 10, Issue 12 | Pages 820 - 829
15 Dec 2021
Schmidutz F Schopf C Yan SG Ahrend M Ihle C Sprecher C

Aims. The distal radius is a major site of osteoporotic bone loss resulting in a high risk of fragility fracture. This study evaluated the capability of a cortical index (CI) at the distal radius to predict the local bone mineral density (BMD). Methods. A total of 54 human cadaver forearms (ten singles, 22 pairs) (19 to 90 years) were systematically assessed by clinical radiograph (XR), dual-energy X-ray absorptiometry (DXA), CT, as well as high-resolution peripheral quantitative CT (HR-pQCT). Cortical bone thickness (CBT) of the distal radius was measured on XR and CT scans, and two cortical indices mean average (CBTavg) and gauge (CBTg) were determined. These cortical indices were compared to the BMD of the distal radius determined by DXA (areal BMD (aBMD)) and HR-pQCT (volumetric BMD (vBMD)). Pearson correlation coefficient (r) and intraclass correlation coefficient (ICC) were used to compare the results and degree of reliability. Results. The CBT could accurately be determined on XRs and highly correlated to those determined on CT scans (r = 0.87 to 0.93). The CBTavg index of the XRs significantly correlated with the BMD measured by DXA (r = 0.78) and HR-pQCT (r = 0.63), as did the CBTg index with the DXA (r = 0.55) and HR-pQCT (r = 0.64) (all p < 0.001). A high correlation of the BMD and CBT was observed between paired specimens (r = 0.79 to 0.96). The intra- and inter-rater reliability was excellent (ICC 0.79 to 0.92). Conclusion. The cortical index (CBTavg) at the distal radius shows a close correlation to the local BMD. It thus can serve as an initial screening tool to estimate the local bone quality if quantitative BMD measurements are unavailable, and enhance decision-making in acute settings on fracture management or further osteoporosis screening. Cite this article: Bone Joint Res 2021;10(12):820–829


Bone & Joint Research
Vol. 13, Issue 6 | Pages 294 - 305
17 Jun 2024
Yang P He W Yang W Jiang L Lin T Sun W Zhang Q Bai X Sun W Guo D

Aims. In this study, we aimed to visualize the spatial distribution characteristics of femoral head necrosis using a novel measurement method. Methods. We retrospectively collected CT imaging data of 108 hips with non-traumatic osteonecrosis of the femoral head from 76 consecutive patients (mean age 34.3 years (SD 8.1), 56.58% male (n = 43)) in two clinical centres. The femoral head was divided into 288 standard units (based on the orientation of units within the femoral head, designated as N[Superior], S[Inferior], E[Anterior], and W[Posterior]) using a new measurement system called the longitude and latitude division system (LLDS). A computer-aided design (CAD) measurement tool was also developed to visualize the measurement of the spatial location of necrotic lesions in CT images. Two orthopaedic surgeons independently performed measurements, and the results were used to draw 2D and 3D heat maps of spatial distribution of necrotic lesions in the femoral head, and for statistical analysis. Results. The results showed that the LLDS has high inter-rater reliability. As illustrated by the heat map, the distribution of Japanese Investigation Committee (JIC) classification type C necrotic lesions exhibited clustering characteristics, with the lesions being concentrated in the northern and eastern regions, forming a hot zone (90% probability) centred on the N4-N6E2, N3-N6E units of outer ring blocks. Statistical results showed that the distribution difference between type C2 and type C1 was most significant in the E1 and E2 units and, combined with the heat map, indicated that the spatial distribution differences at N3-N6E1 and N1-N3E2 units are crucial in understanding type C1 and C2 necrotic lesions. Conclusion. The LLDS can be used to accurately measure the spatial location of necrotic lesions and display their distribution characteristics. Cite this article: Bone Joint Res 2024;13(6):294–305


Bone & Joint Research
Vol. 9, Issue 9 | Pages 623 - 632
5 Sep 2020
Jayadev C Hulley P Swales C Snelling S Collins G Taylor P Price A

Aims. The lack of disease-modifying treatments for osteoarthritis (OA) is linked to a shortage of suitable biomarkers. This study combines multi-molecule synovial fluid analysis with machine learning to produce an accurate diagnostic biomarker model for end-stage knee OA (esOA). Methods. Synovial fluid (SF) from patients with esOA, non-OA knee injury, and inflammatory knee arthritis were analyzed for 35 potential markers using immunoassays. Partial least square discriminant analysis (PLS-DA) was used to derive a biomarker model for cohort classification. The ability of the biomarker model to diagnose esOA was validated by identical wide-spectrum SF analysis of a test cohort of ten patients with esOA. Results. PLS-DA produced a streamlined biomarker model with excellent sensitivity (95%), specificity (98.4%), and reliability (97.4%). The eight-biomarker model produced a fingerprint for esOA comprising type IIA procollagen N-terminal propeptide (PIIANP), tissue inhibitor of metalloproteinase (TIMP)-1, a disintegrin and metalloproteinase with thrombospondin motifs 4 (ADAMTS-4), monocyte chemoattractant protein (MCP)-1, interferon-γ-inducible protein-10 (IP-10), and transforming growth factor (TGF)-β3. Receiver operating characteristic (ROC) analysis demonstrated excellent discriminatory accuracy: area under the curve (AUC) being 0.970 for esOA, 0.957 for knee injury, and 1 for inflammatory arthritis. All ten validation test patients were classified correctly as esOA (accuracy 100%; reliability 100%) by the biomarker model. Conclusion. SF analysis coupled with machine learning produced a partially validated biomarker model with cohort-specific fingerprints that accurately and reliably discriminated esOA from knee injury and inflammatory arthritis with almost 100% efficacy. The presented findings and approach represent a new biomarker concept and potential diagnostic tool to stage disease in therapy trials and monitor the efficacy of such interventions. Cite this article: Bone Joint Res 2020;9(9):623–632


Bone & Joint Research
Vol. 7, Issue 5 | Pages 351 - 356
1 May 2018
Yeoman TFM Clement ND Macdonald D Moran M

Objectives. The primary aim of this study was to assess the reproducibility of the recalled preoperative Oxford Hip Score (OHS) and Oxford Knee Score (OKS) one year following arthroplasty for a cohort of patients. The secondary aim was to assess the reliability of a patient’s recollection of their own preoperative OHS and OKS one year following surgery. Methods. A total of 335 patients (mean age 72.5; 22 to 92; 53.7% female) undergoing total hip arthroplasty (n = 178) and total knee arthroplasty (n = 157) were prospectively assessed. Patients undergoing hip and knee arthroplasty completed an OHS or OKS, respectively, preoperatively and were asked to recall their preoperative condition while completing the same score one year after surgery. Results. A mean difference of 0.04 points (95% confidence intervals (CI) -15.64 to 15.72, p = 0.97) between the actual and the recalled OHS was observed. The mean difference in the OKS was 1.59 points (95% CI -11.57 to 14.75, p = 0.10). There was excellent reliability for the ‘average measures’ intra-class correlation for both the OHS (r = 0.802) and the OKS (r = 0.772). However, this reliability was diminished for the individuals OHS (r = 0.670) and OKS (r = 0.629) using single measures intra-class correlation. Bland–Altman plots demonstrated wide variation in the individual patient’s ability to recall their preoperative score (95% CI ± 16 for OHS, 95% CI ± 13 for OKS). Conclusion. Prospective preoperative collection of OHS and OKS remains the benchmark. Using recalled scores one year following hip and knee arthroplasty is an alternative when used to assess a cohort of patients. However, the recall of an individual patient’s preoperative score should not be relied upon due to the diminished reliability and wide CI. Cite this article: T. F. M. Yeoman, N. D. Clement, D. Macdonald, M. Moran. Recall of preoperative Oxford Hip and Knee Scores one year after arthroplasty is an alternative and reliable technique when used for a cohort of patients. Bone Joint Res 2018;7:351–356. DOI: 10.1302/2046-3758.75.BJR-2017-0259.R1


Bone & Joint Research
Vol. 10, Issue 11 | Pages 714 - 722
1 Nov 2021
Qi W Feng X Zhang T Wu H Fang C Leung F

Aims. To fully verify the reliability and reproducibility of an experimental method in generating standardized micromotion for the rat femur fracture model. Methods. A modularized experimental device has been developed that allows rat models to be used instead of large animal models, with the aim of reducing systematic errors and time and money constraints on grouping. The bench test was used to determine the difference between the measured and set values of the micromotion produced by this device under different simulated loading weights. The displacement of the fixator under different loading conditions was measured by compression tests, which was used to simulate the unexpected micromotion caused by the rat’s ambulation. In vivo preliminary experiments with a small sample size were used to test the feasibility and effectiveness of the whole experimental scheme and surgical scheme. Results. The bench test showed that a weight loading < 500 g did not affect the operation of experimental device. The compression test demonstrated that the stiffness of the device was sufficient to keep the uncontrollable motion between fracture ends, resulting from the rat’s daily activities, within 1% strain. In vivo results on 15 rats prove that the device works reliably, without overburdening the experimental animals, and provides standardized micromotion reproductively at the fracture site according to the set parameters. Conclusion. Our device was able to investigate the effect of micromotion parameters on fracture healing by generating standardized micromotion to small animal models. Cite this article: Bone Joint Res 2021;10(11):714–722


Bone & Joint Research
Vol. 6, Issue 9 | Pages 530 - 534
1 Sep 2017
Krakow L Klockow A Roehner E Brodt S Eijer H Bossert J Matziolis G

Objectives. The determination of the volumetric polyethylene wear on explanted material requires complicated equipment, which is not available in many research institutions. Our aim in this study was to present and validate a method that only requires a set of polyetheretherketone balls and a laboratory balance to determine wear. Methods. The insert to be measured was placed on a balance, and a ball of the appropriate diameter was inserted. The cavity remaining between the ball and insert caused by wear was filled with contrast medium and the weight of the contrast medium was recorded. The volume was calculated from the known density of the liquid. The precision, inter- and intraobserver reliability, were determined by four investigators on four days using nine inserts with specified wear (0.094 ml to 1.626 ml), and the intra-class correlation coefficient was calculated. The feasibility of using this method in routine clinical practice and the time required for measurement were tested on 84 explanted inserts by one investigator. Results. In order to get the mean for all investigators and determinations, the deviation between the measured and specified wear was -0.08 ml . (sd. 0.12; -0.21 to 0.11). The interobserver reliability was 0.989 ml (95% confidence interval (CI) 0.964 to 0.997) and the intraobserver reliability was 0.941 for observer 1 (95% CI 0.846 to 0.985), 0.983 for observer 2 (95% CI 0.956 to 0.995), 0.939 for observer 3 (95% CI 0.855 to 0.984), and 0.934 for observer 4 (95% CI 0.790 to 0.984). The mean time required to examine the samples was two minutes . (sd. 2; 1 to 5). Conclusion. The method presented here was shown to be sufficiently precise for many settings and is a cost-effective and quick method of determining the volumetric wear of explanted acetabular components. However, the measurement of wear for scientific purposes will probably continue to involve more accurate and dedicated laboratory equipment. Cite this article: Bone Joint Res 2017;6:530–534


Bone & Joint Research
Vol. 5, Issue 4 | Pages 153 - 161
1 Apr 2016
Kleinlugtenbelt YV Nienhuis RW Bhandari M Goslings JC Poolman RW Scholtes VAB

Objectives. Patient-reported outcome measures (PROMs) are often used to evaluate the outcome of treatment in patients with distal radial fractures. Which PROM to select is often based on assessment of measurement properties, such as validity and reliability. Measurement properties are assessed in clinimetric studies, and results are often reviewed without considering the methodological quality of these studies. Our aim was to systematically review the methodological quality of clinimetric studies that evaluated measurement properties of PROMs used in patients with distal radial fractures, and to make recommendations for the selection of PROMs based on the level of evidence of each individual measurement property. Methods. A systematic literature search was performed in PubMed, EMbase, CINAHL and PsycINFO databases to identify relevant clinimetric studies. Two reviewers independently assessed the methodological quality of the studies on measurement properties, using the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) checklist. Level of evidence (strong / moderate / limited / lacking) for each measurement property per PROM was determined by combining the methodological quality and the results of the different clinimetric studies. Results. In all, 19 out of 1508 identified unique studies were included, in which 12 PROMs were rated. The Patient-rated wrist evaluation (PRWE) and the Disabilities of Arm, Shoulder and Hand questionnaire (DASH) were evaluated on most measurement properties. The evidence for the PRWE is moderate that its reliability, validity (content and hypothesis testing), and responsiveness are good. The evidence is limited that its internal consistency and cross-cultural validity are good, and its measurement error is acceptable. There is no evidence for its structural and criterion validity. The evidence for the DASH is moderate that its responsiveness is good. The evidence is limited that its reliability and the validity on hypothesis testing are good. There is no evidence for the other measurement properties. Conclusion. According to this systematic review, there is, at best, moderate evidence that the responsiveness of the PRWE and DASH are good, as are the reliability and validity of the PRWE. We recommend these PROMs in clinical studies in patients with distal radial fractures; however, more clinimetric studies of higher methodological quality are needed to adequately determine the other measurement properties. Cite this article: Dr Y. V. Kleinlugtenbelt. Are validated outcome measures used in distal radial fractures truly valid?: A critical assessment using the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) checklist. Bone Joint Res 2016;5:153–161. DOI: 10.1302/2046-3758.54.2000462


Bone & Joint Research
Vol. 7, Issue 7 | Pages 468 - 475
1 Jul 2018
He Q Sun H Shu L Zhu Y Xie X Zhan Y Luo C

Objectives. Researchers continue to seek easier ways to evaluate the quality of bone and screen for osteoporosis and osteopenia. Until recently, radiographic images of various parts of the body, except the distal femur, have been reappraised in the light of dual-energy X-ray absorptiometry (DXA) findings. The incidence of osteoporotic fractures around the knee joint in the elderly continues to increase. The aim of this study was to propose two new radiographic parameters of the distal femur for the assessment of bone quality. Methods. Anteroposterior radiographs of the knee and bone mineral density (BMD) and T-scores from DXA scans of 361 healthy patients were prospectively analyzed. The mean cortical bone thickness (CBTavg) and the distal femoral cortex index (DFCI) were the two parameters that were proposed and measured. Intra- and interobserver reliabilities were assessed. Correlations between the BMD and T-score and these parameters were investigated and their value in the diagnosis of osteoporosis and osteopenia was evaluated. Results. The DFCI, as a ratio, had higher reliability than the CBTavg. Both showed significant correlation with BMD and T-score. When compared with DFCI, CBTavg showed better correlation and was better for predicting osteoporosis and osteopenia. Conclusion. The CBTavg and DFCI are simple and reliable screening tools for the prediction of osteoporosis and osteopenia. The CBTavg is more accurate but the DFCI is easier to use in clinical practice. Cite this article: Q-F. He, H. Sun, L-Y. Shu, Y. Zhu, X-T. Xie, Y. Zhan, C-F. Luo. Radiographic predictors for bone mineral loss: Cortical thickness and index of the distal femur. Bone Joint Res 2018;7:468–475. DOI: 10.1302/2046-3758.77.BJR-2017-0332.R1


Bone & Joint Research
Vol. 5, Issue 4 | Pages 116 - 121
1 Apr 2016
Leow JM Clement ND Tawonsawatruk T Simpson CJ Simpson AHRW

Objectives. The radiographic union score for tibial (RUST) fractures was developed by Whelan et al to assess the healing of tibial fractures following intramedullary nailing. In the current study, the repeatability and reliability of the RUST score was evaluated in an independent centre (a) using the original description, (b) after further interpretation of the description of the score, and (c) with the immediate post-operative radiograph available for comparison. Methods. A total of 15 radiographs of tibial shaft fractures treated by intramedullary nailing (IM) were scored by three observers using the RUST system. Following discussion on how the criteria of the RUST system should be implemented, 45 sets (i.e. AP and lateral) of radiographs of IM nailed tibial fractures were scored by five observers. Finally, these 45 sets of radiographs were rescored with the baseline post-operative radiograph available for comparison. Results. The initial intraclass correlation (ICC) on the first 15 sets of radiographs was 0.67 (95% CI 0.63 to 0.71). However, the original description was being interpreted in different ways. After agreeing on the interpretation, the ICC on the second cohort improved to 0.75. The ICC improved even further to 0.79, when the baseline post-operative radiographs were available for comparison. Conclusion. This study demonstrates that the RUST scoring system is a reliable and repeatable outcome measure for assessing tibial fracture healing. Further improvement in the reliability of the scoring system can be obtained if the radiographs are compared with the baseline post-operative radiographs. Cite this article: Mr J.M. Leow. The radiographic union scale in tibial (RUST) fractures: Reliability of the outcome measure at an independent centre. Bone Joint Res 2016;5:116–121. DOI: 10.1302/2046-3758.54.2000628


Bone & Joint Research
Vol. 8, Issue 10 | Pages 502 - 508
1 Oct 2019
Mao W Ni H Li L He Y Chen X Tang H Dong Y

Objectives. Different criteria for assessing the reduction quality of trochanteric fractures have been reported. The Baumgaertner reduction quality criteria (BRQC) are relatively common and the Chang reduction quality criteria (CRQC) are relatively new. The objectives of the current study were to compare the reliability of the BRQC and CRQC in predicting mechanical complications and to investigate the clinical implications of the CRQC. Methods. A total of 168 patients were assessed in a retrospective observational study. Clinical information including age, sex, fracture side, American Society of Anesthesiologists (ASA) classification, tip-apex distance (TAD), fracture classification, reduction quality, blade position, BRQC, CRQC, bone quality, and the occurrence of mechanical complications were used in the statistical analysis. Results. A total of 127 patients were included in the full analysis, and mechanical complications were observed in 26 patients. The TAD, blade position, BRQC and CRQC were significantly associated with mechanical complications in the univariate analysis. Only the TAD (p = 0.025) and the CRQC (p < 0.001) showed significant results in the multivariate analysis. In the comparison of the receiver operating characteristic curves, the CRQC also performed better than the BRQC. Conclusion. The CRQC are reliable in predicting mechanical complications and are more reliable than the BRQC. Future studies could use the CRQC to assess fracture reduction quality. Intraoperatively, the surgeon should refer to the CRQC to achieve good reduction in trochanteric fractures. Cite this article: Bone Joint Res 2019;8:502–508


Bone & Joint Research
Vol. 8, Issue 3 | Pages 146 - 155
1 Mar 2019
Langton DJ Natu S Harrington CF Bowsher JG Nargol AVF

Objectives. We investigated the reliability of the cobalt-chromium (CoCr) synovial joint fluid ratio (JFR) in identifying the presence of a severe aseptic lymphocyte-dominated vasculitis-associated lesion (ALVAL) response and/or suboptimal taper performance (SOTP) following metal-on-metal (MoM) hip arthroplasty. We then examined the possibility that the CoCr JFR may influence the serum partitioning of Co and Cr. Methods. For part A, we included all revision surgeries carried out at our unit with the relevant data, including volumetric wear analysis, joint fluid (JF) Co and Cr concentrations, and ALVAL grade (n = 315). Receiver operating characteristic curves were constructed to assess the reliability of the CoCr JFR in identifying severe ALVAL and/or SOTP. For part B, we included only patients with unilateral prostheses who had given matched serum and whole blood samples for Co and Cr analysis (n = 155). Multiple regression was used to examine the influence of JF concentrations on the serum partitioning of Co and Cr in the blood. Results. A CoCr JFR > 1 showed a specificity of 83% (77% to 88%) and sensitivity of 63% (55% to 70%) for the detection of severe ALVAL and/or SOTP. In patients with CoCr JFRs > 1, the median blood Cr to serum Cr ratio was 0.99, compared with 0.71 in patients with CoCr JFRs < 1 (p < 0.001). Regression analysis demonstrated that the blood Cr to serum Cr value was positively associated with the JF Co concentration (p = 0.011) and inversely related to the JF Cr concentration (p < 0.001). Conclusion. Elevations in CoCr JFRs are associated with adverse biological (severe ALVAL) or tribocorrosive processes (SOTP). Comparison of serum Cr with blood Cr concentrations may be a useful additional clinical tool to help to identify these conditions. Cite this article: D. J. Langton, S. Natu, C. F. Harrington, J. G. Bowsher, A. V. F. Nargol. Is the synovial fluid cobalt-to-chromium ratio related to the serum partitioning of metal debris following metal-on-metal hip arthroplasty? Bone Joint Res 2019;8:146–155. DOI: 10.1302/2046-3758.83.BJR-2018-0049.R1