Advertisement for orthosearch.org.uk
Results 1 - 20 of 1019
Results per page:
Bone & Joint Open
Vol. 4, Issue 8 | Pages 584 - 593
15 Aug 2023
Sainio H Rämö L Reito A Silvasti-Lundell M Lindahl J

Aims. Several previously identified patient-, injury-, and treatment-related factors are associated with the development of nonunion in distal femur fractures. However, the predictive value of these factors is not well defined. We aimed to assess the predictive ability of previously identified risk factors in the development of nonunion leading to secondary surgery in distal femur fractures. Methods. We conducted a retrospective cohort study of adult patients with traumatic distal femur fracture treated with lateral locking plate between 2009 and 2018. The patients who underwent secondary surgery due to fracture healing problem or plate failure were considered having nonunion. Background knowledge of risk factors of distal femur fracture nonunion based on previous literature was used to form an initial set of variables. A logistic regression model was used with previously identified patient- and injury-related variables (age, sex, BMI, diabetes, smoking, periprosthetic fracture, open fracture, trauma energy, fracture zone length, fracture comminution, medial side comminution) in the first analysis and with treatment-related variables (different surgeon-controlled factors, e.g. plate length, screw placement, and proximal fixation) in the second analysis to predict the nonunion leading to secondary surgery in distal femur fractures. Results. We were able to include 299 fractures in 291 patients. Altogether, 31/299 fractures (10%) developed nonunion. In the first analysis, pseudo-R. 2. was 0.27 and area under the receiver operating characteristic curve (AUC) was 0.81. BMI was the most important variable in the prediction. In the second analysis, pseudo-R. 2. was 0.06 and AUC was 0.67. Plate length was the most important variable in the prediction. Conclusion. The model including patient- and injury-related factors had moderate fit and predictive ability in the prediction of distal femur fracture nonunion leading to secondary surgery. BMI was the most important variable in prediction of nonunion. Surgeon-controlled factors had a minor role in prediction of nonunion. Cite this article: Bone Jt Open 2023;4(8):584–593


The Bone & Joint Journal
Vol. 99-B, Issue 7 | Pages 927 - 933
1 Jul 2017
Poltaretskyi S Chaoui J Mayya M Hamitouche C Bercik MJ Boileau P Walch G

Aims. Restoring the pre-morbid anatomy of the proximal humerus is a goal of anatomical shoulder arthroplasty, but reliance is placed on the surgeon’s experience and on anatomical estimations. The purpose of this study was to present a novel method, ‘Statistical Shape Modelling’, which accurately predicts the pre-morbid proximal humeral anatomy and calculates the 3D geometric parameters needed to restore normal anatomy in patients with severe degenerative osteoarthritis or a fracture of the proximal humerus. Materials and Methods. From a database of 57 humeral CT scans 3D humeral reconstructions were manually created. The reconstructions were used to construct a statistical shape model (SSM), which was then tested on a second set of 52 scans. For each humerus in the second set, 3D reconstructions of four diaphyseal segments of varying lengths were created. These reconstructions were chosen to mimic severe osteoarthritis, a fracture of the surgical neck of the humerus and a proximal humeral fracture with diaphyseal extension. The SSM was then applied to the diaphyseal segments to see how well it predicted proximal morphology, using the actual proximal humeral morphology for comparison. Results. With the metaphysis included, mimicking osteoarthritis, the errors of prediction for retroversion, inclination, height, radius of curvature and posterior and medial offset of the head of the humerus were 2.9° (± 2.3°), 4.0° (± 3.3°), 1.0 mm (± 0.8 mm), 0.8 mm (± 0.6 mm), 0.7 mm (± 0.5 mm) and 1.0 mm (± 0.7 mm), respectively. With the metaphysis excluded, mimicking a fracture of the surgical neck, the errors of prediction for retroversion, inclination, height, radius of curvature and posterior and medial offset of the head of the humerus were 3.8° (± 2.9°), 3.9° (± 3.4°), 2.4 mm (± 1.9 mm), 1.3 mm (± 0.9 mm), 0.8 mm (± 0.5 mm) and 0.9 mm (± 0.6 mm), respectively. Conclusion. This study reports a novel, computerised method that accurately predicts the pre-morbid proximal humeral anatomy even in challenging situations. This information can be used in the surgical planning and operative reconstruction of patients with severe degenerative osteoarthritis or with a fracture of the proximal humerus. Cite this article: Bone Joint J 2017;99-B:927–33


Bone & Joint Open
Vol. 5, Issue 11 | Pages 962 - 970
4 Nov 2024
Suter C Mattila H Ibounig T Sumrein BO Launonen A Järvinen TLN Lähdeoja T Rämö L

Aims

Though most humeral shaft fractures heal nonoperatively, up to one-third may lead to nonunion with inferior outcomes. The Radiographic Union Score for HUmeral Fractures (RUSHU) was created to identify high-risk patients for nonunion. Our study evaluated the RUSHU’s prognostic performance at six and 12 weeks in discriminating nonunion within a significantly larger cohort than before.

Methods

Our study included 226 nonoperatively treated humeral shaft fractures. We evaluated the interobserver reliability and intraobserver reproducibility of RUSHU scoring using intraclass correlation coefficients (ICCs). Additionally, we determined the optimal cut-off thresholds for predicting nonunion using the receiver operating characteristic (ROC) method.


Bone & Joint Research
Vol. 5, Issue 6 | Pages 232 - 238
1 Jun 2016
Tanaka A Yoshimura Y Aoki K Kito M Okamoto M Suzuki S Momose T Kato H

Objectives. Our objective was to predict the knee extension strength and post-operative function in quadriceps resection for soft-tissue sarcoma of the thigh. Methods. A total of 18 patients (14 men, four women) underwent total or partial quadriceps resection for soft-tissue sarcoma of the thigh between 2002 and 2014. The number of resected quadriceps was surveyed, knee extension strength was measured with the Biodex isokinetic dynamometer system (affected side/unaffected side) and relationships between these were examined. The Musculoskeletal Tumor Society (MSTS) score, Toronto Extremity Salvage Score (TESS), European Quality of Life-5 Dimensions (EQ-5D) score and the Short Form 8 were used to evaluate post-operative function and examine correlations with extension strength. The cutoff value for extension strength to expect good post-operative function was also calculated using a receiver operating characteristic (ROC) curve and Fisher’s exact test. Results. Extension strength decreased when the number of resected quadriceps increased (p < 0.001), and was associated with lower MSTS score, TESS and EQ-5D (p = 0.004, p = 0.005, p = 0.006, respectively). Based on the functional evaluation scales, the cutoff value of extension strength was 56.2%, the equivalent to muscle strength with resection of up to two muscles. Conclusion. Good post-operative results can be expected if at least two quadriceps muscles are preserved. Cite this article: A. Tanaka, Y. Yoshimura, K. Aoki, M. Kito, M. Okamoto, S. Suzuki, T. Momose, H. Kato. Knee extension strength and post-operative functional prediction in quadriceps resection for soft-tissue sarcoma of the thigh. Bone Joint Res 2016;5:232–238. DOI: 10.1302/2046-3758.56.2000631


Bone & Joint Research
Vol. 14, Issue 2 | Pages 111 - 123
18 Feb 2025
Wang J Shan L Hang J Li H Meng Y Cao W Gu C Dai J Tao L

Aims. We aimed to develop and validate a novel prediction model for osteoporosis based on serotonin, fat-soluble vitamins, and bone turnover markers to improve prediction accuracy of osteoporosis. Methods. Postmenopausal women aged 55 to 65 years were recruited and divided into three groups based on DXA (normal, osteopenia, and osteoporosis). A total of 109 participants were included in this study and split into healthy (39/109, 35.8%), osteopenia (35/109, 32.1%), and osteoporosis groups (35/109, 32.1%). Serum concentrations of serotonin, fat-soluble vitamins, and bone turnover markers of participants were measured. Stepwise discriminant analysis was performed to identify efficient predictors for osteoporosis. The prediction model was developed based on Bayes and Fisher’s discriminant functions, and validated via leave-one-out cross-validation. Normal and empirical volume under the receiver operating characteristic (ROC) surface (VUS) tests were used to evaluate predictive effects of variables in the prediction model. Results. Significant variables including oestrogen (E2), total procollagen type 1 amino-terminal propeptide (TP1NP), parathyroid hormone (PTH), BMI, vitamin K, serotonin, osteocalcin (OSTEOC), vitamin A, and vitamin D3 were used for the development of the prediction model. The training accuracy for normal, osteopenia, and osteoporosis is 74.4% (29/39), 80.0% (28/35), and 85.7% (30/35), respectively, while the total training accuracy is 79.8% (87/109). The internal validation showed excellent performance with 72.5% testing accuracy (72/109). Among these variables, serotonin and vitamin K exert important roles in the prediction of osteoporosis. Conclusion. We successfully developed and validated a novel prediction model for osteoporosis based on serum concentrations of serotonin, fat-soluble vitamins, and bone turnover markers. In addition, interactive communication between serotonin and fat-soluble vitamins was observed to be critical for bone health in this study. Cite this article: Bone Joint Res 2025;14(2):111–123


Bone & Joint Open
Vol. 4, Issue 10 | Pages 750 - 757
10 Oct 2023
Brenneis M Thewes N Holder J Stief F Braun S

Aims. Accurate skeletal age and final adult height prediction methods in paediatric orthopaedics are crucial for determining optimal timing of growth-guiding interventions and minimizing complications in treatments of various conditions. This study aimed to evaluate the accuracy of final adult height predictions using the central peak height (CPH) method with long leg X-rays and four different multiplier tables. Methods. This study included 31 patients who underwent temporary hemiepiphysiodesis for varus or valgus deformity of the leg between 2014 and 2020. The skeletal age at surgical intervention was evaluated using the CPH method with long leg radiographs. The true final adult height (FH. TRUE. ) was determined when the growth plates were closed. The final height prediction accuracy of four different multiplier tables (1. Bayley and Pinneau; 2. Paley et al; 3. Sanders – Greulich and Pyle (SGP); and 4. Sanders – peak height velocity (PHV)) was then compared using either skeletal age or chronological age. Results. All final adult height predictions overestimated the FH. TRUE. , with the SGP multiplier table having the lowest overestimation and lowest absolute deviation when using both chronological age and skeletal age. There were no significant differences in final height prediction accuracy between using skeletal age and chronological age with PHV (p = 0.652) or SGP multiplier tables (p = 0.969). Adult height predictions with chronological age and SGP (r = 0.769; p ≤ 0.001), as well as chronological age and PHV (r = 0.822; p ≤ 0.001), showed higher correlations with FH. TRUE. than predictions with skeletal age and SGP (r = 0.657; p ≤ 0.001) or skeletal age and PHV (r = 0.707; p ≤ 0.001). Conclusion. There was no significant improvement in adult height prediction accuracy when using the CPH method compared to chronological age alone. The study concludes that there is no advantage in routinely using the CPH method for skeletal age determination over the simple use of chronological age. The findings highlight the need for more accurate methods to predict final adult height in contemporary patient populations. Cite this article: Bone Jt Open 2023;4(10):750–757


The Bone & Joint Journal
Vol. 106-B, Issue 2 | Pages 203 - 211
1 Feb 2024
Park JH Won J Kim H Kim Y Kim S Han I

Aims. This study aimed to compare the performance of survival prediction models for bone metastases of the extremities (BM-E) with pathological fractures in an Asian cohort, and investigate patient characteristics associated with survival. Methods. This retrospective cohort study included 469 patients, who underwent surgery for BM-E between January 2009 and March 2022 at a tertiary hospital in South Korea. Postoperative survival was calculated using the PATHFx3.0, SPRING13, OPTIModel, SORG, and IOR models. Model performance was assessed with area under the curve (AUC), calibration curve, Brier score, and decision curve analysis. Cox regression analyses were performed to evaluate the factors contributing to survival. Results. The SORG model demonstrated the highest discriminatory accuracy with AUC (0.80 (95% confidence interval (CI) 0.76 to 0.85)) at 12 months. In calibration analysis, the PATHfx3.0 and OPTIModel models underestimated survival, while the SPRING13 and IOR models overestimated survival. The SORG model exhibited excellent calibration with intercepts of 0.10 (95% CI -0.13 to 0.33) at 12 months. The SORG model also had lower Brier scores than the null score at three and 12 months, indicating good overall performance. Decision curve analysis showed that all five survival prediction models provided greater net benefit than the default strategy of operating on either all or no patients. Rapid growth cancer and low serum albumin levels were associated with three-, six-, and 12-month survival. Conclusion. State-of-art survival prediction models for BM-E (PATHFx3.0, SPRING13, OPTIModel, SORG, and IOR models) are useful clinical tools for orthopaedic surgeons in the decision-making process for the treatment in Asian patients, with SORG models offering the best predictive performance. Rapid growth cancer and serum albumin level are independent, statistically significant factors contributing to survival following surgery of BM-E. Further refinement of survival prediction models will bring about informed and patient-specific treatment of BM-E. Cite this article: Bone Joint J 2024;106-B(2):203–211


The Bone & Joint Journal
Vol. 104-B, Issue 9 | Pages 1011 - 1016
1 Sep 2022
Acem I van de Sande MAJ

Prediction tools are instruments which are commonly used to estimate the prognosis in oncology and facilitate clinical decision-making in a more personalized manner. Their popularity is shown by the increasing numbers of prediction tools, which have been described in the medical literature. Many of these tools have been shown to be useful in the field of soft-tissue sarcoma of the extremities (eSTS). In this annotation, we aim to provide an overview of the available prediction tools for eSTS, provide an approach for clinicians to evaluate the performance and usefulness of the available tools for their own patients, and discuss their possible applications in the management of patients with an eSTS. Cite this article: Bone Joint J 2022;104-B(9):1011–1016


Bone & Joint Open
Vol. 5, Issue 3 | Pages 243 - 251
25 Mar 2024
Wan HS Wong DLL To CS Meng N Zhang T Cheung JPY

Aims. This systematic review aims to identify 3D predictors derived from biplanar reconstruction, and to describe current methods for improving curve prediction in patients with mild adolescent idiopathic scoliosis. Methods. A comprehensive search was conducted by three independent investigators on MEDLINE, PubMed, Web of Science, and Cochrane Library. Search terms included “adolescent idiopathic scoliosis”,“3D”, and “progression”. The inclusion and exclusion criteria were carefully defined to include clinical studies. Risk of bias was assessed with the Quality in Prognostic Studies tool (QUIPS) and Appraisal tool for Cross-Sectional Studies (AXIS), and level of evidence for each predictor was rated with the Grading of Recommendations, Assessment, Development, and Evaluations (GRADE) approach. In all, 915 publications were identified, with 377 articles subjected to full-text screening; overall, 31 articles were included. Results. Torsion index (TI) and apical vertebral rotation (AVR) were identified as accurate predictors of curve progression in early visits. Initial TI > 3.7° and AVR > 5.8° were predictive of curve progression. Thoracic hypokyphosis was inconsistently observed in progressive curves with weak evidence. While sagittal wedging was observed in mild curves, there is insufficient evidence for its correlation with curve progression. In curves with initial Cobb angle < 25°, Cobb angle was a poor predictor for future curve progression. Prediction accuracy was improved by incorporating serial reconstructions in stepwise layers. However, a lack of post-hoc analysis was identified in studies involving geometrical models. Conclusion. For patients with mild curves, TI and AVR were identified as predictors of curve progression, with TI > 3.7° and AVR > 5.8° found to be important thresholds. Cobb angle acts as a poor predictor in mild curves, and more investigations are required to assess thoracic kyphosis and wedging as predictors. Cumulative reconstruction of radiographs improves prediction accuracy. Comprehensive analysis between progressive and non-progressive curves is recommended to extract meaningful thresholds for clinical prognostication. Cite this article: Bone Jt Open 2024;5(3):243–251


Bone & Joint Open
Vol. 4, Issue 3 | Pages 168 - 181
14 Mar 2023
Dijkstra H Oosterhoff JHF van de Kuit A IJpma FFA Schwab JH Poolman RW Sprague S Bzovsky S Bhandari M Swiontkowski M Schemitsch EH Doornberg JN Hendrickx LAM

Aims. To develop prediction models using machine-learning (ML) algorithms for 90-day and one-year mortality prediction in femoral neck fracture (FNF) patients aged 50 years or older based on the Hip fracture Evaluation with Alternatives of Total Hip arthroplasty versus Hemiarthroplasty (HEALTH) and Fixation using Alternative Implants for the Treatment of Hip fractures (FAITH) trials. Methods. This study included 2,388 patients from the HEALTH and FAITH trials, with 90-day and one-year mortality proportions of 3.0% (71/2,388) and 6.4% (153/2,388), respectively. The mean age was 75.9 years (SD 10.8) and 65.9% of patients (1,574/2,388) were female. The algorithms included patient and injury characteristics. Six algorithms were developed, internally validated and evaluated across discrimination (c-statistic; discriminative ability between those with risk of mortality and those without), calibration (observed outcome compared to the predicted probability), and the Brier score (composite of discrimination and calibration). Results. The developed algorithms distinguished between patients at high and low risk for 90-day and one-year mortality. The penalized logistic regression algorithm had the best performance metrics for both 90-day (c-statistic 0.80, calibration slope 0.95, calibration intercept -0.06, and Brier score 0.039) and one-year (c-statistic 0.76, calibration slope 0.86, calibration intercept -0.20, and Brier score 0.074) mortality prediction in the hold-out set. Conclusion. Using high-quality data, the ML-based prediction models accurately predicted 90-day and one-year mortality in patients aged 50 years or older with a FNF. The final models must be externally validated to assess generalizability to other populations, and prospectively evaluated in the process of shared decision-making. Cite this article: Bone Jt Open 2023;4(3):168–181


The Bone & Joint Journal
Vol. 106-B, Issue 1 | Pages 19 - 27
1 Jan 2024
Tang H Guo S Ma Z Wang S Zhou Y

Aims. The aim of this study was to evaluate the reliability and validity of a patient-specific algorithm which we developed for predicting changes in sagittal pelvic tilt after total hip arthroplasty (THA). Methods. This retrospective study included 143 patients who underwent 171 THAs between April 2019 and October 2020 and had full-body lateral radiographs preoperatively and at one year postoperatively. We measured the pelvic incidence (PI), the sagittal vertical axis (SVA), pelvic tilt, sacral slope (SS), lumbar lordosis (LL), and thoracic kyphosis to classify patients into types A, B1, B2, B3, and C. The change of pelvic tilt was predicted according to the normal range of SVA (0 mm to 50 mm) for types A, B1, B2, and B3, and based on the absolute value of one-third of the PI-LL mismatch for type C patients. The reliability of the classification of the patients and the prediction of the change of pelvic tilt were assessed using kappa values and intraclass correlation coefficients (ICCs), respectively. Validity was assessed using the overall mean error and mean absolute error (MAE) for the prediction of the change of pelvic tilt. Results. The kappa values were 0.927 (95% confidence interval (CI) 0.861 to 0.992) and 0.945 (95% CI 0.903 to 0.988) for the inter- and intraobserver reliabilities, respectively, and the ICCs ranged from 0.919 to 0.997. The overall mean error and MAE for the prediction of the change of pelvic tilt were -0.3° (SD 3.6°) and 2.8° (SD 2.4°), respectively. The overall absolute change of pelvic tilt was 5.0° (SD 4.1°). Pre- and postoperative values and changes in pelvic tilt, SVA, SS, and LL varied significantly among the five types of patient. Conclusion. We found that the proposed algorithm was reliable and valid for predicting the standing pelvic tilt after THA. Cite this article: Bone Joint J 2024;106-B(1):19–27


Orthopaedic Proceedings
Vol. 106-B, Issue SUPP_19 | Pages 31 - 31
22 Nov 2024
Yoon S Jutte P Soriano A Sousa R Zijlstra W Wouthuyzen-Bakker M
Full Access

Aim. This study aimed to externally validate promising preoperative PJI prediction models in a recent, multinational European cohort. Method. Three preoperative PJI prediction models (by Tan et al., Del Toro et al., and Bülow et al.) which previously demonstrated high levels of accuracy were selected for validation. A multicenter retrospective observational analysis was performed of patients undergoing total hip arthroplasty (THA) and total knee arthroplasty (TKA) between January 2020 and December 2021 and treated at centers in the Netherlands, Portugal, and Spain. Patient characteristics were compared between our cohort and those used to develop the prediction models. Model performance was assessed through discrimination and calibration. Results. A total of 2684 patients were included of whom 60 developed a PJI (2.2%). Our patient cohort differed from the models’ original cohorts in terms of demographic variables, procedural variables, and the prevalence of comorbidities. The c-statistics for the Tan, Del Toro, and Bülow models were 0.72, 0.69, and 0.72 respectively. Calibration was reasonable, but precise percentage estimates for PJI risk were most accurate for predicted risks up to 3-4%; the Tan model overestimated risks above 4%, while the Del Toro model underestimated risks above 3%. Conclusions. In this multinational cohort study, the Tan, Del Toro, and Bülow PJI prediction models were found to be externally valid for classifying high risk patients for developing a PJI. These models hold promise for clinical application to enhance preoperative patient counseling and targeted prevention strategies. Keywords. Periprosthetic Joint Infection (PJI), High Risk Groups, Prediction Models, Validation, Infection Prevention


Bone & Joint Research
Vol. 13, Issue 4 | Pages 184 - 192
18 Apr 2024
Morita A Iida Y Inaba Y Tezuka T Kobayashi N Choe H Ike H Kawakami E

Aims. This study was designed to develop a model for predicting bone mineral density (BMD) loss of the femur after total hip arthroplasty (THA) using artificial intelligence (AI), and to identify factors that influence the prediction. Additionally, we virtually examined the efficacy of administration of bisphosphonate for cases with severe BMD loss based on the predictive model. Methods. The study included 538 joints that underwent primary THA. The patients were divided into groups using unsupervised time series clustering for five-year BMD loss of Gruen zone 7 postoperatively, and a machine-learning model to predict the BMD loss was developed. Additionally, the predictor for BMD loss was extracted using SHapley Additive exPlanations (SHAP). The patient-specific efficacy of bisphosphonate, which is the most important categorical predictor for BMD loss, was examined by calculating the change in predictive probability when hypothetically switching between the inclusion and exclusion of bisphosphonate. Results. Time series clustering allowed us to divide the patients into two groups, and the predictive factors were identified including patient- and operation-related factors. The area under the receiver operating characteristic (ROC) curve (AUC) for the BMD loss prediction averaged 0.734. Virtual administration of bisphosphonate showed on average 14% efficacy in preventing BMD loss of zone 7. Additionally, stem types and preoperative triglyceride (TG), creatinine (Cr), estimated glomerular filtration rate (eGFR), and creatine kinase (CK) showed significant association with the estimated patient-specific efficacy of bisphosphonate. Conclusion. Periprosthetic BMD loss after THA is predictable based on patient- and operation-related factors, and optimal prescription of bisphosphonate based on the prediction may prevent BMD loss. Cite this article: Bone Joint Res 2024;13(4):184–192


The Bone & Joint Journal
Vol. 107-B, Issue 3 | Pages 337 - 345
1 Mar 2025
Wang D Wang Q Cui P Wang S Han D Chen X Lu S

Aims. Adult spinal deformity (ASD) surgery can reduce pain and disability. However, the actual surgical efficacy of ASD in doing so is far from desirable, with frequent complications and limited improvement in quality of life. The accurate prediction of surgical outcome is crucial to the process of clinical decision-making. Consequently, the aim of this study was to develop and validate a model for predicting an ideal surgical outcome (ISO) two years after ASD surgery. Methods. We conducted a retrospective analysis of 458 consecutive patients who had undergone spinal fusion surgery for ASD between January 2016 and June 2022. The outcome of interest was achievement of the ISO, defined as an improvement in patient-reported outcomes exceeding the minimal clinically important difference, with no postoperative complications. Three machine-learning (ML) algorithms – LASSO, RFE, and Boruta – were used to identify key variables from the collected data. The dataset was randomly split into training (60%) and test (40%) sets. Five different ML models were trained, including logistic regression, random forest, XGBoost, LightGBM, and multilayer perceptron. The primary model evaluation metric was area under the receiver operating characteristic curve (AUROC). Results. The analysis included 208 patients (mean age 64.62 years (SD 8.21); 48 male (23.1%), 160 female (76.9%)). Overall, 42.8% of patients (89/208) achieved the ideal surgical outcome. Eight features were identified as key variables affecting prognosis: depression, osteoporosis, frailty, failure of pelvic compensation, relative functional cross-sectional area of the paraspinal muscles, postoperative sacral slope, pelvic tilt match, and sagittal age-adjusted score match. The best prediction model was LightGBM, achieving the following performance metrics: AUROC 0.888 (95% CI 0.810 to 0.966); accuracy 0.843; sensitivity 0.829; specificity 0.854; positive predictive value 0.806; and negative predictive value 0.872. Conclusion. In this prognostic study, we developed a machine-learning model that accurately predicted outcome after surgery for ASD. The model is built on routinely modifiable indicators, thereby facilitating its integration into clinical practice to promote optimized decision-making. Cite this article: Bone Joint J 2025;107-B(3):337–345


Orthopaedic Proceedings
Vol. 105-B, Issue SUPP_16 | Pages 23 - 23
17 Nov 2023
Castagno S Birch M van der Schaar M McCaskie A
Full Access

Abstract. Introduction. Precision health aims to develop personalised and proactive strategies for predicting, preventing, and treating complex diseases such as osteoarthritis (OA), a degenerative joint disease affecting over 300 million people worldwide. Due to OA heterogeneity, which makes developing effective treatments challenging, identifying patients at risk for accelerated disease progression is essential for efficient clinical trial design and new treatment target discovery and development. Objectives. This study aims to create a trustworthy and interpretable precision health tool that predicts rapid knee OA progression based on baseline patient characteristics using an advanced automated machine learning (autoML) framework, “Autoprognosis 2.0”. Methods. All available 2-year follow-up periods of 600 patients from the FNIH OA Biomarker Consortium were analysed using “Autoprognosis 2.0” in two separate approaches, with distinct definitions of clinical outcomes: multi-class predictions (categorising patients into non-progressors, pain-only progressors, radiographic-only progressors, and both pain and radiographic progressors) and binary predictions (categorising patients into non-progressors and progressors). Models were developed using a training set of 1352 instances and all available variables (including clinical, X-ray, MRI, and biochemical features), and validated through both stratified 10-fold cross-validation and hold-out validation on a testing set of 339 instances. Model performance was assessed using multiple evaluation metrics, such as AUC-ROC, AUC-PRC, F1-score, precision, and recall. Additionally, interpretability analyses were carried out to identify important predictors of rapid disease progression. Results. Our final models yielded high accuracy scores for both multi-class predictions (AUC-ROC: 0.858, 95% CI: 0.856–0.860; AUC-PRC: 0.675, 95% CI: 0.671–0.679; F1-score: 0.560, 95% CI: 0.554–0.566) and binary predictions (AUC-ROC: 0.717, 95% CI: 0.712–0.722; AUC-PRC: 0.620, 95% CI: 0.616–0.624; F1-score: 0.676, 95% CI: 0.673–0679). Important predictors of rapid disease progression included the Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) scores and MRI features. Our models were further successfully validated using a hold-out dataset, which was previously omitted from model development and training (AUC-ROC: 0.877 for multi-class predictions; AUC-ROC: 0.746 for binary predictions). Additionally, accurate ML models were developed for predicting OA progression in a subgroup of patients aged 65 or younger (AUC-ROC: 0.862, 95% CI: 0.861–0.863 for multi-class predictions; AUC-ROC: 0.736, 95% CI: 0.734–0.738 for binary predictions). Conclusions. This study presents a reliable and interpretable precision health tool for predicting rapid knee OA progression using “Autoprognosis 2.0”. Our models provide accurate predictions and offer insights into important predictors of rapid disease progression. Furthermore, the transparency and interpretability of our methods may facilitate their acceptance by clinicians and patients, enabling effective utilisation in clinical practice. Future work should focus on refining these models by increasing the sample size, integrating additional features, and using independent datasets for external validation. Declaration of Interest. (b) declare that there is no conflict of interest that could be perceived as prejudicing the impartiality of the research reported:I declare that there is no conflict of interest that could be perceived as prejudicing the impartiality of the research project


Orthopaedic Proceedings
Vol. 106-B, Issue SUPP_2 | Pages 19 - 19
2 Jan 2024
Castagno S Birch M van der Schaar M McCaskie A
Full Access

Precision health aims to develop personalised and proactive strategies for predicting, preventing, and treating complex diseases such as osteoarthritis (OA). Due to OA heterogeneity, which makes developing effective treatments challenging, identifying patients at risk for accelerated disease progression is essential for efficient clinical trial design and new treatment target discovery and development. To create a reliable and interpretable precision health tool that predicts rapid knee OA progression over a 2-year period from baseline patient characteristics using an advanced automated machine learning (autoML) framework, “Autoprognosis 2.0”. All available 2-year follow-up periods of 600 patients from the FNIH OA Biomarker Consortium were analysed using “Autoprognosis 2.0” in two separate approaches, with distinct definitions of clinical outcomes: multi-class predictions (categorising disease progression into pain and/or radiographic progression) and binary predictions. Models were developed using a training set of 1352 instances and all available variables (including clinical, X-ray, MRI, and biochemical features), and validated through both stratified 10-fold cross-validation and hold-out validation on a testing set of 339 instances. Model performance was assessed using multiple evaluation metrics. Interpretability analyses were carried out to identify important predictors of progression. Our final models yielded higher accuracy scores for multi-class predictions (AUC-ROC: 0.858, 95% CI: 0.856-0.860) compared to binary predictions (AUC-ROC: 0.717, 95% CI: 0.712-0.722). Important predictors of rapid disease progression included WOMAC scores and MRI features. Additionally, accurate ML models were developed for predicting OA progression in a subgroup of patients aged 65 or younger. This study presents a reliable and interpretable precision health tool for predicting rapid knee OA progression. Our models provide accurate predictions and, importantly, allow specific predictors of rapid disease progression to be identified. Furthermore, the transparency and explainability of our methods may facilitate their acceptance by clinicians and patients, enabling effective translation to clinical practice


Orthopaedic Proceedings
Vol. 105-B, Issue SUPP_8 | Pages 102 - 102
11 Apr 2023
Mosseri J Lex J Abbas A Toor J Ravi B Whyne C Khalil E
Full Access

Total knee and hip arthroplasty (TKA and THA) are the most commonly performed surgical procedures, the costs of which constitute a significant healthcare burden. Improving access to care for THA/TKA requires better efficiency. It is hypothesized that this may be possible through a two-stage approach that utilizes prediction of surgical time to enable optimization of operating room (OR) schedules. Data from 499,432 elective unilateral arthroplasty procedures, including 302,490 TKAs, and 196,942 THAs, performed from 2014-2019 was extracted from the American College of Surgeons (ACS) National Surgical and Quality Improvement (NSQIP) database. A deep multilayer perceptron model was trained to predict duration of surgery (DOS) based on pre-operative clinical and biochemical patient factors. A two-stage approach, utilizing predicted DOS from a held out “test” dataset, was utilized to inform the daily OR schedule. The objective function of the optimization was the total OR utilization, with a penalty for overtime. The scheduling problem and constraints were simulated based on a high-volume elective arthroplasty centre in Canada. This approach was compared to current patient scheduling based on mean procedure DOS. Approaches were compared by performing 1000 simulated OR schedules. The predict then optimize approach achieved an 18% increase in OR utilization over the mean regressor. The two-stage approach reduced overtime by 25-minutes per OR day, however it created a 7-minute increase in underutilization. Better objective value was seen in 85.1% of the simulations. With deep learning prediction and mathematical optimization of patient scheduling it is possible to improve overall OR utilization compared to typical scheduling practices. Maximizing utilization of existing healthcare resources can, in limited resource environments, improve patient's access to arthritis care by increasing patient throughput, reducing surgical wait times and in the immediate future, help clear the backlog associated with the COVID-19 pandemic


The Bone & Joint Journal
Vol. 102-B, Issue 9 | Pages 1183 - 1193
14 Sep 2020
Anis HK Strnad GJ Klika AK Zajichek A Spindler KP Barsoum WK Higuera CA Piuzzi NS

Aims. The purpose of this study was to develop a personalized outcome prediction tool, to be used with knee arthroplasty patients, that predicts outcomes (lengths of stay (LOS), 90 day readmission, and one-year patient-reported outcome measures (PROMs) on an individual basis and allows for dynamic modifiable risk factors. Methods. Data were prospectively collected on all patients who underwent total or unicompartmental knee arthroplasty at a between July 2015 and June 2018. Cohort 1 (n = 5,958) was utilized to develop models for LOS and 90 day readmission. Cohort 2 (n = 2,391, surgery date 2015 to 2017) was utilized to develop models for one-year improvements in Knee Injury and Osteoarthritis Outcome Score (KOOS) pain score, KOOS function score, and KOOS quality of life (QOL) score. Model accuracies within the imputed data set were assessed through cross-validation with root mean square errors (RMSEs) and mean absolute errors (MAEs) for the LOS and PROMs models, and the index of prediction accuracy (IPA), and area under the curve (AUC) for the readmission models. Model accuracies in new patient data sets were assessed with AUC. Results. Within the imputed datasets, the LOS (RMSE 1.161) and PROMs models (RMSE 15.775, 11.056, 21.680 for KOOS pain, function, and QOL, respectively) demonstrated good accuracy. For all models, the accuracy of predicting outcomes in a new set of patients were consistent with the cross-validation accuracy overall. Upon validation with a new patient dataset, the LOS and readmission models demonstrated high accuracy (71.5% and 65.0%, respectively). Similarly, the one-year PROMs improvement models demonstrated high accuracy in predicting ten-point improvements in KOOS pain (72.1%), function (72.9%), and QOL (70.8%) scores. Conclusion. The data-driven models developed in this study offer scalable predictive tools that can accurately estimate the likelihood of improved pain, function, and quality of life one year after knee arthroplasty as well as LOS and 90 day readmission. Cite this article: Bone Joint J 2020;102-B(9):1183–1193


Orthopaedic Proceedings
Vol. 105-B, Issue SUPP_3 | Pages 5 - 5
23 Feb 2023
Jadresic MC Baker J
Full Access

Numerous prediction tools are available for estimating postoperative risk following spine surgery. External validation studies have shown mixed results. We present the development, validation, and comparative evaluation of novel tool (NZSpine) for modelling risk of complications within 30 days of spine surgery. Data was gathered retrospectively from medical records of patients who underwent spine surgery at Waikato Hospital between January 2019 and December 2020 (n = 488). Variables were selected a priori based on previous evidence and clinical judgement. Postoperative adverse events were classified objectively using the Comprehensive Complication Index. Models were constructed for the occurrence of any complication and significant complications (based on CCI >26). Performance and clinical utility of the novel model was compared against SpineSage (. https://depts.washington.edu/spinersk/. ), an extant online tool which we have shown in unpublished work to be valid in our local population. Overall complication rate was 34%. In the multivariate model, higher age, increased surgical invasiveness and the presence of preoperative anemia were most strongly predictive of any postoperative complication (OR = 1.03, 1.09, 2.1 respectively, p <0.001), whereas the occurrence of a major postoperative complication (CCI >26) was most strongly associated with the presence of respiratory disease (OR = 2.82, p <0.001). Internal validation using the bootstrapped models showed the model was robust, with an AUC of 0.73. Using sensitivity analysis, 80% of the model's predictions were correct. By comparison SpineSage had an AUC of 0.71, and in decision curve analysis the novel model showed greater expected benefit at all thresholds of risk. NZSpine is a novel risk assessment tool for patients undergoing acute and elective spine surgery and may help inform clinicians and patients of their prognosis. Use of an objective tool may help to provide uniformity between DHBs when completing the “clinician assessment of risk” section of the national prioritization tool


The Bone & Joint Journal
Vol. 104-B, Issue 1 | Pages 97 - 102
1 Jan 2022
Hijikata Y Kamitani T Nakahara M Kumamoto S Sakai T Itaya T Yamazaki H Ogawa Y Kusumegi A Inoue T Yoshida T Furue N Fukuhara S Yamamoto Y

Aims. To develop and internally validate a preoperative clinical prediction model for acute adjacent vertebral fracture (AVF) after vertebral augmentation to support preoperative decision-making, named the after vertebral augmentation (AVA) score. Methods. In this prognostic study, a multicentre, retrospective single-level vertebral augmentation cohort of 377 patients from six Japanese hospitals was used to derive an AVF prediction model. Backward stepwise selection (p < 0.05) was used to select preoperative clinical and imaging predictors for acute AVF after vertebral augmentation for up to one month, from 14 predictors. We assigned a score to each selected variable based on the regression coefficient and developed the AVA scoring system. We evaluated sensitivity and specificity for each cut-off, area under the curve (AUC), and calibration as diagnostic performance. Internal validation was conducted using bootstrapping to correct the optimism. Results. Of the 377 patients used for model derivation, 58 (15%) had an acute AVF postoperatively. The following preoperative measures on multivariable analysis were summarized in the five-point AVA score: intravertebral instability (≥ 5 mm), focal kyphosis (≥ 10°), duration of symptoms (≥ 30 days), intravertebral cleft, and previous history of vertebral fracture. Internal validation showed a mean optimism of 0.019 with a corrected AUC of 0.77. A cut-off of ≤ one point was chosen to classify a low risk of AVF, for which only four of 137 patients (3%) had AVF with 92.5% sensitivity and 45.6% specificity. A cut-off of ≥ four points was chosen to classify a high risk of AVF, for which 22 of 38 (58%) had AVF with 41.5% sensitivity and 94.5% specificity. Conclusion. In this study, the AVA score was found to be a simple preoperative method for the identification of patients at low and high risk of postoperative acute AVF. This model could be applied to individual patients and could aid in the decision-making before vertebral augmentation. Cite this article: Bone Joint J 2022;104-B(1):97–102