Advertisement for orthosearch.org.uk
Results 1 - 14 of 14
Results per page:
Bone & Joint Open
Vol. 3, Issue 10 | Pages 786 - 794
12 Oct 2022
Harrison CJ Plummer OR Dawson J Jenkinson C Hunt A Rodrigues JN

Aims. The aim of this study was to develop and evaluate machine-learning-based computerized adaptive tests (CATs) for the Oxford Hip Score (OHS), Oxford Knee Score (OKS), Oxford Shoulder Score (OSS), and the Oxford Elbow Score (OES) and its subscales. Methods. We developed CAT algorithms for the OHS, OKS, OSS, overall OES, and each of the OES subscales, using responses to the full-length questionnaires and a machine-learning technique called regression tree learning. The algorithms were evaluated through a series of simulation studies, in which they aimed to predict respondents’ full-length questionnaire scores from only a selection of their item responses. In each case, the total number of items used by the CAT algorithm was recorded and CAT scores were compared to full-length questionnaire scores by mean, SD, score distribution plots, Pearson’s correlation coefficient, intraclass correlation (ICC), and the Bland-Altman method. Differences between CAT scores and full-length questionnaire scores were contextualized through comparison to the instruments’ minimal clinically important difference (MCID). Results. The CAT algorithms accurately estimated 12-item questionnaire scores from between four and nine items. Scores followed a very similar distribution between CAT and full-length assessments, with the mean score difference ranging from 0.03 to 0.26 out of 48 points. Pearson’s correlation coefficient and ICC were 0.98 for each 12-item scale and 0.95 or higher for the OES subscales. In over 95% of cases, a patient’s CAT score was within five points of the full-length questionnaire score for each 12-item questionnaire. Conclusion. Oxford Hip Score, Oxford Knee Score, Oxford Shoulder Score, and Oxford Elbow Score (including separate subscale scores) CATs all markedly reduce the burden of items to be completed without sacrificing score accuracy. Cite this article: Bone Jt Open 2022;3(10):786–794


Orthopaedic Proceedings
Vol. 104-B, Issue SUPP_13 | Pages 60 - 60
1 Dec 2022
Martin RK Wastvedt S Pareek A Persson A Visnes H Fenstad AM Moatshe G Wolfson J Lind M Engebretsen L
Full Access

External validation of machine learning predictive models is achieved through evaluation of model performance on different groups of patients than were used for algorithm development. This important step is uncommonly performed, inhibiting clinical translation of newly developed models. Recently, machine learning was used to develop a tool that can quantify revision risk for a patient undergoing primary anterior cruciate ligament (ACL) reconstruction (https://swastvedt.shinyapps.io/calculator_rev/). The source of data included nearly 25,000 patients with primary ACL reconstruction recorded in the Norwegian Knee Ligament Register (NKLR). The result was a well-calibrated tool capable of predicting revision risk one, two, and five years after primary ACL reconstruction with moderate accuracy. The purpose of this study was to determine the external validity of the NKLR model by assessing algorithm performance when applied to patients from the Danish Knee Ligament Registry (DKLR).

The primary outcome measure of the NKLR model was probability of revision ACL reconstruction within 1, 2, and/or 5 years. For the index study, 24 total predictor variables in the NKLR were included and the models eliminated variables which did not significantly improve prediction ability - without sacrificing accuracy. The result was a well calibrated algorithm developed using the Cox Lasso model that only required five variables (out of the original 24) for outcome prediction. For this external validation study, all DKLR patients with complete data for the five variables required for NKLR prediction were included. The five variables were: graft choice, femur fixation device, Knee Injury and Osteoarthritis Outcome Score (KOOS) Quality of Life subscale score at surgery, years from injury to surgery, and age at surgery. Predicted revision probabilities were calculated for all DKLR patients. The model performance was assessed using the same metrics as the NKLR study: concordance and calibration.

In total, 10,922 DKLR patients were included for analysis. Average follow-up time or time-to-revision was 8.4 (±4.3) years and overall revision rate was 6.9%. Surgical technique trends (i.e., graft choice and fixation devices) and injury characteristics (i.e., concomitant meniscus and cartilage pathology) were dissimilar between registries. The model produced similar concordance when applied to the DKLR population compared to the original NKLR test data (DKLR: 0.68; NKLR: 0.68-0.69). Calibration was poorer for the DKLR population at one and five years post primary surgery but similar to the NKLR at two years.

The NKLR machine learning algorithm demonstrated similar performance when applied to patients from the DKLR, suggesting that it is valid for application outside of the initial patient population. This represents the first machine learning model for predicting revision ACL reconstruction that has been externally validated. Clinicians can use this in-clinic calculator to estimate revision risk at a patient specific level when discussing outcome expectations pre-operatively. While encouraging, it should be noted that the performance of the model on patients undergoing ACL reconstruction outside of Scandinavia remains unknown.


Orthopaedic Proceedings
Vol. 103-B, Issue SUPP_1 | Pages 11 - 11
1 Feb 2021
Bartolo M Accardi M Dini D Amis A
Full Access

Objectives

Articular cartilage damage is a primary outcome of pre-clinical and clinical studies evaluating meniscal and cartilage repair or replacement techniques. Recent studies have quantitatively characterized India Ink stained cartilage damage through light reflectance and the application of local or global thresholds. We develop a method for the quantitative characterisation of inked cartilage damage with improved generalisation capability, and compare its performance to the threshold-based baseline approach against gold standard labels.

Methods

The Trainable WEKA Segmentation (TWS) tool (Arganda-Carreras et al., 2017) available in Fiji (Rueden et al., 2017) was used to train two separate Random Forest classifiers to automatically segment cartilage damage on ink stained cadaveric ovine stifle joints. Gold standard labels were manually annotated for the training, validation and test datasets for each of the femoral and tibial classifiers. Each dataset included a sample of medial and lateral femoral condyles and tibial plateaus from various stifle joints, selected to ensure no overlap across datasets according to ovine identifier. Training was performed on the training data with the TWS tool using edge, texture and noise reduction filters selected for their suitability and performance. The two trained classifiers were then applied to the validation data to output damage probability maps, on which a threshold value was calibrated. Model predictions on the unseen test set were evaluated against the gold standard labels using the Dice Similarity Coefficient (DSC) – an overlap-based metric, and compared with results for the baseline global threshold approach applied in Fiji as shown in Figures 1 and 2.


Orthopaedic Proceedings
Vol. 104-B, Issue SUPP_12 | Pages 90 - 90
1 Dec 2022
Abbas A Toor J Du JT Versteeg A Yee N Finkelstein J Abouali J Nousiainen M Kreder H Hall J Whyne C Larouche J
Full Access

Excessive resident duty hours (RDH) are a recognized issue with implications for physician well-being and patient safety. A major component of the RDH concern is on-call duty. While considerable work has been done to reduce resident call workload, there is a paucity of research in optimizing resident call scheduling. Call coverage is scheduled manually rather than demand-based, which generally leads to over-scheduling to prevent a service gap. Machine learning (ML) has been widely applied in other industries to prevent such issues of a supply-demand mismatch. However, the healthcare field has been slow to adopt these innovations. As such, the aim of this study was to use ML models to 1) predict demand on orthopaedic surgery residents at a level I trauma centre and 2) identify variables key to demand prediction.

Daily surgical handover emails over an eight year (2012-2019) period at a level I trauma centre were collected. The following data was used to calculate demand: spine call coverage, date, and number of operating rooms (ORs), traumas, admissions and consults completed. Various ML models (linear, tree-based and neural networks) were trained to predict the workload, with their results compared to the current scheduling approach. Quality of models was determined by using the area under the receiver operator curve (AUC) and accuracy of the predictions. The top ten most important variables were extracted from the most successful model.

During training, the model with the highest AUC and accuracy was the multivariate adaptive regression splines (MARS) model, with an AUC of 0.78±0.03 and accuracy of 71.7%±3.1%. During testing, the model with the highest AUC and accuracy was the neural network model, with an AUC of 0.81 and accuracy of 73.7%. All models were better than the current approach, which had an AUC of 0.50 and accuracy of 50.1%. Key variables used by the neural network model were (descending order): spine call duty, year, weekday/weekend, month, and day of the week.

This was the first study attempting to use ML to predict the service demand on orthopaedic surgery residents at a major level I trauma centre. Multiple ML models were shown to be more appropriate and accurate at predicting the demand on surgical residents as compared to the current scheduling approach. Future work should look to incorporate predictive models with optimization strategies to match scheduling with demand in order to improve resident well being and patient care.


Orthopaedic Proceedings
Vol. 104-B, Issue SUPP_12 | Pages 33 - 33
1 Dec 2022
Abbas A Lex J Toor J Mosseri J Khalil E Ravi B Whyne C
Full Access

Total knee and hip arthroplasty (TKA and THA) are two of the highest volume and resource intensive surgical procedures. Key drivers of the cost of surgical care are duration of surgery (DOS) and postoperative inpatient length of stay (LOS). The ability to predict TKA and THA DOS and LOS has substantial implications for hospital finances, scheduling and resource allocation. The goal of this study was to predict DOS and LOS for elective unilateral TKAs and THAs using machine learning models (MLMs) constructed on preoperative patient factors using a large North American database.

The American College of Surgeons (ACS) National Surgical and Quality Improvement (NSQIP) database was queried for elective unilateral TKA and THA procedures from 2014-2019. The dataset was split into training, validation and testing based on year. Multiple conventional and deep MLMs such as linear, tree-based and multilayer perceptrons (MLPs) were constructed. The models with best performance on the validation set were evaluated on the testing set. Models were evaluated according to 1) mean squared error (MSE), 2) buffer accuracy (the number of times the predicted target was within a predesignated buffer of the actual target), and 3) classification accuracy (the number of times the correct class was predicted by the models). To ensure useful predictions, the results of the models were compared to a mean regressor.

A total of 499,432 patients (TKA 302,490; THA 196,942) were included. The MLP models had the best MSEs and accuracy across both TKA and THA patients. During testing, the TKA MSEs for DOS and LOS were 0.893 and 0.688 while the THA MSEs for DOS and LOS were 0.895 and 0.691. The TKA DOS 30-minute buffer accuracy and ≤120 min, >120 min classification accuracy were 78.8% and 88.3%, while the TKA LOS 1-day buffer accuracy and ≤2 days, >2 days classification accuracy were 75.2% and 76.1%. The THA DOS 30-minute buffer accuracy and ≤120 min, >120 min classification accuracy were 81.6% and 91.4%, while the THA LOS 1-day buffer accuracy and ≤2 days, >2 days classification accuracy were 78.3% and 80.4%. All models across both TKA and THA patients were more accurate than the mean regressors for both DOS and LOS predictions across both buffer and classification accuracies.

Conventional and deep MLMs have been effectively implemented to predict the DOS and LOS of elective unilateral TKA and THA patients based on preoperative patient factors using a large North American database with a high level of accuracy. Future work should include using operational factors to further refine these models and improve predictive accuracy. Results of this work will allow institutions to optimize their resource allocation, reduce costs and improve surgical scheduling.

Acknowledgements:

The American College of Surgeons National Surgical Quality Improvement Program and the hospitals participating in the ACS NSQIP are the source of the data used herein; they have not verified and are not responsible for the statistical validity of the data analysis or the conclusions derived by the authors.


Orthopaedic Proceedings
Vol. 102-B, Issue SUPP_1 | Pages 76 - 76
1 Feb 2020
Roche C Simovitch R Flurin P Wright T Zuckerman J Routman H
Full Access

Introduction

Machine learning is a relatively novel method to orthopaedics which can be used to evaluate complex associations and patterns in outcomes and healthcare data. The purpose of this study is to utilize 3 different supervised machine learning algorithms to evaluate outcomes from a multi-center international database of a single shoulder prosthesis to evaluate the accuracy of each model to predict post-operative outcomes of both aTSA and rTSA.

Methods

Data from a multi-center international database consisting of 6485 patients who received primary total shoulder arthroplasty using a single shoulder prosthesis (Equinoxe, Exactech, Inc) were analyzed from 19,796 patient visits in this study. Specifically, demographic, comorbidity, implant type and implant size, surgical technique, pre-operative PROMs and ROM measures, post-operative PROMs and ROM measures, pre-operative and post-operative radiographic data, and also adverse event and complication data were obtained for 2367 primary aTSA patients from 8042 visits at an average follow-up of 22 months and 4118 primary rTSA from 11,754 visits at an average follow-up of 16 months were analyzed to create a predictive model using 3 different supervised machine learning techniques: 1) linear regression, 2) random forest, and 3) XGBoost. Each of these 3 different machine learning techniques evaluated the pre-operative parameters and created a predictive model which targeted the post-operative composite score, which was a 100 point score consisting of 50% post-operative composite outcome score (calculated from 33.3% ASES + 33.3% UCLA + 33.3% Constant) and 50% post-operative composite ROM score (calculated from S curves weighted by 70% active forward flexion + 15% internal rotation score + 15% active external rotation). 3 additional predictive models were created to control for the time required for patient improvement after surgery, to do this, each primary aTSA and primary rTSA cohort was subdivided to only include patient data follow-up visits >20 months after surgery, this yielded 1317 primary aTSA patients from 2962 visits at an average follow-up of 50 months and 1593 primary rTSA from 3144 visits at an average follow-up of 42 months. Each of these 6 predictive models were trained using a random selection of 80% of each cohort, then each model predicted the outcomes of the remaining 20% of the data based upon the demographic, comorbidity, implant type and implant size, surgical technique, pre-operative PROMs and ROM measures inputs of each 20% cohort. The error of all 6 predictive models was calculated from the root mean square error (RMSE) between the actual and predicted post-op composite score. The accuracy of each model was determined by subtracting the percent difference of each RMSE value from the average composite score associated with each cohort.


Orthopaedic Proceedings
Vol. 99-B, Issue SUPP_20 | Pages 49 - 49
1 Dec 2017
Zakeri V Fabri F Karasawa M Hodgson AJ
Full Access

Bone drilling is conducted in many surgical disciplines such as orthopedics, maxillofacial, and spine surgery. Most of these procedures involve drilling of different bone materials including hard (cortical) and soft (cancellous) tissues. Identifying these tissues is essential for surgeons to minimise damage to underlying nerves and vessels.

The sound signal generated during drilling is a valuable source of information that could potentially be employed. Such sounds can be captured readily and easily through non-contact sensors. Therefore, our goal in this preliminary study is to investigate whether drilling sounds can enable us to distinguish between cortical and cancellous tissues.

A bovine tibial bone was drilled, and the cortical and cancellous drilling sounds were captured. Each sound record was divided into small windows with a length of 50 ms and a 50% overlap. The window length was selected small, because our intended longer-term application is to provide the surgeon with near-real-time feedback. Short time Fourier Transform (STFT) coefficients were extracted from each window and were averaged accordingly to obtain p features. A support vector machine (SVM) algorithm was used for classification, and its accuracy was evaluated for different number of features (p). Two training/testing scenarios were considered, atlas (ATL) and leave- one-out (LOO).

The total accuracies for ATL and LOO were 100% and 93.8% respectively obtained for p=128. Our study on a single specimen demonstrated that it is possible to discriminate between cortical and cancellous bones based on relatively short 50 ms windows of drilling sounds.


Orthopaedic Proceedings
Vol. 103-B, Issue SUPP_9 | Pages 19 - 19
1 Jun 2021
Desai P
Full Access

Problem. The identification of unknown orthopaedic implants is a crucial step in the pre-operative planning for revision joint arthroplasty. Compatibility of implant components and instrumentation for implant removal is specific based on the manufacturer and model of the implant. The inability to identify an implant correctly can lead to increased case complexity, procedure time, procedure cost and bone loss for the patient. The number of revision joint arthroplasty cases worldwide and the number implants available on the market are growing rapidly, leading to greater difficulty in identifying unknown implants. Solution. The solution is a machine-learning based mobile platform which allows for instant identification of the manufacturer and model of any implant based only on the x-ray image. As more surgeons and implant representatives use the platform, the model should continue to improve in accuracy and number of implants recognized until the algorithm reaches its theoretical maximum of 99% accuracy. Market. Multiple organizations have created small libraries of implant images to assist surgeons with manual identification of unknown implants based on the x-ray, however no automated implant identification system exists to date. One of the most financially successful implant identification tools on the market is a textbook of hip implants which sells for a per unit cost of $200. Several free web-based resources also act as libraries for the manual identification of a limited number of arthroplasty implants. A number of academic and private organizations are working on the development of an automated system for implant identification, however none are available to the public. Product. Implant Identifier is mobile application which uses machine-learning to instantly detect the model and manufacturer of any common arthroplasty implant, based only on x-ray. The beta version offers a large library of implants for manual identification and is currently available for free download on iOS and Android. Its purpose is to further develop the model to its maximal theoretical accuracy, prior to official release. The beta version of the application currently has over 15,000 registered users worldwide and has the largest publicly available arthroplasty library available on the market. Over 200,000 implant images have been submitted by users to date. Timescales. The product was initially released in the form of a closed beta which became available to invited guests around 18 months ago. The current version is an open beta which can be downloaded and used by any individual. It was released roughly 12 months ago. The final rendition of the application will allow for free manual identification using the implant library, as well as subscription-based automated implant identification. The implementation, testing and release of this final subscription product is projected to be completed by Q3 2022. Funding. A small number of early investors have funded the initial research and development of the beta product; however, another round of investment will be beneficial in the final evolution of the product. This additional investment round will allow for completion of development of the identification algorithm, product dissemination, customer support, and lasting sustainability of the venture


Orthopaedic Proceedings
Vol. 103-B, Issue SUPP_15 | Pages 31 - 31
1 Dec 2021
Goswami K Parvizi J
Full Access

Aim. The clinical relevance of microbial DNA detected via next-generation sequencing (NGS) remains unknown. This multicenter study was conceived to: 1) identify species on NGS that may predict periprosthetic joint infection (PJI), then 2) build a predictive model for PJI in a developmental cohort, and 3) validate predictive utility of the model in a separate multi-institutional cohort. Method. Fifteen institutions prospectively collected samples from 194 revision TKA and 184 revision THA between 2017–2019. Synovial fluid, tissue and swabs were obtained intraoperatively and sent to MicrogenDx (Lubbock, TX) for NGS analysis. Reimplantations were excluded. Patients were classified per the 2018 ICM definition of PJI. DNA analysis of community similarities (ANCOM) was used to identify 17 bacterial species of 294 (W-value>50) for differentiating infected vs. noninfected cases. Logistic regression with LASSO selection and random-forest algorithms were then used to build a model for predicting PJI. ICM classification was the response variable (gold-standard) and species identified through ANCOM were predictors. Patients were randomly allocated 1:1 into training and validation sets. Using the training set, a model for PJI diagnosis was generated. The entire model-building procedure and validation was iterated 1000 times. Results. The model's assignment accuracy was 75.9%. There was high accuracy in true-negative and false-negative classification using this model, which has previously been a criticism of NGS. Specificity was 97.1%, PPV 75.0% and NPV 76.2%. On comparison of abundance between ICM-positive and ICM-negative patients, Staphylococcus aureus was the strongest contributor (F=0.99) to model predictive power. In contrast, Cutibacterium acnes was less predictive (F=0.309) and abundant across infected and noninfected revisions. Discussion. This is the first study to utilize predictive algorithms on a large multicenter dataset to transform analytic NGS data into a clinically relevant diagnostic model. Our collaborative findings suggest NGS may be an independent adjunct for PJI diagnosis, while also facilitating pathogen identification. Future work applying machine-learning will improve accuracy and utility of NGS


Bone & Joint Open
Vol. 4, Issue 4 | Pages 250 - 261
7 Apr 2023
Sharma VJ Adegoke JA Afara IO Stok K Poon E Gordon CL Wood BR Raman J

Aims

Disorders of bone integrity carry a high global disease burden, frequently requiring intervention, but there is a paucity of methods capable of noninvasive real-time assessment. Here we show that miniaturized handheld near-infrared spectroscopy (NIRS) scans, operated via a smartphone, can assess structural human bone properties in under three seconds.

Methods

A hand-held NIR spectrometer was used to scan bone samples from 20 patients and predict: bone volume fraction (BV/TV); and trabecular (Tb) and cortical (Ct) thickness (Th), porosity (Po), and spacing (Sp).


Orthopaedic Proceedings
Vol. 97-B, Issue SUPP_8 | Pages 2 - 2
1 Jun 2015
Mossadegh S He S Parker P
Full Access

Various injury severity scores exist for trauma; it is known that they do not correlate accurately to military injuries. A promising anatomical scoring system for blast pelvic and perineal injury led to the development of an improved scoring system using machine-learning techniques. An unbiased genetic algorithm selected optimal anatomical and physiological parameters from 118 military cases. A Naïve Bayesian (NB) model was built using the proposed parameters to predict the probability of survival. Ten-fold cross validation was employed to evaluate its performance. Our model significantly out-performed Injury Severity Score (ISS), Trauma ISS, New ISS and the Revised Trauma Score in virtually all areas; Positive Predictive Value 0.8941, Specificity 0.9027, Accuracy 0.9056 and Area Under Curve 0.9059. A two-sample t-test showed that the predictive performance of the proposed scoring system was significantly better than the other systems (p<0.001). With limited resources and the simplest of Bayesian methodologies we have demonstrated that the Naïve Bayesian model performed significantly better in virtually all areas assessed by current scoring systems used for trauma. This is encouraging and highlights that more can be done to improve trauma systems not only for the military, but also in civilian trauma


Bone & Joint Research
Vol. 13, Issue 8 | Pages 411 - 426
28 Aug 2024
Liu D Wang K Wang J Cao F Tao L

Aims

This study explored the shared genetic traits and molecular interactions between postmenopausal osteoporosis (POMP) and sarcopenia, both of which substantially degrade elderly health and quality of life. We hypothesized that these motor system diseases overlap in pathophysiology and regulatory mechanisms.

Methods

We analyzed microarray data from the Gene Expression Omnibus (GEO) database using weighted gene co-expression network analysis (WGCNA), machine learning, and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis to identify common genetic factors between POMP and sarcopenia. Further validation was done via differential gene expression in a new cohort. Single-cell analysis identified high expression cell subsets, with mononuclear macrophages in osteoporosis and muscle stem cells in sarcopenia, among others. A competitive endogenous RNA network suggested regulatory elements for these genes.


Orthopaedic Proceedings
Vol. 95-B, Issue SUPP_27 | Pages 20 - 20
1 Jul 2013
Kampanakis S Jain N Kemp S Hayward P
Full Access

In professional football a key factor regarding injury is the time to return to play. Accurate prediction of this would aid planning by the club in the event of injury. It would also aid the club medical staff. Gaussian processes may be used for machine learning tasks such as regression and classification. This study determines whether machine-learning methods may be used for predicting how many days a player is unavailable to play. A database of injuries at one English Premier League Professional Football Club was reviewed for a number of factors for each injury. Twenty-five variables were recorded for each injury, including time to return to play. This was determined to be the response variable. We used a Gaussian process model with a Laplacian kernel to determine whether the return to play could be predicted from the other variables. The root mean square error was 13.186 days (S.D.: 8.073), the mean absolute error was 8.192 days (S.D.:13.106) and the mean relative error 171.97% (S.D.:75.56%). A linear trend was observed and the model demonstrated high accuracy with greater errors being observed for cases where the value of the response variable was higher, i.e. in those cases where the time to return to play was lengthy. This is the first step in attempting to design a computer-based model that will accurately predict the time for a professional footballer to return to play. The model is extremely accurate for most cases, with errors increasing as the severity of the case increases too


Bone & Joint 360
Vol. 6, Issue 4 | Pages 38 - 39
1 Aug 2017
Khan T