Systematic reviews of randomized controlled trials (RCTs) are the highest level of evidence used to inform patient care. However, it has been suggested that the quality of randomization in RCTs in orthopaedic surgery may be low. This study aims to describe the quality of randomization in trials included in systematic reviews in orthopaedic surgery. Systematic reviews of RCTs testing orthopaedic procedures published in 2022 were extracted from PubMed, Embase, and the Cochrane Library. A random sample of 100 systematic reviews was selected, and all included RCTs were retrieved. To be eligible for inclusion, systematic reviews must have tested an orthopaedic procedure as the primary intervention, included at least one study identified as a RCT, been published in 2022 in English, and included human clinical trials. The Cochrane Risk of Bias-2 Tool was used to assess random sequence generation as ‘adequate’, ‘inadequate’, or ‘no information’; we then calculated the proportion of trials in each category. We also collected data to test the association between these categories and characteristics of the RCTs and systematic reviews.Aims
Methods
This systematic review aims to identify 3D predictors derived from biplanar reconstruction, and to describe current methods for improving curve prediction in patients with mild adolescent idiopathic scoliosis. A comprehensive search was conducted by three independent investigators on MEDLINE, PubMed, Web of Science, and Cochrane Library. Search terms included “adolescent idiopathic scoliosis”,“3D”, and “progression”. The inclusion and exclusion criteria were carefully defined to include clinical studies. Risk of bias was assessed with the Quality in Prognostic Studies tool (QUIPS) and Appraisal tool for Cross-Sectional Studies (AXIS), and level of evidence for each predictor was rated with the Grading of Recommendations, Assessment, Development, and Evaluations (GRADE) approach. In all, 915 publications were identified, with 377 articles subjected to full-text screening; overall, 31 articles were included.Aims
Methods
While internet search engines have been the primary information source for patients’ questions, artificial intelligence large language models like ChatGPT are trending towards becoming the new primary source. The purpose of this study was to determine if ChatGPT can answer patient questions about total hip (THA) and knee arthroplasty (TKA) with consistent accuracy, comprehensiveness, and easy readability. We posed the 20 most Google-searched questions about THA and TKA, plus ten additional postoperative questions, to ChatGPT. Each question was asked twice to evaluate for consistency in quality. Following each response, we responded with, “Please explain so it is easier to understand,” to evaluate ChatGPT’s ability to reduce response reading grade level, measured as Flesch-Kincaid Grade Level (FKGL). Five resident physicians rated the 120 responses on 1 to 5 accuracy and comprehensiveness scales. Additionally, they answered a “yes” or “no” question regarding acceptability. Mean scores were calculated for each question, and responses were deemed acceptable if ≥ four raters answered “yes.”Aims
Methods
Machine-learning (ML) prediction models in orthopaedic trauma hold great promise in assisting clinicians in various tasks, such as personalized risk stratification. However, an overview of current applications and critical appraisal to peer-reviewed guidelines is lacking. The objectives of this study are to 1) provide an overview of current ML prediction models in orthopaedic trauma; 2) evaluate the completeness of reporting following the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) statement; and 3) assess the risk of bias following the Prediction model Risk Of Bias Assessment Tool (PROBAST) tool. A systematic search screening 3,252 studies identified 45 ML-based prediction models in orthopaedic trauma up to January 2023. The TRIPOD statement assessed transparent reporting and the PROBAST tool the risk of bias.Aims
Methods
The principles of evidence-based medicine (EBM) are the foundation of modern medical practice. Surgeons are familiar with the commonly used statistical techniques to test hypotheses, summarize findings, and provide answers within a specified range of probability. Based on this knowledge, they are able to critically evaluate research before deciding whether or not to adopt the findings into practice. Recently, there has been an increased use of artificial intelligence (AI) to analyze information and derive findings in orthopaedic research. These techniques use a set of statistical tools that are increasingly complex and may be unfamiliar to the orthopaedic surgeon. It is unclear if this shift towards less familiar techniques is widely accepted in the orthopaedic community. This study aimed to provide an exploration of understanding and acceptance of AI use in research among orthopaedic surgeons. Semi-structured in-depth interviews were carried out on a sample of 12 orthopaedic surgeons. Inductive thematic analysis was used to identify key themes.Aims
Methods
The modern prevalence of primary tumours causing metastatic bone disease is ill-defined in the oncological literature. Therefore, the purpose of this study is to identify the prevalence of primary tumours in the setting of metastatic bone disease, as well as reported rates of pathological fracture, postoperative complications, 90-day mortality, and 360-day mortality for each primary tumour subtype. The Premier Healthcare Database was queried to identify all patients who were diagnosed with metastatic bone disease from January 2015 to December 2020. The prevalence of all primary tumour subtypes was tabulated. Rates of long bone pathological fracture, 90-day mortality, and 360-day mortality following surgical treatment of pathological fracture were assessed for each primary tumour subtype. Patient characteristics and postoperative outcomes were analyzed based upon whether patients had impending fractures treated prophylactically versus treated completed fractures.Aims
Methods
Aims. To identify variables independently associated with same-day discharge (SDD) of patients following revision total knee arthroplasty (rTKA) and to develop
To map literature on prognostic factors related to outcomes of revision total knee arthroplasty (rTKA), to identify extensively studied factors and to guide future research into what domains need further exploration. We performed a systematic literature search in MEDLINE, Embase, and Web of Science. The search string included multiple synonyms of the following keywords: "revision TKA", "outcome" and "prognostic factor". We searched for studies assessing the association between at least one prognostic factor and at least one outcome measure after rTKA surgery. Data on sample size, study design, prognostic factors, outcomes, and the direction of the association was extracted and included in an evidence map.Aims
Methods
Disorders of bone integrity carry a high global disease burden, frequently requiring intervention, but there is a paucity of methods capable of noninvasive real-time assessment. Here we show that miniaturized handheld near-infrared spectroscopy (NIRS) scans, operated via a smartphone, can assess structural human bone properties in under three seconds. A hand-held NIR spectrometer was used to scan bone samples from 20 patients and predict: bone volume fraction (BV/TV); and trabecular (Tb) and cortical (Ct) thickness (Th), porosity (Po), and spacing (Sp).Aims
Methods
To develop prediction models using machine-learning (ML) algorithms for 90-day and one-year mortality prediction in femoral neck fracture (FNF) patients aged 50 years or older based on the Hip fracture Evaluation with Alternatives of Total Hip arthroplasty versus Hemiarthroplasty (HEALTH) and Fixation using Alternative Implants for the Treatment of Hip fractures (FAITH) trials. This study included 2,388 patients from the HEALTH and FAITH trials, with 90-day and one-year mortality proportions of 3.0% (71/2,388) and 6.4% (153/2,388), respectively. The mean age was 75.9 years (SD 10.8) and 65.9% of patients (1,574/2,388) were female. The algorithms included patient and injury characteristics. Six algorithms were developed, internally validated and evaluated across discrimination (c-statistic; discriminative ability between those with risk of mortality and those without), calibration (observed outcome compared to the predicted probability), and the Brier score (composite of discrimination and calibration).Aims
Methods
The aim of this study was to develop and evaluate machine-learning-based computerized adaptive tests (CATs) for the Oxford Hip Score (OHS), Oxford Knee Score (OKS), Oxford Shoulder Score (OSS), and the Oxford Elbow Score (OES) and its subscales. We developed CAT algorithms for the OHS, OKS, OSS, overall OES, and each of the OES subscales, using responses to the full-length questionnaires and a machine-learning technique called regression tree learning. The algorithms were evaluated through a series of simulation studies, in which they aimed to predict respondents’ full-length questionnaire scores from only a selection of their item responses. In each case, the total number of items used by the CAT algorithm was recorded and CAT scores were compared to full-length questionnaire scores by mean, SD, score distribution plots, Pearson’s correlation coefficient, intraclass correlation (ICC), and the Bland-Altman method. Differences between CAT scores and full-length questionnaire scores were contextualized through comparison to the instruments’ minimal clinically important difference (MCID).Aims
Methods
Artificial intelligence and machine-learning analytics have gained extensive popularity in recent years due to their clinically relevant applications. A wide range of proof-of-concept studies have demonstrated the ability of these analyses to personalize risk prediction, detect implant specifics from imaging, and monitor and assess patient movement and recovery. Though these applications are exciting and could potentially influence practice, it is imperative to understand when these analyses are indicated and where the data are derived from, prior to investing resources and confidence into the results and conclusions. In this article, we review the current benefits and potential limitations of machine-learning for the orthopaedic surgeon with a specific emphasis on data quality.
The number of convolutional neural networks (CNN) available for fracture detection and classification is rapidly increasing. External validation of a CNN on a temporally separate (separated by time) or geographically separate (separated by location) dataset is crucial to assess generalizability of the CNN before application to clinical practice in other institutions. We aimed to answer the following questions: are current CNNs for fracture recognition externally valid?; which methods are applied for external validation (EV)?; and, what are reported performances of the EV sets compared to the internal validation (IV) sets of these CNNs? The PubMed and Embase databases were systematically searched from January 2010 to October 2020 according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement. The type of EV, characteristics of the external dataset, and diagnostic performance characteristics on the IV and EV datasets were collected and compared. Quality assessment was conducted using a seven-item checklist based on a modified Methodologic Index for NOn-Randomized Studies instrument (MINORS).Aims
Methods
The aim of this study was to describe a quantitative 3D CT method to measure rotator cuff muscle volume, atrophy, and balance in healthy controls and in three pathological shoulder cohorts. In all, 102 CT scans were included in the analysis: 46 healthy, 21 cuff tear arthropathy (CTA), 18 irreparable rotator cuff tear (IRCT), and 17 primary osteoarthritis (OA). The four rotator cuff muscles were manually segmented and their volume, including intramuscular fat, was calculated. The normalized volume (NV) of each muscle was calculated by dividing muscle volume to the patient’s scapular bone volume. Muscle volume and percentage of muscle atrophy were compared between muscles and between cohorts.Aims
Methods
Aims. The use of technology to assess balance and alignment during total knee surgery can provide an overload of numerical data to the surgeon. Meanwhile, this quantification holds the potential to clarify and guide the surgeon through the surgical decision process when selecting the appropriate bone recut or soft tissue adjustment when balancing a total knee. Therefore, this paper evaluates the potential of deploying supervised