The principles of evidence-based medicine (EBM) are the foundation of modern medical practice. Surgeons are familiar with the commonly used statistical techniques to test hypotheses, summarize findings, and provide answers within a specified range of probability. Based on this knowledge, they are able to critically evaluate research before deciding whether or not to adopt the findings into practice. Recently, there has been an increased use of artificial intelligence (AI) to analyze information and derive findings in orthopaedic research. These techniques use a set of statistical tools that are increasingly complex and may be unfamiliar to the orthopaedic surgeon. It is unclear if this shift towards less familiar techniques is widely accepted in the orthopaedic community. This study aimed to provide an exploration of understanding and acceptance of AI use in research among orthopaedic surgeons. Semi-structured in-depth interviews were carried out on a sample of 12 orthopaedic surgeons. Inductive thematic analysis was used to identify key themes.Aims
Methods
The April 2023 Wrist & Hand Roundup360 looks at: MRI-based classification for acute scaphoid injuries: the OxSMART; Deep learning for detection of scaphoid fractures?; Ulnar shortening osteotomy in adolescents; Cost-utility analysis of thumb carpometacarpal resection arthroplasty; Arthritis of the wrist following scaphoid fracture nonunion; Extensor hood injuries in elite boxers; Risk factors for reoperation after flexor tendon repair; Nonoperative versus operative treatment for displaced finger metacarpal shaft fractures.
To examine whether natural language processing (NLP) using a clinically based large language model (LLM) could be used to predict patient selection for total hip or total knee arthroplasty (THA/TKA) from routinely available free-text radiology reports. Data pre-processing and analyses were conducted according to the Artificial intelligence to Revolutionize the patient Care pathway in Hip and knEe aRthroplastY (ARCHERY) project protocol. This included use of de-identified Scottish regional clinical data of patients referred for consideration of THA/TKA, held in a secure data environment designed for artificial intelligence (AI) inference. Only preoperative radiology reports were included. NLP algorithms were based on the freely available GatorTron model, a LLM trained on over 82 billion words of de-identified clinical text. Two inference tasks were performed: assessment after model-fine tuning (50 Epochs and three cycles of k-fold cross validation), and external validation.Aims
Methods
While internet search engines have been the primary information source for patients’ questions, artificial intelligence large language models like ChatGPT are trending towards becoming the new primary source. The purpose of this study was to determine if ChatGPT can answer patient questions about total hip (THA) and knee arthroplasty (TKA) with consistent accuracy, comprehensiveness, and easy readability. We posed the 20 most Google-searched questions about THA and TKA, plus ten additional postoperative questions, to ChatGPT. Each question was asked twice to evaluate for consistency in quality. Following each response, we responded with, “Please explain so it is easier to understand,” to evaluate ChatGPT’s ability to reduce response reading grade level, measured as Flesch-Kincaid Grade Level (FKGL). Five resident physicians rated the 120 responses on 1 to 5 accuracy and comprehensiveness scales. Additionally, they answered a “yes” or “no” question regarding acceptability. Mean scores were calculated for each question, and responses were deemed acceptable if ≥ four raters answered “yes.”Aims
Methods
Understanding spinopelvic mechanics is important for the success of total hip arthroplasty (THA). Despite significant advancements in appreciating spinopelvic balance, numerous challenges remain. It is crucial to recognize the individual variability and postoperative changes in spinopelvic parameters and their consequential impact on prosthetic component positioning to mitigate the risk of dislocation and enhance postoperative outcomes. This review describes the integration of advanced diagnostic approaches, enhanced technology, implant considerations, and surgical planning, all tailored to the unique anatomy and biomechanics of each patient. It underscores the importance of accurately predicting postoperative spinopelvic mechanics, selecting suitable imaging techniques, establishing a consistent nomenclature for spinopelvic stiffness, and considering implant-specific strategies. Furthermore, it highlights the potential of artificial intelligence to personalize care. Cite this article:
Disorders of bone integrity carry a high global disease burden, frequently requiring intervention, but there is a paucity of methods capable of noninvasive real-time assessment. Here we show that miniaturized handheld near-infrared spectroscopy (NIRS) scans, operated via a smartphone, can assess structural human bone properties in under three seconds. A hand-held NIR spectrometer was used to scan bone samples from 20 patients and predict: bone volume fraction (BV/TV); and trabecular (Tb) and cortical (Ct) thickness (Th), porosity (Po), and spacing (Sp).Aims
Methods
We aim to explore the potential technologies for monitoring and assessment of patients undergoing arthroplasty by examining selected literature focusing on the technology currently available and reflecting on possible future development and application. The reviewed literature indicates a large variety of different hardware and software, widely available and used in a limited manner, to assess patients’ performance. There are extensive opportunities to enhance and integrate the systems which are already in existence to develop patient-specific pathways for rehabilitation. Cite this article:
The modern prevalence of primary tumours causing metastatic bone disease is ill-defined in the oncological literature. Therefore, the purpose of this study is to identify the prevalence of primary tumours in the setting of metastatic bone disease, as well as reported rates of pathological fracture, postoperative complications, 90-day mortality, and 360-day mortality for each primary tumour subtype. The Premier Healthcare Database was queried to identify all patients who were diagnosed with metastatic bone disease from January 2015 to December 2020. The prevalence of all primary tumour subtypes was tabulated. Rates of long bone pathological fracture, 90-day mortality, and 360-day mortality following surgical treatment of pathological fracture were assessed for each primary tumour subtype. Patient characteristics and postoperative outcomes were analyzed based upon whether patients had impending fractures treated prophylactically versus treated completed fractures.Aims
Methods
The aim of this study was to develop and evaluate machine-learning-based computerized adaptive tests (CATs) for the Oxford Hip Score (OHS), Oxford Knee Score (OKS), Oxford Shoulder Score (OSS), and the Oxford Elbow Score (OES) and its subscales. We developed CAT algorithms for the OHS, OKS, OSS, overall OES, and each of the OES subscales, using responses to the full-length questionnaires and a machine-learning technique called regression tree learning. The algorithms were evaluated through a series of simulation studies, in which they aimed to predict respondents’ full-length questionnaire scores from only a selection of their item responses. In each case, the total number of items used by the CAT algorithm was recorded and CAT scores were compared to full-length questionnaire scores by mean, SD, score distribution plots, Pearson’s correlation coefficient, intraclass correlation (ICC), and the Bland-Altman method. Differences between CAT scores and full-length questionnaire scores were contextualized through comparison to the instruments’ minimal clinically important difference (MCID).Aims
Methods
This study aimed to explore the biological and clinical importance of dysregulated key genes in osteoarthritis (OA) patients at the cartilage level to find potential biomarkers and targets for diagnosing and treating OA. Six sets of gene expression profiles were obtained from the Gene Expression Omnibus database. Differential expression analysis, weighted gene coexpression network analysis (WGCNA), and multiple machine-learning algorithms were used to screen crucial genes in osteoarthritic cartilage, and genome enrichment and functional annotation analyses were used to decipher the related categories of gene function. Single-sample gene set enrichment analysis was performed to analyze immune cell infiltration. Correlation analysis was used to explore the relationship among the hub genes and immune cells, as well as markers related to articular cartilage degradation and bone mineralization.Aims
Methods
The aim of this study was to estimate the 90-day periprosthetic joint infection (PJI) rates following total knee arthroplasty (TKA) and total hip arthroplasty (THA) for osteoarthritis (OA). This was a data linkage study using the New South Wales (NSW) Admitted Patient Data Collection (APDC) and the Australian Orthopaedic Association National Joint Replacement Registry (AOANJRR), which collect data from all public and private hospitals in NSW, Australia. Patients who underwent a TKA or THA for OA between 1 January 2002 and 31 December 2017 were included. The main outcome measures were 90-day incidence rates of hospital readmission for: revision arthroplasty for PJI as recorded in the AOANJRR; conservative definition of PJI, defined by T84.5, the PJI diagnosis code in the APDC; and extended definition of PJI, defined by the presence of either T84.5, or combinations of diagnosis and procedure code groups derived from recursive binary partitioning in the APDC.Aims
Methods
To map literature on prognostic factors related to outcomes of revision total knee arthroplasty (rTKA), to identify extensively studied factors and to guide future research into what domains need further exploration. We performed a systematic literature search in MEDLINE, Embase, and Web of Science. The search string included multiple synonyms of the following keywords: "revision TKA", "outcome" and "prognostic factor". We searched for studies assessing the association between at least one prognostic factor and at least one outcome measure after rTKA surgery. Data on sample size, study design, prognostic factors, outcomes, and the direction of the association was extracted and included in an evidence map.Aims
Methods
Artificial intelligence and machine-learning analytics have gained extensive popularity in recent years due to their clinically relevant applications. A wide range of proof-of-concept studies have demonstrated the ability of these analyses to personalize risk prediction, detect implant specifics from imaging, and monitor and assess patient movement and recovery. Though these applications are exciting and could potentially influence practice, it is imperative to understand when these analyses are indicated and where the data are derived from, prior to investing resources and confidence into the results and conclusions. In this article, we review the current benefits and potential limitations of machine-learning for the orthopaedic surgeon with a specific emphasis on data quality.
This study used an artificial neural network (ANN) model to determine the most important pre- and perioperative variables to predict same-day discharge in patients undergoing total knee arthroplasty (TKA). Data for this study were collected from the National Surgery Quality Improvement Program (NSQIP) database from the year 2018. Patients who received a primary, elective, unilateral TKA with a diagnosis of primary osteoarthritis were included. Demographic, preoperative, and intraoperative variables were analyzed. The ANN model was compared to a logistic regression model, which is a conventional machine-learning algorithm. Variables collected from 28,742 patients were analyzed based on their contribution to hospital length of stay.Aims
Methods
The number of convolutional neural networks (CNN) available for fracture detection and classification is rapidly increasing. External validation of a CNN on a temporally separate (separated by time) or geographically separate (separated by location) dataset is crucial to assess generalizability of the CNN before application to clinical practice in other institutions. We aimed to answer the following questions: are current CNNs for fracture recognition externally valid?; which methods are applied for external validation (EV)?; and, what are reported performances of the EV sets compared to the internal validation (IV) sets of these CNNs? The PubMed and Embase databases were systematically searched from January 2010 to October 2020 according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement. The type of EV, characteristics of the external dataset, and diagnostic performance characteristics on the IV and EV datasets were collected and compared. Quality assessment was conducted using a seven-item checklist based on a modified Methodologic Index for NOn-Randomized Studies instrument (MINORS).Aims
Methods
Natural Language Processing (NLP) offers an automated method to extract data from unstructured free text fields for arthroplasty registry participation. Our objective was to investigate how accurately NLP can be used to extract structured clinical data from unstructured clinical notes when compared with manual data extraction. A group of 1,000 randomly selected clinical and hospital notes from eight different surgeons were collected for patients undergoing primary arthroplasty between 2012 and 2018. In all, 19 preoperative, 17 operative, and two postoperative variables of interest were manually extracted from these notes. A NLP algorithm was created to automatically extract these variables from a training sample of these notes, and the algorithm was tested on a random test sample of notes. Performance of the NLP algorithm was measured in Statistical Analysis System (SAS) by calculating the accuracy of the variables collected, the ability of the algorithm to collect the correct information when it was indeed in the note (sensitivity), and the ability of the algorithm to not collect a certain data element when it was not in the note (specificity).Aims
Methods
The aim of this study was to describe a quantitative 3D CT method to measure rotator cuff muscle volume, atrophy, and balance in healthy controls and in three pathological shoulder cohorts. In all, 102 CT scans were included in the analysis: 46 healthy, 21 cuff tear arthropathy (CTA), 18 irreparable rotator cuff tear (IRCT), and 17 primary osteoarthritis (OA). The four rotator cuff muscles were manually segmented and their volume, including intramuscular fat, was calculated. The normalized volume (NV) of each muscle was calculated by dividing muscle volume to the patient’s scapular bone volume. Muscle volume and percentage of muscle atrophy were compared between muscles and between cohorts.Aims
Methods