Advertisement for orthosearch.org.uk
Results 1 - 3 of 3
Results per page:
Bone & Joint Open
Vol. 2, Issue 10 | Pages 879 - 885
20 Oct 2021
Oliveira e Carmo L van den Merkhof A Olczak J Gordon M Jutte PC Jaarsma RL IJpma FFA Doornberg JN Prijs J

Aims. The number of convolutional neural networks (CNN) available for fracture detection and classification is rapidly increasing. External validation of a CNN on a temporally separate (separated by time) or geographically separate (separated by location) dataset is crucial to assess generalizability of the CNN before application to clinical practice in other institutions. We aimed to answer the following questions: are current CNNs for fracture recognition externally valid?; which methods are applied for external validation (EV)?; and, what are reported performances of the EV sets compared to the internal validation (IV) sets of these CNNs?. Methods. The PubMed and Embase databases were systematically searched from January 2010 to October 2020 according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement. The type of EV, characteristics of the external dataset, and diagnostic performance characteristics on the IV and EV datasets were collected and compared. Quality assessment was conducted using a seven-item checklist based on a modified Methodologic Index for NOn-Randomized Studies instrument (MINORS). Results. Out of 1,349 studies, 36 reported development of a CNN for fracture detection and/or classification. Of these, only four (11%) reported a form of EV. One study used temporal EV, one conducted both temporal and geographical EV, and two used geographical EV. When comparing the CNN’s performance on the IV set versus the EV set, the following were found: AUCs of 0.967 (IV) versus 0.975 (EV), 0.976 (IV) versus 0.985 to 0.992 (EV), 0.93 to 0.96 (IV) versus 0.80 to 0.89 (EV), and F1-scores of 0.856 to 0.863 (IV) versus 0.757 to 0.840 (EV). Conclusion. The number of externally validated CNNs in orthopaedic trauma for fracture recognition is still scarce. This greatly limits the potential for transfer of these CNNs from the developing institute to another hospital to achieve similar diagnostic performance. We recommend the use of geographical EV and statements such as the Consolidated Standards of Reporting Trials–Artificial Intelligence (CONSORT-AI), the Standard Protocol Items: Recommendations for Interventional Trials–Artificial Intelligence (SPIRIT-AI) and the Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis–Machine Learning (TRIPOD-ML) to critically appraise performance of CNNs and improve methodological rigor, quality of future models, and facilitate eventual implementation in clinical practice. Cite this article: Bone Jt Open 2021;2(10):879–885


Bone & Joint Open
Vol. 3, Issue 1 | Pages 93 - 97
10 Jan 2022
Kunze KN Orr M Krebs V Bhandari M Piuzzi NS

Artificial intelligence and machine-learning analytics have gained extensive popularity in recent years due to their clinically relevant applications. A wide range of proof-of-concept studies have demonstrated the ability of these analyses to personalize risk prediction, detect implant specifics from imaging, and monitor and assess patient movement and recovery. Though these applications are exciting and could potentially influence practice, it is imperative to understand when these analyses are indicated and where the data are derived from, prior to investing resources and confidence into the results and conclusions. In this article, we review the current benefits and potential limitations of machine-learning for the orthopaedic surgeon with a specific emphasis on data quality.


Bone & Joint Open
Vol. 1, Issue 12 | Pages 737 - 742
1 Dec 2020
Mihalič R Zdovc J Brumat P Trebše R

Aims

Synovial fluid white blood cell (WBC) count and percentage of polymorphonuclear cells (%PMN) are elevated at periprosthetic joint infection (PJI). Leucocytes produce different interleukins (IL), including IL-6, so we hypothesized that synovial fluid IL-6 could be a more accurate predictor of PJI than synovial fluid WBC count and %PMN. The main aim of our study was to compare the predictive performance of all three diagnostic tests in the detection of PJI.

Methods

Patients undergoing total hip or knee revision surgery were included. In the perioperative assessment phase, synovial fluid WBC count, %PMN, and IL-6 concentration were measured. Patients were labeled as positive or negative according to the predefined cut-off values for IL-6 and WBC count with %PMN. Intraoperative samples for microbiological and histopathological analysis were obtained. PJI was defined as the presence of sinus tract, inflammation in histopathological samples, and growth of the same microorganism in a minimum of two or more samples out of at least four taken.