header advert
Results 361 - 380 of 8584
Results per page:
The Journal of Bone & Joint Surgery British Volume
Vol. 72-B, Issue 2 | Pages 225 - 230
1 Mar 1990
Staubli H Jakob R

We evaluated the accuracy of six clinical tests for posterior instability in 24 knees with acute surgically-proven posterior cruciate ligament injuries and intact anterior cruciate ligaments. We also performed stress radiography under anaesthesia. The gravity sign and the posterior drawer test in near extension and its passive reduction were diagnostic in 20 of the 24 knees, and the active reduction of posterior subluxation was diagnostic in 18. The reversed pivot shift sign helped to diagnose severe posterior and posterolateral subluxations, but the external rotation recurvatum test was negative in all 24 knees. Stress radiography in near extension revealed a highly significant increase in posterior tibial subluxation in the injured knees.


The Journal of Bone & Joint Surgery British Volume
Vol. 61-B, Issue 1 | Pages 18 - 25
1 Feb 1979
Hall D Harrison M Burwell R

This paper reports a high incidence of minor congenital anomalies in boys and girls with Perthes' disease compared with that in a control population. There is a similarity of the incidence of minor anomalies in the children with Perthes' disease to that in babies with a single major congenital defect. Multiple major defects were more numerous and more severe than in the control children. It is speculated that there may be a congenital abnormality affecting skeletal development which in some way makes the hip susceptible to Perthes' disease at a later date.





Bone & Joint Open
Vol. 3, Issue 10 | Pages 841 - 849
27 Oct 2022
Knight R Keene DJ Dutton SJ Handley R Willett K

Aims. The rationale for exacting restoration of skeletal anatomy after unstable ankle fracture is to improve outcomes by reducing complications from malunion; however, current definitions of malunion lack confirmatory clinical evidence. Methods. Radiological (absolute radiological measurements aided by computer software) and clinical (clinical interpretation of radiographs) definitions of malunion were compared within the Ankle Injury Management (AIM) trial cohort, including people aged ≥ 60 years with an unstable ankle fracture. Linear regressions were used to explore the relationship between radiological malunion (RM) at six months and changes in function at three years. Function was assessed with the Olerud-Molander Ankle Score (OMAS), with a minimal clinically important difference set as six points, as per the AIM trial. Piecewise linear models were used to investigate new radiological thresholds which better explain symptom impact on ankle function. Results. Previously described measures of RM and surgeon opinion of clinically significant malunion (CSM) were shown to be related but with important differences. CSM was more strongly related to outcome (-13.9 points on the OMAS; 95% confidence interval (CI) -21.9 to -5.4) than RM (-5.5 points; 95% CI -9.8 to -1.2). Existing malunion thresholds for talar tilt and tibiofibular clear space were shown to be slightly conservative; new thresholds which better explain function were identified (talar tilt > 2.4°; tibiofibular clear space > 6 mm). Based on this new definition the presence of RM had an impact on function, which was statistically significant, but the clinical significance was uncertain (-9.1 points; 95% CI -13.8 to -4.4). In subsequent analysis, RM of a posterior malleolar fracture was shown to have a statistically significant impact on OMAS change scores, but the clinical significance was uncertain (-11.6 points; 95% CI -21.9 to -0.6). Conclusion. These results provide clinical evidence which supports the previously accepted definitions. Further research to investigate more conservative clinical thresholds for malunion is indicated. Cite this article: Bone Jt Open 2022;3(10):841–849


The Bone & Joint Journal
Vol. 105-B, Issue 3 | Pages 247 - 253
1 Mar 2023
Pakarinen O Ponkilainen V Uimonen M Haapanen M Helenius I Kuitunen I

Aims. To analyze whether the addition of risk-based criteria to clinical examination-based selective ultrasound screening would increase the rates of early detected cases of developmental dysplasia of the hip (DDH) and decrease the rate of late detected cases. Methods. A systematic review with meta-analysis was performed. The initial search was performed in the PubMed, Scopus, and Web of Science databases in November 2021. The following search terms were used: (hip) AND (ultrasound) AND (luxation or dysplasia) AND (newborn or neonate or congenital). Results. A total of 25 studies were included. In 19 studies, newborns were selected for ultrasound based on both risk factors and clinical examination. In six studies, newborns were selected for ultrasound based on only clinical examination. We did not find evidence indicating that there are differences in the incidence of early- and late-detected DDH, or in the incidence of nonoperatively treated DDH between the risk-based and clinical examination-based groups. The pooled incidence of operatively treated DDH was slightly lower in the risk-based group (0.5 (95% confidence interval (CI) 0.3 to 0.7)) compared with the clinical examination group (0.9 per 1,000 newborns, (95% CI 0.7 to 1.0)). Conclusion. The use of risk factors in conjunction with clinical examination in the selective ultrasound screening of DDH might lead to fewer operatively treated cases of DDH. However, more studies are needed before stronger conclusions can be drawn. Cite this article: Bone Joint J 2023;105-B(3):247–253


The Bone & Joint Journal
Vol. 106-B, Issue 5 | Pages 450 - 459
1 May 2024
Clement ND Galloway S Baron J Smith K Weir DJ Deehan DJ

Aims. The aim was to assess whether robotic-assisted total knee arthroplasty (rTKA) had greater knee-specific outcomes, improved fulfilment of expectations, health-related quality of life (HRQoL), and patient satisfaction when compared with manual TKA (mTKA). Methods. A randomized controlled trial was undertaken (May 2019 to December 2021), and patients were allocated to either mTKA or rTKA. A total of 100 patients were randomized, 50 to each group, of whom 43 rTKA and 38 mTKA patients were available for review at 12 months following surgery. There were no statistically significant preoperative differences between the groups. The minimal clinically important difference in the Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) pain score was defined as 7.5 points. Results. There were no clinically or statistically significant differences between the knee-specific measures (WOMAC, Oxford Knee Score (OKS), Forgotten Joint Score (FJS)) or HRQoL measures (EuroQol five-dimension questionnaire (EQ-5D) and EuroQol visual analogue scale (EQ-VAS)) at 12 months between the groups. However, the rTKA group had significantly (p = 0.029) greater improvements in the WOMAC pain component (mean difference 9.7, 95% confidence interval (CI) 1.0 to 18.4) over the postoperative period (two, six, and 12 months), which was clinically meaningful. This was not observed for function (p = 0.248) or total (p = 0.147) WOMAC scores. The rTKA group was significantly (p = 0.039) more likely to have expectation of ‘Relief of daytime pain in the joint’ when compared with the mTKA group. There were no other significant differences in expectations met between the groups. There was no significant difference in patient satisfaction with their knee (p = 0.464), return to work (p = 0.464), activities (p = 0.293), or pain (p = 0.701). Conclusion. Patients undergoing rTKA had a clinically meaningful greater improvement in their knee pain over the first 12 months, and were more likely to have fulfilment of their expectation of daytime pain relief compared with patients undergoing mTKA. However, rTKA was not associated with a clinically significant greater knee-specific function or HRQoL, according to current definitions. Cite this article: Bone Joint J 2024;106-B(5):450–459



Aims. The primary objective of this study was to compare the five-year tibial component migration and wear between highly crosslinked polyethylene (HXLPE) inserts and conventional polyethylene (PE) inserts of the uncemented Triathlon fixed insert cruciate-retaining total knee arthroplasty (TKA). Secondary objectives included clinical outcomes and patient-reported outcome measures (PROMs). Methods. A double-blinded, randomized study was conducted including 96 TKAs. Tibial component migration and insert wear were measured with radiostereometric analysis (RSA) at three, six, 12, 24, and 60 months postoperatively. PROMS were collected preoperatively and at all follow-up timepoints. Results. There was no clinically relevant difference in terms of tibial component migration, insert wear, and PROMs between the HXLPE and PE groups. The mean difference in tibial component migration (maximal total point migration (MTPM)) was 0.02 mm (95% confidence interval (CI) -0.07 to 0.11), which is below the value of 0.2 mm considered to be clinically relevant. Wear after five years for HXLPE was 0.16 mm (95% CI 0.05 to 0.27), and for PE was 0.23 mm (95% CI 0.12 to 0.35). The mean difference in wear rate was 0.01 mm/year (95% CI -0.02 to 0.05) in favour of the HXLPE group. Wear is mainly present on the medial side of the insert. Conclusion. There is no clinically relevant difference in tibial component migration and insert wear for up to five years between the HXLPE conventional PE inserts. For the implant studied, the potential advantages of a HXLPE insert remain to be proven under clinical conditions at longer-term follow-up. Cite this article: Bone Joint J 2023;105-B(5):518–525


The Bone & Joint Journal
Vol. 106-B, Issue 4 | Pages 412 - 418
1 Apr 2024
Alqarni AG Nightingale J Norrish A Gladman JRF Ollivere B

Aims. Frailty greatly increases the risk of adverse outcome of trauma in older people. Frailty detection tools appear to be unsuitable for use in traumatically injured older patients. We therefore aimed to develop a method for detecting frailty in older people sustaining trauma using routinely collected clinical data. Methods. We analyzed prospectively collected registry data from 2,108 patients aged ≥ 65 years who were admitted to a single major trauma centre over five years (1 October 2015 to 31 July 2020). We divided the sample equally into two, creating derivation and validation samples. In the derivation sample, we performed univariate analyses followed by multivariate regression, starting with 27 clinical variables in the registry to predict Clinical Frailty Scale (CFS; range 1 to 9) scores. Bland-Altman analyses were performed in the validation cohort to evaluate any biases between the Nottingham Trauma Frailty Index (NTFI) and the CFS. Results. In the derivation cohort, five of the 27 variables were strongly predictive of the CFS (regression coefficient B = 6.383 (95% confidence interval 5.03 to 7.74), p < 0.001): age, Abbreviated Mental Test score, admission haemoglobin concentration (g/l), pre-admission mobility (needs assistance or not), and mechanism of injury (falls from standing height). In the validation cohort, there was strong agreement between the NTFI and the CFS (mean difference 0.02) with no apparent systematic bias. Conclusion. We have developed a clinically applicable tool using easily and routinely measured physiological and functional parameters, which clinicians and researchers can use to guide patient care and to stratify the analysis of quality improvement and research projects. Cite this article: Bone Joint J 2024;106-B(4):412–418


The Bone & Joint Journal
Vol. 105-B, Issue 9 | Pages 1007 - 1012
1 Sep 2023
Hoeritzauer I Paterson M Jamjoom AAB Srikandarajah N Soleiman H Poon MTC Copley PC Graves C MacKay S Duong C Leung AHC Eames N Statham PFX Darwish S Sell PJ Thorpe P Shekhar H Roy H Woodfield J

Aims. Patients with cauda equina syndrome (CES) require emergency imaging and surgical decompression. The severity and type of symptoms may influence the timing of imaging and surgery, and help predict the patient’s prognosis. Categories of CES attempt to group patients for management and prognostication purposes. We aimed in this study to assess the inter-rater reliability of dividing patients with CES into categories to assess whether they can be reliably applied in clinical practice and in research. Methods. A literature review was undertaken to identify published descriptions of categories of CES. A total of 100 real anonymized clinical vignettes of patients diagnosed with CES from the Understanding Cauda Equina Syndrome (UCES) study were reviewed by consultant spinal surgeons, neurosurgical registrars, and medical students. All were provided with published category definitions and asked to decide whether each patient had ‘suspected CES’; ‘early CES’; ‘incomplete CES’; or ‘CES with urinary retention’. Inter-rater agreement was assessed for all categories, for all raters, and for each group of raters using Fleiss’s kappa. Results. Each of the 100 participants were rated by four medical students, five neurosurgical registrars, and four consultant spinal surgeons. No groups achieved reasonable inter-rater agreement for any of the categories. CES with retention versus all other categories had the highest inter-rater agreement (kappa 0.34 (95% confidence interval 0.27 to 0.31); minimal agreement). There was no improvement in inter-rater agreement with clinical experience. Across all categories, registrars agreed with each other most often (kappa 0.41), followed by medical students (kappa 0.39). Consultant spinal surgeons had the lowest inter-rater agreement (kappa 0.17). Conclusion. Inter-rater agreement for categorizing CES is low among clinicians who regularly manage these patients. CES categories should be used with caution in clinical practice and research studies, as groups may be heterogenous and not comparable. Cite this article: Bone Joint J 2023;105-B(9):1007–1012


The Bone & Joint Journal
Vol. 106-B, Issue 3 | Pages 256 - 261
1 Mar 2024
Goodall R Borsky K Harrison CJ Welck M Malhotra K Rodrigues JN

Aims. The Manchester-Oxford Foot Questionnaire (MOxFQ) is an anatomically specific patient-reported outcome measure (PROM) currently used to assess a wide variety of foot and ankle pathology. It consists of 16 items across three subscales measuring distinct but related traits: walking/standing ability, pain, and social interaction. It is the most used foot and ankle PROM in the UK. Initial MOxFQ validation involved analysis of 100 individuals undergoing hallux valgus surgery. This project aimed to establish whether an individual’s response to the MOxFQ varies with anatomical region of disease (measurement invariance), and to explore structural validity of the factor structure (subscale items) of the MOxFQ. Methods. This was a single-centre, prospective cohort study involving 6,637 patients (mean age 52 years (SD 17.79)) presenting with a wide range of foot and ankle pathologies between January 2013 and December 2021. To assess whether the MOxFQ responses vary by anatomical region of foot and ankle disease, we performed multigroup confirmatory factor analysis. To assess the structural validity of the subscale items, exploratory and confirmatory factor analyses were performed. Results. Measurement invariance by pathology was confirmed, suggesting the same model can be used across all foot and ankle anatomical regions. Exploratory factor analysis demonstrated a two- to three-factor model, and suggested that item 13 (inability to carry out work/everyday activities) and item 14 (inability to undertake social/recreational activities) loaded more positively onto the “walking/standing” subscale than their original “social interaction” subscale. Conclusion. This large cohort study supports the current widespread use of the MOxFQ across a broad range of foot and ankle pathologies. Our analyses found indications that could support alterations to the original factor structure (items 13 and 14 might be moved from the “social interaction” to the “walking/standing” subscale). However, this requires further work to confirm. Cite this article: Bone Joint J 2024;106-B(3):256–261


The Bone & Joint Journal
Vol. 105-B, Issue 3 | Pages 231 - 238
1 Mar 2023
Holme TJ Crate G Trompeter AJ Monsell FP Bridgens A Gelfer Y

Aims. The ‘pink, pulseless hand’ is often used to describe the clinical situation in which a child with a supracondylar fracture of the humerus has normal distal perfusion in the absence of a palpable peripheral pulse. The management guidelines are based on the assessment of perfusion, which is difficult to undertake and poorly evaluated objectively. The aim of this study was to review the available literature in order to explore the techniques available for the preoperative clinical assessment of perfusion in these patients and to evaluate the clinical implications. Methods. A systematic literature review was conducted using the Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines and registered prospectively with the International Prospective Register of Systematic Reviews. Databases were explored in June 2022 with the search terms (pulseless OR dysvascular OR ischaemic OR perfused OR vascular injury) AND supracondylar AND (fracture OR fractures). Results. A total of 573 papers were identified as being suitable for further study, and 25 met the inclusion criteria for detailed analysis. These studies included a total of 504 patients with a perfused, pulseless limb associated with a supracondylar humeral fracture. Clinical examination included skin colour (23 studies (92%)), temperature (16 studies (64%)), and capillary refill time (13 studies (52%)). Other investigations included peripheral oxygen saturation (SaO2) (six studies (24%)), ultrasound (US) (14 (56%)), and CT angiogram (two studies (8.0%)). The parameters of ‘normal perfusion’ were often not objectively defined. The time to surgery ranged from 1.5 to 12 hours. A total of 412 patients (82%) were definitively treated with closed or open reduction and fixation, and 92 (18%) required vascular intervention, ranging from simple release of entrapped vessels to vascular grafts. Conclusion. The description of the vascular assessment of the patient with a supracondylar humeral fracture and a pulseless limb in the literature is variable, with few objective criteria being used to define perfusion. The evidence base for decision-making is limited, and further research is required. We were able, however, to make some recommendations about objective criteria for the assessment of these patients, and we suggest that these are performed frequently to allow the detection of any deterioration of perfusion. Cite this article: Bone Joint J 2023;105-B(3):231–238


Bone & Joint Open
Vol. 4, Issue 2 | Pages 53 - 61
1 Feb 2023
Faraj S de Windt TS van Hooff ML van Hellemondt GG Spruit M

Aims. The aim of this study was to assess the clinical and radiological results of patients who were revised using a custom-made triflange acetabular component (CTAC) for component loosening and pelvic discontinuity (PD) after previous total hip arthroplasty (THA). Methods. Data were extracted from a single centre prospective database of patients with PD who were treated with a CTAC. Patients were included if they had a follow-up of two years. The Hip Disability and Osteoarthritis Outcome Score (HOOS), modified Oxford Hip Score (mOHS), EurQol EuroQoL five-dimension three-level (EQ-5D-3L) utility, and Numeric Rating Scale (NRS), including visual analogue score (VAS) for pain, were gathered at baseline, and at one- and two-year follow-up. Reasons for revision, and radiological and clinical complications were registered. Trends over time are described and tested for significance and clinical relevance. Results. A total of 18 females with 22 CTACs who had a mean age of 73.5 years (SD 7.7) were included. A significant improvement was found in HOOS (p < 0.0001), mOHS (p < 0.0001), EQ-5D-3L utility (p = 0.003), EQ-5D-3L NRS (p = 0.013), VAS pain rest (p = 0.008), and VAS pain activity (p < 0.0001) between baseline and final follow-up. Minimal clinically important improvement in mOHS and the HOOS Physical Function Short Form (HOOS-PS) was observed in 16 patients (73%) and 14 patients (64%), respectively. Definite healing of the PD was observed in 19 hips (86%). Complications included six cases with broken screws (27%), four cases (18%) with bony fractures, and one case (4.5%) with sciatic nerve paresthesia. One patient with concurrent bilateral PD had revision surgery due to recurrent dislocations. No revision surgery was performed for screw failure or implant breakage. Conclusion. CTAC in patients with THA acetabular loosening and PD can result in stable constructs and significant improvement in functioning and health-related quality of life at two years' follow-up. Further follow-up is necessary to determine the mid- to long-term outcome. Cite this article: Bone Jt Open 2023;4(2):53–61


The Bone & Joint Journal
Vol. 106-B, Issue 7 | Pages 688 - 695
1 Jul 2024
Farrow L Zhong M Anderson L

Aims. To examine whether natural language processing (NLP) using a clinically based large language model (LLM) could be used to predict patient selection for total hip or total knee arthroplasty (THA/TKA) from routinely available free-text radiology reports. Methods. Data pre-processing and analyses were conducted according to the Artificial intelligence to Revolutionize the patient Care pathway in Hip and knEe aRthroplastY (ARCHERY) project protocol. This included use of de-identified Scottish regional clinical data of patients referred for consideration of THA/TKA, held in a secure data environment designed for artificial intelligence (AI) inference. Only preoperative radiology reports were included. NLP algorithms were based on the freely available GatorTron model, a LLM trained on over 82 billion words of de-identified clinical text. Two inference tasks were performed: assessment after model-fine tuning (50 Epochs and three cycles of k-fold cross validation), and external validation. Results. For THA, there were 5,558 patient radiology reports included, of which 4,137 were used for model training and testing, and 1,421 for external validation. Following training, model performance demonstrated average (mean across three folds) accuracy, F1 score, and area under the receiver operating curve (AUROC) values of 0.850 (95% confidence interval (CI) 0.833 to 0.867), 0.813 (95% CI 0.785 to 0.841), and 0.847 (95% CI 0.822 to 0.872), respectively. For TKA, 7,457 patient radiology reports were included, with 3,478 used for model training and testing, and 3,152 for external validation. Performance metrics included accuracy, F1 score, and AUROC values of 0.757 (95% CI 0.702 to 0.811), 0.543 (95% CI 0.479 to 0.607), and 0.717 (95% CI 0.657 to 0.778) respectively. There was a notable deterioration in performance on external validation in both cohorts. Conclusion. The use of routinely available preoperative radiology reports provides promising potential to help screen suitable candidates for THA, but not for TKA. The external validation results demonstrate the importance of further model testing and training when confronted with new clinical cohorts. Cite this article: Bone Joint J 2024;106-B(7):688–695


The Bone & Joint Journal
Vol. 106-B, Issue 8 | Pages 842 - 848
1 Aug 2024
Kriechling P Whitefield R Makaram NS Brown IDM Mackenzie SP Robinson CM

Aims. Vascular compromise due to arterial injury is a rare but serious complication of a proximal humeral fracture. The aims of this study were to report its incidence in a large urban population, and to identify clinical and radiological factors which are associated with this complication. We also evaluated the results of the use of our protocol for the management of these injuries. Methods. A total of 3,497 adult patients with a proximal humeral fracture were managed between January 2015 and December 2022 in a single tertiary trauma centre. Their mean age was 66.7 years (18 to 103) and 2,510 (72%) were female. We compared the demographic data, clinical features, and configuration of those whose fracture was complicated by vascular compromise with those of the remaining patients. The incidence of vascular compromise was calculated from national population data, and predictive factors for its occurrence were investigated using univariate analysis. Results. A total of 18 patients (0.5%) had a proximal humeral fracture and clinical evidence of vascular compromise, giving an annual incidence of 0.29 per 100,000 of the population. Their mean age was 68.7 years (45 to 92) and ten (56%) were female. Evidence of a mixed pattern neurological deficit (brachial plexus palsy) (odds ratio (OR) 380.6 (95% CI 85.9 to 1,685.8); p < 0.001), complete separation of the proximal shaft from the humeral head with medial displacement (OR 39.5 (95% CI 14.0 to 111.8); p < 0.001), and a fracture-dislocation (OR 5.0 (95% CI 1.6 to 15.3); p = 0.015) were all associated with an increased risk of associated vascular compromise. A policy of reduction and fixation of the fracture prior to vascular surgical intervention had favourable outcomes without vascular sequelae. Conclusion. The classic signs of distal ischaemia are often absent in patients with proximal injuries to major vessels. We were able to identify specific clinical and radiological ‘red flags’ which, particularly when present in combination, should increase the suspicion of a fracture with an associated vascular injury, and facilitate early diagnosis and appropriate combined orthopaedic and vascular intervention. Cite this article: Bone Joint J 2024;106-B(8):842–848



The Bone & Joint Journal
Vol. 105-B, Issue 3 | Pages 227 - 229
1 Mar 2023
Theologis T Brady MA Hartshorn S Faust SN Offiah AC

Acute bone and joint infections in children are serious, and misdiagnosis can threaten limb and life. Most young children who present acutely with pain, limping, and/or loss of function have transient synovitis, which will resolve spontaneously within a few days. A minority will have a bone or joint infection. Clinicians are faced with a diagnostic challenge: children with transient synovitis can safely be sent home, but children with bone and joint infection require urgent treatment to avoid complications. Clinicians often respond to this challenge by using a series of rudimentary decision support tools, based on clinical, haematological, and biochemical parameters, to differentiate childhood osteoarticular infection from other diagnoses. However, these tools were developed without methodological expertise in diagnostic accuracy and do not consider the importance of imaging (ultrasound scan and MRI). There is wide variation in clinical practice with regard to the indications, choice, sequence, and timing of imaging. This variation is most likely due to the lack of evidence concerning the role of imaging in acute bone and joint infection in children. We describe the first steps of a large UK multicentre study, funded by the National Institute for Health Research, which seeks to integrate definitively the role of imaging into a decision support tool, developed with the assistance of individuals with expertise in the development of clinical prediction tools. Cite this article: Bone Joint J 2023;105-B(3):227–229


Bone & Joint Research
Vol. 13, Issue 1 | Pages 19 - 27
5 Jan 2024
Baertl S Rupp M Kerschbaum M Morgenstern M Baumann F Pfeifer C Worlicek M Popp D Amanatullah DF Alt V

Aims. This study aimed to evaluate the clinical application of the PJI-TNM classification for periprosthetic joint infection (PJI) by determining intraobserver and interobserver reliability. To facilitate its use in clinical practice, an educational app was subsequently developed and evaluated. Methods. A total of ten orthopaedic surgeons classified 20 cases of PJI based on the PJI-TNM classification. Subsequently, the classification was re-evaluated using the PJI-TNM app. Classification accuracy was calculated separately for each subcategory (reinfection, tissue and implant condition, non-human cells, and morbidity of the patient). Fleiss’ kappa and Cohen’s kappa were calculated for interobserver and intraobserver reliability, respectively. Results. Overall, interobserver and intraobserver agreements were substantial across the 20 classified cases. Analyses for the variable ‘reinfection’ revealed an almost perfect interobserver and intraobserver agreement with a classification accuracy of 94.8%. The category 'tissue and implant conditions' showed moderate interobserver and substantial intraobserver reliability, while the classification accuracy was 70.8%. For 'non-human cells,' accuracy was 81.0% and interobserver agreement was moderate with an almost perfect intraobserver reliability. The classification accuracy of the variable 'morbidity of the patient' reached 73.5% with a moderate interobserver agreement, whereas the intraobserver agreement was substantial. The application of the app yielded comparable results across all subgroups. Conclusion. The PJI-TNM classification system captures the heterogeneity of PJI and can be applied with substantial inter- and intraobserver reliability. The PJI-TNM educational app aims to facilitate application in clinical practice. A major limitation was the correct assessment of the implant situation. To eliminate this, a re-evaluation according to intraoperative findings is strongly recommended. Cite this article: Bone Joint Res 2024;13(1):19–27