Advertisement for orthosearch.org.uk
Results 1 - 20 of 837
Results per page:

Aims. The aim of this study was to compare the preinjury functional scores with the postinjury preoperative score and postoperative outcome scores following anterior cruciate ligament (ACL) reconstruction surgery (ACLR). Methods. We performed a prospective study on patients who underwent primary ACLR by a single surgeon at a single centre between October 2010 and January 2018. Preoperative preinjury scores were collected at time of first assessment after the index injury. Preoperative (pre- and post-injury), one-year, and two-year postoperative functional outcomes were assessed by using the Knee injury and Osteoarthritis Outcome Score (KOOS), Lysholm Knee Score, and Tegner Activity Scale. Results. We enrolled 308 males and 263 females of mean age 27 years (19 to 46). The mean preinjury and preoperative post-injury Lysholm Knee Scores were 94 (73 to 100) and 63 (25 to 85), respectively, while the respective mean scores at one and two years postoperatively were 84 (71 to 100) and 89 (71 to 100; p < 0.001). The mean Tegner preinjury and preoperative post-injury scores were 7 (3 to 9) and 3 (0 to 6), respectively, while the respective mean scores at one and two years postoperatively were 6 (1 to 8) and 6 (1 to 9) (p < 0.001). The mean KOOS scores at preinjury versus two years postoperatively were: symptoms (96 vs 84); pain (94 vs 87); activities of daily living (97 vs 91), sports and recreation function (84 vs 71), and quality of life (82 vs 69), respectively (p < 0.001). Conclusion. Functional scores improved following ACLR surgery at two years in comparison to preoperative post-injury scores. However, at two-year follow-up, the majority of patients failed to achieve their preinjury scores. The evaluation of ACLR outcomes needs to consider the preinjury scores rather than the immediate preoperative score that is usually collected. Cite this article: Bone Jt Open 2023;4(1):46–52


Bone & Joint Open
Vol. 4, Issue 3 | Pages 129 - 137
1 Mar 2023
Patel A Edwards TC Jones G Liddle AD Cobb J Garner A

Aims. The metabolic equivalent of task (MET) score examines patient performance in relation to energy expenditure before and after knee arthroplasty. This study assesses its use in a knee arthroplasty population in comparison with the widely used Oxford Knee Score (OKS) and EuroQol five-dimension index (EQ-5D), which are reported to be limited by ceiling effects. Methods. A total of 116 patients with OKS, EQ-5D, and MET scores before, and at least six months following, unilateral primary knee arthroplasty were identified from a database. Procedures were performed by a single surgeon between 2014 and 2019 consecutively. Scores were analyzed for normality, skewness, kurtosis, and the presence of ceiling/floor effects. Concurrent validity between the MET score, OKS, and EQ-5D was assessed using Spearman’s rank. Results. Postoperatively the OKS and EQ-5D demonstrated negative skews in distribution, with high kurtosis at six months and one year. The OKS demonstrated a ceiling effect at one year (15.7%) postoperatively. The EQ-5D demonstrated a ceiling effect at six months (30.2%) and one year (39.8%) postoperatively. The MET score did not demonstrate a skewed distribution or ceiling effect either at six months or one year postoperatively. Weak-moderate correlations were noted between the MET score and conventional scores at six months and one year postoperatively. Conclusion. In contrast to the OKS and EQ-5D, the MET score was normally distributed postoperatively with no ceiling effect. It is worth consideration as an arthroplasty outcome measure, particularly for patients with high expectations. Cite this article: Bone Jt Open 2023;4(3):129–137


Bone & Joint Research
Vol. 11, Issue 5 | Pages 317 - 326
23 May 2022
Edwards TC Guest B Garner A Logishetty K Liddle AD Cobb JP

Aims. This study investigates the use of the metabolic equivalent of task (MET) score in a young hip arthroplasty population, and its ability to capture additional benefit beyond the ceiling effect of conventional patient-reported outcome measures. Methods. From our electronic database of 751 hip arthroplasty procedures, 221 patients were included. Patients were excluded if they had revision surgery, an alternative hip procedure, or incomplete data either preoperatively or at one-year follow-up. Included patients had a mean age of 59.4 years (SD 11.3) and 54.3% were male, incorporating 117 primary total hip and 104 hip resurfacing arthroplasty operations. Oxford Hip Score (OHS), EuroQol five-dimension questionnaire (EQ-5D), and the MET were recorded preoperatively and at one-year follow-up. The distribution was examined reporting the presence of ceiling and floor effects. Validity was assessed correlating the MET with the other scores using Spearman’s rank correlation coefficient and determining responsiveness. A subgroup of 93 patients scoring 48/48 on the OHS were analyzed by age, sex, BMI, and preoperative MET using the other metrics to determine if differences could be established despite scoring identically on the OHS. Results. Postoperatively the OHS and EQ-5D demonstrate considerable negatively skewed distributions with ceiling effects of 41.6% and 53.8%, respectively. The MET was normally distributed postoperatively with no relevant ceiling effect. Weak-to-moderate significant correlations were found between the MET and the other two metrics. In the 48/48 subgroup, no differences were found comparing groups with the EQ-5D, however significantly higher mean MET scores were demonstrated for patients aged < 60 years (12.7 (SD 4.7) vs 10.6 (SD 2.4), p = 0.008), male patients (12.5 (SD 4.5) vs 10.8 (SD 2.8), p = 0.024), and those with preoperative MET scores > 6 (12.6 (SD 4.2) vs 11.0 (SD 3.3), p = 0.040). Conclusion. The MET is normally distributed in patients following hip arthroplasty, recording levels of activity which are undetectable using the OHS. Cite this article: Bone Joint Res 2022;11(5):317–326


Bone & Joint Open
Vol. 3, Issue 10 | Pages 786 - 794
12 Oct 2022
Harrison CJ Plummer OR Dawson J Jenkinson C Hunt A Rodrigues JN

Aims. The aim of this study was to develop and evaluate machine-learning-based computerized adaptive tests (CATs) for the Oxford Hip Score (OHS), Oxford Knee Score (OKS), Oxford Shoulder Score (OSS), and the Oxford Elbow Score (OES) and its subscales. Methods. We developed CAT algorithms for the OHS, OKS, OSS, overall OES, and each of the OES subscales, using responses to the full-length questionnaires and a machine-learning technique called regression tree learning. The algorithms were evaluated through a series of simulation studies, in which they aimed to predict respondents’ full-length questionnaire scores from only a selection of their item responses. In each case, the total number of items used by the CAT algorithm was recorded and CAT scores were compared to full-length questionnaire scores by mean, SD, score distribution plots, Pearson’s correlation coefficient, intraclass correlation (ICC), and the Bland-Altman method. Differences between CAT scores and full-length questionnaire scores were contextualized through comparison to the instruments’ minimal clinically important difference (MCID). Results. The CAT algorithms accurately estimated 12-item questionnaire scores from between four and nine items. Scores followed a very similar distribution between CAT and full-length assessments, with the mean score difference ranging from 0.03 to 0.26 out of 48 points. Pearson’s correlation coefficient and ICC were 0.98 for each 12-item scale and 0.95 or higher for the OES subscales. In over 95% of cases, a patient’s CAT score was within five points of the full-length questionnaire score for each 12-item questionnaire. Conclusion. Oxford Hip Score, Oxford Knee Score, Oxford Shoulder Score, and Oxford Elbow Score (including separate subscale scores) CATs all markedly reduce the burden of items to be completed without sacrificing score accuracy. Cite this article: Bone Jt Open 2022;3(10):786–794


Bone & Joint Open
Vol. 3, Issue 4 | Pages 307 - 313
7 Apr 2022
Singh V Bieganowski T Huang S Karia R Davidovitch RI Schwarzkopf R

Aims. The Forgotten Joint Score-12 (FJS-12) is a validated patient-reported outcome measure (PROM) tool designed to assess artificial prosthesis awareness during daily activities following total hip arthroplasty (THA). The patient-acceptable symptom state (PASS) is the minimum cut-off value that corresponds to a patient’s satisfactory state-of-health. Despite the validity and reliability of the FJS-12 having been previously demonstrated, the PASS has yet to be clearly defined. This study aims to define the PASS of the FJS-12 following primary THA. Methods. We retrospectively reviewed all patients who underwent primary elective THA from 2019 to 2020, and answered both the FJS-12 and the Hip Disability and Osteoarthritis Outcome Score, Joint Replacement (HOOS, JR) questionnaires one-year postoperatively. HOOS, JR score was used as the anchor to estimate the PASS of FJS-12. Two statistical methods were employed: the receiver operating characteristic (ROC) curve point, which maximized the Youden index; and 75th percentile of the cumulative percentage curve of patients who had the HOOS, JR score difference larger than the cut-off value. Results. This study included 780 patients. The mean one-year FJS-12 score was 65.42 (SD 28.59). The mean one-year HOOS, JR score was 82.70 (SD 16.57). A high positive correlation between FJS-12 and HOOS, JR was found (r = 0.74; p<0.001), making the HOOS, JR a valid external anchor. The threshold score of the FJS-12 that maximized the sensitivity and specificity for detecting a PASS was 66.68 (area under the curve = 0.8). The cut-off score value computed with the 75th percentile approach was 92.20. Conclusion. The PASS threshold for the FJS-12 at one year following primary THA was 66.68 and 92.20 using the ROC curve and 75th percentile approaches, respectively. These values can be used to achieve consensus about meaningful postoperative improvement to maximize the utility of the FJS-12 to evaluate and counsel patients undergoing THA. Cite this article: Bone Jt Open 2022;3(4):307–313


Bone & Joint Open
Vol. 3, Issue 7 | Pages 573 - 581
1 Jul 2022
Clement ND Afzal I Peacock CJH MacDonald D Macpherson GJ Patton JT Asopa V Sochart DH Kader DF

Aims. The aims of this study were to assess mapping models to predict the three-level version of EuroQoL five-dimension utility index (EQ-5D-3L) from the Oxford Knee Score (OKS) and validate these before and after total knee arthroplasty (TKA). Methods. A retrospective cohort of 5,857 patients was used to create the prediction models, and a second cohort of 721 patients from a different centre was used to validate the models, all of whom underwent TKA. Patient characteristics, BMI, OKS, and EQ-5D-3L were collected preoperatively and one year postoperatively. Generalized linear regression was used to formulate the prediction models. Results. There were significant correlations between the OKS and EQ-5D-3L preoperatively (r = 0.68; p < 0.001) and postoperatively (r = 0.77; p < 0.001) and for the change in the scores (r = 0.61; p < 0.001). Three different models (preoperative, postoperative, and change) were created. There were no significant differences between the actual and predicted mean EQ-5D-3L utilities at any timepoint or for change in the scores (p > 0.090) in the validation cohort. There was a significant correlation between the actual and predicted EQ-5D-3L utilities preoperatively (r = 0.63; p < 0.001) and postoperatively (r = 0.77; p < 0.001) and for the change in the scores (r = 0.56; p < 0.001). Bland-Altman plots demonstrated that a lower utility was overestimated, and higher utility was underestimated. The individual predicted EQ-5D-3L that was within ± 0.05 and ± 0.010 (minimal clinically important difference (MCID)) of the actual EQ-5D-3L varied between 13% to 35% and 26% to 64%, respectively, according to timepoint assessed and change in the scores, but was not significantly different between the modelling and validation cohorts (p ≥ 0.148). Conclusion. The OKS can be used to estimate EQ-5D-3L. Predicted individual patient utility error beyond the MCID varied from one-third to two-thirds depending on timepoint assessed, but the mean for a cohort did not differ and could be employed for this purpose. Cite this article: Bone Jt Open 2022;3(7):573–581


Bone & Joint Open
Vol. 2, Issue 9 | Pages 765 - 772
14 Sep 2021
Silitonga J Djaja YP Dilogo IH Pontoh LAP

Aims. The aim of this study was to perform a cross-cultural adaptation of Oxford Hip Score (OHS) to Indonesian, and to evaluate its psychometric properties. Methods. We performed a cross-cultural adaptation of Oxford Hip Score into Indonesian language (OHS-ID) and determined its internal consistency, test-retest reliability, measurement error, floor-ceiling effect, responsiveness, and construct validity by hypotheses testing of its correlation with Harris Hip Score (HHS), vsual analogue scale (VAS), and Short Form-36 (SF-36). Adults (> 17 years old) with chronic hip pain (osteoarthritis or osteonecrosis) were included. Results. A total of 125 patients were included, including 50 total hip arthroplasty (THA) patients with six months follow-up. The OHS questionnaire was translated into Indonesian and showed good internal consistency (Cronbach’s alpha = 0.89) and good reliability (intraclass correlation = 0.98). The standard error of measurement value of 2.11 resulted in minimal detectable change score of 5.8. Ten out of ten (100%) a priori hypotheses were met, confirming the construct validity. A strong correlation was found with two subscales of SF-36 (pain and physical function), HHS (0.94), and VAS (-0.83). OHS-ID also showed good responsiveness for post-THA series. Floor and ceiling effect was not found. Conclusion. The Indonesian version of OHS showed similar reliability and validity with the original OHS. This questionnaire will be suitable to assess chronic hip pain in Indonesian-speaking patients. Cite this article: Bone Jt Open 2021;2(9):765–772


Bone & Joint Open
Vol. 4, Issue 3 | Pages 138 - 145
1 Mar 2023
Clark JO Razii N Lee SWJ Grant SJ Davison MJ Bailey O

Aims. The COVID-19 pandemic has caused unprecedented disruption to elective orthopaedic services. The primary objective of this study was to examine changes in functional scores in patients awaiting total hip arthroplasty (THA), total knee arthroplasty (TKA), and unicompartmental knee arthroplasty (UKA). Secondary objectives were to investigate differences between these groups and identify those in a health state ‘worse than death’ (WTD). Methods. In this prospective cohort study, preoperative Oxford hip and knee scores (OHS/OKS) were recorded for patients added to a waiting list for THA, TKA, or UKA, during the initial eight months of the COVID-19 pandemic, and repeated at 14 months into the pandemic (mean interval nine months (SD 2.84)). EuroQoL five-dimension five-level health questionnaire (EQ-5D-5L) index scores were also calculated at this point in time, with a negative score representing a state WTD. OHS/OKS were analyzed over time and in relation to the EQ-5D-5L. Results. A total of 174 patients (58 THA, 74 TKA, 42 UKA) were eligible, after 27 were excluded (one died, seven underwent surgery, 19 non-responders). The overall mean OHS/OKS deteriorated from 15.43 (SD 6.92), when patients were added to the waiting list, to 11.77 (SD 6.45) during the pandemic (p < 0.001). There were significantly worse EQ-5D-5L index scores in the THA group (p = 0.005), with 22 of these patients (38%) in a health state WTD, than either the TKA group (20 patients; 27% WTD), or the UKA group (nine patients; 21% WTD). A strong positive correlation between the EQ-5D-5L index score and OHS/OKS was observed (r = 0.818; p < 0.001). Receiver operating characteristic analysis revealed that an OHS/OKS lower than nine predicted a health state WTD (88% sensitivity and 73% specificity). Conclusion. OHS/OKS deteriorated significantly among patients awaiting lower limb arthroplasty during the COVID-19 pandemic. Overall, 51 patients were in a health state WTD, representing 29% of our entire cohort, which is considerably worse than existing pre-pandemic data. Cite this article: Bone Jt Open 2023;4(3):138–145


Bone & Joint Open
Vol. 1, Issue 2 | Pages 3 - 7
5 Feb 2020
Widnall J Capstick T Wijesekera M Messahel S Perry DC

Aims. This study sought to estimate the clinical outcomes and describe the nationwide variation in practice, as part of the feasibility workup for a National Institute for Health and Care Excellence (NICE) recommended randomized clinical trial to determine the optimal treatment of torus fractures of the distal radius in children. Methods. Prospective data collection on torus fractures presenting to our emergency department. Patient consent and study information, including a copy of the Wong-Baker Faces pain score, was issued at the first patient contact. An automated text message service recorded pain scores at days 0, 3, 7, 21, and 42 postinjury. A cross-sectional survey of current accident and emergency practice in the UK was also undertaken to gauge current practice following the publication of NICE guidance. Results. In all, 30 patients with a mean age of 8.9 years were enrolled over a six-week period. Of the 150 potential data points, data was captured in 146, making the data 97.3% complete. Pain scores were recorded at day 0 (mean 6.5 (95% confidence interval (CI) 5.7 to 7.3)), day 3 (4.4 (95% CI 3.5 to 5.2)), day 7 (3.0 (95% CI 2.3 to 3.6)), day 21 (1.2 (95% CI 0.7 to 1.7)) and day 42 (0.4 (95% CI 0.1 to 0.7)). Of the 100 units who participated in the nationwide survey, 38% were unaware of any local or national protocols regarding torus fractures, 41% treated torus fractures with cast immobilization, and over 60% of patients had follow-up arranged, both contradictory to national guidelines. Conclusion. We have demonstrated the severity, recovery trajectory, and variation in pain scores among children with torus fractures. We demonstrate excellent follow-up of patient outcomes using text messages. Despite national guidelines, there is significant variation in practice. This data directly informed the development of an ongoing nationwide randomized clinical trial – the FORearm Fracture Recovery in Children Evaluation (FORCE) study


Bone & Joint Research
Vol. 5, Issue 4 | Pages 116 - 121
1 Apr 2016
Leow JM Clement ND Tawonsawatruk T Simpson CJ Simpson AHRW

Objectives. The radiographic union score for tibial (RUST) fractures was developed by Whelan et al to assess the healing of tibial fractures following intramedullary nailing. In the current study, the repeatability and reliability of the RUST score was evaluated in an independent centre (a) using the original description, (b) after further interpretation of the description of the score, and (c) with the immediate post-operative radiograph available for comparison. Methods. A total of 15 radiographs of tibial shaft fractures treated by intramedullary nailing (IM) were scored by three observers using the RUST system. Following discussion on how the criteria of the RUST system should be implemented, 45 sets (i.e. AP and lateral) of radiographs of IM nailed tibial fractures were scored by five observers. Finally, these 45 sets of radiographs were rescored with the baseline post-operative radiograph available for comparison. Results. The initial intraclass correlation (ICC) on the first 15 sets of radiographs was 0.67 (95% CI 0.63 to 0.71). However, the original description was being interpreted in different ways. After agreeing on the interpretation, the ICC on the second cohort improved to 0.75. The ICC improved even further to 0.79, when the baseline post-operative radiographs were available for comparison. Conclusion. This study demonstrates that the RUST scoring system is a reliable and repeatable outcome measure for assessing tibial fracture healing. Further improvement in the reliability of the scoring system can be obtained if the radiographs are compared with the baseline post-operative radiographs. Cite this article: Mr J.M. Leow. The radiographic union scale in tibial (RUST) fractures: Reliability of the outcome measure at an independent centre. Bone Joint Res 2016;5:116–121. DOI: 10.1302/2046-3758.54.2000628


Bone & Joint Open
Vol. 2, Issue 12 | Pages 1089 - 1095
21 Dec 2021
Luo W Ali MS Limb R Cornforth C Perry DC

Aims. The Patient-Reported Outcomes Measurement Information System (PROMIS) has demonstrated faster administration, lower burden of data capture and reduced floor and ceiling effects compared to traditional Patient Reported Outcomes Measurements (PROMs). We investigated the suitability of PROMIS Mobility score in assessing physical function in the sequelae of childhood hip disease. Methods. In all, 266 adolscents (aged ≥ 12 years) and adults were identified with a prior diagnosis of childhood hip disease (either Perthes’ disease (n = 232 (87.2%)) or Slipped Capital Femoral Epiphysis (n = 34 (12.8%)) with a mean age of 27.73 years (SD 12.24). Participants completed the PROMIS Mobility Computer Adaptive Test, the Non-Arthritic Hip Score (NAHS), EuroQol five-dimension five-level questionnaire, and the Numeric Pain Rating Scale. We investigated the correlation between the PROMIS Mobility and other tools to assess use in this population and any clustering of outcome scores. Results. There was a strong correlation between the PROMIS Mobility and other established PROMs; NAHS (rs = 0.79; p < 0.001). There was notable clustering in PROMIS at the upper end of the distribution score (42.5%), with less seen in the NAHS (20.3%). However, the clustering was broadly similar between PROMIS Mobility and the comparable domains of the NAHS; function (53.6%), and activity (35.0%). Conclusion. PROMIS Mobility strongly correlated with other tools demonstrating convergent construct validity. There was clustering of physical function scores at the upper end of the distributions, which may reflect truncation of the data caused by participants having excellent outcomes. There were elements of disease not captured within PROMIS Mobility alone, and difficulties in differentiating those with the highest levels of function. Cite this article: Bone Jt Open 2021;2(12):1089–1095


Aims. The purpose of this study was to assess the reliability and responsiveness to hip surgery of a four-point modified Care and Comfort Hypertonicity Questionnaire (mCCHQ) scoring tool in children with cerebral palsy (CP) in Gross Motor Function Classification System (GMFCS) levels IV and V. Methods. This was a population-based cohort study in children with CP from a national surveillance programme. Reliability was assessed from 20 caregivers who completed the mCCHQ questionnaire on two occasions three weeks apart. Test-retest reliability of the mCCHQ was calculated, and responsiveness before and after surgery for a displaced hip was evaluated in a cohort of children. Results. Test-retest reliability for the overall mCCHQ score was good (intraclass correlation coefficient 0.78), and no dimension demonstrated poor reliability. The surgical intervention cohort comprised ten children who had preoperative and postoperative mCCHQ scores at a minimum of six months postoperatively. The mCCHQ tool demonstrated a significant improvement in overall score from preoperative assessment to six-month postoperative follow-up assessment (p < 0.001). Conclusion. The mCCHQ demonstrated responsiveness to intervention and good test-retest reliability. The mCCHQ is proposed as an outcome tool for use within a national surveillance programme for children with CP. Cite this article: Bone Jt Open 2023;4(8):580–583


Bone & Joint Research
Vol. 3, Issue 11 | Pages 305 - 309
1 Nov 2014
Harris KK Price AJ Beard DJ Fitzpatrick R Jenkinson C Dawson J

Objective. The objective of this study was to explore dimensionality of the Oxford Hip Score (OHS) and examine whether self-reported pain and functioning can be distinguished in the form of subscales. Methods. This was a secondary data analysis of the UK NHS hospital episode statistics/patient-reported outcome measures dataset containing pre-operative OHS scores on 97 487 patients who were undergoing hip replacement surgery. . Results. The proposed number of factors to extract depended on the method of extraction employed. Velicer’s Minimum Average Partial test and the Parallel Analysis suggested one factor, the Cattell’s scree test and Kaiser-over-1 rule suggested two factors. Exploratory factor analysis demonstrated that the two-factor OHS had most of the items saliently loading either of the two factors. These factors were named ‘Pain’ and ‘Function’ and their respective subscales were created. There was some cross-loading of items: 8 (pain on standing up from a chair) and 11 (pain during work). These items were assigned to the ‘Pain’ subscale. The final ‘Pain’ subscale consisted of items 1, 8, 9, 10, 11 and 12. The ‘Function’ subscale consisted of items 2, 3, 4, 5, 6 and 7, with the recommended scoring of the subscales being from 0 (worst) to 100 (best). Cronbach’s alpha was 0.855 for the ‘Pain’ subscale and 0.861 for the ‘Function’ subscale. A confirmatory factor analysis demonstrated that the two-factor model of the OHS had a better fit. However, none of the one-factor or two-factor models was rejected. Conclusion. Factor analyses demonstrated that, in addition to current usage as a single summary scale, separate information on pain and self-reported function can be extracted from the OHS in a meaningful way in the form of subscales. Cite this article: Bone Joint Res 2014;3:305–9


Bone & Joint Research
Vol. 1, Issue 9 | Pages 225 - 233
1 Sep 2012
Paulsen A Odgaard A Overgaard S

Objectives. The Oxford hip score (OHS) is a 12-item questionnaire designed and developed to assess function and pain from the perspective of patients who are undergoing total hip replacement (THR). The OHS has been shown to be consistent, reliable, valid and sensitive to clinical change following THR. It has been translated into different languages, but no adequately translated, adapted and validated Danish language version exists. Methods. The OHS was translated and cross-culturally adapted into Danish from the original English version, using methods based on best-practice guidelines. The translation was tested for psychometric quality in patients drawn from a cohort from the Danish Hip Arthroplasty Register (DHR). Results. The Danish OHS had a response rate of 87.4%, no floor effect and a 19.9% ceiling effect (as expected in post-operative patients). Only 1.2% of patients had too many items missing to calculate a sum score. Construct validity was adequate and 80% of our predefined hypotheses regarding the correlation between scores on the Danish OHS and the other questionnaires were confirmed. The intraclass correlation (ICC) of the different items ranged from 0.80 to 0.95 and the average limits of agreement (LOA) ranged from -0.05 to 0.06. The Danish OHS had a high internal consistency with a Cronbach’s alpha of 0.99 and an average inter-item correlation of 0.88. Conclusions. This Danish version of the OHS is a valid and reliable patient-reported outcome measurement instrument (PROM) with similar qualities to the original English language version.


Bone & Joint Research
Vol. 4, Issue 8 | Pages 137 - 144
1 Aug 2015
Hamilton DF Giesinger JM Patton JT MacDonald DJ Simpson AHRW Howie CR Giesinger K

Objectives. The Oxford Hip and Knee Scores (OHS, OKS) have been demonstrated to vary according to age and gender, making it difficult to compare results in cohorts with different demographics. The aim of this paper was to calculate reference values for different patient groups and highlight the concept of normative reference data to contextualise an individual’s outcome. Methods. We accessed prospectively collected OHS and OKS data for patients undergoing lower limb joint arthroplasty at a single orthopaedic teaching hospital during a five-year period. T-scores were calculated based on the OHS and OKS distributions. . Results. Data were obtained from 3203 total hip arthroplasty (THA) patients and 2742 total knee arthroplasty (TKA) patients. The mean age of the patient was 68.0 years (. sd. 11.3, 58.4% women) in the THA group and in 70.2 (. sd. 9.4; 57.5% women) in the TKA group. T-scores were calculated for age and gender subgroups by operation. Different T-score thresholds are seen at different time points pre and post surgery. Values are further stratified by operation (THA/TKA) age and gender. Conclusions. Normative data interpretation requires a fundamental shift in the thinking as to the use of the Oxford Scores. Instead of reporting actual score points, the patient is rated by their relative position within the group of all patients undergoing the same procedure. It is proposed that this form of transformation is beneficial (a) for more appropriately comparing different patient cohorts and (b) informing an individual patient how they are progressing compared with others of their age and gender. Cite this article: Bone Joint Res 2015;4:137–144


Bone & Joint Research
Vol. 12, Issue 3 | Pages 155 - 164
1 Mar 2023
McCarty CP Nazif MA Sangiorgio SN Ebramzadeh E Park S

Aims. Taper corrosion has been widely reported to be problematic for modular total hip arthroplasty implants. A simple and systematic method to evaluate taper damage with sufficient resolution is needed. We introduce a semiquantitative grading system for modular femoral tapers to characterize taper corrosion damage. Methods. After examining a unique collection of retrieved cobalt-chromium (CoCr) taper sleeves (n = 465) using the widely-used Goldberg system, we developed an expanded six-point visual grading system intended to characterize the severity, visible material loss, and absence of direct component contact due to corrosion. Female taper sleeve damage was evaluated by three blinded observers using the Goldberg scoring system and the expanded system. A subset (n = 85) was then re-evaluated following destructive cleaning, using both scoring systems. Material loss for this subset was quantified using metrology and correlated with both scoring systems. Results. There was substantial agreement in grading among all three observers with uncleaned (n = 465) and with the subset of cleaned (n = 85) implants. The expanded scoring criteria provided a wider distribution of scores which ultimately correlated well with corrosion material loss. Cleaning changed the average scores marginally using the Goldberg criteria (p = 0.290); however, using the VGS, approximately 40% of the scores for all observers changed, increasing the average score from 4.24 to 4.35 (p = 0.002). There was a strong correlation between measured material loss and new grading scores. Conclusion. The expanded scoring criteria provided a wider distribution of scores which ultimately correlated well with corrosion material loss. This system provides potential advantages for assessing taper damage without requiring specialized imaging devices. Cite this article: Bone Joint Res 2023;12(3):155–164


The Bone & Joint Journal
Vol. 106-B, Issue 4 | Pages 394 - 400
1 Apr 2024
Kjærvik C Gjertsen J Stensland E Dybvik EH Soereide O

Aims. The aims of this study were to assess quality of life after hip fractures, to characterize respondents to patient-reported outcome measures (PROMs), and to describe the recovery trajectory of hip fracture patients. Methods. Data on 35,206 hip fractures (2014 to 2018; 67.2% female) in the Norwegian Hip Fracture Register were linked to data from the Norwegian Patient Registry and Statistics Norway. PROMs data were collected using the EuroQol five-dimension three-level questionnaire (EQ-5D-3L) scoring instrument and living patients were invited to respond at four, 12, and 36 months post fracture. Multiple imputation procedures were performed as a model to substitute missing PROM data. Differences in response rates between categories of covariates were analyzed using chi-squared test statistics. The association between patient and socioeconomic characteristics and the reported EQ-5D-3L scores was analyzed using linear regression. Results. The median age was 83 years (interquartile range 76 to 90), and 3,561 (10%) lived in a healthcare facility. Observed mean pre-fracture EQ-5D-3L index score was 0.81 (95% confidence interval 0.803 to 0.810), which decreased to 0.66 at four months, to 0.70 at 12 months, and to 0.73 at 36 months. In the imputed datasets, the reduction from pre-fracture was similar (0.15 points) but an improvement up to 36 months was modest (0.01 to 0.03 points). Patients with higher age, male sex, severe comorbidity, cognitive impairment, lower income, lower education, and those in residential care facilities had a lower proportion of respondents, and systematically reported a lower health-related quality of life (HRQoL). The response pattern of patients influenced scores significantly, and the highest scores are found in patients reporting scores at all observation times. Conclusion. Hip fracture leads to a persistent reduction in measured HRQoL, up to 36 months. The patients’ health and socioeconomic status were associated with the proportion of patients returning PROM data for analysis, and affected the results reported. Observed EQ-5D-3L scores are affected by attrition and selection bias mechanisms and motivate the use of statistical modelling for adjustment. Cite this article: Bone Joint J 2024;106-B(4):394–400


The Bone & Joint Journal
Vol. 105-B, Issue 6 | Pages 711 - 716
1 Jun 2023
Ali MS Khattak M Metcalfe D Perry DC

Aims. This study aimed to evaluate the relationship between hip shape and mid-term function in Perthes’ disease. It also explored whether the modified three-group Stulberg classification can offer similar prognostic information to the five-group system. Methods. A total of 136 individuals aged 12 years or older who had Perthes’ disease in childhood completed the Patient-Reported Outcomes Measurement Information System (PROMIS) Mobility score (function), Nonarthritic Hip Score (NAHS) (function), EuroQol five-dimension five-level questionnaire (EQ-5D-5L) score (quality of life), and the numeric rating scale for pain (NRS). The Stulberg class of the participants’ hip radiographs were evaluated by three fellowship-trained paediatric orthopaedic surgeons. Hip shape and Stulberg class were compared to PROM scores. Results. A spherical hip was associated with the highest function and quality of life, and lowest pain. Conversely, aspherical hips exhibited the lowest functional scores and highest pain. The association between worsening Stulberg class (i.e. greater deviation from sphericity) and worse outcome persisted after adjustment for age and sex in relation to PROMIS (predicted mean difference -1.77 (95% confidence interval (CI) -2.70 to -0.83)), NAHS (-5.68 (95% CI -8.45 to -2.90)), and NRS (0.61 (95% CI 0.14 to 1.08)), but not EQ-5D-5L (-0.03 (95% CI -0.72 to 0.11)). Conclusion. Patient-reported outcomes identify lower function, quality of life, and higher pain in aspherical hips. The magnitude of symptoms deteriorated with time. Hip sphericity (i.e. the modified three-group classification of spherical, oval, and aspherical) appeared to offer similar levels of detail to the five-group Stulberg classification. Cite this article: Bone Joint J 2023;105-B(6):711–716


Bone & Joint Open
Vol. 4, Issue 12 | Pages 957 - 963
18 Dec 2023
van den Heuvel S Penning D Sanders F van Veen R Sosef N van Dijkman B Schepers T

Aims. The primary aim of this study was to present the mid-term follow-up of a multicentre randomized controlled trial (RCT) which compared the functional outcome following routine removal (RR) to the outcome following on-demand removal (ODR) of the syndesmotic screw (SS). Methods. All patients included in the ‘ROutine vs on DEmand removal Of the syndesmotic screw’ (RODEO) trial received the Olerud-Molander Ankle Score (OMAS), American Orthopaedic Foot and Ankle Hindfoot Score (AOFAS), Foot and Ankle Outcome Score (FAOS), and EuroQol five-dimension questionnaire (EQ-5D). Out of the 152 patients, 109 (71.7%) completed the mid-term follow-up questionnaire and were included in this study (53 treated with RR and 56 with ODR). Median follow-up was 50 months (interquartile range 43.0 to 56.0) since the initial surgical treatment of the acute syndesmotic injury. The primary outcome of this study consisted of the OMAS scores of the two groups. Results. The median OMAS score was 85.0 for patients treated with RR, and 90.0 for patients treated with ODR (p = 0.384), indicating no significant difference between ODR and RR. The secondary outcome measures included the AOFAS (88.0 in the RR group and 90.0 for ODR; p = 0.722), FAOS (87.5 in the RR group and 92.9 for ODR; p = 0.399), and EQ-5D (0.87 in the RR group and 0.96 for ODR; p = 0.092). Conclusion. This study demonstrated no functional difference comparing ODR to RR in syndesmotic injuries at a four year follow-up period, which supports the results of the primary RODEO trial. ODR should be the standard practice after syndesmotic screw fixation. Cite this article: Bone Jt Open 2023;4(12):957–963


Bone & Joint Open
Vol. 5, Issue 3 | Pages 202 - 209
11 Mar 2024
Lewin AM Cashman K Harries D Ackerman IN Naylor JM Harris IA

Aims. The aim of this study was to describe and compare joint-specific and generic health-related quality of life outcomes of the first versus second knee in patients undergoing staged bilateral total knee arthroplasty (BTKA) for osteoarthritis. Methods. This retrospective cohort study used Australian national arthroplasty registry data from January 2013 to January 2021 to identify participants who underwent elective staged BTKA with six to 24 months between procedures. The primary outcome was Oxford Knee Score (OKS) at six months postoperatively for the first TKA compared to the second TKA, adjusted for age and sex. Secondary outcomes compared six-month EuroQol five-dimension five-level (EQ-5D-5L) domain scores, EQ-5D index scores, and the EQ visual analogue scale (EQ-VAS) between knees at six months postoperatively. Results. The cohort included 635 participants (1,270 primary procedures). Preoperative scores were worse in the first knee compared to the second for all instruments; however, comparing the first knee at six months postoperatively with the second knee at six months postoperatively, the mean between-knee difference was minimal for OKS (-0.8 points; 95% confidence interval (CI) -1.4 to -0.2), EQ-VAS (3.3; 95% CI 1.9 to 4.7), and EQ-5D index (0.09 points; 95% CI 0.07 to 0.12). Outcomes for the EQ-5D-5L domains ‘mobility’, ‘usual activities’, and ‘pain/discomfort’ were better following the second TKA. Conclusion. At six months postoperatively, there were no clinically meaningful differences between the first and second TKA in either the joint-specific or overall generic health-related quality of life outcomes. However, individual domain scores assessing mobility, pain, and usual activities were notably higher after the second TKA, likely reflecting the cumulative improvement in quality of life after both knees have been replaced. Cite this article: Bone Jt Open 2024;5(3):202–209