Advertisement for orthosearch.org.uk
Results 1 - 20 of 1356
Results per page:

Aims. The aim of this study was to compare the preinjury functional scores with the postinjury preoperative score and postoperative outcome scores following anterior cruciate ligament (ACL) reconstruction surgery (ACLR). Methods. We performed a prospective study on patients who underwent primary ACLR by a single surgeon at a single centre between October 2010 and January 2018. Preoperative preinjury scores were collected at time of first assessment after the index injury. Preoperative (pre- and post-injury), one-year, and two-year postoperative functional outcomes were assessed by using the Knee injury and Osteoarthritis Outcome Score (KOOS), Lysholm Knee Score, and Tegner Activity Scale. Results. We enrolled 308 males and 263 females of mean age 27 years (19 to 46). The mean preinjury and preoperative post-injury Lysholm Knee Scores were 94 (73 to 100) and 63 (25 to 85), respectively, while the respective mean scores at one and two years postoperatively were 84 (71 to 100) and 89 (71 to 100; p < 0.001). The mean Tegner preinjury and preoperative post-injury scores were 7 (3 to 9) and 3 (0 to 6), respectively, while the respective mean scores at one and two years postoperatively were 6 (1 to 8) and 6 (1 to 9) (p < 0.001). The mean KOOS scores at preinjury versus two years postoperatively were: symptoms (96 vs 84); pain (94 vs 87); activities of daily living (97 vs 91), sports and recreation function (84 vs 71), and quality of life (82 vs 69), respectively (p < 0.001). Conclusion. Functional scores improved following ACLR surgery at two years in comparison to preoperative post-injury scores. However, at two-year follow-up, the majority of patients failed to achieve their preinjury scores. The evaluation of ACLR outcomes needs to consider the preinjury scores rather than the immediate preoperative score that is usually collected. Cite this article: Bone Jt Open 2023;4(1):46–52


Bone & Joint Open
Vol. 4, Issue 3 | Pages 129 - 137
1 Mar 2023
Patel A Edwards TC Jones G Liddle AD Cobb J Garner A

Aims. The metabolic equivalent of task (MET) score examines patient performance in relation to energy expenditure before and after knee arthroplasty. This study assesses its use in a knee arthroplasty population in comparison with the widely used Oxford Knee Score (OKS) and EuroQol five-dimension index (EQ-5D), which are reported to be limited by ceiling effects. Methods. A total of 116 patients with OKS, EQ-5D, and MET scores before, and at least six months following, unilateral primary knee arthroplasty were identified from a database. Procedures were performed by a single surgeon between 2014 and 2019 consecutively. Scores were analyzed for normality, skewness, kurtosis, and the presence of ceiling/floor effects. Concurrent validity between the MET score, OKS, and EQ-5D was assessed using Spearman’s rank. Results. Postoperatively the OKS and EQ-5D demonstrated negative skews in distribution, with high kurtosis at six months and one year. The OKS demonstrated a ceiling effect at one year (15.7%) postoperatively. The EQ-5D demonstrated a ceiling effect at six months (30.2%) and one year (39.8%) postoperatively. The MET score did not demonstrate a skewed distribution or ceiling effect either at six months or one year postoperatively. Weak-moderate correlations were noted between the MET score and conventional scores at six months and one year postoperatively. Conclusion. In contrast to the OKS and EQ-5D, the MET score was normally distributed postoperatively with no ceiling effect. It is worth consideration as an arthroplasty outcome measure, particularly for patients with high expectations. Cite this article: Bone Jt Open 2023;4(3):129–137


Bone & Joint Research
Vol. 11, Issue 5 | Pages 317 - 326
23 May 2022
Edwards TC Guest B Garner A Logishetty K Liddle AD Cobb JP

Aims. This study investigates the use of the metabolic equivalent of task (MET) score in a young hip arthroplasty population, and its ability to capture additional benefit beyond the ceiling effect of conventional patient-reported outcome measures. Methods. From our electronic database of 751 hip arthroplasty procedures, 221 patients were included. Patients were excluded if they had revision surgery, an alternative hip procedure, or incomplete data either preoperatively or at one-year follow-up. Included patients had a mean age of 59.4 years (SD 11.3) and 54.3% were male, incorporating 117 primary total hip and 104 hip resurfacing arthroplasty operations. Oxford Hip Score (OHS), EuroQol five-dimension questionnaire (EQ-5D), and the MET were recorded preoperatively and at one-year follow-up. The distribution was examined reporting the presence of ceiling and floor effects. Validity was assessed correlating the MET with the other scores using Spearman’s rank correlation coefficient and determining responsiveness. A subgroup of 93 patients scoring 48/48 on the OHS were analyzed by age, sex, BMI, and preoperative MET using the other metrics to determine if differences could be established despite scoring identically on the OHS. Results. Postoperatively the OHS and EQ-5D demonstrate considerable negatively skewed distributions with ceiling effects of 41.6% and 53.8%, respectively. The MET was normally distributed postoperatively with no relevant ceiling effect. Weak-to-moderate significant correlations were found between the MET and the other two metrics. In the 48/48 subgroup, no differences were found comparing groups with the EQ-5D, however significantly higher mean MET scores were demonstrated for patients aged < 60 years (12.7 (SD 4.7) vs 10.6 (SD 2.4), p = 0.008), male patients (12.5 (SD 4.5) vs 10.8 (SD 2.8), p = 0.024), and those with preoperative MET scores > 6 (12.6 (SD 4.2) vs 11.0 (SD 3.3), p = 0.040). Conclusion. The MET is normally distributed in patients following hip arthroplasty, recording levels of activity which are undetectable using the OHS. Cite this article: Bone Joint Res 2022;11(5):317–326


Bone & Joint Open
Vol. 3, Issue 10 | Pages 786 - 794
12 Oct 2022
Harrison CJ Plummer OR Dawson J Jenkinson C Hunt A Rodrigues JN

Aims. The aim of this study was to develop and evaluate machine-learning-based computerized adaptive tests (CATs) for the Oxford Hip Score (OHS), Oxford Knee Score (OKS), Oxford Shoulder Score (OSS), and the Oxford Elbow Score (OES) and its subscales. Methods. We developed CAT algorithms for the OHS, OKS, OSS, overall OES, and each of the OES subscales, using responses to the full-length questionnaires and a machine-learning technique called regression tree learning. The algorithms were evaluated through a series of simulation studies, in which they aimed to predict respondents’ full-length questionnaire scores from only a selection of their item responses. In each case, the total number of items used by the CAT algorithm was recorded and CAT scores were compared to full-length questionnaire scores by mean, SD, score distribution plots, Pearson’s correlation coefficient, intraclass correlation (ICC), and the Bland-Altman method. Differences between CAT scores and full-length questionnaire scores were contextualized through comparison to the instruments’ minimal clinically important difference (MCID). Results. The CAT algorithms accurately estimated 12-item questionnaire scores from between four and nine items. Scores followed a very similar distribution between CAT and full-length assessments, with the mean score difference ranging from 0.03 to 0.26 out of 48 points. Pearson’s correlation coefficient and ICC were 0.98 for each 12-item scale and 0.95 or higher for the OES subscales. In over 95% of cases, a patient’s CAT score was within five points of the full-length questionnaire score for each 12-item questionnaire. Conclusion. Oxford Hip Score, Oxford Knee Score, Oxford Shoulder Score, and Oxford Elbow Score (including separate subscale scores) CATs all markedly reduce the burden of items to be completed without sacrificing score accuracy. Cite this article: Bone Jt Open 2022;3(10):786–794


Bone & Joint Open
Vol. 3, Issue 4 | Pages 307 - 313
7 Apr 2022
Singh V Bieganowski T Huang S Karia R Davidovitch RI Schwarzkopf R

Aims. The Forgotten Joint Score-12 (FJS-12) is a validated patient-reported outcome measure (PROM) tool designed to assess artificial prosthesis awareness during daily activities following total hip arthroplasty (THA). The patient-acceptable symptom state (PASS) is the minimum cut-off value that corresponds to a patient’s satisfactory state-of-health. Despite the validity and reliability of the FJS-12 having been previously demonstrated, the PASS has yet to be clearly defined. This study aims to define the PASS of the FJS-12 following primary THA. Methods. We retrospectively reviewed all patients who underwent primary elective THA from 2019 to 2020, and answered both the FJS-12 and the Hip Disability and Osteoarthritis Outcome Score, Joint Replacement (HOOS, JR) questionnaires one-year postoperatively. HOOS, JR score was used as the anchor to estimate the PASS of FJS-12. Two statistical methods were employed: the receiver operating characteristic (ROC) curve point, which maximized the Youden index; and 75th percentile of the cumulative percentage curve of patients who had the HOOS, JR score difference larger than the cut-off value. Results. This study included 780 patients. The mean one-year FJS-12 score was 65.42 (SD 28.59). The mean one-year HOOS, JR score was 82.70 (SD 16.57). A high positive correlation between FJS-12 and HOOS, JR was found (r = 0.74; p<0.001), making the HOOS, JR a valid external anchor. The threshold score of the FJS-12 that maximized the sensitivity and specificity for detecting a PASS was 66.68 (area under the curve = 0.8). The cut-off score value computed with the 75th percentile approach was 92.20. Conclusion. The PASS threshold for the FJS-12 at one year following primary THA was 66.68 and 92.20 using the ROC curve and 75th percentile approaches, respectively. These values can be used to achieve consensus about meaningful postoperative improvement to maximize the utility of the FJS-12 to evaluate and counsel patients undergoing THA. Cite this article: Bone Jt Open 2022;3(4):307–313


Bone & Joint Open
Vol. 5, Issue 11 | Pages 962 - 970
4 Nov 2024
Suter C Mattila H Ibounig T Sumrein BO Launonen A Järvinen TLN Lähdeoja T Rämö L

Aims. Though most humeral shaft fractures heal nonoperatively, up to one-third may lead to nonunion with inferior outcomes. The Radiographic Union Score for HUmeral Fractures (RUSHU) was created to identify high-risk patients for nonunion. Our study evaluated the RUSHU’s prognostic performance at six and 12 weeks in discriminating nonunion within a significantly larger cohort than before. Methods. Our study included 226 nonoperatively treated humeral shaft fractures. We evaluated the interobserver reliability and intraobserver reproducibility of RUSHU scoring using intraclass correlation coefficients (ICCs). Additionally, we determined the optimal cut-off thresholds for predicting nonunion using the receiver operating characteristic (ROC) method. Results. The RUSHU demonstrated good interobserver reliability with an ICC of 0.78 (95% CI 0.72 to 0.83) at six weeks and 0.77 (95% CI 0.71 to 0.82) at 12 weeks. Intraobserver reproducibility was good or excellent for all analyses. Area under the curve in the ROC analysis was 0.83 (95% CI 0.77 to 0.88) at six weeks and 0.89 (95% CI 0.84 to 0.93) at 12 weeks, indicating excellent discrimination. The optimal cut-off values for predicting nonunion were ≤ eight points at six weeks and ≤ nine points at 12 weeks, providing the best specificity-sensitivity trade-off. Conclusion. The RUSHU proves to be a reliable and reproducible radiological scoring system that aids in identifying patients at risk of nonunion at both six and 12 weeks post-injury during non-surgical treatment of humeral shaft fractures. The statistically optimal cut-off values for predicting nonunion are ≤ eight at six weeks and ≤ nine points at 12 weeks post-injury


Bone & Joint Research
Vol. 5, Issue 4 | Pages 116 - 121
1 Apr 2016
Leow JM Clement ND Tawonsawatruk T Simpson CJ Simpson AHRW

Objectives. The radiographic union score for tibial (RUST) fractures was developed by Whelan et al to assess the healing of tibial fractures following intramedullary nailing. In the current study, the repeatability and reliability of the RUST score was evaluated in an independent centre (a) using the original description, (b) after further interpretation of the description of the score, and (c) with the immediate post-operative radiograph available for comparison. Methods. A total of 15 radiographs of tibial shaft fractures treated by intramedullary nailing (IM) were scored by three observers using the RUST system. Following discussion on how the criteria of the RUST system should be implemented, 45 sets (i.e. AP and lateral) of radiographs of IM nailed tibial fractures were scored by five observers. Finally, these 45 sets of radiographs were rescored with the baseline post-operative radiograph available for comparison. Results. The initial intraclass correlation (ICC) on the first 15 sets of radiographs was 0.67 (95% CI 0.63 to 0.71). However, the original description was being interpreted in different ways. After agreeing on the interpretation, the ICC on the second cohort improved to 0.75. The ICC improved even further to 0.79, when the baseline post-operative radiographs were available for comparison. Conclusion. This study demonstrates that the RUST scoring system is a reliable and repeatable outcome measure for assessing tibial fracture healing. Further improvement in the reliability of the scoring system can be obtained if the radiographs are compared with the baseline post-operative radiographs. Cite this article: Mr J.M. Leow. The radiographic union scale in tibial (RUST) fractures: Reliability of the outcome measure at an independent centre. Bone Joint Res 2016;5:116–121. DOI: 10.1302/2046-3758.54.2000628


Bone & Joint Open
Vol. 4, Issue 3 | Pages 138 - 145
1 Mar 2023
Clark JO Razii N Lee SWJ Grant SJ Davison MJ Bailey O

Aims. The COVID-19 pandemic has caused unprecedented disruption to elective orthopaedic services. The primary objective of this study was to examine changes in functional scores in patients awaiting total hip arthroplasty (THA), total knee arthroplasty (TKA), and unicompartmental knee arthroplasty (UKA). Secondary objectives were to investigate differences between these groups and identify those in a health state ‘worse than death’ (WTD). Methods. In this prospective cohort study, preoperative Oxford hip and knee scores (OHS/OKS) were recorded for patients added to a waiting list for THA, TKA, or UKA, during the initial eight months of the COVID-19 pandemic, and repeated at 14 months into the pandemic (mean interval nine months (SD 2.84)). EuroQoL five-dimension five-level health questionnaire (EQ-5D-5L) index scores were also calculated at this point in time, with a negative score representing a state WTD. OHS/OKS were analyzed over time and in relation to the EQ-5D-5L. Results. A total of 174 patients (58 THA, 74 TKA, 42 UKA) were eligible, after 27 were excluded (one died, seven underwent surgery, 19 non-responders). The overall mean OHS/OKS deteriorated from 15.43 (SD 6.92), when patients were added to the waiting list, to 11.77 (SD 6.45) during the pandemic (p < 0.001). There were significantly worse EQ-5D-5L index scores in the THA group (p = 0.005), with 22 of these patients (38%) in a health state WTD, than either the TKA group (20 patients; 27% WTD), or the UKA group (nine patients; 21% WTD). A strong positive correlation between the EQ-5D-5L index score and OHS/OKS was observed (r = 0.818; p < 0.001). Receiver operating characteristic analysis revealed that an OHS/OKS lower than nine predicted a health state WTD (88% sensitivity and 73% specificity). Conclusion. OHS/OKS deteriorated significantly among patients awaiting lower limb arthroplasty during the COVID-19 pandemic. Overall, 51 patients were in a health state WTD, representing 29% of our entire cohort, which is considerably worse than existing pre-pandemic data. Cite this article: Bone Jt Open 2023;4(3):138–145


Bone & Joint Open
Vol. 3, Issue 7 | Pages 573 - 581
1 Jul 2022
Clement ND Afzal I Peacock CJH MacDonald D Macpherson GJ Patton JT Asopa V Sochart DH Kader DF

Aims. The aims of this study were to assess mapping models to predict the three-level version of EuroQoL five-dimension utility index (EQ-5D-3L) from the Oxford Knee Score (OKS) and validate these before and after total knee arthroplasty (TKA). Methods. A retrospective cohort of 5,857 patients was used to create the prediction models, and a second cohort of 721 patients from a different centre was used to validate the models, all of whom underwent TKA. Patient characteristics, BMI, OKS, and EQ-5D-3L were collected preoperatively and one year postoperatively. Generalized linear regression was used to formulate the prediction models. Results. There were significant correlations between the OKS and EQ-5D-3L preoperatively (r = 0.68; p < 0.001) and postoperatively (r = 0.77; p < 0.001) and for the change in the scores (r = 0.61; p < 0.001). Three different models (preoperative, postoperative, and change) were created. There were no significant differences between the actual and predicted mean EQ-5D-3L utilities at any timepoint or for change in the scores (p > 0.090) in the validation cohort. There was a significant correlation between the actual and predicted EQ-5D-3L utilities preoperatively (r = 0.63; p < 0.001) and postoperatively (r = 0.77; p < 0.001) and for the change in the scores (r = 0.56; p < 0.001). Bland-Altman plots demonstrated that a lower utility was overestimated, and higher utility was underestimated. The individual predicted EQ-5D-3L that was within ± 0.05 and ± 0.010 (minimal clinically important difference (MCID)) of the actual EQ-5D-3L varied between 13% to 35% and 26% to 64%, respectively, according to timepoint assessed and change in the scores, but was not significantly different between the modelling and validation cohorts (p ≥ 0.148). Conclusion. The OKS can be used to estimate EQ-5D-3L. Predicted individual patient utility error beyond the MCID varied from one-third to two-thirds depending on timepoint assessed, but the mean for a cohort did not differ and could be employed for this purpose. Cite this article: Bone Jt Open 2022;3(7):573–581


Bone & Joint Open
Vol. 2, Issue 9 | Pages 765 - 772
14 Sep 2021
Silitonga J Djaja YP Dilogo IH Pontoh LAP

Aims. The aim of this study was to perform a cross-cultural adaptation of Oxford Hip Score (OHS) to Indonesian, and to evaluate its psychometric properties. Methods. We performed a cross-cultural adaptation of Oxford Hip Score into Indonesian language (OHS-ID) and determined its internal consistency, test-retest reliability, measurement error, floor-ceiling effect, responsiveness, and construct validity by hypotheses testing of its correlation with Harris Hip Score (HHS), vsual analogue scale (VAS), and Short Form-36 (SF-36). Adults (> 17 years old) with chronic hip pain (osteoarthritis or osteonecrosis) were included. Results. A total of 125 patients were included, including 50 total hip arthroplasty (THA) patients with six months follow-up. The OHS questionnaire was translated into Indonesian and showed good internal consistency (Cronbach’s alpha = 0.89) and good reliability (intraclass correlation = 0.98). The standard error of measurement value of 2.11 resulted in minimal detectable change score of 5.8. Ten out of ten (100%) a priori hypotheses were met, confirming the construct validity. A strong correlation was found with two subscales of SF-36 (pain and physical function), HHS (0.94), and VAS (-0.83). OHS-ID also showed good responsiveness for post-THA series. Floor and ceiling effect was not found. Conclusion. The Indonesian version of OHS showed similar reliability and validity with the original OHS. This questionnaire will be suitable to assess chronic hip pain in Indonesian-speaking patients. Cite this article: Bone Jt Open 2021;2(9):765–772


Bone & Joint Open
Vol. 1, Issue 2 | Pages 3 - 7
5 Feb 2020
Widnall J Capstick T Wijesekera M Messahel S Perry DC

Aims. This study sought to estimate the clinical outcomes and describe the nationwide variation in practice, as part of the feasibility workup for a National Institute for Health and Care Excellence (NICE) recommended randomized clinical trial to determine the optimal treatment of torus fractures of the distal radius in children. Methods. Prospective data collection on torus fractures presenting to our emergency department. Patient consent and study information, including a copy of the Wong-Baker Faces pain score, was issued at the first patient contact. An automated text message service recorded pain scores at days 0, 3, 7, 21, and 42 postinjury. A cross-sectional survey of current accident and emergency practice in the UK was also undertaken to gauge current practice following the publication of NICE guidance. Results. In all, 30 patients with a mean age of 8.9 years were enrolled over a six-week period. Of the 150 potential data points, data was captured in 146, making the data 97.3% complete. Pain scores were recorded at day 0 (mean 6.5 (95% confidence interval (CI) 5.7 to 7.3)), day 3 (4.4 (95% CI 3.5 to 5.2)), day 7 (3.0 (95% CI 2.3 to 3.6)), day 21 (1.2 (95% CI 0.7 to 1.7)) and day 42 (0.4 (95% CI 0.1 to 0.7)). Of the 100 units who participated in the nationwide survey, 38% were unaware of any local or national protocols regarding torus fractures, 41% treated torus fractures with cast immobilization, and over 60% of patients had follow-up arranged, both contradictory to national guidelines. Conclusion. We have demonstrated the severity, recovery trajectory, and variation in pain scores among children with torus fractures. We demonstrate excellent follow-up of patient outcomes using text messages. Despite national guidelines, there is significant variation in practice. This data directly informed the development of an ongoing nationwide randomized clinical trial – the FORearm Fracture Recovery in Children Evaluation (FORCE) study


Bone & Joint Open
Vol. 2, Issue 12 | Pages 1089 - 1095
21 Dec 2021
Luo W Ali MS Limb R Cornforth C Perry DC

Aims. The Patient-Reported Outcomes Measurement Information System (PROMIS) has demonstrated faster administration, lower burden of data capture and reduced floor and ceiling effects compared to traditional Patient Reported Outcomes Measurements (PROMs). We investigated the suitability of PROMIS Mobility score in assessing physical function in the sequelae of childhood hip disease. Methods. In all, 266 adolscents (aged ≥ 12 years) and adults were identified with a prior diagnosis of childhood hip disease (either Perthes’ disease (n = 232 (87.2%)) or Slipped Capital Femoral Epiphysis (n = 34 (12.8%)) with a mean age of 27.73 years (SD 12.24). Participants completed the PROMIS Mobility Computer Adaptive Test, the Non-Arthritic Hip Score (NAHS), EuroQol five-dimension five-level questionnaire, and the Numeric Pain Rating Scale. We investigated the correlation between the PROMIS Mobility and other tools to assess use in this population and any clustering of outcome scores. Results. There was a strong correlation between the PROMIS Mobility and other established PROMs; NAHS (rs = 0.79; p < 0.001). There was notable clustering in PROMIS at the upper end of the distribution score (42.5%), with less seen in the NAHS (20.3%). However, the clustering was broadly similar between PROMIS Mobility and the comparable domains of the NAHS; function (53.6%), and activity (35.0%). Conclusion. PROMIS Mobility strongly correlated with other tools demonstrating convergent construct validity. There was clustering of physical function scores at the upper end of the distributions, which may reflect truncation of the data caused by participants having excellent outcomes. There were elements of disease not captured within PROMIS Mobility alone, and difficulties in differentiating those with the highest levels of function. Cite this article: Bone Jt Open 2021;2(12):1089–1095


Aims. The purpose of this study was to assess the reliability and responsiveness to hip surgery of a four-point modified Care and Comfort Hypertonicity Questionnaire (mCCHQ) scoring tool in children with cerebral palsy (CP) in Gross Motor Function Classification System (GMFCS) levels IV and V. Methods. This was a population-based cohort study in children with CP from a national surveillance programme. Reliability was assessed from 20 caregivers who completed the mCCHQ questionnaire on two occasions three weeks apart. Test-retest reliability of the mCCHQ was calculated, and responsiveness before and after surgery for a displaced hip was evaluated in a cohort of children. Results. Test-retest reliability for the overall mCCHQ score was good (intraclass correlation coefficient 0.78), and no dimension demonstrated poor reliability. The surgical intervention cohort comprised ten children who had preoperative and postoperative mCCHQ scores at a minimum of six months postoperatively. The mCCHQ tool demonstrated a significant improvement in overall score from preoperative assessment to six-month postoperative follow-up assessment (p < 0.001). Conclusion. The mCCHQ demonstrated responsiveness to intervention and good test-retest reliability. The mCCHQ is proposed as an outcome tool for use within a national surveillance programme for children with CP. Cite this article: Bone Jt Open 2023;4(8):580–583


Bone & Joint Research
Vol. 3, Issue 11 | Pages 305 - 309
1 Nov 2014
Harris KK Price AJ Beard DJ Fitzpatrick R Jenkinson C Dawson J

Objective. The objective of this study was to explore dimensionality of the Oxford Hip Score (OHS) and examine whether self-reported pain and functioning can be distinguished in the form of subscales. Methods. This was a secondary data analysis of the UK NHS hospital episode statistics/patient-reported outcome measures dataset containing pre-operative OHS scores on 97 487 patients who were undergoing hip replacement surgery. . Results. The proposed number of factors to extract depended on the method of extraction employed. Velicer’s Minimum Average Partial test and the Parallel Analysis suggested one factor, the Cattell’s scree test and Kaiser-over-1 rule suggested two factors. Exploratory factor analysis demonstrated that the two-factor OHS had most of the items saliently loading either of the two factors. These factors were named ‘Pain’ and ‘Function’ and their respective subscales were created. There was some cross-loading of items: 8 (pain on standing up from a chair) and 11 (pain during work). These items were assigned to the ‘Pain’ subscale. The final ‘Pain’ subscale consisted of items 1, 8, 9, 10, 11 and 12. The ‘Function’ subscale consisted of items 2, 3, 4, 5, 6 and 7, with the recommended scoring of the subscales being from 0 (worst) to 100 (best). Cronbach’s alpha was 0.855 for the ‘Pain’ subscale and 0.861 for the ‘Function’ subscale. A confirmatory factor analysis demonstrated that the two-factor model of the OHS had a better fit. However, none of the one-factor or two-factor models was rejected. Conclusion. Factor analyses demonstrated that, in addition to current usage as a single summary scale, separate information on pain and self-reported function can be extracted from the OHS in a meaningful way in the form of subscales. Cite this article: Bone Joint Res 2014;3:305–9


Bone & Joint Research
Vol. 1, Issue 9 | Pages 225 - 233
1 Sep 2012
Paulsen A Odgaard A Overgaard S

Objectives. The Oxford hip score (OHS) is a 12-item questionnaire designed and developed to assess function and pain from the perspective of patients who are undergoing total hip replacement (THR). The OHS has been shown to be consistent, reliable, valid and sensitive to clinical change following THR. It has been translated into different languages, but no adequately translated, adapted and validated Danish language version exists. Methods. The OHS was translated and cross-culturally adapted into Danish from the original English version, using methods based on best-practice guidelines. The translation was tested for psychometric quality in patients drawn from a cohort from the Danish Hip Arthroplasty Register (DHR). Results. The Danish OHS had a response rate of 87.4%, no floor effect and a 19.9% ceiling effect (as expected in post-operative patients). Only 1.2% of patients had too many items missing to calculate a sum score. Construct validity was adequate and 80% of our predefined hypotheses regarding the correlation between scores on the Danish OHS and the other questionnaires were confirmed. The intraclass correlation (ICC) of the different items ranged from 0.80 to 0.95 and the average limits of agreement (LOA) ranged from -0.05 to 0.06. The Danish OHS had a high internal consistency with a Cronbach’s alpha of 0.99 and an average inter-item correlation of 0.88. Conclusions. This Danish version of the OHS is a valid and reliable patient-reported outcome measurement instrument (PROM) with similar qualities to the original English language version.


Bone & Joint Research
Vol. 4, Issue 8 | Pages 137 - 144
1 Aug 2015
Hamilton DF Giesinger JM Patton JT MacDonald DJ Simpson AHRW Howie CR Giesinger K

Objectives. The Oxford Hip and Knee Scores (OHS, OKS) have been demonstrated to vary according to age and gender, making it difficult to compare results in cohorts with different demographics. The aim of this paper was to calculate reference values for different patient groups and highlight the concept of normative reference data to contextualise an individual’s outcome. Methods. We accessed prospectively collected OHS and OKS data for patients undergoing lower limb joint arthroplasty at a single orthopaedic teaching hospital during a five-year period. T-scores were calculated based on the OHS and OKS distributions. . Results. Data were obtained from 3203 total hip arthroplasty (THA) patients and 2742 total knee arthroplasty (TKA) patients. The mean age of the patient was 68.0 years (. sd. 11.3, 58.4% women) in the THA group and in 70.2 (. sd. 9.4; 57.5% women) in the TKA group. T-scores were calculated for age and gender subgroups by operation. Different T-score thresholds are seen at different time points pre and post surgery. Values are further stratified by operation (THA/TKA) age and gender. Conclusions. Normative data interpretation requires a fundamental shift in the thinking as to the use of the Oxford Scores. Instead of reporting actual score points, the patient is rated by their relative position within the group of all patients undergoing the same procedure. It is proposed that this form of transformation is beneficial (a) for more appropriately comparing different patient cohorts and (b) informing an individual patient how they are progressing compared with others of their age and gender. Cite this article: Bone Joint Res 2015;4:137–144


Bone & Joint Research
Vol. 12, Issue 3 | Pages 155 - 164
1 Mar 2023
McCarty CP Nazif MA Sangiorgio SN Ebramzadeh E Park S

Aims. Taper corrosion has been widely reported to be problematic for modular total hip arthroplasty implants. A simple and systematic method to evaluate taper damage with sufficient resolution is needed. We introduce a semiquantitative grading system for modular femoral tapers to characterize taper corrosion damage. Methods. After examining a unique collection of retrieved cobalt-chromium (CoCr) taper sleeves (n = 465) using the widely-used Goldberg system, we developed an expanded six-point visual grading system intended to characterize the severity, visible material loss, and absence of direct component contact due to corrosion. Female taper sleeve damage was evaluated by three blinded observers using the Goldberg scoring system and the expanded system. A subset (n = 85) was then re-evaluated following destructive cleaning, using both scoring systems. Material loss for this subset was quantified using metrology and correlated with both scoring systems. Results. There was substantial agreement in grading among all three observers with uncleaned (n = 465) and with the subset of cleaned (n = 85) implants. The expanded scoring criteria provided a wider distribution of scores which ultimately correlated well with corrosion material loss. Cleaning changed the average scores marginally using the Goldberg criteria (p = 0.290); however, using the VGS, approximately 40% of the scores for all observers changed, increasing the average score from 4.24 to 4.35 (p = 0.002). There was a strong correlation between measured material loss and new grading scores. Conclusion. The expanded scoring criteria provided a wider distribution of scores which ultimately correlated well with corrosion material loss. This system provides potential advantages for assessing taper damage without requiring specialized imaging devices. Cite this article: Bone Joint Res 2023;12(3):155–164


The Bone & Joint Journal
Vol. 106-B, Issue 4 | Pages 394 - 400
1 Apr 2024
Kjærvik C Gjertsen J Stensland E Dybvik EH Soereide O

Aims. The aims of this study were to assess quality of life after hip fractures, to characterize respondents to patient-reported outcome measures (PROMs), and to describe the recovery trajectory of hip fracture patients. Methods. Data on 35,206 hip fractures (2014 to 2018; 67.2% female) in the Norwegian Hip Fracture Register were linked to data from the Norwegian Patient Registry and Statistics Norway. PROMs data were collected using the EuroQol five-dimension three-level questionnaire (EQ-5D-3L) scoring instrument and living patients were invited to respond at four, 12, and 36 months post fracture. Multiple imputation procedures were performed as a model to substitute missing PROM data. Differences in response rates between categories of covariates were analyzed using chi-squared test statistics. The association between patient and socioeconomic characteristics and the reported EQ-5D-3L scores was analyzed using linear regression. Results. The median age was 83 years (interquartile range 76 to 90), and 3,561 (10%) lived in a healthcare facility. Observed mean pre-fracture EQ-5D-3L index score was 0.81 (95% confidence interval 0.803 to 0.810), which decreased to 0.66 at four months, to 0.70 at 12 months, and to 0.73 at 36 months. In the imputed datasets, the reduction from pre-fracture was similar (0.15 points) but an improvement up to 36 months was modest (0.01 to 0.03 points). Patients with higher age, male sex, severe comorbidity, cognitive impairment, lower income, lower education, and those in residential care facilities had a lower proportion of respondents, and systematically reported a lower health-related quality of life (HRQoL). The response pattern of patients influenced scores significantly, and the highest scores are found in patients reporting scores at all observation times. Conclusion. Hip fracture leads to a persistent reduction in measured HRQoL, up to 36 months. The patients’ health and socioeconomic status were associated with the proportion of patients returning PROM data for analysis, and affected the results reported. Observed EQ-5D-3L scores are affected by attrition and selection bias mechanisms and motivate the use of statistical modelling for adjustment. Cite this article: Bone Joint J 2024;106-B(4):394–400


Bone & Joint Open
Vol. 5, Issue 10 | Pages 904 - 910
18 Oct 2024
Bergman EM Mulligan EP Patel RM Wells J

Aims. The Single Assessment Numerical Evalution (SANE) score is a pragmatic alternative to longer patient-reported outcome measures (PROMs). The purpose of this study was to investigate the concurrent validity of the SANE and hip-specific PROMs in a generalized population of patients with hip pain at a single timepoint upon initial visit with an orthopaedic surgeon who is a hip preservation specialist. We hypothesized that SANE would have a strong correlation with the 12-question International Hip Outcome Tool (iHOT)-12, the Hip Outcome Score (HOS), and the Hip disability and Osteoarthritis Outcome Score (HOOS), providing evidence for concurrent validity of the SANE and hip-specific outcome measures in patients with hip pain. Methods. This study was a cross-sectional retrospective database analysis at a single timepoint. Data were collected from 2,782 patients at initial evaluation with a hip preservation specialist using the iHOT-12, HOS, HOOS, and SANE. Outcome scores were retrospectively analyzed using Pearson correlation coefficients. Results. Mean raw scores were iHOT-12 67.01 (SD 29.52), HOS 58.42 (SD 26.26), HOOS 86.85 (SD 32.94), and SANE 49.60 (SD 27.92). SANE was moderately correlated with the iHOT-12 (r = -0.4; 95% CI -0.35 to -0.44; p < 0.001), HOS (r = 0.57; 95% CI 0.53 to 0.60; p < 0.001), and HOOS (r = -0.55; 95% CI -0.51 to -0.58; p < 0.001). The iHOT-12 and HOOS were recorded as a lower score, indicating better function, which accounts for the negative r values. Conclusion. This study was the first to investigate the relationship between the SANE and the iHOT-12, HOS, and HOOS in a population of patients with hip pain at the initial evaluation with an orthopaedic surgeon, and found moderate correlation between SANE and the iHOT-12, HOS, and HOOS. The SANE may be a pragmatic alternative for clinical benchmarking in a general population of patients with hip pain. The construct validity of the SANE should be questioned compared to legacy measures whose content validity has been more rigorously investigated. Cite this article: Bone Jt Open 2024;5(10):904–910


Bone & Joint Open
Vol. 5, Issue 9 | Pages 799 - 805
24 Sep 2024
Fletcher WR Collins T Fox A Pillai A

Aims. The Cartiva synthetic cartilage implant (SCI) entered mainstream use in the management of first metatarsophalangeal joint (MTPJ) arthritis following the positive results of large trials in 2016. Limited information is available on the longer-term outcomes of this implant within the literature, particularly when independent from the originator. This single-centre cohort study investigates the efficacy of the Cartiva SCI at up to five years. Methods. First MTPJ arthritis was radiologically graded according to the Hattrup and Johnson (HJ) classification. Preoperative and sequential postoperative patient-reported outcome measures (PROMs) were evaluated using the Manchester-Oxford Foot Questionnaire (MOXFQ), and the activities of daily living (ADL) sub-section of the Foot and Ankle Ability Measure (FAAM). Results. Patients were followed up for a mean of 66 months (SD 7.1). Of an initial 66 cases, 16 did not return PROM questionnaires. A total of six failures were noted, with survival of 82%. Overall, significant improvement in both objective scores (MOXFQ and FAAM ADL) was maintained versus preoperatively: 18.2 versus 58.0 (p > 0.001) and 86.2 versus 41.1 (p > 0.001), respectively. The improvement was noted to be less pronounced in males. Subjective scores had deteriorated since early follow-up, with an interval decrease in patient satisfaction from 89% to 68%. Furthermore, a subset of cases demonstrated clinically important interval deterioration in objective scores. However, no specific patient factors were found to be associated with outcomes following analysis. Conclusion. This study represents the longest-term independent follow-up in the literature. It shows reassuring mid-term efficacy of the Cartiva SCI with better-than-expected survival. However, deterioration in scores for a subset of patients and lower satisfaction may predict ongoing failure in this group of patients. Additionally, males were noted to have a lower degree of improvement in scores than females. As such, ongoing observation of the SCI to assess durability and survivability, and identify predictive factors, is key to improving patient selection. Cite this article: Bone Jt Open 2024;5(9):799–805