header advert
Orthopaedic Proceedings Logo

Receive monthly Table of Contents alerts from Orthopaedic Proceedings

Comprehensive article alerts can be set up and managed through your account settings

View my account settings

Visit Orthopaedic Proceedings at:

Loading...

Loading...

Full Access

Hip

USING NATURAL LANGUAGE PROCESSING TO COLLECT CODED INFORMATION FROM PATIENTS UNDERGOING TOTAL HIP ARTHROPLASTY

The Hip Society (THS) 2019 Summer Meeting, Kohler, WI, USA, 25–27 September 2019.



Abstract

Background

80% of health data is recorded as free text and not easily accessible for use in research and QI. Natural Language Processing (NLP) could be used as a method to abstract data easier than manual methods. Our objectives were to investigate whether NLP can be used to abstract structured clinical data from notes for total joint arthroplasty (TJA).

Methods

Clinical and hospital notes were collected for every patient undergoing a primary TJA. Human annotators reviewed a random training sample(n=400) and test sample(n=600) of notes from 6 different surgeons and manually abstracted historical, physical exam, operative, and outcomes data to create a gold standard dataset. Historical data collected included pain information and the various treatments tried (medications, injections, physical therapy). Physical exam information collected included ROM and the presence of deformity. Operative information included the angle of tibial slope, angle of tibial and femoral cuts, and patellar tracking for TKAs and approach and repair of external rotators for THAs. In addition, information on implant brand/type/size, sutures, and drains were collected for all TJAs. Finally, the occurrence of complications was collected. We then trained and tested our NLP system to automatically collect the respective variables. Finally, we assessed our automated approach by comparing system-generated findings against the gold standard.

Results

Overall, the NLP algorithm performed well at abstracting all variables in our random test dataset (accuracy=96.3%, sensitivity=95.2%, specificity=97.4%). It performed better at abstracting historical information (accuracy=97.0%), physical exam information (accuracy=98.8%), and information on complications (accuracy=96.8%) compared to operative information (accuracy=94.8%), but it performed well with a sensitivity and specificity >90.0% for all variables.

Discussion

The NLP system achieved good performance on a subset of randomly selected notes with querying information about TJA patients. Automated algorithms like the one developed here can help orthopedic practices collect information for registries and help guide QI without increased time-burden.

For any tables or figures, please contact the authors directly.