Differences
This shows you the differences between two versions of the page.
Next revision | |||
— | natural_language_processing [2022/07/01 01:01] – created - external edit 127.0.0.1 | ||
---|---|---|---|
Line 1: | Line 1: | ||
+ | ====== Natural language processing ====== | ||
+ | |||
+ | Natural [[language]] processing (NLP) is a subfield of computer science, information engineering, | ||
+ | ---- | ||
+ | A significant portion of [[data]] in [[Electronic Health Record]]s is only available as unstructured [[text]], such as surgical or finding [[report]]s, | ||
+ | ((Scheible R, Caliskan D, Fischer P, Thomczyk F, Zabka S, Schneider H, Boeker M, Schulz S, Prokosch HU, Gulden C. AHD2FHIR: A Tool for Mapping of Natural Language Annotations to Fast Healthcare Interoperability Resources - A Technical Case Report. Stud Health Technol Inform. 2022 Jun 6; | ||
+ | |||
+ | ---- | ||
+ | Noncommercial tools for [[natural language processing]] are currently provided by a number of [[platform]]s. | ||
+ | |||
+ | [[Text Blob module]] for [[sentiment analysis]], is based on the [[Natural Language ToolKit]] (NLTK) | ||
+ | ((Natural Language Toolkit, NLTK 3.5. documentation. | ||
+ | https:// | ||
+ | |||
+ | ---- | ||
+ | [[Surgical site infection]]s are a major driver of [[morbidity]] and increased [[cost]]s in the [[postoperative]] period after [[spine surgery]]. Current [[tool]]s for [[surveillance]] of these [[adverse event]]s rely on [[prospective]] clinical tracking, manual [[retrospective]] chart review, or administrative procedural and diagnosis codes. | ||
+ | |||
+ | The purpose of a study was to develop [[natural language processing]] (NLP) algorithms for automated reporting of postoperative wound infection requiring reoperation after lumbar discectomy. | ||
+ | |||
+ | Adult patients undergoing discectomy at two academic and three community medical centers between January 1st, 2000 and July 31st, 2019 for lumbar disc herniation. | ||
+ | |||
+ | Reoperation for wound infection within 90-days after surgery METHODS: Free-text notes of patients who underwent surgery from January 1st, 2000 to December 31st, 2015 were used for algorithm training. Free-text notes of patients who underwent surgery after January 1st, 2016 were used for algorithm testing. Manual chart review was used to label which patients had reoperation for wound infection. An extreme gradient-boosting NLP algorithm was developed to detect reoperation for postoperative wound infection. | ||
+ | |||
+ | Overall, 5860 patients were included in this study and 62 (1.1%) had a reoperation for wound infection. In patients who underwent surgery after January 1st, 2016 (n = 1377), | ||
+ | |||
+ | Temporal validation of the algorithm developed in this study demonstrates a proof-of-concept application of NLP for automated reporting of adverse events after spine surgery. Adapting this methodology for other procedures and outcomes in spine and orthopaedics has the potential to dramatically improve and automatize quality and safety reporting | ||
+ | ((Karhade AV, Bongers MER, Groot OQ, Cha TD, Doorly TP, Fogel HA, Hershman SH, | ||
+ | Tobert DG, Schoenfeld AJ, Kang JD, Harris MB, Bono CM, Schwab JH. Can natural | ||
+ | language processing provide accurate, automated reporting of wound infection | ||
+ | requiring reoperation after lumbar discectomy? Spine J. 2020 Mar 4. pii: | ||
+ | S1529-9430(20)30088-7. doi: 10.1016/ | ||
+ | PubMed PMID: 32145358. | ||
+ | )). | ||
+ | ---- | ||
+ | Accurate prediction of [[outcome]]s among [[patient]]s in [[intensive care unit]]s (ICUs) is important for clinical [[research]] and [[monitoring]] [[care]] [[quality]]. Most existing prediction models do not take full advantage of the [[electronic health record]], using only the single worst value of [[laboratory test]]s and [[vital signs]] and largely ignoring information present in free-text notes. Whether capturing more of the available [[data]] and applying [[machine learning]] and [[natural language processing]] (NLP) can improve and automate the prediction of outcomes among patients in the ICU remains unknown. | ||
+ | |||
+ | To evaluate the change in power for a mortality prediction model among patients in the ICU achieved by incorporating measures of clinical trajectory together with NLP of clinical text and to assess the generalizability of this approach. | ||
+ | |||
+ | This retrospective cohort study included 101 196 patients with a first-time admission to the ICU and a length of stay of at least 4 hours. Twenty ICUs at 2 academic medical centers (University of California, San Francisco [UCSF], and Beth Israel Deaconess Medical Center [BIDMC], Boston, Massachusetts) and 1 community hospital (Mills-Peninsula Medical Center [MPMC], Burlingame, California) contributed data from January 1, 2001, through June 1, 2017. Data were analyzed from July 1, 2017, through August 1, 2018. | ||
+ | : | ||
+ | In-hospital mortality and model discrimination as assessed by the area under the receiver operating characteristic curve (AUC) and model calibration as assessed by the modified Hosmer-Lemeshow statistic. | ||
+ | |||
+ | Among 101 196 patients included in the analysis, 51.3% (n = 51 899) were male, with a mean (SD) age of 61.3 (17.1) years; their in-hospital mortality rate was 10.4% (n = 10 505). A baseline model using only the highest and lowest observed values for each laboratory test result or vital sign achieved a cross-validated AUC of 0.831 (95% CI, 0.830-0.832). In contrast, that model augmented with measures of clinical trajectory achieved an AUC of 0.899 (95% CI, 0.896-0.902; | ||
+ | |||
+ | Intensive care unit mortality prediction models incorporating measures of clinical trajectory and NLP-derived terms yielded excellent predictive performance and generalized well in this sample of hospitals. The role of these automated algorithms, particularly those using [[unstructured data]] from notes and other sources, in clinical research and quality improvement seems to merit additional investigation | ||
+ | ((Marafino BJ, Park M, Davies JM, Thombley R, Luft HS, Sing DC, Kazi DS, DeJong | ||
+ | C, Boscardin WJ, Dean ML, Dudley RA. Validation of Prediction Models for Critical | ||
+ | Care Outcomes Using Natural Language Processing of Electronic Health Record Data. | ||
+ | JAMA Netw Open. 2018 Dec 7; | ||
+ | PubMed PMID: 30646310. | ||
+ | )). | ||
+ | |||
+ | ===== References ===== | ||