
Select an Action

Incorporating EMR and Genomic Data Using NLP and Machine Learning to Refine Cancer Treatment
Title:
Incorporating EMR and Genomic Data Using NLP and Machine Learning to Refine Cancer Treatment
Author:
Guan, Meijian, author.
ISBN:
9780355987386
Personal Author:
Physical Description:
1 electronic resource (71 pages)
General Note:
Source: Masters Abstracts International, Volume: 57-06M(E).
Advisors: Samuel Cho Committee members: Grey Ballard; David John; Umit Topaloglu.
Abstract:
Electronic medical records (EMR) have collected vast amounts of clinical data, including genomic testing results. In contrast to numerical data, majority of EMR are unstructured free text and not easy to be processed by computers. In this study, we explored how natural language processing (NLP) and machine learning can help to evaluate their impact on the clinical practice using free-text progress reports of cancer patients. We obtained 5,889 de-identified progress reports for 755 cancer patients from Wake Forest Baptist Health Comprehensive Cancer Center for our data analyses. An NLP system was implemented to process the free-text data and extract NGS-related information. Three types of recurrent neural network (RNN), including gated recurrent unit (GRU), long-short term memory (LSTM), and bidirectional LSTM (LSTM_Bi), were applied to classify documents to treatment-change group and no-treatment-change group. The performances of RNNs was compared to five machine learning algorithms including Naive Bayes (NB), K-nearest Neighbor (KNN), Support Vector Machine (SVM), Random Forest (RF), and Logistic Regression (LR). Our results suggested that, overall, RNNs outperformed traditional machine learning algorithms, and LSTM_Bi showed the best performance among RNNs. In addition, pre-trained word embedding can improve the results of RNNs and reduce their training time. Our findings demonstrated that RNN-based algorithms have advantages in unstructured clinical progress reports classification.
Local Note:
School code: 0248
Added Corporate Author:
Available:*
Shelf Number | Item Barcode | Shelf Location | Status |
|---|---|---|---|
| XX(692483.1) | 692483-1001 | Proquest E-Thesis Collection | Searching... |
On Order
Select a list
Make this your default list.
The following items were successfully added.
There was an error while adding the following items. Please try again.
:
Select An Item
Data usage warning: You will receive one text message for each title you selected.
Standard text messaging rates apply.


