University of Illinois at Chicago

File(s) under embargo





until file(s) become available

Text Modeling and Mining for Healthcare Using Deep Learning

posted on 2021-12-01, 00:00 authored by Shaika Chowdhury
Clinical texts are generated in an ever-increasing manner from sources such as EHR, medical forums and social networks. This data is information rich and being able to distill the relevant knowledge can facilitate various learning and prediction tasks in the healthcare domain. However, working with clinical texts is non-trivial and poses the following challenges: diverse expressions, heterogeneity, polysemy, data scarcity and irregular structure. This thesis focuses on effectively modeling and mining from the textual data for medical applications, so as to tackle the aforementioned challenges using deep learning techniques. To detect the diverse mentions related to pharmacovigilance from social media posts, we design a multi-task framework that benefits from joint learning of three related tasks. To extract useful patient knowledge from the heterogeneous EHR into a meaningful encoded representation, we model the data to concept graphs and fuse them using meta-embedding learning. To mine context-aware domain knowledge that is able to address the limited labeled data and polysemy issues in medical natural language inference (NLI), we supplement the medical ontology with other external resources. To mine the structured section information from the medical reports for efficient information extraction, we tackle the irregular section ordering issue by encoding both the semantic and topical dependencies of the sections using a dual sequential encoding model. Lastly, to extract the clinically-relevant information from patient-doctor conversations, we use a span-based model that helps to perform comprehensive extraction including diverse and overlapping entity mentions, and combine it with a noteworthy utterance prediction model for enhanced performance.



S. Yu, Philip


S. Yu, Philip


Computer Science

Degree Grantor

University of Illinois at Chicago

Degree Level

  • Doctoral

Degree name

PhD, Doctor of Philosophy

Committee Member

Marai, G. Elisabeta Parde, Natalie Tang, Wei Luo, Yuan

Submitted date

December 2021

Thesis type



  • en

Usage metrics


    No categories selected