Deep Representation Learning for Biomedical Text Mining

Date
2021
Authors
Ebadi, Nima
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract

The rapidly evolving curation of biomedical publications has resulted in an information crisis, such that researchers and other individuals working in biomedical domain require specialized text mining tools to keep track of the ever-evolving literature landscape. With the advances in natural language processing (NLP) models, especially those that are based on deep representation learning, gaining valuable insights from large-scale biomedical corpora have become extremely popular among researchers. Inspired by the state-of-the-art models proposed in general NLP settings, many studies have tried to address the common biomedical natural language processing (BioNLP) tasks, such as named-entity recognition NER, information/relation extraction (IE/RE), and question answering. However, directly applying the advancements of general NLP field to biomedical text mining often yields unsatisfactory results due to scarcity of labeled biomedical documents, ambiguous relations of biomedical concepts–with other domains or with general English words–continuous distribution shift of word semantics, multiple synonyms across different domains, and reliability challenges of deep learning models in general. In this dissertation, we aim to address these challenges using deep representation learning algorithms customized for one or multiple common tasks in BioNLP domain. We study the under-investigate aspects of BioNLP problems such as interpretability and reliability in addition to challenges of scale, scarcity of labeled data, and changes of word distributions. This dissertation approach deep representation learning as a tool to address the requirements of the biomedical research community, rather than models that achieve competitive "accuracies". Furthermore, as the aforementioned challenges become more severe in the context of a pandemic, we constantly provide case studies of the current pandemic and provide exclusive solutions from which the literature could substantially benefit if such an emergency situation recur in the future (God forbid!).

Description
This item is available only to currently enrolled UTSA students, faculty or staff.
Keywords
Biomedical Natural Language Processing, Biomedical Text Mining, Deep Learning, Deep Representation Learning, Natural Language Processing, Text Mining
Citation
Department
Electrical and Computer Engineering