Optimal Dynamic Treatment Regime by Reinforcement Learning in Clinical Medicine

dc.contributor.authorSong, Mina
dc.contributor.authorHan, David
dc.date.accessioned2021-02-06T23:27:43Z
dc.date.available2021-02-06T23:27:43Z
dc.date.issued2020-12
dc.description.abstractPrecision medicine allows personalized treatment regime for patients with distinct clinical history and characteristics. Dynamic treatment regime implements a reinforcement learning algorithm to produce the optimal personalized treatment regime in clinical medicine. The reinforcement learning method is applicable when an agent takes action in response to the changing environment over time. Q-learning is one of the popular methods to develop the optimal dynamic treatment regime by fitting linear outcome models in a recursive fashion. Despite its ease of implementation and interpretation for domain experts, Q-learning has a certain limitation due to the risk of misspecification of the linear outcome model. Recently, more robust algorithms to the model misspecification have been developed. For example, the inverse probability weighted estimator overcomes the aforementioned problem by using a nonparametric model with different weights assigned to the observed outcomes for estimating the mean outcome. On the other hand, the augmented inverse probability weighted estimator combines information from both the propensity model and the mean outcome model. The current statistical methods for producing the optimal dynamic treatment regime however allow only a binary action space. In clinical practice, some combinations of treatment regime are required, giving rise to a multi-dimensional action space. This study develops and demonstrates a practical way to accommodate a multi-level action space, utilizing currently available computational methods for the practice of precision medicine.en_US
dc.description.departmentManagement Science and Statistics
dc.identifier.issn2470-3958
dc.identifier.urihttps://hdl.handle.net/20.500.12588/248
dc.language.isoen_USen_US
dc.publisherUTSA Office of Undergraduate Researchen_US
dc.relation.ispartofseriesThe UTSA Journal of Undergraduate Research and Scholarly Work;Volume 7
dc.subjectundergraduate student worksen_US
dc.subjectdynamic treatment regimeen_US
dc.subjectprecision medicineen_US
dc.subjectQ-learning algorithmen_US
dc.subjectreinforcement learningen_US
dc.titleOptimal Dynamic Treatment Regime by Reinforcement Learning in Clinical Medicineen_US
dc.typePosteren_US

Files

Original bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
Number 13a.pdf
Size:
347.28 KB
Format:
Adobe Portable Document Format
Description:
Abstract
Loading...
Thumbnail Image
Name:
Number 13b.pdf
Size:
2.05 MB
Format:
Adobe Portable Document Format
Description:
Poster

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.86 KB
Format:
Item-specific license agreed upon to submission
Description: