Methods for Reinforcement Learning in Clinical Decision Support

Prasad, Niranjani

Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp018s45qc694

Title:	Methods for Reinforcement Learning in Clinical Decision Support
Authors:	Prasad, Niranjani
Advisors:	Engelhardt, Barbara E
Contributors:	Computer Science Department
Keywords:	Clinical Decision Support Healthcare Machine Learning Reinforcement Leaning
Subjects:	Computer science
Issue Date:	2020
Publisher:	Princeton, NJ : Princeton University
Abstract:	The administration of routine interventions, from breathing support to pain management, constitutes a major part of inpatient care. Thoughtful treatment is crucial to improving patient outcomes and minimizing costs, but these interventions are often poorly understood, and clinical opinion on best protocols can vary significantly. Through a series of case studies of key critical care interventions, this thesis develops a framework for clinician-in-loop decision support. The first of these explores the weaning of patients from mechanical ventilation: admissions are modelled as Markov decision processes (MDPs), and model-free batch reinforcement learning algorithms are employed to learn personalized regimes of sedation and ventilator support, that show promise in improving outcomes when assessed against current clinical practice. The second part of this thesis is directed towards effective reward design when formulating clinical decisions as a reinforcement learning task. In tackling the problem of redundant testing in critical care, methods for Pareto-optimal reinforcement learning are integrated with known procedural constraints in order to consolidate multiple, often conflicting, clinical goals and produce a flexible optimized ordering policy. The challenges here are probed further to examine how decisions by care providers, as observed in available data, can be used to restrict the possible convex combinations of objectives in the reward function, to those that yield policies reflecting what we implicitly know from the data about reasonable behaviour for a task, and that allow for high-confidence off-policy evaluation. The proposed approach to reward design is demonstrated through synthetic domains as well as in planning in critical care. The final case study considers the task of electrolyte repletion, describing how this task can be optimized using the MDP framework and analysing current clinical behaviour through the lens of reinforcement learning, before going on to outline the steps necessary in enabling the adoption of these tools in current healthcare systems.
URI:	http://arks.princeton.edu/ark:/88435/dsp018s45qc694
Alternate format:	The Mudd Manuscript Library retains one bound copy of each dissertation. Search for these copies in the library's main catalog: catalog.princeton.edu
Type of Material:	Academic dissertations (Ph.D.)
Language:	en
Appears in Collections:	Computer Science

Files in This Item:

File	Description	Size	Format
Prasad_princeton_0181D_13412.pdf		3.59 MB	Adobe PDF	View/Download

Show full item record

Search

Browse