Risk-averse control of undiscounted transient Markov models

Cavus, Ozlem

doi:doi:10.7282/T3MC8XTF

RUcore: Rutgers University Community Repository

Search
- All
- Text
- Images
- Audio
- Video
Advanced Search | Help

Search all content in all RUcore collections.
Services
Collections

Help Contact Us My Account

Home

Resource

Risk-averse control of undiscounted transient Markov models

PDF

PDF format is widely accepted and good for printing.

Plug-in required

PDF-1(1.82 MB)

Citation & Export

View Usage Statistics

Staff View

Citation & Export
Hide

Simple citation

Cavus, Ozlem. Risk-averse control of undiscounted transient Markov models. Retrieved from https://doi.org/doi:10.7282/T3MC8XTF

Export

Click here for information about Citation Management Tools at Rutgers.

Statistics
Hide

Description

TitleRisk-averse control of undiscounted transient Markov models

NameCavus, Ozlem (author); BEN-ISRAEL, ADI (chair); Ruszczynski, Andrzej (internal member); Boros, Endre (internal member); Alizadeh, Farid (internal member); Katehakis, Michael N. (outside member); Dentcheva, Darinka (outside member); Rutgers University; Graduate School - New Brunswick

Date Created2012

Other Date2012-10 (degree)

SubjectOperations Research, Markov processes, Risk assessment

Extentix, 79 p. : ill.

DescriptionThe classical optimal control problems for discrete-time, transient Markov processes are infinite horizon, undiscounted expected total cost or reward models. Some examples of these models are optimal stopping problems and stochastic shortest or longest path problems, which may have applications in health-care, finance, and maintenance. However, such expected value models implicitly assume the decision maker is risk-neutral, so they may not be appropriate for several real-life problems. In this study, we use Markov risk measures to formulate a risk-averse version of the optimal control problem for transient Markov processes with general state and compact control spaces. We derive risk-averse dynamic programming equations and show that they have a unique solution which is also the optimal value of the Markov control problem. Furthermore, it is shown that a randomized policy may be strictly better than deterministic policies, when risk measures are employed. We suggest two algorithms, value iteration and policy iteration methods, for solving the dynamic programming equations and show their convergence. In general, each policy evaluation step of the policy iteration algorithm requires solving a system of nonsmooth equations. We use a version of nonsmooth Newton method to solve these equations and show its global convergence. We further consider a risk-averse finite horizon Markov control problem under randomized policies and derive a value iteration method for its solution. Finally, we work on asset selling, organ transplant, and credit card examples to illustrate the theory for infinite horizon problem, and present numerical results.

NotePh.D.

NoteIncludes bibliographical references

NoteIncludes vita

Noteby Ozlem Cavus

Genretheses, ETD doctoral

Persistent URLhttps://doi.org/doi:10.7282/T3MC8XTF

Languageeng

CollectionGraduate School - New Brunswick Electronic Theses and Dissertations

Organization NameRutgers, The State University of New Jersey

RightsThe author owns the copyright to this work.

Version 8.5.5

Citation & ExportHide

Simple citation

Export

StatisticsHide

Description

Citation & Export
Hide

Statistics
Hide