Stochastic dilemmas: foundations and applications

Goschin, Sergiu

doi:doi:10.7282/T3H993GS

RUcore: Rutgers University Community Repository

Search
- All
- Text
- Images
- Audio
- Video
Advanced Search | Help

Search all content in all RUcore collections.
Services
Collections

Help Contact Us My Account

Home

Resource

Stochastic dilemmas

PDF

PDF format is widely accepted and good for printing.

Plug-in required

PDF-1(2.09 MB)

Citation & Export

View Usage Statistics

Staff View

Citation & Export
Hide

Simple citation

Goschin, Sergiu. Stochastic dilemmas. Retrieved from https://doi.org/doi:10.7282/T3H993GS

Export

Click here for information about Citation Management Tools at Rutgers.

Statistics
Hide

Description

TitleStochastic dilemmas

NameGoschin, Sergiu (author); Littman, Michael L (chair); Hirsh, Haym (co-chair); Kopparty, Swastik (internal member); Mannor, Shie (outside member); Rutgers University; Graduate School - New Brunswick

Date Created2014

Other Date2014-05 (degree)

SubjectComputer Science, Stochastic processes, Stochastic analysis

Extentxv, 188 p. : ill.

DescriptionOne of the significant challenges when solving optimization problems is addressing possible inaccurate or inconsistent function evaluations. Surprisingly and interestingly, this problem is far from trivial even in one of the most basic possible settings: evaluating which of two options is better when the values of the two options are random variables (a stochastic dilemma). Problems in this space have often been studied in the statistics, operations research and computer-science communities under the name of "multi-armed bandits". While most of the previous work has focused on dealing with noise in an online setting, in this dissertation, I will focus on offline optimization where the goal is to return a reasonable solution with high probability using a finite number of samples. I will discuss a set of problem settings of increasing complexity that allow one to derive formal algorithmic bounds. I will point to and discuss interesting connections between stochastic optimization and noisy data annotation, a problem where the goal is to identify the label of an object from a series of noisy evaluations. As a first contribution, I will introduce and formally analyze a set of novel algorithms that improve the state of the art and provide new insights for solving the stochastic optimization and noisy data-annotation problems. I will then formally prove a novel result: That a widely used derivative-free optimization algorithm (the cross-entropy method) is optimizing for quantiles instead of expectation in stochastic optimization settings. I will back up the theoretical claims on the optimization side with experimental results in a set of non-trivial planning and reinforcement-learning domains. Finally, I will discuss the application of the above algorithms for solving noisy data-annotation problems in a setting involving real crowdsourcing experiments.

NotePh.D.

NoteIncludes bibliographical references

Noteby Sergiu Goschin

Genretheses, ETD doctoral

Persistent URLhttps://doi.org/doi:10.7282/T3H993GS

Languageeng

CollectionGraduate School - New Brunswick Electronic Theses and Dissertations

Organization NameRutgers, The State University of New Jersey

RightsThe author owns the copyright to this work.

Version 8.5.5

Citation & ExportHide

Simple citation

Export

StatisticsHide

Description

Citation & Export
Hide

Statistics
Hide