An object-oriented representation for efficient reinforcement learning

Diuk Wasser, Carlos Gregorio

doi:doi:10.7282/T3H70FK5

RUcore: Rutgers University Community Repository

Search
- All
- Text
- Images
- Audio
- Video
Advanced Search | Help

Search all content in all RUcore collections.
Services
Collections

Help Contact Us My Account

Home

Resource

An object-oriented representation for efficient reinforcement learning

PDF

PDF format is widely accepted and good for printing.

Plug-in required

PDF-1(1.18 MB)

Citation & Export

View Usage Statistics

Staff View

Citation & Export
Hide

Simple citation

Diuk Wasser, Carlos Gregorio. An object-oriented representation for efficient reinforcement learning. Retrieved from https://doi.org/doi:10.7282/T3H70FK5

Export

Click here for information about Citation Management Tools at Rutgers.

Statistics
Hide

Description

TitleAn object-oriented representation for efficient reinforcement learning

NameDiuk Wasser, Carlos Gregorio (author); Littman, Michael L (chair); Borghida, Alex (internal member); Shan, Chung-chieh (internal member); Barto, Andrew G (outside member); Rutgers University; Graduate School - New Brunswick

Date Created2010

Other Date2010-10 (degree)

SubjectComputer Science, Reinforcement learning, Decision making--Testing, Markov processes

Extentxiii, 133 p. : ill.

DescriptionAgents (humans, mice, computers) need to constantly make decisions to survive and thrive in their environment. In the reinforcement-learning problem, an agent needs to learn to maximize its long-term expected reward through direct interaction with the world. To achieve this goal, the agent needs to build some sort of internal representation of the relationship between its actions, the state of the world and the reward it expects to obtain. In this work, I show how the way in which the agent represents state and models the world plays a key role in its ability to learn effectively. I will introduce a new representation, based on objects and their interactions, and show how it enables several orders of magnitude faster learning on a large class of problems. I claim that this representation is a natural way of modeling state and that it bridges a gap between generality and tractability in a broad and interesting class of domains, namely those of relational nature. I will present a set of learning algorithms that make use of this representation in both deterministic and stochastic environments, and present polynomial bounds that prove their efficiency in terms of learning complexity.

NotePh.D.

NoteIncludes bibliographical references

NoteIncludes vita

Noteby Carlos Gregorio Diuk Wasser

Genretheses, ETD doctoral

Persistent URLhttps://doi.org/doi:10.7282/T3H70FK5

Languageeng

CollectionGraduate School - New Brunswick Electronic Theses and Dissertations

Organization NameRutgers, The State University of New Jersey

RightsThe author owns the copyright to this work.

Version 8.5.5

Citation & ExportHide

Simple citation

Export

StatisticsHide

Description

Citation & Export
Hide

Statistics
Hide