Optimal transport in reinforcement learning

Givchi, Arash

doi:doi:10.7282/t3-684x-8p09

RUcore: Rutgers University Community Repository

Search
- All
- Text
- Images
- Audio
- Video
Advanced Search | Help

Search all content in all RUcore collections.
Services
Collections

Help Contact Us My Account

Home

Resource

Optimal transport in reinforcement learning

PDF

PDF format is widely accepted and good for printing.

Plug-in required

PDF-1(1.56 MB)

Citation & Export

View Usage Statistics

Staff View

Citation & Export
Hide

Simple citation

Givchi, Arash. Optimal transport in reinforcement learning. Retrieved from https://doi.org/doi:10.7282/t3-684x-8p09

Export

Click here for information about Citation Management Tools at Rutgers.

Statistics
Hide

Description

TitleOptimal transport in reinforcement learning

NameGivchi, Arash (author); Shafto, Patrick (chair); Shafto, Patrick (member); Loftin, John (member); Fei, Teng (member); Sarwate, Anand (member); Rutgers University; Graduate School - Newark

Date Created2021

Other Date2021-10 (degree)

SubjectArtificial intelligence, Mathematics, Computer science, Reinforcement learning, Optimal transport

Extent1 online resource (vi, 43 pages) : illustrations

DescriptionWe consider constrained policy optimization in Reinforcement Learning (RL), where the constraints are in form of marginals on state visitations and global action executions. Given these distributions, we formulate policy optimization as unbalanced optimal transport over the set of occupancy measures. We propose a general purpose RL objective based on Bregman divergence and optimize it using Dykstra's algorithm. The approach admits a large scale algorithm for when the state or action space is large and only samples from the marginals are available. We discuss applications of our approach and provide demonstrations to show the effectiveness of our algorithm.

NotePh.D.

NoteIncludes bibliographical references

Genretheses

Persistent URLhttps://doi.org/doi:10.7282/t3-684x-8p09

LanguageEnglish

CollectionGraduate School - Newark Electronic Theses and Dissertations

Organization NameRutgers, The State University of New Jersey

RightsThe author owns the copyright to this work.

Version 8.5.5

Citation & ExportHide

Simple citation

Export

StatisticsHide

Description

Citation & Export
Hide

Statistics
Hide