Learning compositional robotic manipulation tasks from unlabeled data

Liang, Junchi

doi:doi:10.7282/t3-sb47-3w21

RUcore: Rutgers University Community Repository

Search
- All
- Text
- Images
- Audio
- Video
Advanced Search | Help

Search all content in all RUcore collections.
Services
Collections

Help Contact Us My Account

Home

Resource

Learning compositional robotic manipulation tasks from unlabeled data

PDF

PDF format is widely accepted and good for printing.

Plug-in required

PDF-1(40.43 MB)

Citation & Export

View Usage Statistics

Staff View

Citation & Export
Hide

Simple citation

Liang, Junchi. Learning compositional robotic manipulation tasks from unlabeled data. Retrieved from https://doi.org/doi:10.7282/t3-sb47-3w21

Export

Click here for information about Citation Management Tools at Rutgers.

Statistics
Hide

Description

TitleLearning compositional robotic manipulation tasks from unlabeled data

NameLiang, Junchi (author); Boularias, Abdeslam (chair); Yu, Jingjin (member); Aanjaneya, Mridul (member); Kroemer, Oliver (member); Rutgers University; School of Graduate Studies

Date Created2022

Other Date2022-10 (degree)

SubjectRobotics, Robots -- Control systems, Machine learning, Automatic programming (Computer science)

Extent1 online resource (134 pages) : illustrations

DescriptionResearchers have been seeking intelligent robotic systems that can accomplish complex tasks autonomously with very little human effort. With the recent progress from both planning algorithms and learning based methods, many low-level primitives can be fulfilled with remarkable quality. This thesis aims at the next step, compositional manipulation tasks. A compositional manipulation task consists of multiple sub-tasks and each sub-tasks can be completed by low-level manipulation, it requires a high-level reasoning to schedule sub-tasks.

In order to reduce labeling effort, we concentrate on self-supervised learning methods. To be specific, we study reinforcement learning algorithms and imitation learning algorithms. Reinforcement learning algorithms explore in the environment and findoptimal behavior based on rewards. As the only feedback is a reward and there is no direct information about sub-tasks provided, we utilize first causality in training predictive model. Another proposed method is to construct a finite-state machine for describing transitions between sub-tasks and include it in the policy input.

Another line of works covered by this thesis are imitation learning methods where the robot is given a collection of human demonstrations rather than the reward. These demonstrations always present a complete task rather than sub-tasks, labels on sub-tasks switching are never provided. We propose a decomposition of the policy, and maximization of demonstration trajectory likelihood based on this decomposition learns the sub-tasks switching autonomously. Finally, we investigate generalizing this manipulation algorithm to unseen objects by removing the requirement of semantic labels on objects. The proposed method describes objects by feature vectors including appearance and shape information, so similar objects share similar features.

NotePh.D.

NoteIncludes bibliographical references

Genretheses

Persistent URLhttps://doi.org/doi:10.7282/t3-sb47-3w21

LanguageEnglish

CollectionSchool of Graduate Studies Electronic Theses and Dissertations

Organization NameRutgers, The State University of New Jersey

RightsThe author owns the copyright to this work.

Version 8.5.5

Citation & ExportHide

Simple citation

Export

StatisticsHide

Description

Citation & Export
Hide

Statistics
Hide