Structured deep neural network with low complexity

Liao, Siyu

doi:doi:10.7282/t3-m5vq-c879

RUcore: Rutgers University Community Repository

Search
- All
- Text
- Images
- Audio
- Video
Advanced Search | Help

Search all content in all RUcore collections.
Services
Collections

Help Contact Us My Account

Home

Resource

Structured deep neural network with low complexity

PDF

PDF format is widely accepted and good for printing.

Plug-in required

PDF-1(2.26 MB)

Citation & Export

View Usage Statistics

Staff View

Citation & Export
Hide

Simple citation

Liao, Siyu. Structured deep neural network with low complexity. Retrieved from https://doi.org/doi:10.7282/t3-m5vq-c879

Export

Click here for information about Citation Management Tools at Rutgers.

Statistics
Hide

Description

TitleStructured deep neural network with low complexity

NameLiao, Siyu (author); Yuan, Bo (chair); Rutgers University; School of Graduate Studies

Date Created2020

Other Date2020-10 (degree)

SubjectDeep learning, Electrical and Computer Engineering

Extent1 online resource (x, 80 pages) : illustrations

DescriptionDeep Neural Network (DNN) has achieved great success in many fields. However, many DNN models are both deep and large thereby causing high storage and energy consumption during the training and inference phases. As the size of DNNs continues to grow, it is critical to improve computation efficiency and energy consumption while maintaining the corresponding model performance. Various methods have been proposed for compressing DNN models, which can be categorized into three different levels, model level, structure level, and weight level. This thesis focuses on structure enforcing compression algorithm and embedding quantization method which aims at:i)less storage and computation complexity, ii)easier hardware implementation because of structured memory access pattern, iii)natural language processing oriented embedding binarization. The first chapter introduces the motivation of this dissertation in detail. Chapter 2 goes over the background and the related work about compressing deep neural network. Chapter 3, Chapter 4 and Chapter 5 presents proposed compression methods for fully connected layer, convolution layer and embedding layer. Final chapter 6 discusses possible future directions of this research.

NotePh.D.

NoteIncludes bibliographical references

Genretheses, ETD doctoral

Persistent URLhttps://doi.org/doi:10.7282/t3-m5vq-c879

LanguageEnglish

CollectionSchool of Graduate Studies Electronic Theses and Dissertations

Organization NameRutgers, The State University of New Jersey

RightsThe author owns the copyright to this work.

Version 8.5.5

Citation & ExportHide

Simple citation

Export

StatisticsHide

Description

Citation & Export
Hide

Statistics
Hide