Programming and managing data-driven applications between the edge and the cloud

Gibert Renart, Eduard

doi:doi:10.7282/t3-g4vd-km15

RUcore: Rutgers University Community Repository

Search
- All
- Text
- Images
- Audio
- Video
Advanced Search | Help

Search all content in all RUcore collections.
Services
Collections

Help Contact Us My Account

Home

Resource

Programming and managing data-driven applications between the edge and the cloud

PDF

PDF format is widely accepted and good for printing.

Plug-in required

PDF-1(2.62 MB)

Citation & Export

View Usage Statistics

Staff View

Citation & Export
Hide

Simple citation

Gibert Renart, Eduard. Programming and managing data-driven applications between the edge and the cloud. Retrieved from https://doi.org/doi:10.7282/t3-g4vd-km15

Export

Click here for information about Citation Management Tools at Rutgers.

Statistics
Hide

Description

TitleProgramming and managing data-driven applications between the edge and the cloud

NameGibert Renart, Eduard (author); Parashar, Manish (chair); Narayana Ganapathy, Srinivas (internal member); Kremer, Ulrich (internal member); Anshus, Otto (outside member); Rutgers University; School of Graduate Studies

Date Created2020

Other Date2020-05 (degree)

SubjectCloud computing, Computer Science

Extent1 online resource (xiv, 122 pages) : illustrations

DescriptionDue to the proliferation of the Internet of Things (IoT), the number of devices connected to the Internet is growing. These devices are generating large volumes of data at the edge of the infrastructure. According to International Data Corporation (IDC) predictions by 2025 the worldwide data will reach 180 zettabytes (ZB), and more than half of that data will come from IoT sensors. Although the generated data provides great potential for science and society, identifying and processing relevant data points hidden in streams of unimportant data, and doing this in near real-time, remains a significant challenge. The prevalent model of moving data from the edge to the cloud of the network is becoming unsustainable, resulting in an impact on latency, network congestion, storage cost and privacy.

These observations can be leveraged to design hybrid architectures that can leverage both the edge and the cloud resources to process the data in a timely manner. Although the cloud is better suited to perform heavier (resource intensive) analysis, such as processing historical events and very large datasets, edge devices can support real-time analytics that consider the temporal and spatial characteristics of IoT data. While edge processing can benefit IoT applications, edge resources are typically constrained in their capabilities. In addition integrating edge computing can also add complexity to applications, especially when they need to include policies that govern what kind of data is processed and analyzed at the edge and what is sent to cloud.

To address these challenges, this dissertation presents an IoT Edge Framework, called R-Pulsar, that extends cloud capabilities to local devices and provides a programming model for deciding what, when, where and how data get collected and processed. This thesis makes the following contributions: (1) A content- and location-based programming abstraction for specifying what data gets collected and where the data gets analyzed. (2) A rule-based programming abstraction for specifying when to trigger data-processing tasks based on data observations. (3) A programming abstraction for specifying how to split a given dataflow and place operators across edge and cloud resources. (4) An operator placement strategy that aims to minimize an aggregate cost which covers the end-to-end latency (time for an event to traverse the entire dataflow), the data transfer rate (amount of data transferred between the edge and the cloud) and the messaging cost (number of messages transferred between edge and the cloud). (5) Performance optimizations on the data-processing pipeline in order to achieve real-time performance on constrained devices. The applicability of this work to real-world IoT applications is validated through a series of experiments in which shows that R-Pulsar can reduce the bandwidth consumption
between the edge and the cloud by up to 82% and obtain results 40% faster than the traditional approach of moving all the data to the cloud.

NotePh.D.

NoteIncludes bibliographical references

Genretheses, ETD doctoral

Persistent URLhttps://doi.org/doi:10.7282/t3-g4vd-km15

LanguageEnglish

CollectionSchool of Graduate Studies Electronic Theses and Dissertations

Organization NameRutgers, The State University of New Jersey

RightsThe author owns the copyright to this work.

Version 8.5.5

Citation & ExportHide

Simple citation

Export

StatisticsHide

Description

Citation & Export
Hide

Statistics
Hide