Privacy or utility? how to preserve both in outlier analysis

Asif, Hafiz

doi:doi:10.7282/t3-5gm1-sc68

RUcore: Rutgers University Community Repository

Search
- All
- Text
- Images
- Audio
- Video
Advanced Search | Help

Search all content in all RUcore collections.
Services
Collections

Help Contact Us My Account

Home

Resource

Privacy or utility? how to preserve both in outlier analysis

PDF

PDF format is widely accepted and good for printing.

Plug-in required

PDF-1(6.46 MB)

Citation & Export

View Usage Statistics

Staff View

Citation & Export
Hide

Simple citation

Asif, Hafiz. Privacy or utility? how to preserve both in outlier analysis. Retrieved from https://doi.org/doi:10.7282/t3-5gm1-sc68

Export

Click here for information about Citation Management Tools at Rutgers.

Statistics
Hide

Description

TitlePrivacy or utility? how to preserve both in outlier analysis

NameAsif, Hafiz (author); Vaidya, Vaidya (chair); Papakonstantinou, Periklis A. (co-chair); Atluri, Vijayalakshmi (internal member); Shafiq, Basit (internal member); Sural, Shamik (outside member); Rutgers University; Graduate School - Newark

Date Created2021

Other Date2021-01 (degree)

SubjectManagement

Extent1 online resource (ix, 140 pages)

DescriptionData analysts use outlier analysis to discover non-conforming patterns in data to gen- erate actionable insights. It is an incredibly useful approach, but like all data-driven approaches, it raises privacy-related serious ethical and legal concerns when data is about peoples’ information. Is it possible to accurately analyze data for outliers while protecting the privacy of people whose data we analyze? In this dissertation, we explicate methods to answer this question for the most practically relevant case, where outliers are defined in a data-dependent way and current privacy methods such as differential privacy fail to achieve practically meaningful utility.

To define what it means to protect privacy in outlier analysis, we conceptualize sensitive privacy — it not only admits eﬀicient algorithmic constructions but is also amenable to analysis. We introduce novel constructions to develop sensitively private mechanisms to accurately identify outliers, and to compile low-accuracy differentially private mechanisms into high-accuracy sensitively private mechanisms. Furthermore, to address the lack of a principled approach to private outlier analysis, we provide a framework to help a data analyst identify the right problem-specification and a practical solution for her application.

Finally, we develop mechanisms — which guarantee privacy and practically mean- ingful utility — to identify (β,r)-anomalies as well as covid-19 hotspots (an outlying event). An extensive empirical evaluation of these private mechanisms over a range of real-world datasets and use cases overwhelmingly supports the effectiveness of our approach.

NotePh.D.

NoteIncludes bibliographical references

Genretheses, ETD doctoral

Persistent URLhttps://doi.org/doi:10.7282/t3-5gm1-sc68

LanguageEnglish

CollectionGraduate School - Newark Electronic Theses and Dissertations

Organization NameRutgers, The State University of New Jersey

RightsThe author owns the copyright to this work.

Version 8.5.5

Citation & ExportHide

Simple citation

Export

StatisticsHide

Description

Citation & Export
Hide

Statistics
Hide