Staff View
New nonparametric approaches for multivariate and functional data analysis in outlier detection, construction of tolerance tubes, and clustering

Descriptive

TitleInfo
Title
New nonparametric approaches for multivariate and functional data analysis in outlier detection, construction of tolerance tubes, and clustering
Name (type = personal)
NamePart (type = family)
Fan
NamePart (type = given)
Yi
NamePart (type = date)
1990-
DisplayForm
Yi Fan
Role
RoleTerm (authority = RULIB)
author
Name (type = personal)
NamePart (type = family)
Liu
NamePart (type = given)
Regina
DisplayForm
Regina Liu
Affiliation
Advisory Committee
Role
RoleTerm (authority = RULIB)
chair
Name (type = personal)
NamePart (type = family)
Zhang
NamePart (type = given)
Cun-Hui
DisplayForm
Cun-Hui Zhang
Affiliation
Advisory Committee
Role
RoleTerm (authority = RULIB)
internal member
Name (type = personal)
NamePart (type = family)
Xie
NamePart (type = given)
Minge
DisplayForm
Minge Xie
Affiliation
Advisory Committee
Role
RoleTerm (authority = RULIB)
internal member
Name (type = personal)
NamePart (type = family)
Cheng
NamePart (type = given)
Andrew
DisplayForm
Andrew Cheng
Affiliation
Advisory Committee
Role
RoleTerm (authority = RULIB)
outside member
Name (type = corporate)
NamePart
Rutgers University
Role
RoleTerm (authority = RULIB)
degree grantor
Name (type = corporate)
NamePart
Graduate School - New Brunswick
Role
RoleTerm (authority = RULIB)
school
TypeOfResource
Text
Genre (authority = marcgt)
theses
OriginInfo
DateCreated (qualifier = exact)
2016
DateOther (qualifier = exact); (type = degree)
2016-05
CopyrightDate (encoding = w3cdtf); (qualifier = exact)
2016
Place
PlaceTerm (type = code)
xx
Language
LanguageTerm (authority = ISO639-2b); (type = code)
eng
Abstract (type = abstract)
Recent advances of powerful computing and data acquisition technologies have made large complex datasets ever-present, including high-dimensional or functional data. Most existing statistical approaches for multivariate or functional data rely on parametric assumptions such as normality. In reality, such assumptions are either difficult to justify or verify. The goal of this dissertation is to develop general nonparametric statistical approaches for outlier detection, tolerance tubes construction, and clustering for multivariate and functional data. i) In Chapter 3, we propose a general approach named Antipodal Reflection Depth (ARD), to refine any existing function of depth (henceforth base depth) to form a class of new depth functions. ARD has the advantage over its base depth in capturing the relative magnitude of deviation from all data points to the deepest one. This desirable property is key in making ARD particularly useful in many applications. Here, we focus primarily on its utility in outlier detection. ii) In Chapter 4, we introduce tolerance tubes, which can be viewed as generalizations of tolerance intervals/regions to functional settings. A tolerance tube ensures a specified portion of the functional dataset be contained within the tolerance limits with some level of confidence. In addition to extending the commonly accepted definitions of $beta-$content and $beta-$expectation, we introduce modifications by incorporating an exempt level $alpha$. The latter relaxes the definitions by allowing $alpha$ portion of each functional to be exempt from the requirements and is thus particularly useful to offset allowable occasional aberrations. iii) In Chapter 5, we propose a new clustering approach named K-means on Pairwise Distance (KMPD), and show it to be effective in detecting clusters with different sizes. Moreover, KMPD has the capability of grouping anomalous sample points into a single cluster, and therefore is an effective approach for outlier detection as well. All these approaches are completely non-parametric and data-driven, and thus can be broadly applicable. Relevant theoretical properties are investigated and justified. These approaches are also illustrated and tested using data from both simulations and a real application on a medical study of continuous glucose monitoring.
Subject (authority = RUETD)
Topic
Statistics and Biostatistics
Subject (authority = ETD-LCSH)
Topic
Outliers (Statistics)
RelatedItem (type = host)
TitleInfo
Title
Rutgers University Electronic Theses and Dissertations
Identifier (type = RULIB)
ETD
Identifier
ETD_7244
PhysicalDescription
Form (authority = gmd)
electronic resource
InternetMediaType
application/pdf
InternetMediaType
text/xml
Extent
1 online resource (xiv, 98 p. : ill.)
Note (type = degree)
Ph.D.
Note (type = bibliography)
Includes bibliographical references
Note (type = statement of responsibility)
by Yi Fan
RelatedItem (type = host)
TitleInfo
Title
Graduate School - New Brunswick Electronic Theses and Dissertations
Identifier (type = local)
rucore19991600001
Location
PhysicalLocation (authority = marcorg); (displayLabel = Rutgers, The State University of New Jersey)
NjNbRU
Identifier (type = doi)
doi:10.7282/T3HH6N76
Genre (authority = ExL-Esploro)
ETD doctoral
Back to the top

Rights

RightsDeclaration (ID = rulibRdec0006)
The author owns the copyright to this work.
RightsHolder (type = personal)
Name
FamilyName
Fan
GivenName
Yi
Role
Copyright Holder
RightsEvent
Type
Permission or license
DateTime (encoding = w3cdtf); (qualifier = exact); (point = start)
2016-04-14 17:06:20
AssociatedEntity
Name
Yi Fan
Role
Copyright holder
Affiliation
Rutgers University. Graduate School - New Brunswick
AssociatedObject
Type
License
Name
Author Agreement License
Detail
I hereby grant to the Rutgers University Libraries and to my school the non-exclusive right to archive, reproduce and distribute my thesis or dissertation, in whole or in part, and/or my abstract, in whole or in part, in and from an electronic format, subject to the release date subsequently stipulated in this submittal form and approved by my school. I represent and stipulate that the thesis or dissertation and its abstract are my original work, that they do not infringe or violate any rights of others, and that I make these grants as the sole owner of the rights to my thesis or dissertation and its abstract. I represent that I have obtained written permissions, when necessary, from the owner(s) of each third party copyrighted matter to be included in my thesis or dissertation and will supply copies of such upon request by my school. I acknowledge that RU ETD and my school will not distribute my thesis or dissertation or its abstract if, in their reasonable judgment, they believe all such rights have not been secured. I acknowledge that I retain ownership rights to the copyright of my work. I also retain the right to use all or part of this thesis or dissertation in future works, such as articles or books.
RightsEvent
DateTime (encoding = w3cdtf); (qualifier = exact); (point = start)
2016-05-31
DateTime (encoding = w3cdtf); (qualifier = exact); (point = end)
2017-05-31
Type
Embargo
Detail
Access to this PDF has been restricted at the author's request. It will be publicly available after May 31st, 2017.
Copyright
Status
Copyright protected
Availability
Status
Open
Reason
Permission or license
Back to the top

Technical

RULTechMD (ID = TECHNICAL1)
ContentModel
ETD
OperatingSystem (VERSION = 5.1)
windows xp
CreatingApplication
Version
1.5
ApplicationName
pdfTeX-1.40.14
DateCreated (point = end); (encoding = w3cdtf); (qualifier = exact)
2016-04-14T17:03:06
DateCreated (point = end); (encoding = w3cdtf); (qualifier = exact)
2016-04-14T17:03:06
Back to the top
Version 8.5.5
Rutgers University Libraries - Copyright ©2024