Staff View
The strong role of contour geometry in Structure-from-motion and its implications for projective consistency

Descriptive

TitleInfo
Title
The strong role of contour geometry in Structure-from-motion and its implications for projective consistency
Name (type = personal)
NamePart (type = family)
He
NamePart (type = given)
Xiaoli
NamePart (type = date)
1990-
DisplayForm
Xiaoli He
Role
RoleTerm (authority = RULIB)
author
Name (type = personal)
NamePart (type = family)
Singh
NamePart (type = given)
Manish
DisplayForm
Manish Singh
Affiliation
Advisory Committee
Role
RoleTerm (authority = RULIB)
chair
Name (type = personal)
NamePart (type = family)
Feldman
NamePart (type = given)
Jacob
DisplayForm
Jacob Feldman
Affiliation
Advisory Committee
Role
RoleTerm (authority = RULIB)
internal member
Name (type = personal)
NamePart (type = family)
Papathomas
NamePart (type = given)
Thomas
DisplayForm
Thomas Papathomas
Affiliation
Advisory Committee
Role
RoleTerm (authority = RULIB)
internal member
Name (type = personal)
NamePart (type = family)
Elgammal
NamePart (type = given)
Ahmed
DisplayForm
Ahmed Elgammal
Affiliation
Advisory Committee
Role
RoleTerm (authority = RULIB)
outside member
Name (type = corporate)
NamePart
Rutgers University
Role
RoleTerm (authority = RULIB)
degree grantor
Name (type = corporate)
NamePart
School of Graduate Studies
Role
RoleTerm (authority = RULIB)
school
TypeOfResource
Text
Genre (authority = marcgt)
theses
OriginInfo
DateCreated (encoding = w3cdtf); (keyDate = yes); (qualifier = exact)
2020
DateOther (encoding = w3cdtf); (qualifier = exact); (type = degree)
2020-05
CopyrightDate (encoding = w3cdtf); (qualifier = exact)
2020
Language
LanguageTerm (authority = ISO 639-3:2007); (type = text)
English
Abstract (type = abstract)
Structure from motion (SFM) studies have shown that observers can perceive 3D structure in dynamic dot displays that are projected from rotating 3D objects. The SFM literature has focused on explaining how human observers ‘reconstruct’ the 3D structure from dynamic dot displays. Some analyses are based on the positions of dots, others are based on the velocity field. For a long time, there have been two implicit assumptions in SFM research: first, the perceived 3D structure should be projectively consistent with the 2D display; second, the computation of 3D percept does not depend on contour geometry, only on the motion profile of the dots. However, there have been some suggestions in the literature that neither of the two assumptions is correct (Ramachandran, Cobb, & Rogers-Ramachandran, 1988; Thompson, Kersten, & Knecht, 1992; Froyen, Feldman, & Singh, 2013; Tanrıkulu, Froyen, Feldman, & Singh, 2016). Under certain conditions, the perceived 3D structure is not projectively consistent with the 2D motion, and in those cases, the 3D percept is generally dominated by the contour geometry (despite the inconsistency with the motion). Nevertheless, the role of con- tour geometry in SFM remains largely ignored, and none of the existing SFM models (in both Psychology and Computer Science) incorporate its role. The current dissertation aims to pro- vide more direct evidence against these two assumptions and investigate possible connections between them.

In chapter 1, we overview the literature in the field of human structure-from-motion research. The studies on SFM have been either based on the position of individual dots, or on the analysis of motion profile. Both types of studies assume that the perceived 3D structure should be projectively consistent with the 2D display. Also, they seem to ignore the role of contour geometry in the determined 3D structures. We show that some studies have provided evidence that questions the above two assumptions. The aim of current dissertation is to explore both topics, and its potential connections. Our hypothesis is that both motion and contour geometry play important roles in SFM, and when the two sources of information conflict with each other, the influence of contour geometry can be strong enough to override the inconsistency in motion profile. Therefore, this may provide an explanation for why projective consistency is not a necessary requirement in SFM.

In chapter 2, we manipulate the dots’ 2D speed profile in symmetric asymmetric and parallel shapes using a method introduced in He (2016), which allows us to systematically control the degree of projective consistency in the displays. The latter two types of shape contain projective inconsistency both in the speed profile and contour geometry. However, our results show that the volumetric percept with asymmetric displays is still vivid, providing strong evidence that projective consistency is not a necessary requirement in SFM. The similar strength of the 3D percept between symmetric and asymmetric displays also suggests that the global contour symmetry may not be as important as we thought in SFM. In these displays, the speed profile is locally consistent with contour geometry along each horizontal cross-section, so the zero-speed at the boundary suggests a slant of 90◦, which may serve as a cue to be an occluding contour of a 3D object. It suggests that instead of global geometry, we should probably focus on the local presence of 2D contour, which may itself provide a cue to the occluding contour of a 3D object, even if the speed profile does not go down to 0 there.

In chapter 3, we investigate the role of contour geometry by manipulating it separately from the speed profile of the dots. Specifically, we manipulate the shape of the aperture through which the same image motion is shown. Because of the absence of any occlusion cues, the shape of the aperture essentially defines the bounding contour of the SFM display. We start with a ‘motion region’ (such as rectangle, trapezoid, hexagon, barrel, diamond, ellipse) containing dot motion consistent with 3D rotation, and then transform the shape of this motion region to define a smaller aperture. In many conditions, rather than looking like dots moving behind an aperture, the contour captures and defines the perceived 3D shape. The question is how much the aperture shape can be transformed and still determine the 3D percept (i.e. contour-defined percept which is now projectively inconsistent with the image motion). We used Method of Constant Stimuli to find the thresholds for the contour-defined percept. In Experiment 1, we generated smaller apertures by compressing the horizontal width of the motion region. We found that when the horizontal width ratio ω was as low as 0.2 (ellipse) or 0.6 (rectangle), the SFM percept was still dominated by the contour shape. This shows the importance of contour geometry in SFM. In Experiments 2 and 3, we broke coaxiality (the aperture shape and motion region no longer shared the same axis) by either translating or rotating the aperture within the motion region. We found that observers were quite sensitive to the deviation from co-axiality. In Experiment 4, we introduced variation of horizontal width ratio ∆ω within the same display (such as a rotating sphere display shown through a triangle). We found no systematical effect of ∆ω. However, the results did show systematic effects of convexity and contour smoothness.

Overall: 1) These studies provide strong evidence that projective consistency is not a necessary requirement in SFM. Observers can readily perceive a 3D shape even when it is not projectively consistent with the SFM display; 2) These studies document the prominent role played by contour geometry in SFM, as well as some of the ways in which contour geometry and image motion interact in SFM. The prominent influence of contour geometry also in part explains why SFM percepts can fail to be projectively consistent with image motion. Rather than ignore contour geometry (as the SFM literature has largely done), the current study highlights the need for further systematic investigations into how contour geometry and image motion combine in SFM to generate a 3D percept. This in turn would make it possible to develop mathematical models that incorporate both of these cues in SFM.
Subject (authority = local)
Topic
Structure from motion
Subject (authority = LCSH)
Topic
Motion perception (Vision)
Subject (authority = RUETD)
Topic
Psychology
RelatedItem (type = host)
TitleInfo
Title
Rutgers University Electronic Theses and Dissertations
Identifier (type = RULIB)
ETD
Identifier
ETD_10649
PhysicalDescription
Form (authority = gmd)
InternetMediaType
application/pdf
InternetMediaType
text/xml
Extent
1 online resource (xiii, 103 pages) : illustrations
Note (type = degree)
Ph.D.
Note (type = bibliography)
Includes bibliographical references
RelatedItem (type = host)
TitleInfo
Title
School of Graduate Studies Electronic Theses and Dissertations
Identifier (type = local)
rucore10001600001
Location
PhysicalLocation (authority = marcorg); (displayLabel = Rutgers, The State University of New Jersey)
NjNbRU
Identifier (type = doi)
doi:10.7282/t3-0g55-pq96
Genre (authority = ExL-Esploro)
ETD doctoral
Back to the top

Rights

RightsDeclaration (ID = rulibRdec0006)
The author owns the copyright to this work.
RightsHolder (type = personal)
Name
FamilyName
He
GivenName
Xiaoli
Role
Copyright Holder
RightsEvent
Type
Permission or license
DateTime (encoding = w3cdtf); (qualifier = exact); (point = start)
2020-03-26 17:04:30
AssociatedEntity
Name
Xiaoli He
Role
Copyright holder
Affiliation
Rutgers University. School of Graduate Studies
AssociatedObject
Type
License
Name
Author Agreement License
Detail
I hereby grant to the Rutgers University Libraries and to my school the non-exclusive right to archive, reproduce and distribute my thesis or dissertation, in whole or in part, and/or my abstract, in whole or in part, in and from an electronic format, subject to the release date subsequently stipulated in this submittal form and approved by my school. I represent and stipulate that the thesis or dissertation and its abstract are my original work, that they do not infringe or violate any rights of others, and that I make these grants as the sole owner of the rights to my thesis or dissertation and its abstract. I represent that I have obtained written permissions, when necessary, from the owner(s) of each third party copyrighted matter to be included in my thesis or dissertation and will supply copies of such upon request by my school. I acknowledge that RU ETD and my school will not distribute my thesis or dissertation or its abstract if, in their reasonable judgment, they believe all such rights have not been secured. I acknowledge that I retain ownership rights to the copyright of my work. I also retain the right to use all or part of this thesis or dissertation in future works, such as articles or books.
RightsEvent
Type
Embargo
DateTime (encoding = w3cdtf); (qualifier = exact); (point = start)
2020-05-31
DateTime (encoding = w3cdtf); (qualifier = exact); (point = end)
2022-05-31
Detail
Access to this PDF has been restricted at the author's request. It will be publicly available after May 31st, 2022.
Copyright
Status
Copyright protected
Availability
Status
Open
Reason
Permission or license
Back to the top

Technical

RULTechMD (ID = TECHNICAL1)
ContentModel
ETD
OperatingSystem (VERSION = 5.1)
windows xp
CreatingApplication
Version
1.7
DateCreated (point = end); (encoding = w3cdtf); (qualifier = exact)
2020-03-27T14:11:21
DateCreated (point = end); (encoding = w3cdtf); (qualifier = exact)
2020-03-27T14:13:12
ApplicationName
Acrobat Pro DC 20.6.20042
Back to the top
Version 8.4.8
Rutgers University Libraries - Copyright ©2023