Staff View
Scalable algorithm and workload execution for geo locating satellite imagery

Descriptive

TitleInfo
Title
Scalable algorithm and workload execution for geo locating satellite imagery
Name (type = personal)
NamePart (type = family)
Al-Saadi
NamePart (type = given)
Aymen
NamePart (type = date)
1992-
DisplayForm
Aymen Al-Saadi
Role
RoleTerm (authority = RULIB)
author
Name (type = personal)
NamePart (type = family)
Jha
NamePart (type = given)
Shantenu
DisplayForm
Shantenu Jha
Affiliation
Advisory Committee
Role
RoleTerm (authority = RULIB)
chair
Name (type = personal)
NamePart (type = family)
Turilli
NamePart (type = given)
Matteo
DisplayForm
Matteo Turilli
Affiliation
Advisory Committee
Role
RoleTerm (authority = RULIB)
co-chair
Name (type = personal)
NamePart (type = family)
Soljanin
NamePart (type = given)
Emina
DisplayForm
Emina Soljanin
Affiliation
Advisory Committee
Role
RoleTerm (authority = RULIB)
internal member
Name (type = personal)
NamePart (type = family)
Ortiz
NamePart (type = given)
Jorge
DisplayForm
Jorge Ortiz
Affiliation
Advisory Committee
Role
RoleTerm (authority = RULIB)
internal member
Name (type = personal)
NamePart (type = family)
Lynch
NamePart (type = given)
Heather J
DisplayForm
Heather J Lynch
Affiliation
Advisory Committee
Role
RoleTerm (authority = RULIB)
outside member
Name (type = corporate)
NamePart
Rutgers University
Role
RoleTerm (authority = RULIB)
degree grantor
Name (type = corporate)
NamePart
School of Graduate Studies
Role
RoleTerm (authority = RULIB)
school
TypeOfResource
Text
Genre (authority = marcgt)
theses
OriginInfo
DateCreated (encoding = w3cdtf); (keyDate = yes); (qualifier = exact)
2020
DateOther (encoding = w3cdtf); (qualifier = exact); (type = degree)
2020-01
Language
LanguageTerm (authority = ISO 639-3:2007); (type = text)
English
Abstract (type = abstract)
Climate change is having an impact on the polar regions, causing the retreat of sea ice and mountain glaciers as well as mass loss from the Greenland and Antarctic ice sheets. A better and deeper understanding of these changes requires processing large volumes of imagery data. Efficient analysis of these increasingly large volumes of data requires scalable computing, advanced and efficiently implemented algorithms and petabytes of dedicated storage on high-performance computing (HPC) platforms.

Image geolocation is the process of estimating the geographic location of a single image using previously geolocated aerial and satellite imagery. The Scale-invariant feature transform (SIFT) algorithm is used to geolocate images by finding the similarities with other images in the database. The SIFT algorithm is invariant to image resizing and rotation, and partially invariant to changes in brightness and camera viewpoint.

In Chapter 3 we evaluate the accuracy in terms of the number of matched key points of two SIFT implementations: (a) CPU-SIFT, and (b) CUDA-SIFT. CPU-SIFT is an implementation of the SIFT algorithm on CPU; CUDA-SIFT is an implementation of the SIFT algorithm on GPU using CUDA parallel framework.

A performance characterization showed a number of limitations for CUDA-SIFT, such as low matching accuracy and run one CUDA kernels on a single GPU device while it is possible to run two. We addressed these limitations by increasing the size of image tiles that can be processed, the number of kernels of the CUDA implementation that can be concurrently run on the same GPU device, and improved the number and type of image features that can be concurrently evaluated.

We also evaluated the throughput of CPU-SIFT and CUDA-SIFT in terms of image size in Megabytes per second (MB/s) and their memory consumption. Chapter 3 showed the performance evaluation of both implementations in matching multispectral and satellite imagery. In our evaluation, geolocating two 1 GB satellite images with no geolocation information took 15 minutes using the CPU-SIFT implementation with an accuracy level of 89% and 4 GB of memory on one CPU (Intel Xeon E5-8860 v3). CUDA-SIFT matched the same images, taking 3.5 minutes on one GPU (NVIDIA Tesla P100), with an accuracy of 56.81%, and 7.1 GB of GPU memory on the XSEDE Bridges supercomputer.

An analysis of the current CUDA-SIFT showed that both accuracy and throughput could be improved. Motivated by possible enhancements, we implemented GPU-SIFT, which is an implementation of the SIFT algorithm, for GPUs implemented and based on the previous implementation of CUDA-SIFT. GPU-SIFT offers three major improvements over the previous implementation: (i) we implemented the MPS CUDA parallel framework to enable concurrent matching of multiple image tiles via multiple GPUs; (ii) we doubled the amount of dynamic memory that can be allocated per GPU from 2 to 4 GB. In this way, we increased the size of the images that can be tiled and processed from a maximum of 3000 x 3000 to 5000 x 5000 pixels; and (iii) we implemented an adaptive contrast enhancement, increasing the number of extracted and matched keypoints for each image. GPU-SIFT offers a significant increase in the number of extracted features with an accuracy of 74.18% by applying histogram contrast enhancement.

Finally, we compared the tradeoffs between the accuracy of CPU-SIFT, CUDA-SIFT and GPU-SIFT in terms of the number of matches between a pair of images and the execution time for the entire matching process. The tradeoffs between SIFT matching accuracy and SIFT execution time can create performance variance and can lead to different resource requirements. Characterizing the performance and understanding the performance tradeoffs between GPU-SIFT, CUDA-SIFT and CPU-SIFT can help to predict the performance of these algorithms and their applications with large-scale workloads.

Chapter 4 presents a workflow to run hundreds of image geolocating workflows concurrently on HPC using algorithms, and software tools that support the scalable and automated computation of image geolocation. We present a scalable image geolocating workflow with two different kernels (CPU or GPU) and evaluates the requirement of large-scale ensemble-based technique to run image geolocating pipelines.

The contributions of this thesis are: (i) Compare and characterize the performance between CPU-SIFT, CUDA-SIFT and implement GPU-SIFT as an improvement to serve the image geolocating use case; (ii) The design, implementation and concurrent execution of scalable image geolocating workflows as a set of pipelines to enable concurrent execution for ~200 images with near-linear scalability on 224/8 CPUs/GPUs.

This thesis is a part of the Imagery Cyberinfrastructure and Extensible Building-Blocks to Enhance Research in the Geosciences (ICEBERG) project. Our work has an impact on the domain scientist that will help move geoscience studies of image analysis to a new infrastructure linking scientists, satellite imagery, and high-performance computers. This new imagery-computing superhighway will make it easier for scientists to study processes at much larger spatial scales than has been previously possible.
Subject (authority = RUETD)
Topic
Electrical and Computer Engineering
RelatedItem (type = host)
TitleInfo
Title
Rutgers University Electronic Theses and Dissertations
Identifier (type = RULIB)
ETD
Identifier
ETD_10548
PhysicalDescription
Form (authority = gmd)
InternetMediaType
application/pdf
InternetMediaType
text/xml
Extent
1 online resource (ix, 55 pages) : illustrations
Note (type = degree)
M.S.
Note (type = bibliography)
Includes bibliographical references
Subject (authority = local)
Topic
Image matching
Subject (authority = LCSH)
Topic
Remote sensing
RelatedItem (type = host)
TitleInfo
Title
School of Graduate Studies Electronic Theses and Dissertations
Identifier (type = local)
rucore10001600001
Location
PhysicalLocation (authority = marcorg); (displayLabel = Rutgers, The State University of New Jersey)
NjNbRU
Identifier (type = doi)
doi:10.7282/t3-z0w1-sg59
Genre (authority = ExL-Esploro)
ETD graduate
Back to the top

Rights

RightsDeclaration (ID = rulibRdec0006)
The author owns the copyright to this work.
RightsHolder (type = personal)
Name
FamilyName
AL-Saadi
GivenName
Aymen
Role
Copyright Holder
RightsEvent
Type
Permission or license
DateTime (encoding = w3cdtf); (qualifier = exact); (point = start)
2020-01-13 10:54:30
AssociatedEntity
Name
Aymen AL-Saadi
Role
Copyright holder
Affiliation
Rutgers University. School of Graduate Studies
AssociatedObject
Type
License
Name
Author Agreement License
Detail
I hereby grant to the Rutgers University Libraries and to my school the non-exclusive right to archive, reproduce and distribute my thesis or dissertation, in whole or in part, and/or my abstract, in whole or in part, in and from an electronic format, subject to the release date subsequently stipulated in this submittal form and approved by my school. I represent and stipulate that the thesis or dissertation and its abstract are my original work, that they do not infringe or violate any rights of others, and that I make these grants as the sole owner of the rights to my thesis or dissertation and its abstract. I represent that I have obtained written permissions, when necessary, from the owner(s) of each third party copyrighted matter to be included in my thesis or dissertation and will supply copies of such upon request by my school. I acknowledge that RU ETD and my school will not distribute my thesis or dissertation or its abstract if, in their reasonable judgment, they believe all such rights have not been secured. I acknowledge that I retain ownership rights to the copyright of my work. I also retain the right to use all or part of this thesis or dissertation in future works, such as articles or books.
RightsEvent
Type
Embargo
DateTime (encoding = w3cdtf); (qualifier = exact); (point = start)
2020-01-31
DateTime (encoding = w3cdtf); (qualifier = exact); (point = end)
2020-08-01
Detail
Access to this PDF has been restricted at the author's request. It will be publicly available after August 1st, 2020.
Copyright
Status
Copyright protected
Availability
Status
Open
Reason
Permission or license
Back to the top

Technical

RULTechMD (ID = TECHNICAL1)
ContentModel
ETD
OperatingSystem (VERSION = 5.1)
windows xp
CreatingApplication
Version
1.5
DateCreated (point = end); (encoding = w3cdtf); (qualifier = exact)
2020-01-30T16:16:26
DateCreated (point = end); (encoding = w3cdtf); (qualifier = exact)
2020-01-30T16:16:26
ApplicationName
pdfTeX-1.40.19
Back to the top
Version 8.5.5
Rutgers University Libraries - Copyright ©2024