Fairness in multimedia processing

Alasadi, Jamal

doi:doi:10.7282/t3-se4a-r265

RUcore: Rutgers University Community Repository

Search
- All
- Text
- Images
- Audio
- Video
Advanced Search | Help

Search all content in all RUcore collections.
Services
Collections

Help Contact Us My Account

Home

Resource

Fairness in multimedia processing

PDF

PDF format is widely accepted and good for printing.

Plug-in required

PDF-1(2.35 MB)

Citation & Export

View Usage Statistics

Staff View

Citation & Export
Hide

Simple citation

Alasadi, Jamal. Fairness in multimedia processing. Retrieved from https://doi.org/doi:10.7282/t3-se4a-r265

Export

Click here for information about Citation Management Tools at Rutgers.

Statistics
Hide

Description

TitleFairness in multimedia processing

NameAlasadi, Jamal (author); Singh, Vivek (chair); Gajic, Zoran (member); Hassan, Umer (member); Atrey, Pradeep K (member); Rutgers University; School of Graduate Studies

Date Created2023

Other Date2023-01 (degree)

SubjectComputer engineering, Computer engineering, Bias in machine learning, Deep learning, Face detection, Face matching, Fairness, Neural network

Extent78 pages : illustrations

DescriptionRecently, Machine learning, as a branch of artificial intelligence, has been playing an increasingly significant role in academia and industry fields, specifically in terms of image classification, object detection, and video analytics. Recent reports of bias in multimedia algorithms (e.g., lesser accuracy of face detection for women and persons of color) haveunderscored the urgent need to devise approaches which work equally well for different demographic groups. Hence, we posit that ensuring fairness in multimodal processing (e.g., equal performance irrespective of the gender of the user) is an important research challenge.
This dissertation proposes three novel contributions to the literature on fairness in multimedia processing. We first focus on the problem of face matching (i.e., matching low-resolution and high-resolution images of a person). We describe how adopting an adversarial deep learning-based approach allows for the model to maintain accuracy at face matching while also reducing demographic disparities compared to a baseline (non-adversarial deep learning) approach. The results motivate and pave the way for more accurate and fair face-matching algorithms.
Secondly, we consider multimodal cyberbullying detection and propose a fairness-aware fusion framework, which ensures both fairness and accuracy when combining data coming from multiple modalities. This Bayesian fusion framework is cognizant of the different confidence levels associated with each feature, the inter-dependencies between features, and, importantly, the fairness potential of each feature. Results of applying the framework to a multimodal (visual and text) cyberbullying detection problem demonstrate the value of the proposed framework in ensuring both accuracy and fairness.
Our third contribution revisits the problem of fairness in face matching and proposes a generative AI framework that can counter multiple kinds of bias (e.g., gender bias and age bias) at the same time. The framework consists of two major components: a variational auto-encoder (VAE) that converts the images into their more generic underlying representation, and second, a neural network architecture that uses the above representations to undertake multi-label classification. A generative approach is useful in ensuring that the system learns to deal with the underlying (latent) structure of the data for better generalizability and bias reduction. The approach is tested over a public image dataset and found to be effective at reducing bias while maintaining high accuracy.
In effect, the three contributions pave way for fairer multimedia information processing which would enable multiple security and personalization applications to provide equal opportunities to different demographic groups.

NotePh.D.

NoteIncludes bibliographical references

Genretheses

Persistent URLhttps://doi.org/doi:10.7282/t3-se4a-r265

LanguageEnglish

CollectionSchool of Graduate Studies Electronic Theses and Dissertations

Organization NameRutgers, The State University of New Jersey

RightsThe author owns the copyright to this work.

Version 8.5.5

Citation & ExportHide

Simple citation

Export

StatisticsHide

Description

Citation & Export
Hide

Statistics
Hide