COMPACT REPRESENTATIVES OF DATABASES AND RESPONSIBLE DATA MANAGEMENT

Shetiya, Suraj

dc.contributor.advisor	Das, Gautam
dc.creator	Shetiya, Suraj
dc.date.accessioned	2023-09-27T16:30:06Z
dc.date.available	2023-09-27T16:30:06Z
dc.date.created	2023-08
dc.date.issued	2023-08-08
dc.date.submitted	August 2023
dc.identifier.uri	http://hdl.handle.net/10106/31725
dc.description.abstract	With the advent of advanced computational models, we are being constantly judged by AI systems, complex algorithmic systems based on data that has been collected about us. These analysis are critical as they span many wide spread areas of our lives. For instance, these systems have been shown to find effective ways to fight back and make informed decisions during the COVID-19 pandemic. The wide spread use of these models naturally give rise to a few questions regarding explaining decisions that these systems have made, fairness questions in various parts of these systems. In this dissertation, we present three important problems pertaining to interpret-ability and explain-ability in interactive black-box systems. User preference is often complex to express and quantify. In our first work in the dissertation, we present the problem of finding a small set of items from a data-set. These set of items minimize the user's regret of missing out on the top item from the whole data-set. Such a model also presents an opportunity for interactively understanding the user's preference and providing the user with the top item in the data-set. Our work presents the novel framework to deal with a class of regret measures. Use of data which is riddled with representative biases leads to unfair outcomes. In our second word, we present the problem of integrating fairness into range queries. More importantly, our goal is to eliminate representative biases from the output data which often is consumed by complex systems which can make critical decisions. In our work, we present a innovative index structure to deal with the single predicate range queries and a neighborhood based search to deal with multi-predicate queries. In our third work in this dissertation, we present the problem of explain-ability in black box matching systems. Our approach relies on Shapley values based methods for explanations. To speed the slow nature of Shapley algorithm, we present a sampling based approximate approach with theoretical guarantees. We present a few common problems that arise in these bipartite black-box matching systems. For each of these problems, we illustrate the design of the Shapley value function. For each of these problems, we discuss our approach to the problem, analyze its behavior and empirically evaluate our procedure. Our extensive experimental results show the efficacy and efficiency of our techniques.
dc.format.mimetype	application/pdf
dc.language.iso	en_US
dc.subject	Fairness
dc.subject	Responsible data management
dc.title	COMPACT REPRESENTATIVES OF DATABASES AND RESPONSIBLE DATA MANAGEMENT
dc.type	Thesis
dc.date.updated	2023-09-27T16:30:07Z
thesis.degree.department	Computer Science and Engineering
thesis.degree.grantor	The University of Texas at Arlington
thesis.degree.level	Doctoral
thesis.degree.name	Doctor of Philosophy in Computer Science
dc.type.material	text
dc.creator.orcid	0000-0001-9166-2365

Files in this item

Name:: SHETIYA-DISSERTATION-2023.pdf
Size:: 2.235Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Show simple item record