Show simple item record

dc.contributor.advisorDas, Gautam
dc.creatorShetiya, Suraj
dc.date.accessioned2023-09-27T16:30:06Z
dc.date.available2023-09-27T16:30:06Z
dc.date.created2023-08
dc.date.issued2023-08-08
dc.date.submittedAugust 2023
dc.identifier.urihttp://hdl.handle.net/10106/31725
dc.description.abstractWith the advent of advanced computational models, we are being constantly judged by AI systems, complex algorithmic systems based on data that has been collected about us. These analysis are critical as they span many wide spread areas of our lives. For instance, these systems have been shown to find effective ways to fight back and make informed decisions during the COVID-19 pandemic. The wide spread use of these models naturally give rise to a few questions regarding explaining decisions that these systems have made, fairness questions in various parts of these systems. In this dissertation, we present three important problems pertaining to interpret-ability and explain-ability in interactive black-box systems. User preference is often complex to express and quantify. In our first work in the dissertation, we present the problem of finding a small set of items from a data-set. These set of items minimize the user's regret of missing out on the top item from the whole data-set. Such a model also presents an opportunity for interactively understanding the user's preference and providing the user with the top item in the data-set. Our work presents the novel framework to deal with a class of regret measures. Use of data which is riddled with representative biases leads to unfair outcomes. In our second word, we present the problem of integrating fairness into range queries. More importantly, our goal is to eliminate representative biases from the output data which often is consumed by complex systems which can make critical decisions. In our work, we present a innovative index structure to deal with the single predicate range queries and a neighborhood based search to deal with multi-predicate queries. In our third work in this dissertation, we present the problem of explain-ability in black box matching systems. Our approach relies on Shapley values based methods for explanations. To speed the slow nature of Shapley algorithm, we present a sampling based approximate approach with theoretical guarantees. We present a few common problems that arise in these bipartite black-box matching systems. For each of these problems, we illustrate the design of the Shapley value function. For each of these problems, we discuss our approach to the problem, analyze its behavior and empirically evaluate our procedure. Our extensive experimental results show the efficacy and efficiency of our techniques.
dc.format.mimetypeapplication/pdf
dc.language.isoen_US
dc.subjectFairness
dc.subjectResponsible data management
dc.titleCOMPACT REPRESENTATIVES OF DATABASES AND RESPONSIBLE DATA MANAGEMENT
dc.typeThesis
dc.date.updated2023-09-27T16:30:07Z
thesis.degree.departmentComputer Science and Engineering
thesis.degree.grantorThe University of Texas at Arlington
thesis.degree.levelDoctoral
thesis.degree.nameDoctor of Philosophy in Computer Science
dc.type.materialtext
dc.creator.orcid0000-0001-9166-2365


Files in this item

Thumbnail


This item appears in the following Collection(s)

Show simple item record