ATTENTION: The works hosted here are being migrated to a new repository that will consolidate resources, improve discoverability, and better show UTA's research impact on the global community. We will update authors as the migration progresses. Please see MavMatrix for more information.
Show simple item record
dc.contributor.advisor | Gao, Jean | |
dc.creator | Alipourfard, Borzou | |
dc.date.accessioned | 2023-06-28T17:39:05Z | |
dc.date.available | 2023-06-28T17:39:05Z | |
dc.date.created | 2022-05 | |
dc.date.issued | 2022-03-14 | |
dc.date.submitted | May 2022 | |
dc.identifier.uri | http://hdl.handle.net/10106/31406 | |
dc.description.abstract | Discovering causal relations is a fundamental goal of science. Randomized controlled experiments were often considered to be the only reliable method for tackling this task. However, in recent years, various causal discovery methods have been proposed that are capable of identifying causal relations from purely observational data. While these causal discovery methods provide a theoretical framework for bridging the gap from statistical relations to causal conclusions, causal discovery remains a challenging task in practice; this challenge arises because many of the assumptions made in obtaining these theoretical results are often not met in practice. This is especially true when one considers causal discovery in the landscape of genomic data. The critical challenge in learning causal relations in genomic data concerns a marked contrast between the sample size requirements of the aforementioned causal discovery algorithms and the size of the samples obtained through genomic experiments. These causal discovery tools require at least thousands of samples for identifying small causal networks in domains with five to ten variables; in genomic data, we often face networks with hundreds of nodes while the available sample is limited to thousands at best. Besides this factor, in genomic studies, we have to deal with measurement errors, averaging effects, and feedback loops, all of which undermine the theoretical assumptions that come at the core of any typical causal discovery algorithms. In this work, we propose a series of improvements to the available causal discovery algorithms and propose new causal discovery tools that overcome all the aforementioned challenges allowing one to learn and uncover relations of causal nature in gene expression measurements. | |
dc.format.mimetype | application/pdf | |
dc.language.iso | en_US | |
dc.subject | Causal discovery | |
dc.subject | Gene expression analysis | |
dc.subject | Marginal dependence | |
dc.subject | Sink graph | |
dc.title | LEARNING CAUSAL BOUNDS USING MARGINAL INDEPENDENCE INFORMATION WITH APPLICATIONS TO GENE EXPRESSION ANALYSIS | |
dc.type | Thesis | |
dc.date.updated | 2023-06-28T17:39:05Z | |
thesis.degree.department | Computer Science and Engineering | |
thesis.degree.grantor | The University of Texas at Arlington | |
thesis.degree.level | Doctoral | |
thesis.degree.name | Doctor of Philosophy in Computer Science | |
dc.type.material | text | |
local.embargo.terms | 2024-05-01 | |
local.embargo.lift | 2024-05-01 | |
Files in this item
- Name:
- ALIPOURFARD-DISSERTATION-2022.pdf
- Size:
- 1.496Mb
- Format:
- PDF
- Name:
- Dissertation_2.zip
- Size:
- 4.139Mb
- Format:
- Unknown
- Name:
- Dissertation_1.zip
- Size:
- 4.139Mb
- Format:
- Unknown
- Name:
- Dissertation.zip
- Size:
- 5.437Mb
- Format:
- Unknown
This item appears in the following Collection(s)
Show simple item record