M.Sc Thesis

M.Sc StudentBen-Elazar Shay
SubjectComputational Methods for Analyzing Gene Regulation in
Model Organisms
DepartmentDepartment of Computer Science
Supervisor ASSOCIATE PROF. Zohar Yakhini
Full Thesis textFull thesis text - English Version


This dissertation embodies two separate research projects with a common goal - exploring gene regulation.  In Biology, gene regulation encompasses a broad field which attempts to describe the molecular interactions between various cellular factors that conspire to silence or activate the machinery in charge of compiling a gene from its source code - the DNA, to an executable thread - Protein, which in turn works in cohort with other active machinery in the cell to determine the organism’s phenotype. In the first project, we examine the environment’s’ effect on gene regulation through the lens of evolution, comparing gene expression of 5 strains of the nematode C. elegans grown in 5 different mediums. We use robust statistical methods to show that highly regulated genes, as distinguished by intergenic lengths, motif concentration, and expression levels, are particularly biased towards genotype-environment interactions. Sequencing these strains, we find that genes with expression variation across genotypes are enriched for promoter SNPs, as expected. However, genes with genotype-environment interactions do not significantly differ from background in terms of their promoter SNPs. Collectively, these results suggest that the highly-regulated nature of particular genes predispose them for exhibiting genotype-environment interaction as a consequence of changes to upstream regulators. This observation may provide a deeper understanding into the origin of the extraordinary gene expression diversity present in even closely related species.

In the second project, we take a pragmatic approach and provide an analytical framework of exploring both the structure of DNA and of detecting spatial co-localization of genomic markers. We go on to deploy this framework and provide a 3D structural model of the Saccharomyces Cerevisae genome, and use it to provide evidence of widespread co-localization of the targets of cellular factors, termed Transcription Factors (TFs). We also describe additional work aimed at exploring the space of structural conformations of the genome in an attempt to cluster chromatin conformations.