Identification of Transcription Factor Binding Regions
| Michael Snyder (PI) | Yale University | |
|---|---|---|
| Mark Gerstein | Yale University | |
| Anthony Hyman | Max Planck Institute | |
| Stuart Kim | Stanford University | |
| Valerie Reinke | Yale University | |
| Robert Waterston | University of Washington |
C. elegans has a modest-sized and completely sequenced genome, a completely defined somatic cell lineage, and a small number of cells; thus it provides an ideal system to map the regulatory information encoded in its genome. We propose to identify the DNA binding sites for 300 transcription factors in C. elegans. Transcription factor genes will be tagged and their expression examined throughout the C. elegans life cycle. The binding sites of each factor will be determined using chromatin immunoprecipitation followed by DNA microarray analysis (ChIP-chip). This binding information will be compared to other genome annotation and made publicly available. Specifically, we will: 1) Prepare constructs in which all transcription factor genes are fused to coding sequences for fluorescent reporter genes. (Hyman) 2) Generate a series of C. elegans lines that express each fusion construct and obtain a preliminary evaluation of the temporal and spatial expression of each factor throughout the life cycle of the animal. (Waterston, Kim) 3) Based on the expression results, perform ChIP-chip of each factor at one or more key time points. (Snyder, Reinke) 4) Validate the binding results using quantitative measures to measure binding. (Snyder, Reinke). 5) Implement high-throughput data processing pipelines and store the results of these in a web-accessible database for use by the scientific community. (Gerstein) 6) Perform a limited number of bioinformatic analyses to identify regulatory motifs, analyze the association of binding sites between factors and their distribution in the genome (Gerstein) We expect this project to contribute significant information to the identification of regulatory elements in the C. elegans genome; additionally it will serve as a model for parallel studies in human systems.
Information Resources to be Generated
- Binding site locations throughoput the genome for ~300 transcrption factor throughout the genome at one or more time points.
- Raw data for itme 2
- Possibly preliminary data on expression timing.
Contact Information
- PI: Mike Snyder (michael.snyder@yale.edu)
- Wet lab: Mei Zhong (mei.zhong@yale.edu)
- Informatics: Ashish Agarwal (ashish.agarwal@yale.edu)



