Colloquium - Justin Kinney (Cold Springs Harbor Lab) - Massively Parallel Assays, Machine Learning and the BioPhysics of Gene Regulation

Justin Kinney (Cold Springs Harbor Lab) 1/21/20 colloquium speaker
January 21, 2020
3:45PM - 4:45PM
1080 Physics Research Building, Smith Seminar room - reception at 3:30pm in front of the SSR

Date Range
2020-01-21 15:45:00 2020-01-21 16:45:00 Colloquium - Justin Kinney (Cold Springs Harbor Lab) - Massively Parallel Assays, Machine Learning and the BioPhysics of Gene Regulation Gene expression in all organisms is controlled by short DNA and RNA sequences called cis-regulatory elements (CREs). Proteins in the cellular milieu bind to nucleic acid sequences present within CREs, interact with one another, and thus form macromolecular complexes that modulate the expression of nearby genes. My lab uses a combination of experiments and mathematical modeling to study the biophysics of these regulatory processes in living cells. Our experiments are based on massively parallel reporter assays, which leverage the power of ultra-high-throughput DNA sequencing to measure the effects that thousands to millions of different CRE sequence perturbations have on gene expression. Our mathematical modeling efforts aim to infer biophysical models from the large DNA sequence datasets that these experiments yield. This machine learning task has proven to be remarkably fertile from a theoretical standpoint, tying together ideas in Bayesian field theory, information theory, dimensionality reduction, and deep learning. 1080 Physics Research Building, Smith Seminar room - reception at 3:30pm in front of the SSR America/New_York public

Gene expression in all organisms is controlled by short DNA and RNA sequences called cis-regulatory elements (CREs). Proteins in the cellular milieu bind to nucleic acid sequences present within CREs, interact with one another, and thus form macromolecular complexes that modulate the expression of nearby genes. My lab uses a combination of experiments and mathematical modeling to study the biophysics of these regulatory processes in living cells. Our experiments are based on massively parallel reporter assays, which leverage the power of ultra-high-throughput DNA sequencing to measure the effects that thousands to millions of different CRE sequence perturbations have on gene expression. Our mathematical modeling efforts aim to infer biophysical models from the large DNA sequence datasets that these experiments yield. This machine learning task has proven to be remarkably fertile from a theoretical standpoint, tying together ideas in Bayesian field theory, information theory, dimensionality reduction, and deep learning.