RNA splicing, the cellular process by which "junk" intronic regions are removed from initially transcribed RNA, is tightly regulated in healthy human development but frequently dysregulated in disease. This complex process involves hundreds of proteins and non-coding RNAs, so that predicting the effects of mutations on outcome is beyond current physical models. We instead take a data-driven approach: leveraging large-scale RNA-seq and massively parallel reporter assay data to train deep neural networks to predict splicing directly from gene sequence. This project will extend and apply such models to better understand the genetic basis of ALS and/or autism using large-scale whole-genome and RNA sequencing data we have access to through collaborators at the New York Genome Center.
Lab: Knowles Lab
Direct Supervisor: David Knowles
Position Dates: 5/17/2021 - 8/20/2021
Hours per Week: 40
Paid Position: Yes
Number of positions: 1
Qualifications: Machine learning, computational biology (ideally would have taken my class COMS4762).
Eligibility: Freshman, Sophomore, Junior, Senior, Master's (SEAS only)