Unravelling the genetic basis of neurological disorders with machine learning

Location of Research: Remote 

RNA splicing, the cellular process by which "junk" intronic regions are removed from initially transcribed RNA, is tightly regulated in healthy human development but frequently dysregulated in disease. This complex process involves hundreds of proteins and non-coding RNAs, so that predicting the effects of mutations on outcome is beyond current physical models. We instead take a data-driven approach: leveraging large-scale RNA-seq and massively parallel reporter assay data to train deep neural networks to predict splicing directly from gene sequence. This project will extend and apply such models to better understand the genetic basis of ALS and/or autism using large-scale whole-genome and RNA sequencing data we have access to through collaborators at the New York Genome Center.

Lab: Knowles Lab

Direct Supervisor: David Knowles

Position Dates: 5/17/2021 - 8/20/2021

Hours per Week: 40

Paid Position: Yes

Credit: No

Number of positions: 1

Qualifications: Machine learning, computational biology (ideally would have taken my class COMS4762).

Eligibility: Freshman, Sophomore, Junior, Senior, Master's (SEAS only)

David Knowles, [email protected]