I studied computer science and math during my undergrad at the University of Connecticut. I'm now a PhD student in the Systems, Synthetic, and Quantitative Biology PhD program at Harvard, working in Michael Desai's lab. I like working on a wide range of problems but my interests are mainly in protein fitness landscapes, antibodies, and Bayesian statistics.
Drug Discov. Today
The AUC and AUPR are metrics commonly used to evaluate models that predict the side effects of drugs using their molecular features. These models predict a matrix of drugs' associated side effects. However, the baseline AUC and AUPR can be high depending on the statistical properties of this matrix. We analyze this dependence, take it into account, and ask: to what degree do models actually benefit from molecular fingerprints?
Bioinformatics
Studies have genotyped and measured the gene expression levels of many people. Using this data, one can investigate how genotype influences gene expression, which could be useful for understanding the genetic basis of complex traits such as disease. We attempt to model gene expression, accounting for interactions among genetic markers. By doing so, we more accurately predict the expression of a large subset of genes.
Forecasting
Thunderstorms are complex phenomena that cause substantial power outages in a short period. Predicting these outages is challenging using models that summarize the weather over the entire course of the storm. Instead, we develop a framework designed for models to learn the dynamics of thunderstorm-caused outages directly from hourly weather forecasts.