Gene expression
Modeling gene regulation from DNA sequence
I study how sequence-to-function models can learn regulatory logic, predict expression across biological conditions, and reveal which parts of DNA shape molecular phenotypes.
What I work on
I develop computational methods for reading, organizing, and comparing genomes — from models of gene regulation and RNA splicing to algorithms for annotation, assembly, and graph indexing.
Gene expression
I study how sequence-to-function models can learn regulatory logic, predict expression across biological conditions, and reveal which parts of DNA shape molecular phenotypes.
RNA splicing
I develop computational models of splice sites and junctions, with an emphasis on sequence context, cross-species transfer, variant effects, and reliable evidence for transcript reconstruction.
Genome annotation
I develop LiftOn, a genome annotation lift-over algorithm that combines DNA and protein alignments to transfer accurate gene models across assemblies and species.
Genome assembly
I study how complete and diverse reference genomes become biologically useful, from assembling and annotating Han1 to annotating haplotype-resolved diploid genomes.
Pangenome indexing
I study graph representations and indexing algorithms that make collections of diverse genomes compact, searchable, and computationally practical.