<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"><channel><title>Kuan-Hao Chao — Posts</title><description>Research summaries and opinions from Kuan-Hao Chao.</description><link>https://khchao.com/</link><language>en-us</language><item><title>Why we built Shorkie: reading the regulatory code of yeast</title><link>https://khchao.com/posts/shorkie/</link><guid isPermaLink="true">https://khchao.com/posts/shorkie/</guid><description>The story behind Shorkie — a fungal DNA language model that learns the regulatory grammar of budding yeast and predicts how its genes turn on and off over time.</description><pubDate>Fri, 19 Sep 2025 00:00:00 GMT</pubDate><category>DNA language models</category><category>Gene expression</category><category>Yeast</category></item><item><title>Why we rebuilt SpliceAI: an open, retrainable splice-site predictor</title><link>https://khchao.com/posts/openspliceai/</link><guid isPermaLink="true">https://khchao.com/posts/openspliceai/</guid><description>The story behind OpenSpliceAI — a faithful, fast PyTorch reimplementation of SpliceAI that anyone can retrain on any species, and use to read the splicing impact of genetic variants.</description><pubDate>Sun, 01 Jun 2025 00:00:00 GMT</pubDate><category>RNA splicing</category><category>Deep learning</category><category>Open source</category></item><item><title>Why we built LiftOn: better genome annotation by combining DNA and protein</title><link>https://khchao.com/posts/lifton/</link><guid isPermaLink="true">https://khchao.com/posts/lifton/</guid><description>The story behind LiftOn — a genome-annotation tool that fuses DNA alignment (Liftoff) and protein alignment (miniprot) so you can carry accurate gene annotations onto a new assembly, even across species.</description><pubDate>Sat, 01 Feb 2025 00:00:00 GMT</pubDate><category>Genome annotation</category><category>Comparative genomics</category><category>Open source</category></item><item><title>Why we built Splam: a splice-junction model that cleans up RNA-seq</title><link>https://khchao.com/posts/splam/</link><guid isPermaLink="true">https://khchao.com/posts/splam/</guid><description>The story behind Splam — a compact deep-learning model that scores a splice junction from its local sequence, more accurately than SpliceAI, and uses it to strip spurious alignments out of RNA-seq.</description><pubDate>Thu, 15 Aug 2024 00:00:00 GMT</pubDate><category>RNA splicing</category><category>Deep learning</category><category>RNA-seq</category></item><item><title>Why we built WGT: tools for Wheeler graphs, the backbone of pangenome indexes</title><link>https://khchao.com/posts/wgt/</link><guid isPermaLink="true">https://khchao.com/posts/wgt/</guid><description>The story behind the Wheeler Graph Toolkit — software to generate, recognize, and visualize Wheeler graphs, the elegant class of graphs that can be indexed and searched like a BWT, using a fast heuristic and an SMT solver.</description><pubDate>Fri, 14 Jul 2023 00:00:00 GMT</pubDate><category>Pangenomics</category><category>Algorithms</category><category>Open source</category></item><item><title>Why we built Han1: a complete, annotated genome for the world&apos;s largest ethnic group</title><link>https://khchao.com/posts/han1/</link><guid isPermaLink="true">https://khchao.com/posts/han1/</guid><description>The story behind Han1 — the first gapless, reference-quality, fully annotated human genome from a Han Chinese individual, and what comparing two finished human genomes revealed.</description><pubDate>Wed, 01 Mar 2023 00:00:00 GMT</pubDate><category>Genome assembly</category><category>Human genome</category><category>Comparative genomics</category></item><item><title>Why we built sangeranalyseR: painless Sanger sequencing analysis in R</title><link>https://khchao.com/posts/sangeranalyser/</link><guid isPermaLink="true">https://khchao.com/posts/sangeranalyser/</guid><description>The story behind sangeranalyseR — a free, open-source R/Bioconductor package that takes you from raw .ab1 chromatograms to aligned consensus sequences in about four lines of code, with an interactive Shiny GUI.</description><pubDate>Mon, 01 Mar 2021 00:00:00 GMT</pubDate><category>Sanger sequencing</category><category>R / Bioconductor</category><category>Open source</category></item></channel></rss>