Publications
A collection of my research publications and preprints.
EvoLen: Evolution-Guided Tokenization for DNA Language Models
Nan Huang, Xingyu Zhou, Jiaqi Cui, Michelle Tapia-Pacheco, Tiffany Amariuta, Yang E. Li, Jingbo Shang
Under review at COLM 2026 2026
A novel evolution-guided tokenization approach for DNA language models that captures evolutionary constraints in genomic sequences.
Simulating Organized Group Behavior: New Framework, Benchmark, and Analysis
Xinhao Zou, Yifei Huang, Zijian Wu, Jiarui Sha, Nan Huang, Lingfeng Yun, Jingbo Shang, Liangcai Peng
Under review at COLM 2026 2026
A new framework and benchmark for simulating and analyzing organized group behavior.
Integrated Genetic and Transcriptomic Risk Prediction for Neonatal Asthma
Nan Huang, Matthew F. Ragsac, Brian K. Pham, Kelan G. Tantisira, Tiffany Amariuta
In Preparation 2026
Integrating polygenic risk scores and transcriptomic data for biologically informed neonatal asthma risk prediction.