Publications

A collection of my research publications and preprints.

EvoLen: Evolution-Guided Tokenization for DNA Language Models

Nan Huang, Xingyu Zhou, Jiaqi Cui, Michelle Tapia-Pacheco, Tiffany Amariuta, Yang E. Li, Jingbo Shang

Under review at COLM 2026 2026

A novel evolution-guided tokenization approach for DNA language models that captures evolutionary constraints in genomic sequences.

Simulating Organized Group Behavior: New Framework, Benchmark, and Analysis

Xinhao Zou, Yifei Huang, Zijian Wu, Jiarui Sha, Nan Huang, Lingfeng Yun, Jingbo Shang, Liangcai Peng

Under review at COLM 2026 2026

A new framework and benchmark for simulating and analyzing organized group behavior.

Integrated Genetic and Transcriptomic Risk Prediction for Neonatal Asthma

Nan Huang, Matthew F. Ragsac, Brian K. Pham, Kelan G. Tantisira, Tiffany Amariuta

In Preparation 2026

Integrating polygenic risk scores and transcriptomic data for biologically informed neonatal asthma risk prediction.