Publications
A collection of my research publications and preprints.
EvoLen: Evolution-Guided Tokenization for DNA Language Models
Huang, N., Zhou, X., Cui, J., Tapia-Pacheco, M., Amariuta, T., Li, Y. E., Shang, J.
Under review at COLM 2026 2026
A novel evolution-guided tokenization approach for DNA language models that captures evolutionary constraints in genomic sequences.
Simulating Organized Group Behavior: New Framework, Benchmark, and Analysis
Zou, X., Huang, Y., Wu, Z., Sha, J., Huang, N., Yun, L., Shang, J., Peng, L.
Under review at COLM 2026 2026
A new framework and benchmark for simulating and analyzing organized group behavior.
Integrated Genetic and Transcriptomic Risk Prediction for Neonatal Asthma
Huang, N., Ragsac, M. F., Pham, B. K., Tantisira, K. G., Amariuta, T.
In Preparation 2026
Integrating polygenic risk scores and transcriptomic data for biologically informed neonatal asthma risk prediction.