Wang, Yiping, Jing Wang, Junhao Zhu, Fengyao Zhai, Hu Zhu, Ziwei Dai, Zengru Di, Da Zhou, and Yu Liu. “Compression-Based Tokenization Improves Language Modeling of Hierarchical Genomic Structure”. LangTaoSha Preprint Server, December 11, 2025. Accessed December 20, 2025. https://langtaosha.org.cn/index.php/lts/preprint/view/51.