(1)
Wang, Y.; Wang, J.; Zhu, J.; Zhai, F.; Zhu, H.; Dai, Z.; Di, Z.; Zhou, D.; Liu, Y. Compression-Based Tokenization Improves Language Modeling of Hierarchical Genomic Structure. LangTaoSha Preprint Server. December 11, 2025. https://doi.org/10.65215/2qt5jb81.