Wang, Y., Wang, J., Zhu, J., Zhai, F., Zhu, H., Dai, Z., Di, Z., Zhou, D., & Liu, Y. (2025). Compression-Based Tokenization Improves Language Modeling of Hierarchical Genomic Structure. LangTaoSha Preprint Server. https://doi.org/10.65215/2qt5jb81