[1]
Y. Wang, “Compression-Based Tokenization Improves Language Modeling of Hierarchical Genomic Structure”, LangTaoSha Preprint Server. Dec. 11, 2025. doi: 10.65215/2qt5jb81.