计算生物学与生物信息学

2 个项目

所有项目

  • 作者: Yiping Wang,Jing Wang,Junhao Zhu,Fengyao Zhai,Hu Zhu,Ziwei Dai,Zengru Di,Da Zhou,Yu Liu
    摘要:

    Tokenization is a critical design choice in genomic language modeling. Widely used schemes---character-level encoding, fixed-length $k$-mers, and greedy subword algorithms such as BPE---show intrinsic limitations on DNA that are magnified by the small four-letter alphabet. To address this, we adapt Ladderpath, an Algorithmic Information Theory method that identifies nested and hierarchical...

    关键词: Tokenization; Algorithmic Information Theory; Language Model
    发布于: 首次发布 2025-12-11 | 最后更新 2025-12-11
    指标: 查看次数:195  |  下载次数:89  |  收藏次数:0
  • 作者: Han Liu,Wenkang Qu,Da Liang,Xin Yu,Yuejie Lin,Haoyu Zou,Siyuan Han,Zhijun Zhao,Ying Lin,Xiaoyin Zhang,Jinyong Tao,Wenbin Li,Huiping Zhao,Yibin Zhang,Gongning Luo,Ningyi Jiang,Qiyu Peng
    摘要:

    Background. Positron Emission Tomography (PET) is a vital medical imaging tool for studying in vivo metabolism. However, conventional PET systems are limited in their inability to image during free movement. Therefore, we have developed a wearable brain PET system for real-time imaging, called SmartBrain. Methods. The...

    关键词: Wearable PET; Human brain imaging; NEMA; SmartBrain
    发布于: 首次发布 2025-12-04 | 最后更新 2025-12-04
    指标: 查看次数:281  |  下载次数:131  |  收藏次数:0