High-throughput cryo-EM characterization and automated model building of glycofibrils via CryoSeek
摘要
With CryoSeek, a structure-first paradigm for discovery, we have determined high resolution 3D structures of a number of glycofibrils, in which well-ordered glycans either form a thick shell coating various protein cores or constitute the entire fibril. To improve the throughput of CryoSeek, we hereby report two methods. The recursive bisection clustering (RBC) strategy has been designed to enable high-throughput cryo-EM data processing of fibrils. EModelG is an AI-facilitated algorithm for automated model building of glycans. Using the RBC method, we have established a high-throughput workflow for CryoSeek and have reconstructed 3D EM maps for hundreds of fibrils that can be automatically modelled in EModelG. Based on their molecular compositions and structural features, we tentatively proposed a unified nomenclature scheme for the fibrils discovered via CryoSeek. These structures will lay the foundation for decoding the principles of glycan folding. Furthermore, to adapt to the high volume of cryo-EM structures quickly obtained with the CryoSeek strategy, we have established a namesake database for data archiving and sharing.
指标
DOI:
Submission ID:
下载次数
已发布
如何引用
利益冲突声明
作者声明无任何需要披露的利益冲突。
Copyright
本预印本的版权持有者为作者/资助方。
保留所有权利。本作品受版权保护。未经版权所有者事先书面许可,不得以任何形式或方式复制、分发或传播本作品的任何部分。