Preprint / Version 1

SPIN-dvEvo: Exploration of vast functional sequence space by directed virtual evolution from a local sequence cluster

This article is a preprint and has not been certified by peer review.

Authors

Categories
Keywords
Directed evolution; Protein engineering; Protein Language Model

Abstract

Both natural and directed evolution are powerful in improving protein functions but they are slow in exploring the nearly endless sequence space. Here, we present SPIN-dvEvo that couples few-shot low-rank adaptation (LoRA) of an ESM-2 protein language model with a genetic algorithm to quickly evolve functional remote homologs from a local cluster of highly-homologous, binary-labeled sequences. We experimentally tested SPIN-dvEvo on an enzyme (the core deaminase component of adenine base editors, TadA) and an intrinsically disordered protein (antitoxin CcdA). In TadA, virtually evolved sequences with low sequence identity to the starting sequences achieved a 38% success rate (23/60) in the first round and a 51% success rate along with a one-order-of-magnitude improvement in enzymatic activity in the second round, for which SPIN-dvEvo was retrained on first-round labels. Virtual evolution of the disordered protein CcdA was also successful, albeit at low success rate of 2.6%.  Thus, SPIN-dvEvo can simulate billions of years of evolution in just minutes, rapidly creating new functional clusters.

Metrics

Favorites: 4
Views: 537
Downloads: 63

Downloads

Additional Files

Supplemental File(s)

Posted

2026-01-30

How to Cite

Chen, Z., Tang, J., Zhang, T., Zhang, X., Nie, Q., Zhan, J., & Zhou, Y. (2026). SPIN-dvEvo: Exploration of vast functional sequence space by directed virtual evolution from a local sequence cluster. LangTaoSha Preprint Server. https://doi.org/10.65215/LTSpreprints.2026.01.29.000103

Declaration of Competing Interests

The authors declare no competing interests to disclose.