Preprint / Version 1

HALink: A VAE-Based and Hybrid GCN–GAT Architecture for Inferring Single-Cell Gene Regulatory Networks

This article is a preprint and has not been certified by peer review.

Authors

    Chaowang Lan,  
    Chaowang Lan
    Yulong Yuan,  
    Yulong Yuan
    Jingxin Wu,  
    Jingxin Wu
    Huiwu Zhang,  
    Huiwu Zhang
    Xingyu Ji,  
    Xingyu Ji
    Caihua Liu
    Caihua Liu
Categories
Keywords
Gene regulatory network; Variational Autoencoder; sparsification strategy; hybrid architecture

Abstract

Gene regulatory networks (GRNs) describe the complex interactions that control gene expression and are key indicators of cellular function and disease progression. Although numerous methods have been developed for inferring GRNs from single-cell transcriptomic data, current methods have two limitations: sensitivity to data sparsity and technical noise in single-cell RNA sequencing, and inadequate integration of both global structural information and local dependencies within GRNs.
To address these limitations, we propose HALink. This method integrating a variational autoencoder-based model and Top-K sparsification strategy to address the data sparsity, reduce technical noise, and insufficient prior knowledge. Furthermore, it ulitizes a hybrid Graph Convolutional Network–Graph Attention Network (GCN–GAT) architecture that simultaneously captures both global structural information and local dependencies within GRNs, enabling more comprehensive and accurate GRN construction.
Experiments results demonstrate that both the VAE-based model with sparsification strategy and the hybrid GCN–GAT architecture are able to improve the prediction performance, while their integrated implementation yields synergistic improvements. Compared with the state-of-art methods, HALink achieves superior performance on most experimental cases across four benchmark datasets. Our method facilitates the identification of key gene regulatory networks underlying life processes and enables the discovery of biologically meaningful insights into disease pathogenesis.

References

1. Alberto de la Fuente. What are gene regulatory networks?. In Handbook of research

on computational methodologies in gene regulatory networks, pages 1–27. IGI Global,

2010.

2. Daniele Mercatelli, Laura Scalambra, Luca Triboli, Forest Ray, and Federico M.

Giorgi. Gene regulatory network inference resources: A practical overview. Biochim-

ica et Biophysica Acta (BBA)-Gene Regulatory Mechanisms, 1863(6):194430, 2020.

Elsevier.

3. Eric H. Davidson and Douglas H. Erwin. Gene regulatory networks and the evolution

of animal body plans. Science, 311(5762):796–800, 2006.

4. Douglas H. Erwin and Eric H. Davidson. The evolution of hierarchical gene regu-

latory networks. Nature Reviews Genetics, 10(2):141–148, 2009. Nature Publishing

Group UK London.

5. Junlin Xu, Changcheng Lu, Shuting Jin, Yajie Meng, Xiangzheng Fu, Xiangxiang

Zeng, Ruth Nussinov, and Feixiong Cheng. Deep learning-based cell-specific gene

regulatory networks inferred from single-cell multiome data. Nucleic Acids Research,

53(5):gkaf138, 2025. Oxford University Press.

6. Frank Emmert-Streib, Matthias Dehmer, and Benjamin Haibe-Kains. Gene regu-

latory networks and their applications: understanding biological and medical prob-

lems in terms of networks. Frontiers in Cell and Developmental Biology, 2:38, 2014.

Frontiers Media SA.

7. Benzhe Su, Weiwei Wang, Xiaohui Lin, Shenglan Liu, and Xin Huang. Identifying

the potential miRNA biomarkers based on multi-view networks and reinforcement

learning for diseases. Briefings in Bioinformatics, 25(1):bbad427, 2024.

8. Mengyuan Zhao, Wenying He, Jijun Tang, Quan Zou, and Fei Guo. A comprehensive

overview and critical evaluation of gene regulatory network inference technologies.

Briefings in Bioinformatics, 22(5):bbab009, 2021.

9. Guo Mao, Ruigeng Zeng, Jintao Peng, Ke Zuo, Zhengbin Pang, and Jie Liu. Recon-

structing gene regulatory networks of biological function using differential equations

of multilayer perceptrons. BMC Bioinformatics, 23(1):503, 2022.

10. Weixu Wang, Yichen Wang, Ruiqi Lyu, and Dominic Grün. Scalable identification

of lineage-specific gene regulatory networks from metacells with NetID. Genome

Biology, 25(1):275, 2024.

11. Yang Li, Anjun Ma, Yizhong Wang, Qi Guo, Cankun Wang, Hongjun Fu,

Bingqiang Liu, and Qin Ma. Enhancer-driven gene regulatory networks infer-

ence from single-cell RNA-seq and ATAC-seq data. Briefings in Bioinformatics,

25(5):bbae369, 2024.

12. Eva Hedlund and Qiaolin Deng. Single-cell RNA sequencing: technical advance-

ments and biological applications. Molecular Aspects of Medicine, 59:36–46, 2018.

13. Riet De Smet and Kathleen Marchal. Advantages and limitations of current net-

work inference methods. Nature Reviews Microbiology, 8(10):717–729, 2010.

14. Alicia T. Specht and Jun Li. LEAP: constructing gene co-expression networks

for single-cell RNA-sequencing data using pseudotime ordering. Bioinformatics,

33(5):764–766, 2017.

15. Thalia E. Chan, Michael P. H. Stumpf, and Ann C. Babtie. Gene regulatory

network inference from single-cell data using multivariate information measures.

Cell Systems, 5(3):251–267, 2017.

16. Jiayi Dong, Jiahao Li, and Fei Wang. Deep learning in gene regulatory network

inference: a survey. IEEE/ACM Transactions on Computational Biology and Bioin-

formatics, 2024.

17. Mengyuan Zhao, Wenying He, Jijun Tang, Quan Zou, and Fei Guo. A hybrid deep

learning framework for gene regulatory network inference from single-cell transcrip-

tomic data. Briefings in Bioinformatics, 23(2):bbab568, 2022.

18. Guo Mao, Zhengbin Pang, Ke Zuo, Qinglin Wang, Xiangdong Pei, Xinhai Chen,

and Jie Liu. Predicting gene regulatory links from single-cell RNA-seq data using

graph neural networks. Briefings in Bioinformatics, 24(6):bbad414, 2023.

19. Lin Yuan, Ling Zhao, Yufeng Jiang, Zhen Shen, Qinhu Zhang, Ming Zhang, Chun-

Hou Zheng, and De-Shuang Huang. scMGATGRN: a multiview graph attention

network–based method for inferring gene regulatory networks from single-cell tran-

scriptomic data. Briefings in Bioinformatics, 25(6):bbae526, 2024.

20. Weiming Yu, Zerun Lin, Miaofang Lan, and Le Ou-Yang. GCLink: a graph con-

trastive link prediction framework for gene regulatory network inference. Bioinfor-

matics, 41(3):btaf074, 2025.

21. Jiacheng Wang, Yaojia Chen, and Quan Zou. Inferring gene regulatory network

from single-cell transcriptomes with graph autoencoder model. PLOS Genetics,

19(9):e1010942, 2023.

22. Guangyi Chen and Zhi-Ping Liu. Graph attention network for link predic-

tion of gene regulations from single-cell RNA-sequencing data. Bioinformatics,

38(19):4522–4529, 2022.

23. Muhan Zhang and Yixin Chen. Link prediction based on graph neural networks.

Advances in Neural Information Processing Systems, 31, 2018.

24. Steffen Albrecht, Tommaso Andreani, Miguel A Andrade-Navarro, and Jean Fred

Fontaine. Single-cell specific and interpretable machine learning models for sparse

scChIP-seq data imputation. Plos One, 17(7):e0270043, 2022.

25. HaiYun Wang, JianPing Zhao, ChunHou Zheng, and YanSen Su. scDSSC: deep

sparse subspace clustering for scRNA-seq data. PLOS Computational Biology,

18(12):e1010772, 2022.

26. Aditya Pratapa, Amogh P. Jalihal, Jeffrey N. Law, Aditya Bharadwaj, and T.

M. Murali. Benchmarking algorithms for gene regulatory network inference from

single-cell transcriptomic data. Nature Methods, 17(2):147–154, 2020.

27. Kishan Kc, Rui Li, Feng Cui, Qi Yu, and Anne R. Haake. GNE: a deep learning

framework for gene network inference by aggregating biological information. BMC

Systems Biology, 13:1–14, 2019.

28. J. Gray Camp, Keisuke Sekine, Tobias Gerber, Henry Loeffler-Wirth, Hans Binder,

Malgorzata Gac, Sabina Kanton, Jorge Kageyama, Georg Damm, Daniel Seehofer,

et al. Multilineage communication regulates human liver bud development from

pluripotency. Nature, 546(7659):533–538, 2017.

29. Tetsutaro Hayashi, Haruka Ozaki, Yohei Sasagawa, Mana Umeda, Hiroki Danno,

and Itoshi Nikaido. Single-cell full-length total RNA sequencing uncovers dynamics

of recursive splicing and enhancer RNAs. Nature Communications, 9(1):619, 2018.

30. Alex K. Shalek, Rahul Satija, Joe Shuga, John J. Trombetta, Dave Gennert, Di-

ana Lu, Peilin Chen, Rona S. Gertner, Jellert T. Gaublomme, Nir Yosef, et al.

Single-cell RNA-seq reveals dynamic paracrine control of cellular variation. Nature,

510(7505):363–369, 2014.

31. Huilei Xu, Caroline Baroukh, Ruth Dannenfelser, Edward Y. Chen, Christopher

M. Tan, Yan Kou, Yujin E. Kim, Ihor R. Lemischka, and Avi Ma’ayan. ESCAPE:

database for integrating high-content published data collected from human and

mouse embryonic stem cells. Database, 2013:bat045, 2013.

32. Damian Szklarczyk, Annika L. Gable, David Lyon, Alexander Junge, Stefan Wyder,

Jaime Huerta-Cepas, Milan Simonovic, Nadezhda T. Doncheva, John H. Morris,

Peer Bork, et al. STRING v11: protein–protein association networks with increased

coverage, supporting functional discovery in genome-wide experimental datasets.

Nucleic Acids Research, 47(D1):D607–D613, 2019.

33. Luz Garcia-Alonso, Christian H. Holland, Mahmoud M. Ibrahim, Denes Turei, and

Julio Saez-Rodriguez. Benchmark and integration of resources for the estimation of

human transcription factor activities. Genome Research, 29(8):1363–1375, 2019.

34. Zhi-Ping Liu, Canglin Wu, Hongyu Miao, and Hulin Wu. RegNetwork: an inte-

grated database of transcriptional and post-transcriptional regulatory networks in

human and mouse. Database, 2015:bav095, 2015.

35. Heonjong Han, Jae-Won Cho, Sangyoung Lee, Ayoung Yun, Hyojin Kim, Dasom

Bae, Sunmo Yang, Chan Yeong Kim, Muyoung Lee, Eunbeen Kim, et al. TRRUST

v2: an expanded reference database of human and mouse transcriptional regulatory

interactions. Nucleic Acids Research, 46(D1):D380–D386, 2018.

36. Jill E. Moore, Michael J. Purcaro, Henry E. Pratt, Charles B. Epstein, Noam

Shoresh, Jessika Adrian, Trupti Kawli, Carrie A. Davis, Alexander Dobin, et al.

Expanded encyclopaedias of DNA elements in the human and mouse genomes. Na-

ture, 583(7818):699–710, 2020.

37. Shinya Oki, Tazro Ohta, Go Shioi, Hideki Hatanaka, Osamu Ogasawara, Yoshihiro

Okuda, Hideya Kawaji, Ryo Nakaki, Jun Sese, and Chikara Meno. ChIP-Atlas:

a data-mining suite powered by full integration of public ChIP-seq data. EMBO

Reports, 19(12):e46255, 2018.

38. Sonia Nestorowa, Fiona K. Hamey, Blanca Pijuan Sala, Evangelia Diamanti, Mairi

Shepherd, Elisa Laurenti, Nicola K. Wilson, David G. Kent, and Berthold Göttgens.

A single-cell resolution map of mouse hematopoietic stem and progenitor cell differ-

entiation. Blood, The Journal of the American Society of Hematology, 128(8):e20–

e31, 2016.

Metrics

Views: 38
Downloads: 12

Downloads

Additional Files

Supplemental File(s)

Posted

2026-05-11

How to Cite

Lan, C., Yuan, Y., Wu, J., Zhang, H., Ji, X., & Liu, C. (2026). HALink: A VAE-Based and Hybrid GCN–GAT Architecture for Inferring Single-Cell Gene Regulatory Networks. LangTaoSha Preprint Server. https://doi.org/10.65215/LTSpreprints.2026.05.11.000236

Download Citation

Declaration of Competing Interests

The authors declare no competing interests to disclose.