Projects per year
Abstract
Graph and image are two common representations of Hi-C cis-contact maps. Existing computational tools have only adopted Hi-C data modeled as unitary data structures but neglected the potential advantages of synergizing the information of different views. Here we propose GILoop, a dual-branch neural network that learns from both representations to identify genome-wide CTCF-mediated loops. With GILoop, we explore the combined strength of integrating the two view representations of Hi-C data and corroborate the complementary relationship between the views. In particular, the model outperforms the state-of-the-art loop calling framework and is also more robust against low-quality Hi-C libraries. We also uncover distinct preferences for matrix density by graph-based and image-based models, revealing interesting insights into Hi-C data elucidation. Finally, along with multiple transfer-learning case studies, we demonstrate that GILoop can accurately model the organizational and functional patterns of CTCF-mediated looping across different cell lines.
Original language | English |
---|---|
Article number | 105535 |
Journal | iScience |
Volume | 25 |
Issue number | 12 |
Online published | 10 Nov 2022 |
DOIs | |
Publication status | Published - 22 Dec 2022 |
Research Keywords
- Computational bioinformatics
- Genomic analysis
- Neural networks
Publisher's Copyright Statement
- This full text is made available under CC-BY 4.0. https://creativecommons.org/licenses/by/4.0/
Fingerprint
Dive into the research topics of 'GILoop: Robust chromatin loop calling across multiple sequencing depths on Hi-C data'. Together they form a unique fingerprint.Projects
- 1 Finished
-
HMRF: Development of Big Data Tools for High-Throughput Sequencing Data with Applications to Colorectal Cancer Genomes
WONG, K. C. (Principal Investigator / Project Coordinator) & WANG, X. (Co-Investigator)
1/09/20 → 13/11/23
Project: Research