Abstract
The cooperativity of transcription factors (TFs) is a widespread phenomenon in the gene regulation system. However, the interaction patterns between TF binding motifs remain elusive. The recent high-throughput assays, CAP-SELEX, have identified over 600 composite DNA sites (i.e. heterodimeric motifs) bound by cooperative TF pairs. However, there are over 25 000 inferentially effective heterodimeric TFs in the human cells. It is not practically feasible to validate all heterodimeric motifs due to cost and labor. We introduce DeepMotifSyn, a deep learning-based tool for synthesizing heterodimeric motifs from monomeric motif pairs. Specifically, DeepMotifSyn is composed of heterodimeric motif generator and evaluator. The generator is a U-Net-based neural network that can synthesize heterodimeric motifs from aligned motif pairs. The evaluator is a machine learning-based model that can score the generated heterodimeric motif candidates based on the motif sequence features. Systematic evaluations on CAP-SELEX data illustrate that DeepMotifSyn significantly outperforms the current state-of-the-art predictors. In addition, DeepMotifSyn can synthesize multiple heterodimeric motifs with different orientation and spacing settings. Such a feature can address the shortcomings of previous models. We believe DeepMotifSyn is a more practical and reliable model than current predictors on heterodimeric motif synthesis.
| Original language | English |
|---|---|
| Article number | bbab334 |
| Number of pages | 9 |
| Journal | Briefings in Bioinformatics |
| Volume | 23 |
| Issue number | 1 |
| Online published | 14 Sept 2021 |
| DOIs | |
| Publication status | Published - Jan 2022 |
Research Keywords
- transcription factor cooperativity
- heterodimeric motif synthesis
- deep learning
RGC Funding Information
- RGC-funded
Fingerprint
Dive into the research topics of 'DeepMotifSyn: a deep learning approach to synthesize heterodimeric DNA motifs'. Together they form a unique fingerprint.Projects
- 2 Finished
-
HMRF: Development of Big Data Tools for High-Throughput Sequencing Data with Applications to Colorectal Cancer Genomes
WONG, K. C. (Principal Investigator / Project Coordinator) & WANG, X. (Co-Investigator)
1/09/20 → 13/11/23
Project: Research
-
GRF: Heterodimeric DNA Motif Synthesis and Validations
WONG, K. C. (Principal Investigator / Project Coordinator) & SONG, Y. Q. (Co-Investigator)
1/12/18 → 29/11/22
Project: Research
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver