Logo image
A novel dual-attention deep neural network with multi-scale fusion feature processing for predicting transcription factor binding sites
Journal article   Peer reviewed

A novel dual-attention deep neural network with multi-scale fusion feature processing for predicting transcription factor binding sites

Yuechuan Dai, Xianjun Shen, Weizhong Zhao and Xiaohua Hu
IEEE journal of biomedical and health informatics, v 36(11), pp 1-11
01 Sep 2025
PMID: 40889326

Abstract

System software Protocols Libraries Synchronization Standards Peer-to-peer computing Parallel programming Message passing Semantics Accuracy High-performance computing (HPC) message passing interface (MPI) MPI startup MPI termination
Transcription functions as a pivotal biological process in cell biology, which is required to complete the binding of transcription factors (TFs) to transcription factor binding sites (TFBSs) on the DNA. Accurate prediction of TFBSs can provide great potential to regulate the expression of interested genes, which can facilitate exploration of new drugs and treatment for diseases. Although many deep learning-based models have been proposed for predicting TFBSs, existing models still have problems, including the use of convolutional processing of DNA sequences that loses information about the DNA double helix structure and fails to adequately account for the stereoscopic structure of DNA shape data in three dimensions. In this paper, we propose a novel model called DeepCTMS, in which both sequence features and shape features of DNA slices are effectively fused to derive high-quality representations for the task of TFBS prediction. A sequence feature processing module is first used to extract the DNA double helix structure features of DNA slices. The three-dimensional features of DNA shape data are extracted by employing a convolutional triple attention (CTA) module for the shape data of a DNA slice. Finally, a multi-scale fusion feature processing (MSFFP) module is used to fuse sequence features and shape features to obtain representations with significantly aligned semantics of both features. Ablation experiments, t-SNE visual analysis, and cross-cell line validation results demonstrate that DeepCTMS consistently outperforms benchmark models on prediction performance and generalization ability on 165 ChIP-seq datasets.

Metrics

Details

InCites Highlights

Data related to this publication, from InCites Benchmarking & Analytics tool:

Web of Science research areas
Computer Science, Theory & Methods
Engineering, Electrical & Electronic
Logo image