Cross-species modeling of plant genomes at single-nucleotide resolution using a pretrained DNA language model

A Milestone in Cross-Species Modeling of Plant Genomes: Creation and Breakthrough Application of the PlantCaduceus DNA Language Model I. Academic Background and Research Motivation In the past two decades, with the rapid development of high-throughput sequencing technology, over 1,000 plant genomes have been published, and this number is expected t...

Bias-Aware Training and Evaluation of Link Prediction Algorithms in Network Biology

Revealing “Rich Node” Bias in Link Prediction Algorithms and Countermeasures — An Interpretation of “Bias-aware Training and Evaluation of Link Prediction Algorithms in Network Biology” 1. Academic Background and Motivation Over the past decade, network biology has played an increasingly important role in uncovering associations and functions of bi...

DeepRNA-Twist: Language-Model-Guided RNA Torsion Angle Prediction with Attention-Inception Network

1. Academic Background and Research Motivation With the rapid development of life sciences and bioinformatics, research into RNA molecular structure and function has become a hot topic. RNA is not merely a carrier of genetic information, but also plays critical roles in regulation, catalysis, and various physiological processes. The three-dimension...

Chrombus-XMBD: A Graph Convolution Model Predicting 3D-Genome from Chromatin Features

Research Background and Disciplinary Significance In eukaryotic cells, the three-dimensional (3D) spatial structure of chromatin plays a crucial role in gene expression regulation. Through complex folding, looping, and local spatial reconfiguration of DNA, different genetic elements (such as promoters and enhancers) are brought into spatial proximi...

Ensemble Learning Based on Matrix Completion Improves Microbe-Disease Association Prediction

Academic Background and Research Problem Microorganisms, as one of the most widely distributed forms of life on Earth, are closely related to oceans, soil, and the human body. The human body contains approximately 350 trillion microbial cells, which are intricately linked to human health and the onset and progression of diseases. In recent years, w...

HSSPPI: Hierarchical and Spatial-Sequential Modeling for PPIs Prediction

Background: Unveiling the Bottlenecks and Opportunities in Protein Interaction Prediction Proteins serve as the core molecules for life activities, participating in almost all biological processes and cellular functions, including gene expression, RNA transcription, DNA synthesis, immune response, and more. Protein-protein interactions (PPI), as we...

MAEST: Accurate Spatial Domain Detection in Spatial Transcriptomics with Graph Masked Autoencoder

Spatial Transcriptomics: Cutting-Edge Technology for Deciphering Spatial Heterogeneity in Tissues Spatial transcriptomics (ST) is an emerging sequencing technology that has rapidly developed in recent years. Its core advantage lies in the simultaneous acquisition of gene expression and spatial location information at the tissue section level, provi...

TopoQA: A Topological Deep Learning-Based Approach for Protein Complex Structure Interface Quality Assessment

Academic Background The elucidation of protein complex 3D structures is a central topic in modern structural biology, molecular mechanism studies, drug design, and even artificial protein design. The function of a protein is often determined by its structure, and many biological processes involve complex interactions between proteins. Although trad...

Inferring Gene Regulatory Networks from Time-Series scRNA-Seq Data via Granger Causal Recurrent Autoencoders

1. Academic Background and Research Motivation In recent years, single-cell RNA sequencing (scRNA-seq) has become one of the most groundbreaking technologies in life sciences and medical research, enabling researchers to capture subtle differences in transcript levels among numerous cells at the resolution of individual cells. This technology has g...

Cox-SAGE: Enhancing Cox Proportional Hazards Model with Interpretable Graph Neural Networks for Cancer Prognosis

1. Research Background and Disciplinary Frontiers Cancer prognosis analysis has always been a core research direction in the medical field. In recent years, with the widespread application of high-throughput sequencing technologies, scientists have been able to delve deeper into exploring molecular biomarkers and clinical characteristics of cancer ...