DeepES: Deep Learning-Based Enzyme Screening for Identifying Orphan Enzyme Genes

Academic Background With the rapid advancement of sequencing technology, scientists have been able to obtain a vast amount of protein sequence data, including many enzyme sequences. However, despite the establishment of large enzyme databases such as the Kyoto Encyclopedia of Genes and Genomes (KEGG) and BRENDA, sequence information for many enzyme...

MostPlas: A Self-Correction Multi-Label Learning Model for Plasmid Host Range Prediction

Plasmids are small, circular, double-stranded DNA molecules that exist independently of chromosomal DNA in bacteria. They facilitate horizontal gene transfer, enabling host bacteria to acquire beneficial traits such as antibiotic resistance and metal resistance. Some plasmids can transfer, replicate, or persist in multiple microorganisms, and these...

Sequence Analysis: DNA Sequence Alignment Using Transformer Models

Academic Background DNA sequence alignment is a core task in genomics, aiming to map short DNA fragments (reads) to the most probable locations on a reference genome. Traditional methods typically involve two steps: first, indexing the genome, followed by efficient searching to locate potential positions for the reads. However, with the exponential...

SCICONE: Single-Cell Copy Number Calling and Event History Reconstruction

During tumor development, copy number alterations (CNAs) are key drivers of tumor heterogeneity and evolution. Understanding these variations is crucial for developing personalized cancer diagnostics and therapies. Single-cell sequencing technology offers the highest resolution for copy number analysis, down to the individual cell level. However, l...

FlowPacker: Protein Side-Chain Packing with Torsional Flow Matching

The three-dimensional structure of a protein is determined by its amino acid sequence, and the function of the protein is highly dependent on its three-dimensional structure. The side-chain conformations of proteins play a crucial role in protein folding, protein-protein interactions, and de novo protein design. Accurate prediction of protein side-...