DeepSRAMP
deepSRAMP is a accurate deep learning model for predicting m6A sites by fusing the sequence and genomic position features
Install / Use
/learn @zhfanrui/DeepSRAMPREADME
deepSRAMP
SRAMP is a popular mammalian m6A site predictor we previously developed (Nucleic Acids Res 2016). SRAMP has been totally cited by more than 570 papers (Google Scholar, 4-16, 2024) and represents the mostly used algorithm in this field. A large number of m6A sites were identified by the helps of SRAMP.
After ~8 years after its development, Now we released deepSRAMP (www.cuilab.cn/deepsramp) , which is designed based on a combined framework of transformer neural network and recurrent neural network by fusing the sequence and genomic position features. The results showed that SRAMP2 greatly outperforms its predecessor SRAMP with 15.0% increase of AUROC and 30.9% increase of AUPRC, and greatly outperforms other state-of-the-art m6A predictors (WHISTLE and DeepPromise) with average 16.1% and 18.3% increase of AUC and 43.9% and 46.4% increase of AUPRC, respectively.

Requirements
- torch
- pandas
- scikit-learn
- joblib
- tqdm
- matplotlib
- seaborn
- shap
Installation
- Install
condaand create a virtual enviroument namedsrampwithpythoninstalled.
conda create -y -n sramp python
conda activate sramp
- Clone this repo.
git clone https://github.com/zhfanrui/deepSRAMP.git
- Install this package and dependencies.
cd deepSRAMP
pip install .
Usage
Inference
- Download
BLASTandMuscle.
sh setup_inference.sh
- Prepare your query fasta and run
deepsramp predict \
--fasta /path/to/fasta.fa \
--db /path/to/database/hg38_mature \
--blast /path/to/blast/bin \
--model /path/to/model/full_400_mature.model \
--out ./result.csv
Train
- Download
GTFandFASTAfiles for training;
sh download.sh
- See tutorials for training.
Pretrained Models
Model names follow the rule of {mode}_{half_window_size}_{target}_{extra}.model.
| Model Name | Model Mode | Half Window Size | Window Size | Target | Extra | | :---: | :---: | :---: | :---: | :---: | :---: | | Main Models | | full_400_ythdf.model | Full Model | 400 | 801 | YTHDF1/2 | - | | full_400_mature.model | Full Model | 400 | 801 | Mature transcripts | - | | full_400_full.model | Full Model | 400 | 801 | Full transcripts | - | | Other Model Examples | | seqonly_400_mature.model | Sequence Features Only Model | 400 | 801 | Mature transcripts | - | | genomeonly_400_mature.model | Genome Features Only Model | 400 | 801 | Mature transcripts | - | | full_100_mature.model | Full Model | 100 | 201 | Mature transcripts | - | | full_600_mature.model | Full Model | 600 | 1201 | Mature transcripts | - | | full_400_mature_for_time.model | Full Model | 400 | 801 | Mature transcripts | For training time estimation | | Single Model Examples | | full_400_a549_single.model | Full Model | 400 | 801 | A549 | deepSRAMP$_{single}$ | | full_400_a549_dp.model | - | 500 | 1001 | A549 | DeepPromise | | full_400_a549_single_random.model | Full Model | 500 | 1001 | A549 | Selecting random transcript for training instead of max length trascript |
Tutorials
The reproduction of figures in the paper can be found in ipynb files.
Citation
[paper]
Performance
Cross Validation
Test on YTHDF1/2
Test on m6Aatlas
Related Skills
YC-Killer
2.7kA library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.
best-practices-researcher
The most comprehensive Claude Code skills registry | Web Search: https://skills-registry-web.vercel.app
groundhog
398Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).
isf-agent
a repo for an agent that helps researchers apply for isf funding
