Balrog
Bacterial Annotation by Learned Representation of Genes
Install / Use
/learn @salzberg-lab/BalrogREADME
Bacterial Annotation by Learned Representation Of Genes
<!-- badges: start --> <!--badges: end -->Overview
Balrog is a prokaryotic gene finder based on a Temporal Convolutional Network. We took a data-driven approach to prokaryotic gene finding, relying on the large and diverse collection of already-sequenced genomes. By training a single, universal model of bacterial genes on protein sequences from many different species, we were able to match the sensitivity of current gene finders while reducing the overall number of gene predictions. Balrog does not need to be refit on any new genome.
Published in PLOS Computational Biology here.

Getting started
Click the "Open in Colab" button above or click here to get started.
Press the play button on the left side of each cell to run it. Alternatively, hold shift or ctrl and press enter to run cells. Double click the top of a cell to inspect the code inside and change things. Double click the right side of the cell to hide the code. Have fun!
Because Balrog uses a complex gene model and performs alignment-based search with mmseqs2, each genome takes ~10-15 minutes to process. Feel free to open a GitHub issue if you run into problems or would like a command line version of Balrog.
Related Skills
node-connect
352.2kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
111.1kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
352.2kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
352.2kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
