SkillAgentSearch skills...

Taxonkit

A Practical and Efficient NCBI Taxonomy Toolkit, also supports creating NCBI-style taxdump files for custom taxonomies like GTDB/ICTV

Install / Use

/learn @shenwei356/Taxonkit
About this skill

Quality Score

0/100

Supported Platforms

Universal

README

TaxonKit - A Practical and Efficient NCBI Taxonomy Toolkit

Related projects:

  • Taxid-Changelog: Tracking all changes of TaxIds, including deletion, new adding, merge, reuse, and rank/name changes.
  • GTDB taxdump: GTDB taxonomy taxdump files with trackable TaxIds.
  • ICTV taxdump: NCBI-style taxdump files for International Committee on Taxonomy of Viruses (ICTV)

Table of Contents

<!-- START doctoc generated TOC please keep comment here to allow auto update --> <!-- DON'T EDIT THIS SECTION, INSTEAD RE-RUN doctoc TO UPDATE --> <!-- END doctoc generated TOC please keep comment here to allow auto update -->

Features

Subcommands

Subcommand |Function :-----------------------------------------------------------------------------|:---------------------------------------------- list |List taxonomic subtrees (TaxIds) bellow given TaxIds lineage |Query taxonomic lineage of given TaxIds reformat |Reformat lineage in canonical ranks reformat2<sup></sup>|Reformat lineage in chosen ranks, allowing more ranks than 'reformat' name2taxid |Convert taxon names to TaxIds filter |Filter TaxIds by taxonomic rank range lca |Compute lowest common ancestor (LCA) for TaxIds taxid-changelog|Create TaxId changelog from dump archives profile2cami<sup></sup> |Convert metagenomic profile table to CAMI format cami-filter<sup></sup> |Remove taxa of given TaxIds and their descendants in CAMI metagenomic profile create-taxdump<sup></sup> |Create NCBI-style taxdump files for custom taxonomy, e.g., GTDB and ICTV

Note: <sup>*</sup>New commands since the publication.

<img src="taxonkit.jpg" alt="taxonkit" width="700" align="center" />

Benchmark

  1. Getting complete lineage for given TaxIds (this plot is very old).

    <img src="bench/bench.get_lineage.reformat.tsv.png" alt="" width="600" align="center" />

    Versions: ETE=3.1.2, taxopy=0.5.0 (faster since 0.6.0), TaxonKit=0.7.2.

Dataset

  1. Download and uncompress taxdump.tar.gz: https://ftp.ncbi.nih.gov/pub/taxonomy/taxdump.tar.gz
  2. Copy names.dmp, nodes.dmp, delnodes.dmp and merged.dmp to data directory: $HOME/.taxonkit, e.g., /home/shenwei/.taxonkit ,
  3. Optionally copy to some other directories, and later you can refer to using flag --data-dir, or environment variable TAXONKIT_DB.

All-in-one command:

wget -c https://ftp.ncbi.nih.gov/pub/taxonomy/taxdump.tar.gz 
tar -zxvf taxdump.tar.gz

mkdir -p $HOME/.taxonkit
cp names.dmp nodes.dmp delnodes.dmp merged.dmp $HOME/.taxonkit

Update dataset: Simply re-download the taxdump files, uncompress and override old ones.

Installation

Go to Download Page for more download options and changelogs.

TaxonKit is implemented in Go programming language, executable binary files for most popular operating systems are freely available in release page.

Method 1: Download binaries (latest stable/dev version)

Just download compressed executable file of your operating system, and uncompress it with tar -zxvf *.tar.gz command or other tools. And then:

  1. For Linux-like systems

    1. If you have root privilege simply copy it to /usr/local/bin:

       sudo cp taxonkit /usr/local/bin/
      
    2. Or copy to anywhere in the environment variable PATH:

       mkdir -p $HOME/bin/; cp taxonkit $HOME/bin/
      
  2. For Windows, just copy taxonkit.exe to C:\WINDOWS\system32.

Method 2: Install via conda (latest stable version) Install-with-conda Anaconda Cloud downloads

conda install -c bioconda taxonkit

Method 3: Install via homebrew (out of date)

brew install brewsci/bio/taxonkit

Method 4: Compile from source (latest stable/dev version)

  1. Install go

     wget https://go.dev/dl/go1.24.1.linux-amd64.tar.gz
    
     tar -zxf go1.24.1.linux-amd64.tar.gz -C $HOME/
    
     # or 
     #   echo "export PATH=$PATH:$HOME/go/bin" >> ~/.bashrc
     #   source ~/.bashrc
     export PATH=$PATH:$HOME/go/bin
    
  2. Compile TaxonKit

     # ------------- the latest stable version -------------
    
     go get -v -u github.com/shenwei356/taxonkit/taxonkit
    
     # The executable binary file is located in:
     #   ~/go/bin/taxonkit
     # You can also move it to anywhere in the $PATH
     mkdir -p $HOME/bin
     cp ~/go/bin/taxonkit $HOME/bin/
    
    
     # --------------- the development version --------------
    
     git clone https://github.com/shenwei356/taxonkit
     cd taxonkit/taxonkit/
     go build
    
     # The executable binary file is located in:
     #   ./taxonkit
     # You can also move it to anywhere in the $PATH
     mkdir -p $HOME/bin
     cp ./taxonkit $HOME/bin/
    

Bash-completion

Supported shell: bash|zsh|fish|powershell

Bash:

# generate completion shell
taxonkit gena

Related Skills

View on GitHub
GitHub Stars466
CategoryCustomer
Updated3d ago
Forks30

Languages

Go

Security Score

100/100

Audited on Mar 27, 2026

No findings