SkillAgentSearch skills...

Kmermaid

k-mer similarity analysis pipeline

Install / Use

/learn @nf-core/Kmermaid
About this skill

Quality Score

0/100

Supported Platforms

Universal

README

<h1> <picture> <source media="(prefers-color-scheme: dark)" srcset="docs/images/nf-core-kmermaid_logo_dark.png"> <img alt="nf-core/kmermaid" src="docs/images/nf-core-kmermaid_logo_light.png"> </picture> </h1>[![GitHub Actions CI Status](https://github.com/nf-core/kmermaid/actions/workflows/ci.yml/badge.svg)](https://github.com/nf-core/kmermaid/actions/workflows/ci.yml) [![GitHub Actions Linting Status](https://github.com/nf-core/kmermaid/actions/workflows/linting.yml/badge.svg)](https://github.com/nf-core/kmermaid/actions/workflows/linting.yml)[![AWS CI](https://img.shields.io/badge/CI%20tests-full%20size-FF9900?labelColor=000000&logo=Amazon%20AWS)](https://nf-co.re/kmermaid/results)[![Cite with Zenodo](http://img.shields.io/badge/DOI-10.5281/zenodo.4143939-1073c8?labelColor=000000)](https://doi.org/10.5281/zenodo.4143939) [![nf-test](https://img.shields.io/badge/unit_tests-nf--test-337ab7.svg)](https://www.nf-test.com)

Nextflow run with conda run with docker run with singularity Launch on Seqera Platform

Get help on SlackFollow on TwitterFollow on MastodonWatch on YouTube

Introduction

nf-core/kmermaid is a bioinformatics pipeline that performs comparative analysis of *omes using k-mer based methods. It supports various reference and sequencing input formats, and provides statistics files along with a MultiQC report as output. It provides pre-processing methods for reads and alignments.

<!-- TODO nf-core: Include a figure that guides the user through the major workflow steps. Many nf-core workflows use the "tube map" design for that. See https://nf-co.re/docs/contributing/design_guidelines#examples for examples. -->

In the outline below, every step except for the main analysis is optional and might be input-dependent.

Optional – BAM preprocessing

  1. Extract BAM from 10X archive (tar)

  2. Extract FASTQ reads (samtools)

  3. Split reads per cell (grep)

  4. Count UMIs per cell (pbtk)

  5. Download SRA experiment () [optional]

Optional – read preprocessing

  1. Trim reads (fastp)
  2. Read QC (FastQC)
  3. Remove rRNA (sortmerna)
  4. Translate to protein (orpheum)

k-mer analysis per method

  1. Create sketch

  2. Calculate distances

  3. Present the results (MultiQC)

[!NOTE] If you are new to Nextflow and nf-core, please refer to this page on how to set-up Nextflow.Make sure to test your setup with -profile test before running the workflow on actual data.

Usage

With a samples.csv file

nextflow run nf-core/kmermaid --outdir s3://bucket/sub-bucket --samples samples.csv

With R1, R2 read pairs

nextflow run nf-core/kmermaid --outdir s3://olgabot-maca/nf-kmer-similarity/ \
  --read_pairs 's3://bucket/sub-bucket/*{R1,R2}*.fastq.gz,s3://bucket/sub-bucket2/*{1,2}.fastq.gz'

With SRA ids

nextflow run nf-core/kmermaid --outdir s3://bucket/sub-bucket --sra SRP016501

With fasta files

nextflow run nf-core/kmermaid --outdir s3://bucket/sub-bucket \
  --fastas '*.fasta'

With bam file

nextflow run nf-core/kmermaid  --outdir s3://bucket/sub-bucket \
  --bam 'possorted_genome_bam.bam'

With split kmer

nextflow run nf-core/kmermaid --outdir s3://bucket/sub-bucket --samples samples.csv --split_kmer --subsample 1000

Credits

nf-core/kmermaid was originally written by Olga Botvinnik. The DSL2 port is done by Igor Trujnara.

We thank the following people for their extensive assistance in the development of this pipeline:

<!-- TODO nf-core: If applicable, make list of people who have also contributed -->

Contributions and Support

If you would like to contribute to this pipeline, please see the contributing guidelines.

For further information or help, don't hesitate to get in touch on the Slack #kmermaid channel (you can join with this invite).

Citations

<!-- TODO nf-core: Add citation for pipeline after first release. Uncomment lines below and update Zenodo doi and badge at the top of this file. --> <!-- If you use nf-core/kmermaid for your analysis, please cite it using the following doi: [10.5281/zenodo.XXXXXX](https://doi.org/10.5281/zenodo.XXXXXX) --><!-- TODO nf-core: Add bibliography of tools and data used in your pipeline -->

An extensive list of references for the tools used by the pipeline can be found in the CITATIONS.md file.

You can cite the nf-core publication as follows:

The nf-core framework for community-curated bioinformatics pipelines.

Philip Ewels, Alexander Peltzer, Sven Fillinger, Harshil Patel, Johannes Alneberg, Andreas Wilm, Maxime Ulysse Garcia, Paolo Di Tommaso & Sven Nahnsen.

Nat Biotechnol. 2020 Feb 13. doi: 10.1038/s41587-020-0439-x.

Related Skills

View on GitHub
GitHub Stars22
CategoryDevelopment
Updated22d ago
Forks13

Languages

Nextflow

Security Score

95/100

Audited on Mar 17, 2026

No findings