5 skills found
natapol / Kitsunea toolkit for evaluation of the lenght of k-mer in a given genome dataset for alignment-free phylogenimic analysis.
olgabot / Kvectorkvector is a small utility for converting motifs to kmer vectors to compare motifs of different lengths
davidrequena / MultifastatsMultifastats: Multi-Fasta Sequence Stats. Free python-based program that, from a set of a set of 'fasta' sequences (as group or individually), produces basic outputs and statistics (N50, %GC, molecular weight, cut in kmers and concatenate sequences using defined spacers). And also allows to apply some filters and select subgroups by length and/or type. This runs in Windows, Linux and Mac OS.
suchapalaver / Fastas2kmersA naive implementation of a k-mer counter in Python, takes fasta files and k-mer length requests and outputs all k-mers of length k, their reverse complement, and their frequency within their respective sequence for each sequence in the file.
jstjohn / JellyfishFork of the jellyfish kmer counter. Here is the description copied from their site: JELLYFISH is a tool for fast, memory-efficient counting of k-mers in DNA. A k-mer is a substring of length k, and counting the occurrences of all such substrings is a central step in many analyses of DNA sequence. JELLYFISH can count k-mers using an order of magnitude less memory and an order of magnitude faster than other k-mer counting packages by using an efficient encoding of a hash table and by exploiting the "compare-and-swap" CPU instruction to increase parallelism. JELLYFISH is a command-line program that reads FASTA and multi-FASTA files containing DNA sequences. It outputs its k-mer counts in an binary format, which can be translated into a human-readable text format using the "jellyfish stats" command. See the documentation below for more details.