SkillAgentSearch skills...

Copycat

Copycat is a simple script to capture and bin the read coverage across a genome from a bam file of read alignments. It relies on bedtools to get coverage for each individual nucleotide in the genome using bedtools genomecov, then bins and organizes these coverage values into 10kb bins and outputs the coverage information in a .csv format (for upload to SplitThreader) and in a .seg format (for viewing the copy number profile in IGV)

Install / Use

/learn @MariaNattestad/Copycat
About this skill

Quality Score

0/100

Supported Platforms

Universal

README

Copycat

Copycat is a simple script to capture and bin the read coverage across a genome from a bam file of read alignments.

It relies on bedtools to get coverage for each individual nucleotide in the genome using bedtools genomecov, then bins and organizes these coverage values into 10kb bins and outputs the coverage information in a .csv format (for upload to SplitThreader) and in a .seg format (for viewing the copy number profile in IGV)

Usage: copycat alignments.sorted.bam genome_file output_prefix

  • alignments.sorted.bam:

    • Bam file must be sorted.
    • Remember to filter for mapping quality first if desired: \n\tsamtools view -q -b BAM.bam > FILTERED_BAM.bam
  • genome_file:

    • The genome file should be structured as chromosome_name<tab>chromosome_size
    • For example:
chr1    249250621
chr2    243199373
  • output_prefix:
    • Prefix for all output files. Output files will be output_prefix.coverage.10kb, output_prefix.coverage.10kb.csv, output_prefix.coverage.10kb.for_IGV.seg

Related Skills

View on GitHub
GitHub Stars17
CategoryDevelopment
Updated1y ago
Forks7

Languages

Shell

Security Score

60/100

Audited on Mar 28, 2024

No findings