SkillAgentSearch skills...

16SProcessingPipelinesWithQIIME2

No description available

Install / Use

/learn @BEEGlab/16SProcessingPipelinesWithQIIME2
About this skill

Quality Score

0/100

Supported Platforms

Universal

README

16SProcessingPipelinesWithQIIME2: Mock community analysis scripts

The scripts in this folder are associated with the following manuscript: Dacey, D.P., Chain, F.J.J. Concatenation of paired-end reads improves taxonomic classification of amplicons for profiling microbial communities. BMC Bioinformatics 22, 493 (2021). https://doi.org/10.1186/s12859-021-04410-2

1. Check raw quality scores, trim, merge, and concatenate sequences before Qiime 2 submission.

Calls on Check_sequence_quality.sh

source ./Check_sequence_quality_MultiJobSubmission.sh

2. Remove primers, trims, merges, merges/concats with cutadapt and pandaseq on paired end sequences.

Calls on TrimPrimersLengthQual_MergeConcat_PE.sh

source ./TrimPrimersLengthQual_MergeConcat_PE_MultiJobSubmission.sh

3. From the above primer removed and trimmed PE sequences perform full concatenation. No merging.

Calls on TrimPrimersLengthQual_ALLCONCAT.sh

source ./TrimPrimersLengthQual_ALLCONCAT_MultiJobSubmission.sh

4. FASTQ manifest creation - tells Qiime 2 what sequences to use.

Calls on fastq_manifest_filecreate_singleend.sh, fastq_manifest_filecreate_pairedend.sh, fastq_manifest_singleend.sh, and fastq_manifest_pairedend.sh.

source ./fastq_manifest_creator_MultiJobSubmission.sh

5. Metadata creation - links metadata to sequences.

Calls on Metadata_filecreate.sh and Metadata.sh scripts.

source ./Metadata_MultiJobSubmission.sh

Now that all input files are generated, submit to Qiime 2.

Classifiers for SILVA and Greengenes:

silva-132-99-nb-classifier.qza from https://data.qiime2.org/2020.2/common/silva-132-99-nb-classifier.qza

gg-13-8-99-nb-classifier.qza from https://data.qiime2.org/2020.2/common/gg-13-8-99-nb-classifier.qza

6. Runs submitted to DADA2 as single-end. May be considered single-end either because paired-end sequences were merged previously or they are true single-end (ie just R1 or just R2 sequences).

Calls on Qiime2_quality_dada2_tree_tax_biom_SE.sh.

source ./Qiime2_quality_dada2_tree_tax_biom_SE_MultiJobSubmission.sh

7. Runs submitted to DADA2 as paired-end, meaning these sequences were not merged beforehand and will have DADA2 perform merging. For some runs DADA2 may also perform trimming.

Calls on Qiime2_quality_dada2_tree_tax_biom_PE.sh and Qiime2_PL10_quality_dada2_tree_tax_biom_PE.sh.

source ./Qiime2_quality_dada2_tree_tax_biom_PE_MultiJobSubmission.sh

8. Modify and combine DADA2 stats tables from multiple pipelines. Need to manually modify header if don't want default dada2 stat headers.

Calls on Qiime2_export_tsv_dada2_stats.sh.

source ./Qiime2_export_tsv_dada2_stats_MultiJobSubmission.sh

View on GitHub
GitHub Stars4
CategoryDevelopment
Updated11mo ago
Forks3

Languages

Shell

Security Score

57/100

Audited on Apr 6, 2025

No findings