SkillAgentSearch skills...

MultiAssayExperiment.TCGA

The pipeline to produce TCGA data as MultiAssayExperiment objects

Install / Use

/learn @waldronlab/MultiAssayExperiment.TCGA
About this skill

Quality Score

0/100

Supported Platforms

Universal

README

MultiAssayExperiment.TCGA

MultiAssayExperiment.TCGA is the pipeline package for building and uploading MultiAssayExperiment datasets from the GDAC Firehose pipeline as obtained from RTCGAToolbox.

There are several steps to rebuild datasets for 33 cancer types.

Generally, users should use the packaged product of the pipeline:

curatedTCGAData

For those looking to rebuild the pipeline, there are several steps that need to be followed:

  1. Create all data directories required (dataDirectories)
  2. Obtain all clinical and assay data from RTCGAToolbox (saveRTCGAdata)
  3. Introduce additional clinical variables to all clinical datasets
  4. Download and integrate subtype curation data from Dropbox
  5. Generate and serialize data maps, providing relationships between samples and patients
  6. Update metadata and upload to ExperimentHub (buildMultiAssayExperiments)

These functions can be found in the data-raw, inst/scripts, and R folders.

NOTE. Include AWS CLI authentication credentials in the ~/.Renviron file. It should include three key:value pairs, AWS_SESSION_TOKEN, AWS_SECRET_ACCESS_KEY, and AWS_ACCESS_KEY_ID

Related Skills

View on GitHub
GitHub Stars7
CategoryDevelopment
Updated10mo ago
Forks4

Languages

R

Security Score

67/100

Audited on May 14, 2025

No findings