Skip to the content.

MultiRepMacsChIPSeq - Examples

Home Overview Usage Variations Examples Applications Install

Examples

These are example pipelines for a variety of situations.

There are associated shell scripts for each scenario described below, found in the examples directory.

There are sample Bam files in the data subdirectory. These are heavily sub-sampled and used here only as working files to test that the pipeline is installed correctly and works as expected, i.e. DO NOT expect to interpret the results. Each script should take about 1-3 minutes to complete.

NOTE: The shell scripts included here will not run until the project is built and/or installed. Numerous accessory applications and R packages also need to be installed. See the Install document for details.

To build the project and run the first example pipeline script, run the following:

perl Build.PL
./Build
cd examples
bash ./run_se.sh

NOTE: The shell scripts include additional parameters that are excluded below for simplicity. However, they can certainly be used as templates for actual analysis situations if edited appropriately.

Example ChIPSeq Pipelines

Bam files

The example Bam files are derived from the publication Parnell et al, 2020, PLoS Genetics. The raw Fastq data is archived in GEO Accession GSE158180.

These are two ChIPSeq experiments from Saccharomyces cerevisiae of two different chromatin factors, a corepressor (Tup1) and a histone deacetylase (Rpd3). There are three biological replicates and one Input reference per experiment.

FileName SampleName GEO ID
Tup1_Ch1.bam Tup1-V5_rep1 GSM4794781
Tup1_Ch2.bam Tup1-V5_rep2 GSM4794782
Tup1_Ch3.bam Tup1-V5_rep3 GSM4794783
Rpd3_Ch1.bam Rpd3-V5_rep1 GSM4794784
Rpd3_Ch2.bam Rpd3-V5_rep2 GSM4794785
Rpd3_Ch3.bam Rpd3-V5_rep3 GSM4794786
Tup1_Input.bam Tup1-V5_Input GSM4794788
Rpd3_Input.bam Rpd3-V5_Input GSM4794789

Example Bam files were aligned to SacCer3. Properly paired alignments on chrI were extracted and subsampled to < 5%. Read sequence, qualities, and CIGAR string were condensed to the first aligned base. Read names were condensed to tile coordinates only. Extraneous SAM attributes were excluded.

Example Alignment Scripts

These are example alignment script templates for aligning raw Fastq files to generate alignment Bam files suitable for use in the pipeline. They utilize Novocraft novoAlign for alignment, and run a few metrics applications to determine duplication rate and fragment size.

You can combine the metrics with the script combine_replicate_data.pl. It will look for and read stdout.txt and stderr.txt files.