MultiRepMacsChIPSeq - run_DESeq2
| Home | Overview | Usage | Variations | Applications | Install |
run_DESeq2.R
This script will run a basic DESeq2 differential analysis between two conditions to identify differential (or enriched in the case of ChIP and Input) ChIPseq regions.
It requires an input text file with chromosome, start, and stop columns (or a coordinate name string), along with columns of fragment counts for both ChIP1 and ChIP2 (or Input reference) sample replicates. DESeq2 requires at least 2 (ideally more) replicates per condition to estimate variance for normalization and significance testing.
Count data may be already normalized to a uniform genomic depth, in which case it should indicated as such and SizeFactors will be set to 1. Otherwise counts will be normalized by DESeq2. WARNING: Default normalization by DESeq2 may potentially erase any enrichment signal.
A sample condition file is required, consisting of two columns,
unique sample identifiers and group names (conditions). If desired,
a third column could be added as an additional batch factor. Only
two groups are used in the contrast, defined with the --first and
--second options; any additional groups and samples are ignored.
Results are filtered at the indicated threshold or alpha level (False Discovery Rate). Significant regions are written to a text file and respective bed files. Log2 Fold Changes may be shrunken to reduce the effect of low count peaks. If additional filtering on Log2FC will be performed, the values should be shrunk.
USAGE:
run_DESeq2.R [options]
OPTIONS:
-c COUNT, --count=COUNT
Input file containing count data
-a SAMPLE, --sample=SAMPLE
Sample condition file containing identifiers and conditions
-o OUTPUT, --output=OUTPUT
Output file basename, default 'first_second' names
-f FIRST, --first=FIRST
Name of first ChIP condition
-s SECOND, --second=SECOND
Name of second ChIP condition or Input reference
-b, --batch
Use third column in customized sample table as batch factor
-t THRESHOLD, --threshold=THRESHOLD
Threshold adjusted p-value (alpha) for significance (FDR), default 0.1
--norm
Input counts are already normalized
--all
Report all windows, not just significant
--shrink
Shrink Log2FC values for low count peaks
-h, --help
Show this help message and exit