Postprocessing- ChIP quality assessment¶
The quality of a ChIP experiment largely depends on the specificity of the antibody and the degree of enrichment achieved in the affinity precipitation step. In this section, we will use ChIPQC R package to calculate cross-correlation and FRiP score to assess quality of our ChIP data. FRiP stands for Fraction of Reads in Peaks. As per ENCODE guidelines, FRiP% values around 5% or higher generally reflect successful enrichment (Landt et al., 2012).
Input Data:
Input | Description | Location |
---|---|---|
Alignment file | alignment file in bam format | iplantcollaborative > example_data > chipseq_webinar -> bowtie_output |
Identified peaks | MACS2 output | iplantcollaborative > example_data > chipseq_webinar -> macs2_output |
Run Rstudio-chipqc app in CyVerse DE
- CLick on “Apps” tab in the Discovery Enviornment and search for “rstudio-chipqc”.
- Click on the app icon.
- Change the name of the analysis and output folder as needed or leave for defaults.
- Under “Input” provide the path for the folder for your input files. This will make your input dataset available in rstudio workspace.
- Under “Resource Requirements” request resources as needed or leave for defaults
- Click Launch Analysis. You will receive a notification that the job has been submitted and running. Click on ‘Access your analysis here’ link.
- Once the analysis is launched, provide username (rstudio) and password (rstudio1) for rstudio. Analysis may take few minutes to launch, depends on the size of your input data.
- Use the following script to calculate FRiP percentage and cross-correlation for one sample using ChIPQCsample function. Check ChIPQC package documentation here for more details.
library(ChIPQC)
bamFiles <- 'bowtie_chip_sorted.bam'
mypeaks <- read.delim("ecoli_peaks.narrowPeak", header=F)
exampleExp <- ChIPQCsample(bamFiles,peaks=mypeaks)
QCmetrics(exampleExp)
plotFrip(exampleExp)
frip(exampleExp)
plotCC(exampleExp)
Note
ChIPQC package accepts sorted BAM files as input. A sorted BAM file is a compressed binary version of a SAM file that has reads sorted by coordinates. Reads from the beginning of the first chromosome are found first in the coordinate sorted alignment file. You can use Samtools SAM to sorted BAM App in the DE to convert SAM to sorted BAM files.
Fix or improve this documentation
- Search for an answer: CyVerse Learning Center
- Ask us for help: click on the lower right-hand side of the page
- Report an issue or submit a change: Github Repo Link
- Send feedback: Tutorials@CyVerse.org