CyVerse_logo

Home_Icon Learning Center Home

Postprocessing- ChIP quality assessment

The quality of a ChIP experiment largely depends on the specificity of the antibody and the degree of enrichment achieved in the affinity precipitation step. In this section, we will use ChIPQC R package to calculate cross-correlation and FRiP score to assess quality of our ChIP data. FRiP stands for Fraction of Reads in Peaks. As per ENCODE guidelines, FRiP% values around 5% or higher generally reflect successful enrichment (Landt et al., 2012).


Input Data:

Input Description Location
Alignment file alignment file in bam format iplantcollaborative > example_data > chipseq_webinar -> bowtie_output
Identified peaks MACS2 output iplantcollaborative > example_data > chipseq_webinar -> macs2_output

Run Rstudio-chipqc app in CyVerse DE

  1. CLick on “Apps” tab in the Discovery Enviornment and search for “rstudio-chipqc”.
  2. Click on the app icon.

chipqc_icon

  1. Change the name of the analysis and output folder as needed or leave for defaults.
  2. Under “Input” provide the path for the folder for your input files. This will make your input dataset available in rstudio workspace.
  3. Under “Resource Requirements” request resources as needed or leave for defaults
  4. Click Launch Analysis. You will receive a notification that the job has been submitted and running. Click on ‘Access your analysis here’ link.
  5. Once the analysis is launched, provide username (rstudio) and password (rstudio1) for rstudio. Analysis may take few minutes to launch, depends on the size of your input data.
  6. Use the following script to calculate FRiP percentage and cross-correlation for one sample using ChIPQCsample function. Check ChIPQC package documentation here for more details.
library(ChIPQC)

bamFiles <- 'bowtie_chip_sorted.bam'
mypeaks <- read.delim("ecoli_peaks.narrowPeak", header=F)
exampleExp <- ChIPQCsample(bamFiles,peaks=mypeaks)

QCmetrics(exampleExp)
plotFrip(exampleExp)
frip(exampleExp)
plotCC(exampleExp)

Note

ChIPQC package accepts sorted BAM files as input. A sorted BAM file is a compressed binary version of a SAM file that has reads sorted by coordinates. Reads from the beginning of the first chromosome are found first in the coordinate sorted alignment file. You can use Samtools SAM to sorted BAM App in the DE to convert SAM to sorted BAM files.


Fix or improve this documentation


Home_Icon Learning Center Home