Skip to main content

deepTools

Tools to process and analyze deep sequencing data

http://deeptools.readthedocs.io

deepTools addresses the challenge of handling the large amounts of data that are now routinelygenerated from DNA sequencing centers. deepTools contains useful modules to process the mapped reads data for multiple quality checks, creating normalized coverage files in standard bedGraph and bigWig file formats, that allow comparison between different files (for example, treatment and control). Finally, using such normalized and standardized files, deepTools can create many publication-ready visualizations to identify enrichments and for functional annotations of the genome.

The module for deepTools parses a number of the text files that deepTools can produce. In particular, the following are supported:

  • bamPEFragmentSize --table
  • bamPEFragmentSize --outRawFragmentLengths
  • estimateReadFiltering
  • plotCoverage ---outRawCounts (as well as the content written normally to the console)
  • plotEnrichment --outRawCounts
  • plotFingerprint --outQualityMetrics --outRawCounts
  • plotPCA --outFileNameData
  • plotCorrelation --outFileCorMatrix
  • plotProfile --outFileNameData

Please be aware that some tools (namely, plotFingerprint --outRawCounts and plotCoverage --outRawCounts) are only supported as of deepTools version 2.6. For earlier output from plotCoverage --outRawCounts, you can use #'chr' 'start' 'end' in search_patterns.yaml (see here for more details). Also for these types of files, you may need to increase the maximum file size supported by MultiQC (log_filesize_limit in the MultiQC configuration file). You can find details regarding the configuration file location here.

Note that sample names are parsed from the text files themselves, they are not derived from file names.

File search patterns

deeptools/bamPEFragmentSizeDistribution:
contents: "#bamPEFragmentSize"
num_lines: 1
deeptools/bamPEFragmentSizeTable:
contents: "\tFrag. Sampled\tFrag. Len. Min.\tFrag. Len. 1st. Qu.\tFrag. Len. Mean\t\
Frag. Len. Median\tFrag. Len. 3rd Qu."
num_lines: 1
deeptools/estimateReadFiltering:
contents: "Sample\tTotal Reads\tMapped Reads\tAlignments in blacklisted regions\t\
Estimated mapped reads"
num_lines: 1
deeptools/plotCorrelationData:
contents: "#plotCorrelation --outFileCorMatrix"
num_lines: 1
deeptools/plotCoverageOutRawCounts:
contents: "#plotCoverage --outRawCounts"
num_lines: 1
deeptools/plotCoverageStdout:
contents: "sample\tmean\tstd\tmin\t25%\t50%\t75%\tmax"
num_lines: 1
deeptools/plotEnrichment:
contents: "file\tfeatureType\tpercent\tfeatureReadCount\ttotalReadCount"
num_lines: 1
deeptools/plotFingerprintOutQualityMetrics:
contents: "Sample\tAUC\tSynthetic AUC\tX-intercept\tSynthetic X-intercept\tElbow\
\ Point\tSynthetic Elbow Point"
num_lines: 1
deeptools/plotFingerprintOutRawCounts:
contents: "#plotFingerprint --outRawCounts"
num_lines: 1
deeptools/plotPCAData:
contents: "#plotPCA --outFileNameData"
num_lines: 1
deeptools/plotProfile:
contents: bin labels
num_lines: 1