Flexible read trimming tool for illumina ngs data
Paired End Mode:
TrimmomaticPE [-threads threads] [-phred33 | -phred64] [-trimlog logFile] paired output 1 unpaired output 1 paired output 2 unpaired output 2 step 1 ...
Single End Mode:
TrimmomaticSE [-threads threads] [-phred33 | -phred64] [-trimlog logFile] output step 1 ...
Trimmomatic performs a variety of useful trimming tasks for illumina paired-end and single ended data.The selection of trimming steps and their associated parameters are supplied on the command line.
If no quality score is specified, phred-64 is the default.
Specifying a trimlog file creates a log of all read trimmings, indicating the following details:
the read name
the surviving sequence length
the location of the first surviving base, aka. the amount trimmed from the start
the location of the last surviving base in the original read
the amount trimmed from the end
Multiple steps can be specified as required, by using additional arguments at the end.
Most steps take one or more settings, delimited by \':\' (a colon)
ILLUMINACLIP:<fastaWithAdaptersEtc>:<seed mismatches>:<palindrome clip threshold>:<simple clip threshold> fastaWithAdaptersEtc: specifies the path to a fasta file containing all the adapters, PCR sequences etc. The naming of the various sequences within this file determines how they are used. See below. seedMismatches: specifies the maximum mismatch count which will still allow a full match to be performed palindromeClipThreshold: specifies how accurate the match between the two \'adapter ligated\' reads must be for PE palindrome read alignment. simpleClipThreshold: specifies how accurate the match between any adapter etc. sequence must be against a read. . The adapters are installed on the Debian system at /usr/share/trimmomatic/. SLIDINGWINDOW:<windowSize>:<requiredQuality> windowSize: specifies the number of bases to average across requiredQuality: specifies the average quality required. LEADING:<quality> quality: Specifies the minimum quality required to keep a base. TRAILING:<quality> quality: Specifies the minimum quality required to keep a base. CROP:<length> length: The number of bases to keep, from the start of the read. HEADCROP:<length> length: The number of bases to remove from the start of the read. MINLENGTH:<length> length: Specifies the minimum length of reads to be kept.
Trimming occurs in the order which the steps are specified on the command line. It is recommended in most cases that adapter clipping, if required, is done as early as possible.
TrimmomaticPE s_1_1_sequence.txt.gz s_1_2_sequence.txt.gz lane1_forward_paired.fq.gz lane1_forward_unpaired.fq.gz lane1_reverse_paired.fq.gz lane1_reverse_unpaired.fq.gz ILLUMINACLIP:/usr/share/trimmomatic/illuminaClipping.fa:2:40:15 LEADING:3 TRAILING:3 SLIDINGWINDOW:4:15 MINLEN:36
This will perform the following:
Remove adapters Remove leading low quality or N bases (below quality 3) Remove trailing low quality or N bases (below quality 3) Scan the read with a 4-base wide sliding window, cutting when the average quality per base drops below 15 Drop reads below the 36 bases long
The equivalent for single-ended reads is:
TrimmomaticSE s_1_1_sequence.txt.gz lane1_forward.fq.gz ILLUMINACLIP:/usr/share/trimmomatic/illuminaClipping.fa:2:40:15 LEADING:3 TRAILING:3 SLIDINGWINDOW:4:15 MINLEN:36