Trim paired- or single-end Illumina reads with Trimmomatic.
This App runs Trimmomatic: A flexible read trimming tool for Illumina NGS data. Trimmomatic is written by Anthony Bolger from the Bjorn Usadel Lab.
Trimmomatic version: 0.36 binary, source, and manual. Trimmomatic performs a variety of useful quality control tasks for Illumina paired-end and single-end reads. These tasks are performed as a series of steps. At least one step must be specified, and steps are run in the following order.
This step will remove Illumina adapters from the reads. You need to select one of the predefined adapter sets and set parameters for match criteria. Suggested adapter sequences are provided for TruSeq2 (as used in GAII machines) and TruSeq3 (as used by HiSeq and MiSeq machines), for both single-end and paired-end mode. You can find more information on the adapters in the Trimmomatic manual. If your reads were sequenced on other platforms, you may consider using Cutadapt - v1.18 to manually specify the adapters to remove.
- Adapters: Select one of the predefined adapter sets.
- Seed mismatches: The maximum number of mismatches that will allow a full match to be performed. To speed up the search, short sections of each adapter (upto 16bp) are tested at all possible positions to find "seeds" that trigger a full alignment. This Seed mismatch parameter specifies the allowable mismatches for a seed.
- Simple clip threshold: Minimum score threshold for the adapter to align to the read for clipping to take place. Suggested values are 7-15 depending on the length of the Score; increase 0.6 per match, decrease by Q/10 per mismatch where Q is the quality score.
- Palindrome clip threshold: Specifies how accurate the match between the two 'adapter ligated' reads must be for paired-end palindrome read alignment. Using the same scoring system as above, the pair of reads is aligned and scored. Suggested value around 30.
Removes bases, regardless of their quality, from the end of the read, so that the read has the specified length. Steps performed after Crop might further shorten the read.
- Crop length: Number of bp to keep, from the start of the read.
Removes the specified number of bases, regardless of quality, from the beginning of the read.
- Head crop length: Number of bp of reads to remove from the start of a read.
Leading minimum quality:
Remove low quality bases from the beginning, i.e. bases with a quality score below this threshold.
- Leading minimum quality: Minimum quality score.
Trailing minimum quality:
Remove low quality bases from the end. As long as a base has a quality score below this threshold, the base is removed and the next base is investigated. This approach can be used for removing the special Illumina 'low quality segment' regions (which have a quality score of 2), but we recommend Sliding window for that.
- Trailing minimum quality: Minimum quality score.
Performs sliding window trimming once the average quality within the window falls below specified threshold. By considering multiple bases, a single poor quality base will not cause the removal of high quality data later in the read.
- Sliding window size: Length of window in bp.
- Sliding window minimum quality: The average quality required.
Minimum read length:
This removes reads that fall below the specified minimum length. Reads removed by this step are included in the 'dropped reads' count.
- Minimum read length: Length in bp.
This App creates a new reads library or library set object(s), with the name given by the user. In the case of paired-end reads libraries, separate forward and reverse single end objects are created. All created objects are reported in a table.
- Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30: 2114 2120. doi:10.1093/bioinformatics/btu170 , http://www.ncbi.nlm.nih.gov/pubmed/24695404
Module Commit: af04de41b31471b14e040859d89e617933f07b4a