Generated May 29, 2024

Thirteen draft genome assemblies of Bacillus spp. isolated from HLB-impacted citrus trees

Abstract

We report the draft genome assembly, annotation and phylogenetic placement of 13 Bacillus spp. isolated from citrus groves under high (Florida) or low (California) Huanglongbing (HLB) disease pressure.

Introduction

Bacillus is a well-studied bacterial genus of Gram-positive and spore-forming bacteria, commonly soil-borne and plant-associated [1-3]. Many Bacillus species produce a range of secondary metabolites and volatile organic compounds that are beneficial to plants through various mechanisms, such as pathogen suppression or plant growth promotion [4-7]. Genome availability is the initial step in unlocking the beneficial metabolic potential of plant-associated Bacillus spp. by enabling the identification of bioactive compound-producing gene clusters. Here we announce the draft genome assemblies of thirteen Bacillus spp. isolated either from the root, leaf or stem tissue of citrus trees in areas of high (Florida) or low (California) Huanglongbing (HLB) disease pressure.

Publication

The publication by Vieira et al. can be found here: in preparation for submission to Microbiology Resource Announcements.

External Data Availability

The Sequence Read Archive and genomic data has been deposited in GenBank BioProject under no. PRJNA1046128 (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA1046128). The version described in this paper is the first version.

Table of Contents

  1. Background and Experimental Methods
  2. Import
  3. Genome Assembly
  4. Genome Annotation
  5. Taxonomic Identification
  6. References
Narrative created by: Flavia Campos Vieira

1. Background and Experimental Methods

Sample Collection

Sampling of plant material and bacterial isolation are described in [8] as part of a bioprospecting effort designed to generate a citrus-associated microbial repository. Briefly, leaves, stems, and roots from citrus trees were collected from citrus orchards in Florida and in California. Each sample was macerated with 1X PBS and 100 ul of the slurry was plated on tryptic soy agar (TSA) and potato dextrose agar plates with 0.1 g/liter tetracycline hydrochloride (PDA). Plates were incubated at 28C for 4 days. The microbial consortia on each plate were scrapped after adding 1 ml of 1X PBS to generate bulk culture tubes. Isolates were recovered from bulk culture tubes by plating on TSA and PDA and incubating plates at 28C for up to 5 days. Single colony of bacterial isolates were streaked onto TSA plates until pure cultures were obtained. Long-term stocks of each isolate were made in 15% glycerol (final concentration) and stored at -80C.

Isolation

For each isolate, DNA was extracted from a single colony grown overnight at 28C in tryptic soy broth (TSB), using the Wizard® Genomic DNA Purification Kit (Promega Corporation, Madison, Wisconsin, USA), following manufacturer’s instructions for Gram-positive bacteria.

Genome Sequencing

Library preparation and sequencing were performed by SeqCenter (Pittsburgh, Pennsylvania, USA). Sample libraries were prepared using the Illumina DNA Prep kit and IDT 10 bp unique dual indexing (UDI) indices, and sequenced on an Illumina NovaSeq 6000, producing 2x151 bp reads. Demultiplexing, quality control and adapter trimming was performed with bcl-convert (v4.0.3) [9]. The total of paired-end raw reads recovered ranged from 2,560,758 to 13,949,264.

2. Import

Raw reads were imported through the staging area into the narrative as non-interleaved reads.

from biokbase.narrative.jobs.appmanager import AppManager
AppManager().run_app_batch(
    [{
        "app_id": "kb_uploadmethods/import_fastq_noninterleaved_as_reads_from_staging",
        "tag": "release",
        "version": "5b9346463df88a422ff5d4f4cba421679f63c73f",
        "params": [{
            "fastq_fwd_staging_file_name": "CB0069_S156_R1_001.fastq.gz",
            "fastq_rev_staging_file_name": "CB0069_S156_R2_001.fastq.gz",
            "name": "CB0069v2_reads"
        }, {
            "fastq_fwd_staging_file_name": "CB0742_S157_R1_001.fastq.gz",
            "fastq_rev_staging_file_name": "CB0742_S157_R2_001.fastq.gz",
            "name": "CB0742_reads"
        }],
        "shared_params": {
            "sequencing_tech": "Illumina",
            "single_genome": 1,
            "read_orientation_outward": 0,
            "insert_size_std_dev": None,
            "insert_size_mean": None
        }
    }],
    cell_id="e77909e3-3e87-4be5-b112-54d13dfcae65",
    run_id="531891a7-d530-43d0-aa32-4c30da6b3fd7"
)
from biokbase.narrative.jobs.appmanager import AppManager
AppManager().run_app_batch(
    [{
        "app_id": "kb_uploadmethods/import_fastq_noninterleaved_as_reads_from_staging",
        "tag": "release",
        "version": "5b9346463df88a422ff5d4f4cba421679f63c73f",
        "params": [{
            "fastq_fwd_staging_file_name": "CB0069_1.fq.gz",
            "fastq_rev_staging_file_name": "CB0069_2.fq.gz",
            "name": "CB0069_reads"
        }],
        "shared_params": {
            "sequencing_tech": "Illumina",
            "single_genome": 1,
            "read_orientation_outward": 0,
            "insert_size_std_dev": None,
            "insert_size_mean": None
        }
    }],
    cell_id="ade04936-e295-489b-87b2-a5a9fd34bf2d",
    run_id="918efa19-54e2-47c1-a347-d6fb677a4e45"
)
from biokbase.narrative.jobs.appmanager import AppManager
AppManager().run_app_batch(
    [{
        "app_id": "kb_uploadmethods/import_fastq_noninterleaved_as_reads_from_staging",
        "tag": "release",
        "version": "5b9346463df88a422ff5d4f4cba421679f63c73f",
        "params": [{
            "fastq_fwd_staging_file_name": "893_S98_R1_001.fastq.gz",
            "fastq_rev_staging_file_name": "893_S98_R2_001.fastq.gz",
            "name": "CB893_reads"
        }, {
            "fastq_fwd_staging_file_name": "676_S96_R1_001.fastq.gz",
            "fastq_rev_staging_file_name": "676_S96_R2_001.fastq.gz",
            "name": "CB676_reads"
        }],
        "shared_params": {
            "sequencing_tech": "Illumina",
            "single_genome": 1,
            "read_orientation_outward": 0,
            "insert_size_std_dev": None,
            "insert_size_mean": None
        }
    }],
    cell_id="81509afe-ef90-4bd5-b7c8-0da15afcac08",
    run_id="72bf7f52-a547-4210-bf69-8385bff2b906"
)
from biokbase.narrative.jobs.appmanager import AppManager
AppManager().run_app_batch(
    [{
        "app_id": "kb_uploadmethods/import_fastq_noninterleaved_as_reads_from_staging",
        "tag": "release",
        "version": "5b9346463df88a422ff5d4f4cba421679f63c73f",
        "params": [{
            "fastq_fwd_staging_file_name": "Copy of cb0027_1.fq.gz",
            "fastq_rev_staging_file_name": "Copy of cb0027_2.fq.gz",
            "name": "CB0027_reads"
        }, {
            "fastq_fwd_staging_file_name": "Copy of cb0021_1.fq.gz",
            "fastq_rev_staging_file_name": "Copy of cb0021_2.fq.gz",
            "name": "CB0021_reads"
        }, {
            "fastq_fwd_staging_file_name": "Copy of cb0019_1.fq.gz",
            "fastq_rev_staging_file_name": "Copy of cb0019_2.fq.gz",
            "name": "CB0019_reads"
        }],
        "shared_params": {
            "sequencing_tech": "Illumina",
            "single_genome": 1,
            "read_orientation_outward": 0,
            "insert_size_std_dev": None,
            "insert_size_mean": None
        }
    }],
    cell_id="3a92803f-718b-4ab2-9950-7f66ee3cadbd",
    run_id="594621e9-628f-432a-b0b0-268dfaaa24ef"
)
from biokbase.narrative.jobs.appmanager import AppManager
AppManager().run_app_batch(
    [{
        "app_id": "kb_uploadmethods/import_fastq_noninterleaved_as_reads_from_staging",
        "tag": "release",
        "version": "5b9346463df88a422ff5d4f4cba421679f63c73f",
        "params": [{
            "fastq_fwd_staging_file_name": "CB909_S144_R1_001.fastq.gz",
            "fastq_rev_staging_file_name": "CB909_S144_R2_001.fastq.gz",
            "name": "CB909"
        }, {
            "fastq_fwd_staging_file_name": "CB687_S152_R1_001.fastq.gz",
            "fastq_rev_staging_file_name": "CB687_S152_R2_001.fastq.gz",
            "name": "CB687"
        }, {
            "fastq_fwd_staging_file_name": "CB912_S156_R1_001.fastq.gz",
            "fastq_rev_staging_file_name": "CB912_S156_R2_001.fastq.gz",
            "name": "CB912"
        }],
        "shared_params": {
            "sequencing_tech": "Illumina",
            "single_genome": 1,
            "read_orientation_outward": 0,
            "insert_size_std_dev": None,
            "insert_size_mean": None
        }
    }],
    cell_id="538b7113-c0a0-4bae-811d-fbbde312b5fc",
    run_id="ca75ce03-d77d-4918-9de0-a83f872c4612"
)
from biokbase.narrative.jobs.appmanager import AppManager
AppManager().run_app_batch(
    [{
        "app_id": "kb_uploadmethods/import_fastq_noninterleaved_as_reads_from_staging",
        "tag": "release",
        "version": "5b9346463df88a422ff5d4f4cba421679f63c73f",
        "params": [{
            "fastq_fwd_staging_file_name": "CB729_S153_R1_001.fastq.gz",
            "fastq_rev_staging_file_name": "CB729_S153_R2_001.fastq.gz",
            "name": "CB729_reads"
        }, {
            "fastq_fwd_staging_file_name": "CB902_S142_R1_001.fastq.gz",
            "fastq_rev_staging_file_name": "CB902_S142_R2_001.fastq.gz",
            "name": "CB902_reads"
        }, {
            "fastq_fwd_staging_file_name": "CB904_S143_R1_001.fastq.gz",
            "fastq_rev_staging_file_name": "CB904_S143_R2_001.fastq.gz",
            "name": "CB904_reads"
        }],
        "shared_params": {
            "sequencing_tech": "Illumina",
            "single_genome": 1,
            "read_orientation_outward": 0,
            "insert_size_std_dev": None,
            "insert_size_mean": None
        }
    }],
    cell_id="d9a39030-1c57-400b-9577-b1dd20961451",
    run_id="5cc84383-2a8d-40ae-8b2e-efd135c67424"
)

3. Genome Assembly

QC

Sequence analysis from quality control to annotation was performed on the KBase web service [10] and the publicly available narrative containing all analyses and data can be found at https://narrative.kbase.us/narrative/157793. Default parameters were used except where otherwise noted. Quality of raw reads were assessed using FastQC (v0.12.1) [11] and quality control was performed using JGI RQCFilter pipeline BBTools (v38.22) [12] and PRINSEQ (v0.20.4) [13].

Assembly

Genomes were assembled using SPAdes (v3.15.3) [14] with k-mer sizes set for 21, 33, 55, 77, 91, 111. The quality of assemblies was assessed with QUAST (v4.4) [15] and CheckM (v1.0.18) [16], demonstrating that the number of contigs in the resulting assemblies ranged from 14 to 87 and the N50 value ranged from 164,068 to 1,085,110. The completeness of the genomes was between 99.38 to 99.59 and the GC content percentage ranged from 34.95 to 46.26.

A quality control application for high throughput sequence data.
This app completed without errors in 2m 31s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CB0069_reads_157793_31_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • CB0069_reads_157793_31_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 1m 36s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CB893_reads_157793_29_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • CB893_reads_157793_29_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 1m 14s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CB676_reads_157793_27_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • CB676_reads_157793_27_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 3m 1s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CB0027_reads_157793_20_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • CB0027_reads_157793_20_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 2m 23s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CB0019_reads_157793_17_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • CB0019_reads_157793_17_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 2m 16s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CB0021_reads_157793_16_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • CB0021_reads_157793_16_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 1m 17s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CB902_reads_157793_12_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • CB902_reads_157793_12_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 1m 12s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CB729_reads_157793_10_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • CB729_reads_157793_10_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 1m 18s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CB904_reads_157793_8_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • CB904_reads_157793_8_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 1m 16s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CB912_reads_157793_7_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • CB912_reads_157793_7_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 1m 15s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CB909_reads_157793_5_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • CB909_reads_157793_5_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 1m 12s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CB687_reads_157793_3_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • CB687_reads_157793_3_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 1m 4s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CB0742_reads_157793_292_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • CB0742_reads_157793_292_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 58s.
Summary
====/kb/module/work/tmp/5238229f-4196-44e0-9233-2e68e1418752.inter.fastq==== reads 13465950 len 150 len mean 150.0000 len stdev 0.0000 len min 150 phred 33 window-size 2000000 cycle-max 35 dups 1407904 %dup 10.4553 unique-dup seq 383834 min dup count 2 dup seq 1 1213 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG dup seq 2 181 CTTCGATTTCCTGCCCCATGTTATTCATACGAAGA dup seq 3 71 CTTTGTCTTCTTTTCAGCGTGATAGAAAACCTTTG dup seq 4 70 GAAGAACCTTACCAGGTCTTGACATCCTCTGACAA dup seq 5 70 CAAAGGTTTTCTATCACGCTGAAAAGAAGACAAAG dup seq 6 64 AAAGGGCGCACGGTGGATGCCTTGGCACTAGGAGC dup seq 7 64 GAAGGCGACTCTCTGGTCTGTAACTGACGCTGAGG dup seq 8 63 AGCTTCGGTGATACGTTTAGCCCCGGTACATTTTC dup seq 9 60 TAACGCATTAAGCACTCCGCCTGGGGAGTACGGTC dup seq 10 59 CCTTTGTCTTCTTTTCAGCGTGATAGAAAACCTTT dup mean 4.6680 dup stddev 3.9646 qual min 2 qual max 37 qual mean 36.3173 qual stdev 3.5160 %A 29.9312 %C 20.4289 %G 20.3714 %T 29.2670 %N 0.0015 total bases 2019892500
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 37s.
Summary
====/kb/module/work/tmp/04d69eff-ef69-43b7-8695-339e8207575f.inter.fastq==== reads 5694468 len 149 len mean 122.4740 len stdev 31.8765 len min 35 phred 33 window-size 2000000 cycle-max 35 dups 1701301 %dup 29.8764 unique-dup seq 379544 min dup count 2 dup seq 1 2121 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG dup seq 2 269 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN dup seq 3 213 GTTGTACGGTGATCTAAATAGAAAAAACCTTCTGG dup seq 4 157 CCATAGAGGTATCCGATAAACATCATTTTAAATAA dup seq 5 156 GGATATGAGAAGGCAGACCCGGGGAACTGAAACAT dup seq 6 154 ATATAAGACCTTGGGTCTTATAAACAGAACGTTCC dup seq 7 146 CCTCTATGGTATCCGTTCAGAAAGACAACTTGAAA dup seq 8 144 CCTTAAAGCCAATGCCAACAGGCATAAATACACAA dup seq 9 136 GTAGTAAGGTTTAACCTTTTCAATGATGAAAGAGA dup seq 10 134 GGCATCCACCGTGCGCCCTTTCTAACTTAACCGTT dup mean 5.4825 dup stddev 5.4147 qual min 2 qual max 36 qual mean 34.4302 qual stdev 4.2649 %A 28.4702 %C 21.4489 %G 21.6440 %T 28.3917 %N 0.0451 total bases 697424392
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 26s.
Summary
====/kb/module/work/tmp/d4b2b297-7458-4288-bf85-a9139d42aaed.inter.fastq==== reads 3236434 len 149 len mean 130.5361 len stdev 27.3310 len min 35 phred 33 window-size 2000000 cycle-max 35 dups 1083619 %dup 33.4819 unique-dup seq 332997 min dup count 2 dup seq 1 1619 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG dup seq 2 1180 GTATATAGAGTGTGTGTAGTTGTTTGTCTTCAATG dup seq 3 761 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN dup seq 4 525 ATCATATACACTCGTCCTCTATTTGTTATATAGAA dup seq 5 525 GTATAGAAGGATTGGTTGACACTACTTACAACATG dup seq 6 490 CTATCACTCTCTATCTATTACATACCCTCTCATTA dup seq 7 481 CTCTATATACAAGCCACCTGTCACAGACCATACAC dup seq 8 420 ATCTATTACATACCCTCTCATTACTCACACCTTCC dup seq 9 402 GACTAACCCTGAGCGGACGAGCCTTCCTCAGGAAA dup seq 10 398 GTATACTCCTTTAGGTGTATGGTCTGTGACAGGTG dup mean 4.2541 dup stddev 8.1357 qual min 2 qual max 36 qual mean 34.5472 qual stdev 4.0442 %A 31.4911 %C 18.3685 %G 18.4237 %T 31.6638 %N 0.0528 total bases 422471356
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 58s.
Summary
====/kb/module/work/tmp/0e8cfffe-0e14-48e5-b332-0ced99248d2a.inter.fastq==== reads 13949264 len 150 len mean 150.0000 len stdev 0.0000 len min 150 phred 33 window-size 2000000 cycle-max 35 dups 1423068 %dup 10.2017 unique-dup seq 378659 min dup count 2 dup seq 1 1332 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG dup seq 2 117 GAAGCCGTAAGGCGAAGTATAGGGGCTGACGCCTG dup seq 3 91 AAGGGGCGGAAACCCCCTAACACTTAGCACTCATC dup seq 4 88 GACCGTACTCCCCAGGCGGAGTGCTTAATGCGTTA dup seq 5 85 TAGGAGGTAACAGCCCTGTAGTCAAAACTTCGTTC dup seq 6 82 TAGGGAAGAACAAGTGCCGTTCAAATAGGGCGGCA dup seq 7 82 GGAAACCCCCTAACACTTAGCACTCATCGTTTACG dup seq 8 82 TGGGCTACACACGTGCTACAATGGACAGAACAAAG dup seq 9 79 GAAGGCGACTCTCTGGTCTGTAACTGACGCTGAGG dup seq 10 79 TAGGAGTCTGGGCCGTGTCTCAGTCCCAGTGTGGC dup mean 4.7582 dup stddev 4.6951 qual min 2 qual max 37 qual mean 36.2592 qual stdev 3.6760 %A 27.3602 %C 22.9416 %G 23.0180 %T 26.6787 %N 0.0015 total bases 2092389600
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 57s.
Summary
====/kb/module/work/tmp/493f1412-e263-4be5-a4ae-d49097d08506.inter.fastq==== reads 12000690 len 150 len mean 150.0000 len stdev 0.0000 len min 150 phred 33 window-size 2000000 cycle-max 35 dups 1037760 %dup 8.6475 unique-dup seq 333057 min dup count 2 dup seq 1 931 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG dup seq 2 80 GAAGCCGTAAGGCGAAGTATAGGGGCTGACGCCTG dup seq 3 62 GACCGTACTCCCCAGGCGGAGTGCTTAATGCGTTA dup seq 4 57 GAAGAGGTCACACCCGTTCCCATGCCGAACACGGA dup seq 5 55 AGGGACGAAAGTCGGGCTTAGTGATCCGGTGGTTC dup seq 6 55 TAGGACCGTTATAGTTACGGCCGCCGTTTACTGGG dup seq 7 53 TGACGACAACCATGCACCACCTGTCACTCTGCCCC dup seq 8 53 TAGCACTCATCGTTTACGGCGTGGACTACCAGGGT dup seq 9 52 GAATGATTTCCAACCATTCTGAGGGAACCTTTGGG dup seq 10 52 GGAGCCGCAGCGAAAGCGAGTCTGAATAGGGCGAA dup mean 4.1159 dup stddev 3.3307 qual min 2 qual max 37 qual mean 36.2038 qual stdev 3.7935 %A 27.2887 %C 22.9167 %G 23.0087 %T 26.7832 %N 0.0027 total bases 1800103500
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 55s.
Summary
====/kb/module/work/tmp/75c8ffeb-46d9-482c-a4cc-ecbfac81e4a6.inter.fastq==== reads 10508320 len 150 len mean 150.0000 len stdev 0.0000 len min 150 phred 33 window-size 2000000 cycle-max 35 dups 1108378 %dup 10.5476 unique-dup seq 374426 min dup count 2 dup seq 1 1124 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG dup seq 2 88 TAGGAGGTAACAGCCCTGTAGTCAAAACTTCGTTC dup seq 3 71 GAAGCCGTAAGGCGAAGTATAGGGGCTGACGCCTG dup seq 4 64 TAGGGGGTTTCCGCCCCTTAGTGCTGCAGCTAACG dup seq 5 63 AAGGGGCGGAAACCCCCTAACACTTAGCACTCATC dup seq 6 62 TGGAAGGGCCATCGCTCAACGGATAAAAGCTACCC dup seq 7 59 AGGGGCATGATGATTTGACGTCATCCCCACCTTCC dup seq 8 58 TAGCGAGGTGAGAGGTGCCCGTACCGCAAACCGAC dup seq 9 57 TGGGTAGATCACCTGGTTTCGGGTCTACGACCACG dup seq 10 56 TGGAGCTGACAGATACTAATCGATCGAGGACTTAA dup mean 3.9602 dup stddev 3.2640 qual min 2 qual max 37 qual mean 36.3024 qual stdev 3.5652 %A 27.4332 %C 22.8422 %G 22.9091 %T 26.8140 %N 0.0015 total bases 1576248000
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 26s.
Summary
====/kb/module/work/tmp/bc385bd8-8e7d-4c09-bf77-3d8d64383931.inter.fastq==== reads 3495930 len 151 len mean 147.0869 len stdev 13.9141 len min 35 phred 33 window-size 2000000 cycle-max 35 dups 1693623 %dup 48.4456 unique-dup seq 442171 min dup count 2 dup seq 1 2387 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG dup seq 2 185 GCATATGGCTGCCCATTCTTCTTGCGCTTTTTCAC dup seq 3 184 GTGTAAAGGCACAAGGGAGCTTGACTGCGAGACCT dup seq 4 173 CCTTAGGATTCTCTCCTCGCCTACCTGTGTCGGTT dup seq 5 171 CCTTTACACTCTGCGAATGATTTCCAACCATTCTG dup seq 6 170 GGCATCCACCGTGCGCCCTTTCTAACTTAACCGTT dup seq 7 162 GGATTAGATACCCTGGTAGTCCACGCCGTAAACGA dup seq 8 157 GTGGATGCCTTGGCACTAGGAGCCGATGAAGGACG dup seq 9 143 ATCCTAAGGTGATCGAGAGAACTCTCGTTAAGGAA dup seq 10 142 ATCTAATCCTGTTCGCTCCCCACGCTTTCGCTCCT dup mean 4.8302 dup stddev 5.4391 qual min 2 qual max 34 qual mean 33.3724 qual stdev 3.2522 %A 26.3869 %C 23.2506 %G 23.4724 %T 26.8310 %N 0.0591 total bases 514205429
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 25s.
Summary
====/kb/module/work/tmp/546bb049-f165-4ddf-ad2c-704d6128790e.inter.fastq==== reads 2560758 len 149 len mean 138.9649 len stdev 22.0908 len min 35 phred 33 window-size 2000000 cycle-max 35 dups 966860 %dup 37.7568 unique-dup seq 384253 min dup count 2 dup seq 1 1174 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG dup seq 2 223 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN dup seq 3 118 GAGTATGAGAAGGCATACCCGGGGAACTGAAACAT dup seq 4 108 TCCTTGAACCATGCGGTTCAAGGAACTATCCGGTA dup seq 5 108 CTCATACTCTATGTATTCAAGTATGGATACTACTC dup seq 6 84 ATTCAGACTCGCTTTCGCTGCGGCTCCGCATCTTC dup seq 7 84 GTGGATGCCTTGGCACTAGGAGCCGATGAAGGACG dup seq 8 82 CTCTTTAAATGGTGGCTGCTTCTAAGCCAACATCC dup seq 9 77 GGATTAGATACCCTGGTAGTCCACGCCGTAAACGA dup seq 10 76 ATATAATATAGATAGTCAATTTGTCATGAGGAGTG dup mean 3.5162 dup stddev 3.2272 qual min 2 qual max 36 qual mean 34.9483 qual stdev 3.1081 %A 29.2954 %C 20.5136 %G 20.7801 %T 29.3806 %N 0.0302 total bases 355855442
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 25s.
Summary
====/kb/module/work/tmp/b2cf33a6-4faf-42aa-a751-2f89e7205c56.inter.fastq==== reads 3654122 len 151 len mean 147.0456 len stdev 13.9311 len min 35 phred 33 window-size 2000000 cycle-max 35 dups 1818880 %dup 49.7761 unique-dup seq 435678 min dup count 2 dup seq 1 2177 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG dup seq 2 164 TGTATATACAGAGTAAACTTAAAATGTTTTTGAAC dup seq 3 147 GCATATGGCTGCCCATTCTTCTTGCGCTTTTTCAC dup seq 4 144 GGATTAGATACCCTGGTAGTCCACGCCGTAAACGA dup seq 5 141 GTGTAAAGGCACAAGGGAGCTTGACTGCGAGACCT dup seq 6 141 GTATATACATGTCTGATTTTAAAAAGAATTCTTTA dup seq 7 137 CCTTAGGATTCTCTCCTCGCCTACCTGTGTCGGTT dup seq 8 135 GTGGATGCCTTGGCACTAGGAGCCGATGAAGGACG dup seq 9 127 ATGTACACCTCCTTTCCTATAACTCTATTATAGCA dup seq 10 127 CCCTAACACTTAGCACTCATCGTTTACGGCGTGGA dup mean 5.1748 dup stddev 5.9814 qual min 2 qual max 34 qual mean 33.3982 qual stdev 3.1827 %A 26.8339 %C 22.8202 %G 23.0050 %T 27.2835 %N 0.0574 total bases 537322388
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 25s.
Summary
====/kb/module/work/tmp/ae6fffe3-6064-46e6-8291-81600d034862.inter.fastq==== reads 2631398 len 149 len mean 138.0884 len stdev 23.4147 len min 35 phred 33 window-size 2000000 cycle-max 35 dups 964886 %dup 36.6682 unique-dup seq 368621 min dup count 2 dup seq 1 1252 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG dup seq 2 263 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN dup seq 3 160 ATATTAGGGTTATGGTAACACAAAATTTCAGTTTT dup seq 4 152 ATACAGGCTTGCTATATGCTTTACCGTTTCGTCGA dup seq 5 134 GGATATGAGAAGGCAGACCCGGGGAACTGAAACAT dup seq 6 132 CCCTAATATAGTGGTATGTATTGGGGGGGTTCGTG dup seq 7 132 TAATTATAGCATCCTTGAGGACAAGACCGCAACAG dup seq 8 131 ATATAAGACCTTGGGTCTTATAAACAGAACGTTCC dup seq 9 124 GTATAGAGCCGGTGACTGTCCGCAAGTATGCCAGT dup seq 10 122 GTACATAGATACTCTAAAGTCTTTACGTCATCCTC dup mean 3.6176 dup stddev 3.7671 qual min 2 qual max 36 qual mean 34.9347 qual stdev 3.1445 %A 27.7066 %C 22.0902 %G 22.3146 %T 27.8559 %N 0.0327 total bases 363365583
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 26s.
Summary
====/kb/module/work/tmp/c7ac6b5d-d4ff-4d14-b932-430f10758d36.inter.fastq==== reads 3614876 len 149 len mean 131.6950 len stdev 27.5933 len min 35 phred 33 window-size 2000000 cycle-max 35 dups 1258925 %dup 34.8262 unique-dup seq 380454 min dup count 2 dup seq 1 1308 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG dup seq 2 167 ATATAAGACCTTGGGTCTTATAAACAGAACGTTCC dup seq 3 153 GGATATGAGAAGGCAGACCCGGGGAACTGAAACAT dup seq 4 146 CTCATATCCTATGTATTCAGATATGGATACCACTC dup seq 5 141 ATTCAGACTCGCTTTCGCTGCGGCTCCGCATCTTC dup seq 6 127 GTCTTATATTCCGTTAAAATCCTTAGAAAGGAGGT dup seq 7 123 GTATTAGGCACGCCGCCAGCGTTCGTCCTGAGCCA dup seq 8 121 CTTATAAACAGAACGTTCCCTGTCTTGTTTAGTTT dup seq 9 120 TTTAAAGAGTGCGTAATAGCTCACTGGTCGAGTGA dup seq 10 118 TTCTATGGCAGTGCGTTCTACGATGGCGGTCGTAG dup mean 4.3090 dup stddev 3.9880 qual min 2 qual max 36 qual mean 34.6247 qual stdev 3.8621 %A 27.8748 %C 22.0549 %G 22.2451 %T 27.8158 %N 0.0094 total bases 476060936
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 25s.
Summary
====/kb/module/work/tmp/80b0b25d-d20c-4dd4-90dc-c5fcd0b2bf22.inter.fastq==== reads 3303372 len 151 len mean 147.3595 len stdev 13.1746 len min 35 phred 33 window-size 2000000 cycle-max 35 dups 1591229 %dup 48.1698 unique-dup seq 448814 min dup count 2 dup seq 1 1068 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG dup seq 2 153 GCATATGGCTGCCCATTCTTCTTGCGCTTTTTCAC dup seq 3 144 GGCATCCACCGTGCGCCCTTTCTAACTTAACCGTT dup seq 4 142 GTCCTAAGGTAGCGAAATTCCTTGTCGGGTAAGTT dup seq 5 131 GTGGATGCCTTGGCACTAGGAGCCGATGAAGGACG dup seq 6 130 CCTTAGGATTCTCTCCTCGCCTACCTGTGTCGGTT dup seq 7 128 GGATTAGATACCCTGGTAGTCCACGCCGTAAACGA dup seq 8 127 GCCATATGCTGTGGCCGAGAATATATTGGATCGAA dup seq 9 126 CTCCTACCACTGTTCGAAGAACAGTCCGCAGCTTC dup seq 10 118 CTCTAAGGTGACTGCCGGTGACAAACCGGAGGAAG dup mean 4.5454 dup stddev 3.9541 qual min 2 qual max 34 qual mean 33.2891 qual stdev 3.4169 %A 26.3070 %C 23.3518 %G 23.5295 %T 26.7560 %N 0.0556 total bases 486783147
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 24s.
Summary
====/kb/module/work/tmp/33489393-7d20-4872-8d21-81b7b601cad6.inter.fastq==== reads 2845570 len 151 len mean 145.4961 len stdev 18.2899 len min 35 phred 33 window-size 2000000 cycle-max 35 dups 1266248 %dup 44.4989 unique-dup seq 415023 min dup count 2 dup seq 1 17924 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG dup seq 2 163 CCTCTATGGTATCCGTTCAGAAAGACAACTTGAAA dup seq 3 150 GGATTAGATACCCTGGTAGTCCACGCCGTAAACGA dup seq 4 150 GTGTAAAGGCACAAGGGAGCTTGACTGCGAGACCT dup seq 5 141 GTGGATGCCTTGGCACTAGGAGCCGATGAAGGACG dup seq 6 130 CCTTAGGATTCTCTCCTCGCCTACCTGTGTCGGTT dup seq 7 128 CCATAGAGGTATCCGATAAACATCATTTTAAATAA dup seq 8 124 GAATAGGCCCAGGCGACTGTTTAGCAAAAACACAG dup seq 9 124 ATTCAGACTCGCTTTCGCTGCGGCTCCGCATCTTC dup seq 10 124 GTTGTACGGTGATCTAAATAGAAAAAACCTTCTGG dup mean 4.0510 dup stddev 28.0020 qual min 2 qual max 37 qual mean 35.9405 qual stdev 4.7507 %A 27.1488 %C 22.1586 %G 22.9670 %T 27.6790 %N 0.0466 total bases 414019325
Runs the JGI reads data preprocessing pipeline
This app completed without errors in 1h 35m 45s.
Objects
Created Object Name Type Description
CB0069_RQCfilter_100923 PairedEndLibrary Filtered reads library
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • rqcfilter_report.zip - RQCFilter report files
Runs the JGI reads data preprocessing pipeline
This app completed without errors in 50m 3s.
Objects
Created Object Name Type Description
CB0893_RQCfilter_100923 PairedEndLibrary Filtered reads library
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • rqcfilter_report.zip - RQCFilter report files
Runs the JGI reads data preprocessing pipeline
This app completed without errors in 29m 27s.
Objects
Created Object Name Type Description
CB0676_RQCfilter_100923 PairedEndLibrary Filtered reads library
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • rqcfilter_report.zip - RQCFilter report files
Runs the JGI reads data preprocessing pipeline
This app completed without errors in 1h 47m 21s.
Objects
Created Object Name Type Description
CB0027_RQCfilter_100923 PairedEndLibrary Filtered reads library
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • rqcfilter_report.zip - RQCFilter report files
Runs the JGI reads data preprocessing pipeline
This app completed without errors in 2h 16m 39s.
Objects
Created Object Name Type Description
CB0019_RQCfilter_100923 PairedEndLibrary Filtered reads library
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • rqcfilter_report.zip - RQCFilter report files
Runs the JGI reads data preprocessing pipeline
This app completed without errors in 1h 56m 16s.
Objects
Created Object Name Type Description
CB0021_RQCfilter_100923 PairedEndLibrary Filtered reads library
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • rqcfilter_report.zip - RQCFilter report files
Runs the JGI reads data preprocessing pipeline
This app completed without errors in 15m 32s.
Objects
Created Object Name Type Description
CB0902_RQCfilter_100923 PairedEndLibrary Filtered reads library
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • rqcfilter_report.zip - RQCFilter report files
Runs the JGI reads data preprocessing pipeline
This app completed without errors in 13m 3s.
Objects
Created Object Name Type Description
CB0729_RQCfilter_100923 PairedEndLibrary Filtered reads library
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • rqcfilter_report.zip - RQCFilter report files
Runs the JGI reads data preprocessing pipeline
This app completed without errors in 27m 41s.
Objects
Created Object Name Type Description
CB0904_RQCfilter_100923 PairedEndLibrary Filtered reads library
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • rqcfilter_report.zip - RQCFilter report files
Runs the JGI reads data preprocessing pipeline
This app completed without errors in 18m 60s.
Objects
Created Object Name Type Description
CB0912_RQCfilter_100923 PairedEndLibrary Filtered reads library
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • rqcfilter_report.zip - RQCFilter report files
Runs the JGI reads data preprocessing pipeline
This app completed without errors in 15m 42s.
Objects
Created Object Name Type Description
CB0909_RQCfilter_100923 PairedEndLibrary Filtered reads library
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • rqcfilter_report.zip - RQCFilter report files
Runs the JGI reads data preprocessing pipeline
This app completed without errors in 20m 35s.
Objects
Created Object Name Type Description
CB0687_RQCfilter_100923 PairedEndLibrary Filtered reads library
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • rqcfilter_report.zip - RQCFilter report files
Runs the JGI reads data preprocessing pipeline
This app completed without errors in 12m 55s.
Objects
Created Object Name Type Description
CB0742_RQCfilter_102323 PairedEndLibrary Filtered reads library
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • rqcfilter_report.zip - RQCFilter report files
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 26s.
Summary
====/kb/module/work/tmp/12040c34-0d76-402c-b0a6-1b8ff3baa1a5.inter.fastq==== reads 3225432 len 145 len mean 127.6709 len stdev 26.9287 len min 26 phred 33 window-size 2000000 cycle-max 35 dups 1080940 %dup 33.5130 unique-dup seq 333251 min dup count 2 dup seq 1 1594 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG dup seq 2 1176 GTATATAGAGTGTGTGTAGTTGTTTGTCTTCAATG dup seq 3 524 GTATAGAAGGATTGGTTGACACTACTTACAACATG dup seq 4 524 ATCATATACACTCGTCCTCTATTTGTTATATAGAA dup seq 5 488 CTATCACTCTCTATCTATTACATACCCTCTCATTA dup seq 6 478 CTCTATATACAAGCCACCTGTCACAGACCATACAC dup seq 7 419 ATCTATTACATACCCTCTCATTACTCACACCTTCC dup seq 8 402 GACTAACCCTGAGCGGACGAGCCTTCCTCAGGAAA dup seq 9 398 GTATACTCCTTTAGGTGTATGGTCTGTGACAGGTG dup seq 10 396 GTATATGATATCACAACGACTTCCTTTCTAGAAGT dup mean 4.2436 dup stddev 7.9928 qual min 14 qual max 36 qual mean 34.5655 qual stdev 3.9734 %A 31.5087 %C 18.3773 %G 18.4324 %T 31.6817 %N 0.0000 total bases 411793653
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 25s.
Summary
====/kb/module/work/tmp/b9e98e8f-24af-481a-b92a-e6fa63f865eb.inter.fastq==== reads 2557920 len 145 len mean 136.0102 len stdev 21.7174 len min 27 phred 33 window-size 2000000 cycle-max 35 dups 966151 %dup 37.7710 unique-dup seq 384371 min dup count 2 dup seq 1 1102 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG dup seq 2 118 GAGTATGAGAAGGCATACCCGGGGAACTGAAACAT dup seq 3 108 TCCTTGAACCATGCGGTTCAAGGAACTATCCGGTA dup seq 4 108 CTCATACTCTATGTATTCAAGTATGGATACTACTC dup seq 5 84 ATTCAGACTCGCTTTCGCTGCGGCTCCGCATCTTC dup seq 6 83 GTGGATGCCTTGGCACTAGGAGCCGATGAAGGACG dup seq 7 82 CTCTTTAAATGGTGGCTGCTTCTAAGCCAACATCC dup seq 8 77 GGATTAGATACCCTGGTAGTCCACGCCGTAAACGA dup seq 9 76 ATATAATATAGATAGTCAATTTGTCATGAGGAGTG dup seq 10 75 CCTTAGGATTCTCTCCTCGCCTACCTGTGTCGGTT dup mean 3.5136 dup stddev 3.1383 qual min 14 qual max 36 qual mean 34.9586 qual stdev 3.0550 %A 29.3053 %C 20.5198 %G 20.7840 %T 29.3908 %N 0.0000 total bases 347903235
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 25s.
Summary
====/kb/module/work/tmp/8338ff0e-c240-4715-bbb8-1719aaa08b32.inter.fastq==== reads 3116664 len 150 len mean 145.8560 len stdev 14.7186 len min 25 phred 33 window-size 2000000 cycle-max 35 dups 1565358 %dup 50.2254 unique-dup seq 443854 min dup count 2 dup seq 1 2140 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG dup seq 2 175 GCATATGGCTGCCCATTCTTCTTGCGCTTTTTCAC dup seq 3 166 GTGTAAAGGCACAAGGGAGCTTGACTGCGAGACCT dup seq 4 157 GGCATCCACCGTGCGCCCTTTCTAACTTAACCGTT dup seq 5 152 CCTTTACACTCTGCGAATGATTTCCAACCATTCTG dup seq 6 151 CCTTAGGATTCTCTCCTCGCCTACCTGTGTCGGTT dup seq 7 142 GGATTAGATACCCTGGTAGTCCACGCCGTAAACGA dup seq 8 137 GTGGATGCCTTGGCACTAGGAGCCGATGAAGGACG dup seq 9 130 ATCTAATCCTGTTCGCTCCCCACGCTTTCGCTCCT dup seq 10 128 GCCATATGCTGTGGCCGAGAATATATTGGATCGAA dup mean 4.5267 dup stddev 4.9029 qual min 12 qual max 34 qual mean 33.3837 qual stdev 3.1805 %A 26.4030 %C 23.2652 %G 23.4833 %T 26.8485 %N 0.0000 total bases 454584014
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 38s.
Summary
====/kb/module/work/tmp/27a95b12-e63a-4c98-8444-bf8ed32d1f0b.inter.fastq==== reads 5665026 len 145 len mean 119.6287 len stdev 31.4859 len min 25 phred 33 window-size 2000000 cycle-max 35 dups 1698000 %dup 29.9734 unique-dup seq 379961 min dup count 2 dup seq 1 2067 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG dup seq 2 212 GTTGTACGGTGATCTAAATAGAAAAAACCTTCTGG dup seq 3 157 CCATAGAGGTATCCGATAAACATCATTTTAAATAA dup seq 4 156 GGATATGAGAAGGCAGACCCGGGGAACTGAAACAT dup seq 5 154 ATATAAGACCTTGGGTCTTATAAACAGAACGTTCC dup seq 6 146 CCTCTATGGTATCCGTTCAGAAAGACAACTTGAAA dup seq 7 144 CCTTAAAGCCAATGCCAACAGGCATAAATACACAA dup seq 8 136 GTAGTAAGGTTTAACCTTTTCAATGATGAAAGAGA dup seq 9 134 GGCATCCACCGTGCGCCCTTTCTAACTTAACCGTT dup seq 10 132 GTGGATGCCTTGGCACTAGGAGCCGATGAAGGACG dup mean 5.4689 dup stddev 5.3323 qual min 14 qual max 36 qual mean 34.4473 qual stdev 4.2050 %A 28.4842 %C 21.4571 %G 21.6517 %T 28.4070 %N 0.0000 total bases 677699682
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 25s.
Summary
====/kb/module/work/tmp/7fed71ee-4676-4ea3-9d2a-eedf36e6552a.inter.fastq==== reads 2627288 len 145 len mean 135.1225 len stdev 23.0244 len min 25 phred 33 window-size 2000000 cycle-max 35 dups 963886 %dup 36.6875 unique-dup seq 368799 min dup count 2 dup seq 1 1124 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG dup seq 2 160 ATATTAGGGTTATGGTAACACAAAATTTCAGTTTT dup seq 3 152 ATACAGGCTTGCTATATGCTTTACCGTTTCGTCGA dup seq 4 134 GGATATGAGAAGGCAGACCCGGGGAACTGAAACAT dup seq 5 132 CCCTAATATAGTGGTATGTATTGGGGGGGTTCGTG dup seq 6 132 TAATTATAGCATCCTTGAGGACAAGACCGCAACAG dup seq 7 131 ATATAAGACCTTGGGTCTTATAAACAGAACGTTCC dup seq 8 124 GTATAGAGCCGGTGACTGTCCGCAAGTATGCCAGT dup seq 9 122 GTACATAGATACTCTAAAGTCTTTACGTCATCCTC dup seq 10 118 ATCCAATACCGCGCTTACCCTATCCTCCTGCGTCC dup mean 3.6136 dup stddev 3.6256 qual min 14 qual max 36 qual mean 34.9460 qual stdev 3.0871 %A 27.7170 %C 22.0979 %G 22.3188 %T 27.8663 %N 0.0000 total bases 355005652
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 26s.
Summary
====/kb/module/work/tmp/aa721d4e-3cef-459b-813c-977c7d19e19f.inter.fastq==== reads 3262856 len 150 len mean 145.8111 len stdev 14.7355 len min 26 phred 33 window-size 2000000 cycle-max 35 dups 1681583 %dup 51.5372 unique-dup seq 436634 min dup count 2 dup seq 1 1956 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG dup seq 2 142 TGTATATACAGAGTAAACTTAAAATGTTTTTGAAC dup seq 3 135 GCATATGGCTGCCCATTCTTCTTGCGCTTTTTCAC dup seq 4 131 GGATTAGATACCCTGGTAGTCCACGCCGTAAACGA dup seq 5 126 CCTTAGGATTCTCTCCTCGCCTACCTGTGTCGGTT dup seq 6 125 GTATATACATGTCTGATTTTAAAAAGAATTCTTTA dup seq 7 125 GTGTAAAGGCACAAGGGAGCTTGACTGCGAGACCT dup seq 8 125 GTGGATGCCTTGGCACTAGGAGCCGATGAAGGACG dup seq 9 119 ATGTACACCTCCTTTCCTATAACTCTATTATAGCA dup seq 10 117 CCTTTACACTCTGCGAATGATTTCCAACCATTCTG dup mean 4.8512 dup stddev 5.4128 qual min 12 qual max 34 qual mean 33.4092 qual stdev 3.1114 %A 26.8514 %C 22.8321 %G 23.0167 %T 27.2998 %N 0.0000 total bases 475760594
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 27s.
Summary
====/kb/module/work/tmp/a70b16fa-7557-49ea-baef-d1a3654ec2c8.inter.fastq==== reads 3609504 len 145 len mean 128.8214 len stdev 27.1950 len min 25 phred 33 window-size 2000000 cycle-max 35 dups 1258362 %dup 34.8625 unique-dup seq 380700 min dup count 2 dup seq 1 1305 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG dup seq 2 167 ATATAAGACCTTGGGTCTTATAAACAGAACGTTCC dup seq 3 152 GGATATGAGAAGGCAGACCCGGGGAACTGAAACAT dup seq 4 146 CTCATATCCTATGTATTCAGATATGGATACCACTC dup seq 5 141 ATTCAGACTCGCTTTCGCTGCGGCTCCGCATCTTC dup seq 6 127 GTCTTATATTCCGTTAAAATCCTTAGAAAGGAGGT dup seq 7 123 GTATTAGGCACGCCGCCAGCGTTCGTCCTGAGCCA dup seq 8 121 CTTATAAACAGAACGTTCCCTGTCTTGTTTAGTTT dup seq 9 120 TTTAAAGAGTGCGTAATAGCTCACTGGTCGAGTGA dup seq 10 118 TTCTATGACAGTGCGTTCTACGATGGCAGTCGTAG dup mean 4.3054 dup stddev 3.9809 qual min 14 qual max 36 qual mean 34.6284 qual stdev 3.8481 %A 27.8773 %C 22.0570 %G 22.2474 %T 27.8184 %N 0.0000 total bases 464981347
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 58s.
Summary
====/kb/module/work/tmp/92d8bdc1-a04a-45a2-8cc4-ad5b21b6176e.inter.fastq==== reads 13452436 len 150 len mean 149.9710 len stdev 1.0083 len min 49 phred 33 window-size 2000000 cycle-max 35 dups 1408475 %dup 10.4700 unique-dup seq 384005 min dup count 2 dup seq 1 1206 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG dup seq 2 181 CTTCGATTTCCTGCCCCATGTTATTCATACGAAGA dup seq 3 71 CTTTGTCTTCTTTTCAGCGTGATAGAAAACCTTTG dup seq 4 70 GAAGAACCTTACCAGGTCTTGACATCCTCTGACAA dup seq 5 70 CAAAGGTTTTCTATCACGCTGAAAAGAAGACAAAG dup seq 6 64 GAAGGCGACTCTCTGGTCTGTAACTGACGCTGAGG dup seq 7 64 AAAGGGCGCACGGTGGATGCCTTGGCACTAGGAGC dup seq 8 63 AGCTTCGGTGATACGTTTAGCCCCGGTACATTTTC dup seq 9 60 TAACGCATTAAGCACTCCGCCTGGGGAGTACGGTC dup seq 10 59 AAAGGGCTAAAATAAAGATCATTTTAGCCCTTTGT dup mean 4.6679 dup stddev 3.9589 qual min 11 qual max 37 qual mean 36.3178 qual stdev 3.5134 %A 29.9320 %C 20.4288 %G 20.3711 %T 29.2681 %N 0.0000 total bases 2017475275
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 1m 21s.
Summary
====/kb/module/work/tmp/e86b5dfb-dab4-4c15-b6c0-76a003c1b82f.inter.fastq==== reads 13931330 len 150 len mean 149.9760 len stdev 0.6828 len min 49 phred 33 window-size 2000000 cycle-max 35 dups 1423745 %dup 10.2197 unique-dup seq 378875 min dup count 2 dup seq 1 1323 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG dup seq 2 117 GAAGCCGTAAGGCGAAGTATAGGGGCTGACGCCTG dup seq 3 91 AAGGGGCGGAAACCCCCTAACACTTAGCACTCATC dup seq 4 88 GACCGTACTCCCCAGGCGGAGTGCTTAATGCGTTA dup seq 5 85 TAGGAGGTAACAGCCCTGTAGTCAAAACTTCGTTC dup seq 6 82 GGAAACCCCCTAACACTTAGCACTCATCGTTTACG dup seq 7 82 TAGGGAAGAACAAGTGCCGTTCAAATAGGGCGGCA dup seq 8 82 TGGGCTACACACGTGCTACAATGGACAGAACAAAG dup seq 9 79 TAGGAGTCTGGGCCGTGTCTCAGTCCCAGTGTGGC dup seq 10 79 GAAGGCGACTCTCTGGTCTGTAACTGACGCTGAGG dup mean 4.7578 dup stddev 4.6883 qual min 11 qual max 37 qual mean 36.2598 qual stdev 3.6736 %A 27.3600 %C 22.9431 %G 23.0189 %T 26.6780 %N 0.0000 total bases 2089364659
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 56s.
Summary
====/kb/module/work/tmp/dc338e08-c7d0-494d-80fe-a0039dbcd491.inter.fastq==== reads 10495344 len 150 len mean 149.9746 len stdev 0.7045 len min 51 phred 33 window-size 2000000 cycle-max 35 dups 1108904 %dup 10.5657 unique-dup seq 374605 min dup count 2 dup seq 1 1117 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG dup seq 2 88 TAGGAGGTAACAGCCCTGTAGTCAAAACTTCGTTC dup seq 3 71 GAAGCCGTAAGGCGAAGTATAGGGGCTGACGCCTG dup seq 4 64 TAGGGGGTTTCCGCCCCTTAGTGCTGCAGCTAACG dup seq 5 63 AAGGGGCGGAAACCCCCTAACACTTAGCACTCATC dup seq 6 62 TGGAAGGGCCATCGCTCAACGGATAAAAGCTACCC dup seq 7 59 AGGGGCATGATGATTTGACGTCATCCCCACCTTCC dup seq 8 58 TAGCGAGGTGAGAGGTGCCCGTACCGCAAACCGAC dup seq 9 57 TGGGTAGATCACCTGGTTTCGGGTCTACGACCACG dup seq 10 56 TGGAGCTGACAGATACTAATCGATCGAGGACTTAA dup mean 3.9602 dup stddev 3.2577 qual min 11 qual max 37 qual mean 36.3029 qual stdev 3.5629 %A 27.4331 %C 22.8435 %G 22.9101 %T 26.8133 %N 0.0000 total bases 1574035482
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 58s.
Summary
====/kb/module/work/tmp/297fba02-87a6-41ca-ac27-8e78132ed348.inter.fastq==== reads 11987086 len 150 len mean 149.9751 len stdev 0.6799 len min 49 phred 33 window-size 2000000 cycle-max 35 dups 1039020 %dup 8.6678 unique-dup seq 333408 min dup count 2 dup seq 1 916 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG dup seq 2 80 GAAGCCGTAAGGCGAAGTATAGGGGCTGACGCCTG dup seq 3 62 GACCGTACTCCCCAGGCGGAGTGCTTAATGCGTTA dup seq 4 58 GAAGAGGTCACACCCGTTCCCATGCCGAACACGGA dup seq 5 55 TAGGACCGTTATAGTTACGGCCGCCGTTTACTGGG dup seq 6 55 AGGGACGAAAGTCGGGCTTAGTGATCCGGTGGTTC dup seq 7 53 TAGCACTCATCGTTTACGGCGTGGACTACCAGGGT dup seq 8 53 TGACGACAACCATGCACCACCTGTCACTCTGCCCC dup seq 9 52 GGAGCCGCAGCGAAAGCGAGTCTGAATAGGGCGAA dup seq 10 52 GAATGATTTCCAACCATTCTGAGGGAACCTTTGGG dup mean 4.1164 dup stddev 3.3181 qual min 11 qual max 37 qual mean 36.2047 qual stdev 3.7893 %A 27.2888 %C 22.9184 %G 23.0100 %T 26.7828 %N 0.0000 total bases 1797764791
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 25s.
Summary
====/kb/module/work/tmp/e6800b02-80d9-47b5-bbc1-10d96445b3e3.inter.fastq==== reads 2944544 len 150 len mean 146.1515 len stdev 13.9436 len min 25 phred 33 window-size 2000000 cycle-max 35 dups 1471464 %dup 49.9726 unique-dup seq 450442 min dup count 2 dup seq 1 949 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG dup seq 2 142 GCATATGGCTGCCCATTCTTCTTGCGCTTTTTCAC dup seq 3 133 GGCATCCACCGTGCGCCCTTTCTAACTTAACCGTT dup seq 4 128 GTCCTAAGGTAGCGAAATTCCTTGTCGGGTAAGTT dup seq 5 120 GTGGATGCCTTGGCACTAGGAGCCGATGAAGGACG dup seq 6 116 CCTTAGGATTCTCTCCTCGCCTACCTGTGTCGGTT dup seq 7 116 CTCCTACCACTGTTCGAAGAACAGTCCGCAGCTTC dup seq 8 113 GGATTAGATACCCTGGTAGTCCACGCCGTAAACGA dup seq 9 108 GCCATATGCTGTGGCCGAGAATATATTGGATCGAA dup seq 10 106 GCCCAACCCTTGGGACCGACTACAGCCCCAGGATG dup mean 4.2667 dup stddev 3.5742 qual min 12 qual max 34 qual mean 33.2993 qual stdev 3.3544 %A 26.3214 %C 23.3663 %G 23.5392 %T 26.7732 %N 0.0000 total bases 430349561
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 25s.
Summary
====/kb/module/work/tmp/4652b813-efaa-4a69-86eb-ca076a0b24e1.inter.fastq==== reads 2772772 len 150 len mean 144.3393 len stdev 18.6769 len min 25 phred 33 window-size 2000000 cycle-max 35 dups 1246254 %dup 44.9461 unique-dup seq 415629 min dup count 2 dup seq 1 17671 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG dup seq 2 160 CCTCTATGGTATCCGTTCAGAAAGACAACTTGAAA dup seq 3 148 GGATTAGATACCCTGGTAGTCCACGCCGTAAACGA dup seq 4 143 GTGTAAAGGCACAAGGGAGCTTGACTGCGAGACCT dup seq 5 138 GTGGATGCCTTGGCACTAGGAGCCGATGAAGGACG dup seq 6 128 CCTTAGGATTCTCTCCTCGCCTACCTGTGTCGGTT dup seq 7 128 CCATAGAGGTATCCGATAAACATCATTTTAAATAA dup seq 8 124 GTTGTACGGTGATCTAAATAGAAAAAACCTTCTGG dup seq 9 123 ATTCAGACTCGCTTTCGCTGCGGCTCCGCATCTTC dup seq 10 122 GAATAGGCCCAGGCGACTGTTTAGCAAAAACACAG dup mean 3.9985 dup stddev 27.5856 qual min 12 qual max 37 qual mean 35.9611 qual stdev 4.6844 %A 27.1629 %C 22.1683 %G 22.9742 %T 27.6945 %N 0.0000 total bases 400220013
Filter out low-complexity paired- or single-end reads with PRINSEQ.
This app completed without errors in 24m 15s.
Objects
Created Object Name Type Description
CB0069_RQC_prinseq_100923 PairedEndLibrary Filtered Paired End Reads
CB0069_RQC_prinseq_100923_fwd_singletons SingleEndLibrary Filtered Forward Unpaired End Reads
CB0069_RQC_prinseq_100923_rev_singletons SingleEndLibrary Filtered Reverse Unpaired End Reads
Summary
Input and filter stats: Input sequences (file 1): 6,726,218 Input bases (file 1): 1,008,732,088 Input mean length (file 1): 149.97 Input sequences (file 2): 6,726,218 Input bases (file 2): 1,008,743,187 Input mean length (file 2): 149.97 Good sequences (pairs): 6,717,033 Good bases (pairs): 2,014,722,366 Good mean length (pairs): 299.94 Good sequences (singletons file 1): 7,567 (0.11%) Good bases (singletons file 1): 1,134,750 Good mean length (singletons file 1): 149.96 Good sequences (singletons file 2): 1,468 (0.02%) Good bases (singletons file 2): 219,553 Good mean length (singletons file 2): 149.56 Bad sequences (file 1): 1,618 (0.02%) Bad bases (file 1): 241,701 Bad mean length (file 1): 149.38 Bad sequences (file 2): 7,717 (0.11%) Bad bases (file 2): 1,156,905 Bad mean length (file 2): 149.92 Sequences filtered by specified parameters: lc_method: 9335 OBJECTS CREATED : 157793/111/1 : Filtered Paired End Reads 157793/114/1 : Filtered Forward Unpaired End Reads 157793/116/1 : Filtered Reverse Unpaired End Reads
Filter out low-complexity paired- or single-end reads with PRINSEQ.
This app completed without errors in 22m 45s.
Objects
Created Object Name Type Description
CB0019_RQC_prinseq_100923 PairedEndLibrary Filtered Paired End Reads
CB0019_RQC_prinseq_100923_fwd_singletons SingleEndLibrary Filtered Forward Unpaired End Reads
CB0019_RQC_prinseq_100923_rev_singletons SingleEndLibrary Filtered Reverse Unpaired End Reads
Summary
Input and filter stats: Input sequences (file 1): 5,993,543 Input bases (file 1): 898,880,947 Input mean length (file 1): 149.97 Input sequences (file 2): 5,993,543 Input bases (file 2): 898,883,844 Input mean length (file 2): 149.98 Good sequences (pairs): 5,987,423 Good bases (pairs): 1,795,929,208 Good mean length (pairs): 299.95 Good sequences (singletons file 1): 5,368 (0.09%) Good bases (singletons file 1): 805,116 Good mean length (singletons file 1): 149.98 Good sequences (singletons file 2): 724 (0.01%) Good bases (singletons file 2): 108,565 Good mean length (singletons file 2): 149.95 Bad sequences (file 1): 752 (0.01%) Bad bases (file 1): 112,674 Bad mean length (file 1): 149.83 Bad sequences (file 2): 5,396 (0.09%) Bad bases (file 2): 809,228 Bad mean length (file 2): 149.97 Sequences filtered by specified parameters: lc_method: 6148 OBJECTS CREATED : 157793/110/1 : Filtered Paired End Reads 157793/112/1 : Filtered Forward Unpaired End Reads 157793/113/1 : Filtered Reverse Unpaired End Reads
Filter out low-complexity paired- or single-end reads with PRINSEQ.
This app completed without errors in 19m 6s.
Objects
Created Object Name Type Description
CB0021_RQC_prinseq_100923 PairedEndLibrary Filtered Paired End Reads
CB0021_RQC_prinseq_100923_fwd_singletons SingleEndLibrary Filtered Forward Unpaired End Reads
CB0021_RQC_prinseq_100923_rev_singletons SingleEndLibrary Filtered Reverse Unpaired End Reads
Summary
Input and filter stats: Input sequences (file 1): 5,247,672 Input bases (file 1): 787,013,473 Input mean length (file 1): 149.97 Input sequences (file 2): 5,247,672 Input bases (file 2): 787,022,009 Input mean length (file 2): 149.98 Good sequences (pairs): 5,241,556 Good bases (pairs): 1,572,201,842 Good mean length (pairs): 299.95 Good sequences (singletons file 1): 5,278 (0.10%) Good bases (singletons file 1): 791,519 Good mean length (singletons file 1): 149.97 Good sequences (singletons file 2): 766 (0.01%) Good bases (singletons file 2): 114,647 Good mean length (singletons file 2): 149.67 Bad sequences (file 1): 838 (0.02%) Bad bases (file 1): 125,298 Bad mean length (file 1): 149.52 Bad sequences (file 2): 5,350 (0.10%) Bad bases (file 2): 802,176 Bad mean length (file 2): 149.94 Sequences filtered by specified parameters: lc_method: 6188 OBJECTS CREATED : 157793/103/1 : Filtered Paired End Reads 157793/107/1 : Filtered Forward Unpaired End Reads 157793/108/1 : Filtered Reverse Unpaired End Reads
Filter out low-complexity paired- or single-end reads with PRINSEQ.
This app completed without errors in 25m 58s.
Objects
Created Object Name Type Description
CB0027_RQC_prinseq_100923 PairedEndLibrary Filtered Paired End Reads
CB0027_RQC_prinseq_100923_fwd_singletons SingleEndLibrary Filtered Forward Unpaired End Reads
CB0027_RQC_prinseq_100923_rev_singletons SingleEndLibrary Filtered Reverse Unpaired End Reads
Summary
Input and filter stats: Input sequences (file 1): 6,965,665 Input bases (file 1): 1,044,676,724 Input mean length (file 1): 149.98 Input sequences (file 2): 6,965,665 Input bases (file 2): 1,044,687,935 Input mean length (file 2): 149.98 Good sequences (pairs): 6,957,823 Good bases (pairs): 2,087,013,294 Good mean length (pairs): 299.95 Good sequences (singletons file 1): 6,785 (0.10%) Good bases (singletons file 1): 1,017,381 Good mean length (singletons file 1): 149.95 Good sequences (singletons file 2): 988 (0.01%) Good bases (singletons file 2): 148,082 Good mean length (singletons file 2): 149.88 Bad sequences (file 1): 1,057 (0.02%) Bad bases (file 1): 158,299 Bad mean length (file 1): 149.76 Bad sequences (file 2): 6,854 (0.10%) Bad bases (file 2): 1,027,603 Bad mean length (file 2): 149.93 Sequences filtered by specified parameters: lc_method: 7911 OBJECTS CREATED : 157793/118/1 : Filtered Paired End Reads 157793/119/1 : Filtered Forward Unpaired End Reads 157793/120/1 : Filtered Reverse Unpaired End Reads
Filter out low-complexity paired- or single-end reads with PRINSEQ.
This app completed without errors in 5m 54s.
Objects
Created Object Name Type Description
CB0687_RQC_prinseq_100923 PairedEndLibrary Filtered Paired End Reads
CB0687_RQC_prinseq_100923_fwd_singletons SingleEndLibrary Filtered Forward Unpaired End Reads
CB0687_RQC_prinseq_100923_rev_singletons SingleEndLibrary Filtered Reverse Unpaired End Reads
Summary
Input and filter stats: Input sequences (file 1): 1,313,644 Input bases (file 1): 177,500,568 Input mean length (file 1): 135.12 Input sequences (file 2): 1,313,644 Input bases (file 2): 177,505,084 Input mean length (file 2): 135.12 Good sequences (pairs): 1,311,545 Good bases (pairs): 354,424,697 Good mean length (pairs): 270.23 Good sequences (singletons file 1): 1,821 (0.14%) Good bases (singletons file 1): 244,230 Good mean length (singletons file 1): 134.12 Good sequences (singletons file 2): 235 (0.02%) Good bases (singletons file 2): 31,412 Good mean length (singletons file 2): 133.67 Bad sequences (file 1): 278 (0.02%) Bad bases (file 1): 38,210 Bad mean length (file 1): 137.45 Bad sequences (file 2): 1,864 (0.14%) Bad bases (file 2): 267,103 Bad mean length (file 2): 143.30 Sequences filtered by specified parameters: lc_method: 2142 OBJECTS CREATED : 157793/94/1 : Filtered Paired End Reads 157793/95/1 : Filtered Forward Unpaired End Reads 157793/96/1 : Filtered Reverse Unpaired End Reads
Filter out low-complexity paired- or single-end reads with PRINSEQ.
This app completed without errors in 6m 43s.
Objects
Created Object Name Type Description
CB0902_RQC_prinseq_100923 PairedEndLibrary Filtered Paired End Reads
CB0902_RQC_prinseq_100923_fwd_singletons SingleEndLibrary Filtered Forward Unpaired End Reads
CB0902_RQC_prinseq_100923_rev_singletons SingleEndLibrary Filtered Reverse Unpaired End Reads
Summary
Input and filter stats: Input sequences (file 1): 1,558,332 Input bases (file 1): 227,255,671 Input mean length (file 1): 145.83 Input sequences (file 2): 1,558,332 Input bases (file 2): 227,328,343 Input mean length (file 2): 145.88 Good sequences (pairs): 1,554,884 Good bases (pairs): 453,562,695 Good mean length (pairs): 291.70 Good sequences (singletons file 1): 2,966 (0.19%) Good bases (singletons file 1): 438,039 Good mean length (singletons file 1): 147.69 Good sequences (singletons file 2): 438 (0.03%) Good bases (singletons file 2): 62,487 Good mean length (singletons file 2): 142.66 Bad sequences (file 1): 482 (0.03%) Bad bases (file 1): 71,130 Bad mean length (file 1): 147.57 Bad sequences (file 2): 3,010 (0.19%) Bad bases (file 2): 449,663 Bad mean length (file 2): 149.39 Sequences filtered by specified parameters: lc_method: 3492 OBJECTS CREATED : 157793/98/1 : Filtered Paired End Reads 157793/99/1 : Filtered Forward Unpaired End Reads 157793/100/1 : Filtered Reverse Unpaired End Reads
Filter out low-complexity paired- or single-end reads with PRINSEQ.
This app completed without errors in 6m 52s.
Objects
Created Object Name Type Description
CB0904_RQC_prinseq_100923 PairedEndLibrary Filtered Paired End Reads
CB0904_RQC_prinseq_100923_fwd_singletons SingleEndLibrary Filtered Forward Unpaired End Reads
CB0904_RQC_prinseq_100923_rev_singletons SingleEndLibrary Filtered Reverse Unpaired End Reads
Summary
Input and filter stats: Input sequences (file 1): 1,631,428 Input bases (file 1): 237,843,112 Input mean length (file 1): 145.79 Input sequences (file 2): 1,631,428 Input bases (file 2): 237,917,482 Input mean length (file 2): 145.83 Good sequences (pairs): 1,628,244 Good bases (pairs): 474,817,784 Good mean length (pairs): 291.61 Good sequences (singletons file 1): 2,696 (0.17%) Good bases (singletons file 1): 398,142 Good mean length (singletons file 1): 147.68 Good sequences (singletons file 2): 437 (0.03%) Good bases (singletons file 2): 64,085 Good mean length (singletons file 2): 146.65 Bad sequences (file 1): 488 (0.03%) Bad bases (file 1): 71,241 Bad mean length (file 1): 145.99 Bad sequences (file 2): 2,747 (0.17%) Bad bases (file 2): 409,342 Bad mean length (file 2): 149.01 Sequences filtered by specified parameters: lc_method: 3235 OBJECTS CREATED : 157793/102/1 : Filtered Paired End Reads 157793/104/1 : Filtered Forward Unpaired End Reads 157793/105/1 : Filtered Reverse Unpaired End Reads
Filter out low-complexity paired- or single-end reads with PRINSEQ.
This app completed without errors in 9m 46s.
Objects
Created Object Name Type Description
CB0893_RQC_prinseq_100923 PairedEndLibrary Filtered Paired End Reads
CB0893_RQC_prinseq_100923_fwd_singletons SingleEndLibrary Filtered Forward Unpaired End Reads
CB0893_RQC_prinseq_100923_rev_singletons SingleEndLibrary Filtered Reverse Unpaired End Reads
Summary
Input and filter stats: Input sequences (file 1): 2,832,513 Input bases (file 1): 338,756,562 Input mean length (file 1): 119.60 Input sequences (file 2): 2,832,513 Input bases (file 2): 338,943,120 Input mean length (file 2): 119.66 Good sequences (pairs): 2,827,438 Good bases (pairs): 676,388,632 Good mean length (pairs): 239.22 Good sequences (singletons file 1): 4,336 (0.15%) Good bases (singletons file 1): 514,977 Good mean length (singletons file 1): 118.77 Good sequences (singletons file 2): 547 (0.02%) Good bases (singletons file 2): 65,625 Good mean length (singletons file 2): 119.97 Bad sequences (file 1): 739 (0.03%) Bad bases (file 1): 93,024 Bad mean length (file 1): 125.88 Bad sequences (file 2): 4,528 (0.16%) Bad bases (file 2): 637,424 Bad mean length (file 2): 140.77 Sequences filtered by specified parameters: lc_method: 5267 OBJECTS CREATED : 157793/127/1 : Filtered Paired End Reads 157793/128/1 : Filtered Forward Unpaired End Reads 157793/129/1 : Filtered Reverse Unpaired End Reads
Filter out low-complexity paired- or single-end reads with PRINSEQ.
This app completed without errors in 6m 23s.
Objects
Created Object Name Type Description
CB0676_RQC_prinseq_101023 PairedEndLibrary Filtered Paired End Reads
CB0676_RQC_prinseq_101023_fwd_singletons SingleEndLibrary Filtered Forward Unpaired End Reads
CB0676_RQC_prinseq_101023_rev_singletons SingleEndLibrary Filtered Reverse Unpaired End Reads
Summary
Input and filter stats: Input sequences (file 1): 1,612,716 Input bases (file 1): 205,856,705 Input mean length (file 1): 127.65 Input sequences (file 2): 1,612,716 Input bases (file 2): 205,936,948 Input mean length (file 2): 127.70 Good sequences (pairs): 1,608,128 Good bases (pairs): 410,561,381 Good mean length (pairs): 255.30 Good sequences (singletons file 1): 3,392 (0.21%) Good bases (singletons file 1): 439,592 Good mean length (singletons file 1): 129.60 Good sequences (singletons file 2): 589 (0.04%) Good bases (singletons file 2): 77,448 Good mean length (singletons file 2): 131.49 Bad sequences (file 1): 1,196 (0.07%) Bad bases (file 1): 154,220 Bad mean length (file 1): 128.95 Bad sequences (file 2): 3,999 (0.25%) Bad bases (file 2): 561,012 Bad mean length (file 2): 140.29 Sequences filtered by specified parameters: lc_method: 5195 OBJECTS CREATED : 157793/132/1 : Filtered Paired End Reads 157793/133/1 : Filtered Forward Unpaired End Reads 157793/135/1 : Filtered Reverse Unpaired End Reads
Filter out low-complexity paired- or single-end reads with PRINSEQ.
This app completed without errors in 5m 53s.
Objects
Created Object Name Type Description
CB0729_RQC_prinseq_101023 PairedEndLibrary Filtered Paired End Reads
CB0729_RQC_prinseq_101023_fwd_singletons SingleEndLibrary Filtered Forward Unpaired End Reads
CB0729_RQC_prinseq_101023_rev_singletons SingleEndLibrary Filtered Reverse Unpaired End Reads
Summary
Input and filter stats: Input sequences (file 1): 1,278,960 Input bases (file 1): 173,954,355 Input mean length (file 1): 136.01 Input sequences (file 2): 1,278,960 Input bases (file 2): 173,948,880 Input mean length (file 2): 136.01 Good sequences (pairs): 1,276,923 Good bases (pairs): 347,332,644 Good mean length (pairs): 272.01 Good sequences (singletons file 1): 1,833 (0.14%) Good bases (singletons file 1): 251,064 Good mean length (singletons file 1): 136.97 Good sequences (singletons file 2): 190 (0.01%) Good bases (singletons file 2): 25,293 Good mean length (singletons file 2): 133.12 Bad sequences (file 1): 204 (0.02%) Bad bases (file 1): 28,274 Bad mean length (file 1): 138.60 Bad sequences (file 2): 1,847 (0.14%) Bad bases (file 2): 265,960 Bad mean length (file 2): 144.00 Sequences filtered by specified parameters: lc_method: 2051 OBJECTS CREATED : 157793/131/1 : Filtered Paired End Reads 157793/134/1 : Filtered Forward Unpaired End Reads 157793/136/1 : Filtered Reverse Unpaired End Reads
Filter out low-complexity paired- or single-end reads with PRINSEQ.
This app completed without errors in 6m 31s.
Objects
Created Object Name Type Description
CB0909_RQC_prinseq_101023 PairedEndLibrary Filtered Paired End Reads
CB0909_RQC_prinseq_101023_fwd_singletons SingleEndLibrary Filtered Forward Unpaired End Reads
CB0909_RQC_prinseq_101023_rev_singletons SingleEndLibrary Filtered Reverse Unpaired End Reads
Summary
Input and filter stats: Input sequences (file 1): 1,472,272 Input bases (file 1): 215,134,438 Input mean length (file 1): 146.12 Input sequences (file 2): 1,472,272 Input bases (file 2): 215,215,123 Input mean length (file 2): 146.18 Good sequences (pairs): 1,470,434 Good bases (pairs): 429,807,891 Good mean length (pairs): 292.30 Good sequences (singletons file 1): 1,438 (0.10%) Good bases (singletons file 1): 210,437 Good mean length (singletons file 1): 146.34 Good sequences (singletons file 2): 372 (0.03%) Good bases (singletons file 2): 54,450 Good mean length (singletons file 2): 146.37 Bad sequences (file 1): 400 (0.03%) Bad bases (file 1): 58,800 Bad mean length (file 1): 147.00 Bad sequences (file 2): 1,466 (0.10%) Bad bases (file 2): 217,983 Bad mean length (file 2): 148.69 Sequences filtered by specified parameters: lc_method: 1866 OBJECTS CREATED : 157793/139/1 : Filtered Paired End Reads 157793/140/1 : Filtered Forward Unpaired End Reads 157793/141/1 : Filtered Reverse Unpaired End Reads
Filter out low-complexity paired- or single-end reads with PRINSEQ.
This app completed without errors in 6m 60s.
Objects
Created Object Name Type Description
CB0912_RQC_prinseq_101023 PairedEndLibrary Filtered Paired End Reads
CB0912_RQC_prinseq_101023_fwd_singletons SingleEndLibrary Filtered Forward Unpaired End Reads
CB0912_RQC_prinseq_101023_rev_singletons SingleEndLibrary Filtered Reverse Unpaired End Reads
Summary
Input and filter stats: Input sequences (file 1): 1,804,752 Input bases (file 1): 232,504,029 Input mean length (file 1): 128.83 Input sequences (file 2): 1,804,752 Input bases (file 2): 232,477,318 Input mean length (file 2): 128.81 Good sequences (pairs): 1,801,455 Good bases (pairs): 464,087,227 Good mean length (pairs): 257.62 Good sequences (singletons file 1): 2,793 (0.15%) Good bases (singletons file 1): 360,107 Good mean length (singletons file 1): 128.93 Good sequences (singletons file 2): 433 (0.02%) Good bases (singletons file 2): 55,453 Good mean length (singletons file 2): 128.07 Bad sequences (file 1): 504 (0.03%) Bad bases (file 1): 68,667 Bad mean length (file 1): 136.24 Bad sequences (file 2): 2,864 (0.16%) Bad bases (file 2): 409,893 Bad mean length (file 2): 143.12 Sequences filtered by specified parameters: lc_method: 3368 OBJECTS CREATED : 157793/143/1 : Filtered Paired End Reads 157793/144/1 : Filtered Forward Unpaired End Reads 157793/145/1 : Filtered Reverse Unpaired End Reads
Filter out low-complexity paired- or single-end reads with PRINSEQ.
This app completed without errors in 5m 37s.
Objects
Created Object Name Type Description
CB0742_RQCfilter_prinseq_102423 PairedEndLibrary Filtered Paired End Reads
CB0742_RQCfilter_prinseq_102423_fwd_singletons SingleEndLibrary Filtered Forward Unpaired End Reads
CB0742_RQCfilter_prinseq_102423_rev_singletons SingleEndLibrary Filtered Reverse Unpaired End Reads
Summary
Input and filter stats: Input sequences (file 1): 1,386,386 Input bases (file 1): 200,103,332 Input mean length (file 1): 144.33 Input sequences (file 2): 1,386,386 Input bases (file 2): 200,116,681 Input mean length (file 2): 144.34 Good sequences (pairs): 1,365,367 Good bases (pairs): 393,948,124 Good mean length (pairs): 288.53 Good sequences (singletons file 1): 20,761 (1.50%) Good bases (singletons file 1): 3,093,037 Good mean length (singletons file 1): 148.98 Good sequences (singletons file 2): 180 (0.01%) Good bases (singletons file 2): 24,784 Good mean length (singletons file 2): 137.69 Bad sequences (file 1): 258 (0.02%) Bad bases (file 1): 35,300 Bad mean length (file 1): 136.82 Bad sequences (file 2): 20,839 (1.50%) Bad bases (file 2): 3,118,768 Bad mean length (file 2): 149.66 Sequences filtered by specified parameters: lc_method: 21097 OBJECTS CREATED : 157793/306/1 : Filtered Paired End Reads 157793/307/1 : Filtered Forward Unpaired End Reads 157793/308/1 : Filtered Reverse Unpaired End Reads
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 1m 1s.
Summary
====/kb/module/work/tmp/1148e8b8-13b8-40e3-ae6e-9ecf729f3825.inter.fastq==== reads 11974846 len 150 len mean 149.9751 len stdev 0.6793 len min 49 phred 33 window-size 2000000 cycle-max 35 dups 1038149 %dup 8.6694 unique-dup seq 333580 min dup count 2 dup seq 1 80 GAAGCCGTAAGGCGAAGTATAGGGGCTGACGCCTG dup seq 2 62 GACCGTACTCCCCAGGCGGAGTGCTTAATGCGTTA dup seq 3 58 GAAGAGGTCACACCCGTTCCCATGCCGAACACGGA dup seq 4 55 AGGGACGAAAGTCGGGCTTAGTGATCCGGTGGTTC dup seq 5 55 TAGGACCGTTATAGTTACGGCCGCCGTTTACTGGG dup seq 6 53 TGACGACAACCATGCACCACCTGTCACTCTGCCCC dup seq 7 53 TAGCACTCATCGTTTACGGCGTGGACTACCAGGGT dup seq 8 52 GAATGATTTCCAACCATTCTGAGGGAACCTTTGGG dup seq 9 52 AAGGGGAGTGAAAGAGATCCTGAAACCGTGTGCCT dup seq 10 52 GGAGCCGCAGCGAAAGCGAGTCTGAATAGGGCGAA dup mean 4.1121 dup stddev 2.9162 qual min 11 qual max 37 qual mean 36.2096 qual stdev 3.7748 %A 27.2832 %C 22.9240 %G 23.0045 %T 26.7883 %N 0.0000 total bases 1795929208
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 57s.
Summary
====/kb/module/work/tmp/23677ebc-42cd-4a34-b657-dbf2dec27c1c.inter.fastq==== reads 10483112 len 150 len mean 149.9747 len stdev 0.7000 len min 52 phred 33 window-size 2000000 cycle-max 35 dups 1108047 %dup 10.5698 unique-dup seq 374841 min dup count 2 dup seq 1 88 TAGGAGGTAACAGCCCTGTAGTCAAAACTTCGTTC dup seq 2 71 GAAGCCGTAAGGCGAAGTATAGGGGCTGACGCCTG dup seq 3 64 TAGGGGGTTTCCGCCCCTTAGTGCTGCAGCTAACG dup seq 4 63 AAGGGGCGGAAACCCCCTAACACTTAGCACTCATC dup seq 5 62 TGGAAGGGCCATCGCTCAACGGATAAAAGCTACCC dup seq 6 59 AGGGGCATGATGATTTGACGTCATCCCCACCTTCC dup seq 7 58 TAGCGAGGTGAGAGGTGCCCGTACCGCAAACCGAC dup seq 8 57 TGGGTAGATCACCTGGTTTCGGGTCTACGACCACG dup seq 9 56 TGACCTGGGCTACACACGTGCTACAATGGACAGAA dup seq 10 56 TGGAGCTGACAGATACTAATCGATCGAGGACTTAA dup mean 3.9560 dup stddev 2.7004 qual min 11 qual max 37 qual mean 36.3086 qual stdev 3.5447 %A 27.4275 %C 22.8501 %G 22.9025 %T 26.8199 %N 0.0000 total bases 1572201842
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 1m 20s.
Summary
====/kb/module/work/tmp/e04e67d8-0e8b-4fd6-aa6c-f102bebac0c7.inter.fastq==== reads 13915646 len 150 len mean 149.9760 len stdev 0.6803 len min 49 phred 33 window-size 2000000 cycle-max 35 dups 1422354 %dup 10.2213 unique-dup seq 379035 min dup count 2 dup seq 1 117 GAAGCCGTAAGGCGAAGTATAGGGGCTGACGCCTG dup seq 2 91 AAGGGGCGGAAACCCCCTAACACTTAGCACTCATC dup seq 3 88 GACCGTACTCCCCAGGCGGAGTGCTTAATGCGTTA dup seq 4 85 TAGGAGGTAACAGCCCTGTAGTCAAAACTTCGTTC dup seq 5 82 TGGGCTACACACGTGCTACAATGGACAGAACAAAG dup seq 6 82 GGAAACCCCCTAACACTTAGCACTCATCGTTTACG dup seq 7 82 TAGGGAAGAACAAGTGCCGTTCAAATAGGGCGGCA dup seq 8 79 GAAGGCGACTCTCTGGTCTGTAACTGACGCTGAGG dup seq 9 79 TAGGAGTCTGGGCCGTGTCTCAGTCCCAGTGTGGC dup seq 10 78 TAGGGCGGCACCTTGACGGTACCTAACCAGAAAGC dup mean 4.7526 dup stddev 4.1674 qual min 11 qual max 37 qual mean 36.2654 qual stdev 3.6562 %A 27.3544 %C 22.9493 %G 23.0120 %T 26.6843 %N 0.0000 total bases 2087013294
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 1m 18s.
Summary
====/kb/module/work/tmp/47c644e3-73e1-469b-a8a4-0daee485015f.inter.fastq==== reads 13434066 len 150 len mean 149.9712 len stdev 1.0027 len min 49 phred 33 window-size 2000000 cycle-max 35 dups 1407420 %dup 10.4765 unique-dup seq 384214 min dup count 2 dup seq 1 181 CTTCGATTTCCTGCCCCATGTTATTCATACGAAGA dup seq 2 71 CTTTGTCTTCTTTTCAGCGTGATAGAAAACCTTTG dup seq 3 70 CAAAGGTTTTCTATCACGCTGAAAAGAAGACAAAG dup seq 4 70 GAAGAACCTTACCAGGTCTTGACATCCTCTGACAA dup seq 5 64 AAAGGGCGCACGGTGGATGCCTTGGCACTAGGAGC dup seq 6 64 GAAGGCGACTCTCTGGTCTGTAACTGACGCTGAGG dup seq 7 63 AGCTTCGGTGATACGTTTAGCCCCGGTACATTTTC dup seq 8 60 TAACGCATTAAGCACTCCGCCTGGGGAGTACGGTC dup seq 9 59 CCTTTGTCTTCTTTTCAGCGTGATAGAAAACCTTT dup seq 10 59 AAAGGGCTAAAATAAAGATCATTTTAGCCCTTTGT dup mean 4.6631 dup stddev 3.4490 qual min 11 qual max 37 qual mean 36.3244 qual stdev 3.4921 %A 29.9255 %C 20.4353 %G 20.3635 %T 29.2757 %N 0.0000 total bases 2014722366
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 35s.
Summary
====/kb/module/work/tmp/411cb924-4618-490f-bd9f-808dad7d7210.inter.fastq==== reads 3216256 len 145 len mean 127.6520 len stdev 26.9370 len min 26 phred 33 window-size 2000000 cycle-max 35 dups 1078400 %dup 33.5297 unique-dup seq 333648 min dup count 2 dup seq 1 1175 GTATATAGAGTGTGTGTAGTTGTTTGTCTTCAATG dup seq 2 524 ATCATATACACTCGTCCTCTATTTGTTATATAGAA dup seq 3 524 GTATAGAAGGATTGGTTGACACTACTTACAACATG dup seq 4 488 CTATCACTCTCTATCTATTACATACCCTCTCATTA dup seq 5 478 CTCTATATACAAGCCACCTGTCACAGACCATACAC dup seq 6 419 ATCTATTACATACCCTCTCATTACTCACACCTTCC dup seq 7 402 GACTAACCCTGAGCGGACGAGCCTTCCTCAGGAAA dup seq 8 398 GTATACTCCTTTAGGTGTATGGTCTGTGACAGGTG dup seq 9 396 GTATATGATATCACAACGACTTCCTTTCTAGAAGT dup seq 10 393 TCCTTATATTGACTATGATTGGGAGCGAAATAGCT dup mean 4.2321 dup stddev 7.4904 qual min 14 qual max 36 qual mean 34.5720 qual stdev 3.9582 %A 31.5189 %C 18.3916 %G 18.3877 %T 31.7018 %N 0.0000 total bases 410561381
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 39s.
Summary
====/kb/module/work/tmp/e9c34f55-ec12-4328-ae17-0c1948876577.inter.fastq==== reads 2623090 len 145 len mean 135.1172 len stdev 23.0293 len min 25 phred 33 window-size 2000000 cycle-max 35 dups 962334 %dup 36.6870 unique-dup seq 369082 min dup count 2 dup seq 1 160 ATATTAGGGTTATGGTAACACAAAATTTCAGTTTT dup seq 2 152 ATACAGGCTTGCTATATGCTTTACCGTTTCGTCGA dup seq 3 134 GGATATGAGAAGGCAGACCCGGGGAACTGAAACAT dup seq 4 132 TAATTATAGCATCCTTGAGGACAAGACCGCAACAG dup seq 5 132 CCCTAATATAGTGGTATGTATTGGGGGGGTTCGTG dup seq 6 131 ATATAAGACCTTGGGTCTTATAAACAGAACGTTCC dup seq 7 124 GTATAGAGCCGGTGACTGTCCGCAAGTATGCCAGT dup seq 8 121 GTACATAGATACTCTAAAGTCTTTACGTCATCCTC dup seq 9 118 ATCCAATACCGCGCTTACCCTATCCTCCTGCGTCC dup seq 10 117 ATATAGAAACAACAAGATTTTAAAAAATCACGCTC dup mean 3.6074 dup stddev 3.1159 qual min 14 qual max 36 qual mean 34.9500 qual stdev 3.0745 %A 27.7260 %C 22.1110 %G 22.2819 %T 27.8810 %N 0.0000 total bases 354424697
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 25s.
Summary
====/kb/module/work/tmp/9b6bfb35-4d29-428c-be8c-29717646c9ae.inter.fastq==== reads 2553846 len 145 len mean 136.0038 len stdev 21.7235 len min 27 phred 33 window-size 2000000 cycle-max 35 dups 964700 %dup 37.7744 unique-dup seq 384674 min dup count 2 dup seq 1 117 GAGTATGAGAAGGCATACCCGGGGAACTGAAACAT dup seq 2 108 CTCATACTCTATGTATTCAAGTATGGATACTACTC dup seq 3 108 TCCTTGAACCATGCGGTTCAAGGAACTATCCGGTA dup seq 4 84 ATTCAGACTCGCTTTCGCTGCGGCTCCGCATCTTC dup seq 5 83 GTGGATGCCTTGGCACTAGGAGCCGATGAAGGACG dup seq 6 82 CTCTTTAAATGGTGGCTGCTTCTAAGCCAACATCC dup seq 7 77 GGATTAGATACCCTGGTAGTCCACGCCGTAAACGA dup seq 8 76 ATATAATATAGATAGTCAATTTGTCATGAGGAGTG dup seq 9 75 CCTTAGGATTCTCTCCTCGCCTACCTGTGTCGGTT dup seq 10 74 CTTTAAATGGTGGCTGCTTCTAAGCCAACATCCTG dup mean 3.5078 dup stddev 2.5862 qual min 14 qual max 36 qual mean 34.9634 qual stdev 3.0400 %A 29.3140 %C 20.5331 %G 20.7465 %T 29.4064 %N 0.0000 total bases 347332644
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 38s.
Summary
====/kb/module/work/tmp/3f82097a-6fae-4f0f-9596-f311aca2f8a5.inter.fastq==== reads 5654876 len 145 len mean 119.6116 len stdev 31.4888 len min 25 phred 33 window-size 2000000 cycle-max 35 dups 1695503 %dup 29.9830 unique-dup seq 380315 min dup count 2 dup seq 1 212 GTTGTACGGTGATCTAAATAGAAAAAACCTTCTGG dup seq 2 157 CCATAGAGGTATCCGATAAACATCATTTTAAATAA dup seq 3 156 GGATATGAGAAGGCAGACCCGGGGAACTGAAACAT dup seq 4 153 ATATAAGACCTTGGGTCTTATAAACAGAACGTTCC dup seq 5 146 CCTCTATGGTATCCGTTCAGAAAGACAACTTGAAA dup seq 6 144 CCTTAAAGCCAATGCCAACAGGCATAAATACACAA dup seq 7 136 GTAGTAAGGTTTAACCTTTTCAATGATGAAAGAGA dup seq 8 134 GGCATCCACCGTGCGCCCTTTCTAACTTAACCGTT dup seq 9 132 GTGGATGCCTTGGCACTAGGAGCCGATGAAGGACG dup seq 10 129 GTCTTGTTCACTGTCATAATGAAATTTCCATTTTG dup mean 5.4582 dup stddev 4.1470 qual min 14 qual max 36 qual mean 34.4528 qual stdev 4.1931 %A 28.4888 %C 21.4701 %G 21.6205 %T 28.4206 %N 0.0000 total bases 676388632
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 25s.
Summary
====/kb/module/work/tmp/131d46de-95f9-4a2f-bb6a-c62c5c8e4715.inter.fastq==== reads 3109768 len 150 len mean 145.8510 len stdev 14.7268 len min 25 phred 33 window-size 2000000 cycle-max 35 dups 1562029 %dup 50.2298 unique-dup seq 444087 min dup count 2 dup seq 1 175 GCATATGGCTGCCCATTCTTCTTGCGCTTTTTCAC dup seq 2 166 GTGTAAAGGCACAAGGGAGCTTGACTGCGAGACCT dup seq 3 157 GGCATCCACCGTGCGCCCTTTCTAACTTAACCGTT dup seq 4 152 CCTTTACACTCTGCGAATGATTTCCAACCATTCTG dup seq 5 151 CCTTAGGATTCTCTCCTCGCCTACCTGTGTCGGTT dup seq 6 142 GGATTAGATACCCTGGTAGTCCACGCCGTAAACGA dup seq 7 137 GTGGATGCCTTGGCACTAGGAGCCGATGAAGGACG dup seq 8 130 ATCTAATCCTGTTCGCTCCCCACGCTTTCGCTCCT dup seq 9 128 GCCATATGCTGTGGCCGAGAATATATTGGATCGAA dup seq 10 125 ATCCTAAGGTGATCGAGAGAACTCTCGTTAAGGAA dup mean 4.5174 dup stddev 3.7041 qual min 12 qual max 34 qual mean 33.3919 qual stdev 3.1564 %A 26.4220 %C 23.2847 %G 23.4239 %T 26.8694 %N 0.0000 total bases 453562695
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 25s.
Summary
====/kb/module/work/tmp/4f1af612-350e-4b55-8fa5-91b361d302ec.inter.fastq==== reads 3256488 len 150 len mean 145.8067 len stdev 14.7426 len min 26 phred 33 window-size 2000000 cycle-max 35 dups 1678776 %dup 51.5517 unique-dup seq 436866 min dup count 2 dup seq 1 142 TGTATATACAGAGTAAACTTAAAATGTTTTTGAAC dup seq 2 135 GCATATGGCTGCCCATTCTTCTTGCGCTTTTTCAC dup seq 3 131 GGATTAGATACCCTGGTAGTCCACGCCGTAAACGA dup seq 4 125 CCTTAGGATTCTCTCCTCGCCTACCTGTGTCGGTT dup seq 5 125 GTGGATGCCTTGGCACTAGGAGCCGATGAAGGACG dup seq 6 125 GTATATACATGTCTGATTTTAAAAAGAATTCTTTA dup seq 7 125 GTGTAAAGGCACAAGGGAGCTTGACTGCGAGACCT dup seq 8 119 ATGTACACCTCCTTTCCTATAACTCTATTATAGCA dup seq 9 117 CCTTTACACTCTGCGAATGATTTCCAACCATTCTG dup seq 10 116 CCCTAACACTTAGCACTCATCGTTTACGGCGTGGA dup mean 4.8428 dup stddev 4.5307 qual min 12 qual max 34 qual mean 33.4165 qual stdev 3.0891 %A 26.8679 %C 22.8486 %G 22.9648 %T 27.3186 %N 0.0000 total bases 474817784
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 25s.
Summary
====/kb/module/work/tmp/fc838e22-f8ca-4c37-b664-2a1358c17a15.inter.fastq==== reads 2940868 len 150 len mean 146.1500 len stdev 13.9461 len min 25 phred 33 window-size 2000000 cycle-max 35 dups 1469947 %dup 49.9834 unique-dup seq 450592 min dup count 2 dup seq 1 142 GCATATGGCTGCCCATTCTTCTTGCGCTTTTTCAC dup seq 2 133 GGCATCCACCGTGCGCCCTTTCTAACTTAACCGTT dup seq 3 128 GTCCTAAGGTAGCGAAATTCCTTGTCGGGTAAGTT dup seq 4 120 GTGGATGCCTTGGCACTAGGAGCCGATGAAGGACG dup seq 5 116 CCTTAGGATTCTCTCCTCGCCTACCTGTGTCGGTT dup seq 6 116 CTCCTACCACTGTTCGAAGAACAGTCCGCAGCTTC dup seq 7 113 GGATTAGATACCCTGGTAGTCCACGCCGTAAACGA dup seq 8 108 GCCATATGCTGTGGCCGAGAATATATTGGATCGAA dup seq 9 106 CTCTAAGGTGACTGCCGGTGACAAACCGGAGGAAG dup seq 10 106 GCCCAACCCTTGGGACCGACTACAGCCCCAGGATG dup mean 4.2623 dup stddev 3.2825 qual min 12 qual max 34 qual mean 33.3024 qual stdev 3.3456 %A 26.3296 %C 23.3758 %G 23.5124 %T 26.7823 %N 0.0000 total bases 429807891
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 27s.
Summary
====/kb/module/work/tmp/eec19307-fc64-44c8-b657-d5bb917b0784.inter.fastq==== reads 3602910 len 145 len mean 128.8090 len stdev 27.2010 len min 25 phred 33 window-size 2000000 cycle-max 35 dups 1256509 %dup 34.8748 unique-dup seq 381022 min dup count 2 dup seq 1 167 ATATAAGACCTTGGGTCTTATAAACAGAACGTTCC dup seq 2 151 GGATATGAGAAGGCAGACCCGGGGAACTGAAACAT dup seq 3 146 CTCATATCCTATGTATTCAGATATGGATACCACTC dup seq 4 141 ATTCAGACTCGCTTTCGCTGCGGCTCCGCATCTTC dup seq 5 127 GTCTTATATTCCGTTAAAATCCTTAGAAAGGAGGT dup seq 6 123 GTATTAGGCACGCCGCCAGCGTTCGTCCTGAGCCA dup seq 7 121 CTTATAAACAGAACGTTCCCTGTCTTGTTTAGTTT dup seq 8 120 TTTAAAGAGTGCGTAATAGCTCACTGGTCGAGTGA dup seq 9 118 TTCTATGGCAGTGCGTTCTACGATGGCGGTCGTAG dup seq 10 117 TTCTATGACAGTGCGTTCTACGATGGCAGTCGTAG dup mean 4.2977 dup stddev 3.3715 qual min 14 qual max 36 qual mean 34.6342 qual stdev 3.8341 %A 27.8820 %C 22.0702 %G 22.2151 %T 27.8327 %N 0.0000 total bases 464087227
Run the EA Utils program fastq-stats to compute and print basic summary stats.
This app completed without errors in 25s.
Summary
====/kb/module/work/tmp/33cd5398-fe30-4558-b99c-18e15e5c580e.inter.fastq==== reads 2730734 len 150 len mean 144.2646 len stdev 18.7953 len min 25 phred 33 window-size 2000000 cycle-max 35 dups 1222654 %dup 44.7738 unique-dup seq 418457 min dup count 2 dup seq 1 160 CCTCTATGGTATCCGTTCAGAAAGACAACTTGAAA dup seq 2 148 GGATTAGATACCCTGGTAGTCCACGCCGTAAACGA dup seq 3 140 GTGTAAAGGCACAAGGGAGCTTGACTGCGAGACCT dup seq 4 133 GTGGATGCCTTGGCACTAGGAGCCGATGAAGGACG dup seq 5 128 CCTTAGGATTCTCTCCTCGCCTACCTGTGTCGGTT dup seq 6 124 CCATAGAGGTATCCGATAAACATCATTTTAAATAA dup seq 7 124 GTTGTACGGTGATCTAAATAGAAAAAACCTTCTGG dup seq 8 121 GAATAGGCCCAGGCGACTGTTTAGCAAAAACACAG dup seq 9 121 ATTCAGACTCGCTTTCGCTGCGGCTCCGCATCTTC dup seq 10 117 GGCATCCACCGTGCGCCCTTTCTAACTTAACCGTT dup mean 3.9218 dup stddev 3.1018 qual min 12 qual max 37 qual mean 36.0698 qual stdev 4.4391 %A 27.3605 %C 22.3158 %G 22.4344 %T 27.8892 %N 0.0000 total bases 393948124
A quality control application for high throughput sequence data.
This app completed without errors in 2m 26s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CB0019_RQC_prinseq_100923_157793_110_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • CB0019_RQC_prinseq_100923_157793_110_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 2m 18s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CB0021_RQC_prinseq_100923_157793_103_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • CB0021_RQC_prinseq_100923_157793_103_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 3m 7s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CB0027_RQC_prinseq_100923_157793_118_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • CB0027_RQC_prinseq_100923_157793_118_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 2m 57s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CB0069_RQC_prinseq_100923_157793_111_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • CB0069_RQC_prinseq_100923_157793_111_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 1m 36s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CB0676_RQC_prinseq_101023_157793_132_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • CB0676_RQC_prinseq_101023_157793_132_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 1m 14s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CB0687_RQC_prinseq_100923_157793_94_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • CB0687_RQC_prinseq_100923_157793_94_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 1m 13s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CB0729_RQC_prinseq_101023_157793_131_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • CB0729_RQC_prinseq_101023_157793_131_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 1m 37s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CB0893_RQC_prinseq_100923_157793_127_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • CB0893_RQC_prinseq_100923_157793_127_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 1m 17s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CB0902_RQC_prinseq_100923_157793_98_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • CB0902_RQC_prinseq_100923_157793_98_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 1m 18s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CB0904_RQC_prinseq_100923_157793_102_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • CB0904_RQC_prinseq_100923_157793_102_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 1m 15s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CB0909_RQC_prinseq_101023_157793_139_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • CB0909_RQC_prinseq_101023_157793_139_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 1m 16s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CB0912_RQC_prinseq_101023_157793_143_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • CB0912_RQC_prinseq_101023_157793_143_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 1m 14s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CB0742_RQCfilter_prinseq_102423_157793_306_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • CB0742_RQCfilter_prinseq_102423_157793_306_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
Assemble reads using the SPAdes assembler.
This app completed without errors in 7m 19s.
Objects
Created Object Name Type Description
CB0742_SPAdes.Assembly_kmer Assembly Assembled contigs
Summary
Assembly saved to: fvieira:narrative_1695752967822/CB0742_SPAdes.Assembly_kmer Assembled into 36 contigs. Avg Length: 117204.97222222222 bp. Contig Length Distribution (# of contigs -- min to max basepairs): 23 -- 506.0 to 72237.1 bp 2 -- 72237.1 to 143968.2 bp 2 -- 143968.2 to 215699.30000000002 bp 4 -- 215699.30000000002 to 287430.4 bp 3 -- 287430.4 to 359161.5 bp 0 -- 359161.5 to 430892.60000000003 bp 1 -- 430892.60000000003 to 502623.70000000007 bp 0 -- 502623.70000000007 to 574354.8 bp 0 -- 574354.8 to 646085.9 bp 1 -- 646085.9 to 717817.0 bp
Links
Assemble reads using the SPAdes assembler.
This app completed without errors in 55m 13s.
Objects
Created Object Name Type Description
CB0019_SPAdes.Assembly_kmer Assembly Assembled contigs
Summary
Assembly saved to: fvieira:narrative_1695752967822/CB0019_SPAdes.Assembly_kmer Assembled into 21 contigs. Avg Length: 199928.09523809524 bp. Contig Length Distribution (# of contigs -- min to max basepairs): 15 -- 626.0 to 114224.8 bp 0 -- 114224.8 to 227823.6 bp 1 -- 227823.6 to 341422.4 bp 1 -- 341422.4 to 455021.2 bp 1 -- 455021.2 to 568620.0 bp 1 -- 568620.0 to 682218.8 bp 0 -- 682218.8 to 795817.6 bp 0 -- 795817.6 to 909416.4 bp 0 -- 909416.4 to 1023015.2000000001 bp 2 -- 1023015.2000000001 to 1136614.0 bp
Links
Assemble reads using the SPAdes assembler.
This app completed without errors in 19m 49s.
Objects
Created Object Name Type Description
CB0676_SPAdes.Assembly_kmer Assembly Assembled contigs
Summary
Assembly saved to: fvieira:narrative_1695752967822/CB0676_SPAdes.Assembly_kmer Assembled into 87 contigs. Avg Length: 63811.35632183908 bp. Contig Length Distribution (# of contigs -- min to max basepairs): 66 -- 504.0 to 99817.0 bp 12 -- 99817.0 to 199130.0 bp 4 -- 199130.0 to 298443.0 bp 2 -- 298443.0 to 397756.0 bp 2 -- 397756.0 to 497069.0 bp 0 -- 497069.0 to 596382.0 bp 0 -- 596382.0 to 695695.0 bp 0 -- 695695.0 to 795008.0 bp 0 -- 795008.0 to 894321.0 bp 1 -- 894321.0 to 993634.0 bp
Links
Assemble reads using the SPAdes assembler.
This app completed without errors in 25m 3s.
Objects
Created Object Name Type Description
CB021_SPAdes.Assembly_kmer Assembly Assembled contigs
Summary
Assembly saved to: fvieira:narrative_1695752967822/CB021_SPAdes.Assembly_kmer Assembled into 21 contigs. Avg Length: 199920.95238095237 bp. Contig Length Distribution (# of contigs -- min to max basepairs): 15 -- 626.0 to 114224.8 bp 0 -- 114224.8 to 227823.6 bp 1 -- 227823.6 to 341422.4 bp 1 -- 341422.4 to 455021.2 bp 1 -- 455021.2 to 568620.0 bp 1 -- 568620.0 to 682218.8 bp 0 -- 682218.8 to 795817.6 bp 0 -- 795817.6 to 909416.4 bp 0 -- 909416.4 to 1023015.2000000001 bp 2 -- 1023015.2000000001 to 1136614.0 bp
Links
Assemble reads using the SPAdes assembler.
This app completed without errors in 27m 14s.
Objects
Created Object Name Type Description
CB27_SPAdes.Assembly_kmer Assembly Assembled contigs
Summary
Assembly saved to: fvieira:narrative_1695752967822/CB27_SPAdes.Assembly_kmer Assembled into 22 contigs. Avg Length: 190693.13636363635 bp. Contig Length Distribution (# of contigs -- min to max basepairs): 16 -- 626.0 to 114224.8 bp 0 -- 114224.8 to 227823.6 bp 1 -- 227823.6 to 341422.4 bp 1 -- 341422.4 to 455021.2 bp 1 -- 455021.2 to 568620.0 bp 1 -- 568620.0 to 682218.8 bp 0 -- 682218.8 to 795817.6 bp 0 -- 795817.6 to 909416.4 bp 0 -- 909416.4 to 1023015.2000000001 bp 2 -- 1023015.2000000001 to 1136614.0 bp
Links
Assemble reads using the SPAdes assembler.
This app completed without errors in 30m 2s.
Objects
Created Object Name Type Description
CB069_SPAdes.Assembly_kmer Assembly Assembled contigs
Summary
Assembly saved to: fvieira:narrative_1695752967822/CB069_SPAdes.Assembly_kmer Assembled into 59 contigs. Avg Length: 66215.93220338984 bp. Contig Length Distribution (# of contigs -- min to max basepairs): 34 -- 504.0 to 34950.1 bp 4 -- 34950.1 to 69396.2 bp 6 -- 69396.2 to 103842.29999999999 bp 3 -- 103842.29999999999 to 138288.4 bp 5 -- 138288.4 to 172734.5 bp 1 -- 172734.5 to 207180.59999999998 bp 1 -- 207180.59999999998 to 241626.69999999998 bp 4 -- 241626.69999999998 to 276072.8 bp 0 -- 276072.8 to 310518.89999999997 bp 1 -- 310518.89999999997 to 344965.0 bp
Links
Assemble reads using the SPAdes assembler.
This app completed without errors in 8m 11s.
Objects
Created Object Name Type Description
CB0687_SPAdes.Assembly_kmer Assembly Assembled contigs
Summary
Assembly saved to: fvieira:narrative_1695752967822/CB0687_SPAdes.Assembly_kmer Assembled into 14 contigs. Avg Length: 292298.78571428574 bp. Contig Length Distribution (# of contigs -- min to max basepairs): 9 -- 945.0 to 112878.3 bp 0 -- 112878.3 to 224811.6 bp 1 -- 224811.6 to 336744.9 bp 1 -- 336744.9 to 448678.2 bp 0 -- 448678.2 to 560611.5 bp 0 -- 560611.5 to 672544.8 bp 0 -- 672544.8 to 784478.1 bp 0 -- 784478.1 to 896411.4 bp 1 -- 896411.4 to 1008344.7000000001 bp 2 -- 1008344.7000000001 to 1120278.0 bp
Links
Assemble reads using the SPAdes assembler.
This app completed without errors in 7m 50s.
Objects
Created Object Name Type Description
CB0729_SPAdes.Assembly_kmer Assembly Assembled contigs
Summary
Assembly saved to: fvieira:narrative_1695752967822/CB0729_SPAdes.Assembly_kmer Assembled into 19 contigs. Avg Length: 193532.52631578947 bp. Contig Length Distribution (# of contigs -- min to max basepairs): 14 -- 516.0 to 164496.9 bp 3 -- 164496.9 to 328477.8 bp 0 -- 328477.8 to 492458.69999999995 bp 0 -- 492458.69999999995 to 656439.6 bp 0 -- 656439.6 to 820420.5 bp 1 -- 820420.5 to 984401.3999999999 bp 0 -- 984401.3999999999 to 1148382.3 bp 0 -- 1148382.3 to 1312363.2 bp 0 -- 1312363.2 to 1476344.0999999999 bp 1 -- 1476344.0999999999 to 1640325.0 bp
Links
Assemble reads using the SPAdes assembler.
This app completed without errors in 13m 10s.
Objects
Created Object Name Type Description
CB0893_SPAdes.Assembly_kmer Assembly Assembled contigs
Summary
Assembly saved to: fvieira:narrative_1695752967822/CB0893_SPAdes.Assembly_kmer Assembled into 42 contigs. Avg Length: 100373.19047619047 bp. Contig Length Distribution (# of contigs -- min to max basepairs): 31 -- 562.0 to 107564.2 bp 4 -- 107564.2 to 214566.4 bp 4 -- 214566.4 to 321568.6 bp 2 -- 321568.6 to 428570.8 bp 0 -- 428570.8 to 535573.0 bp 0 -- 535573.0 to 642575.2 bp 0 -- 642575.2 to 749577.4 bp 0 -- 749577.4 to 856579.6 bp 0 -- 856579.6 to 963581.7999999999 bp 1 -- 963581.7999999999 to 1070584.0 bp
Links
Assemble reads using the SPAdes assembler.
This app completed without errors in 8m 35s.
Objects
Created Object Name Type Description
CB902_SPAdes.Assembly_kmer Assembly Assembled contigs
Summary
Assembly saved to: fvieira:narrative_1695752967822/CB902_SPAdes.Assembly_kmer Assembled into 40 contigs. Avg Length: 101534.425 bp. Contig Length Distribution (# of contigs -- min to max basepairs): 25 -- 590.0 to 56787.7 bp 1 -- 56787.7 to 112985.4 bp 4 -- 112985.4 to 169183.09999999998 bp 1 -- 169183.09999999998 to 225380.8 bp 5 -- 225380.8 to 281578.5 bp 1 -- 281578.5 to 337776.19999999995 bp 1 -- 337776.19999999995 to 393973.89999999997 bp 1 -- 393973.89999999997 to 450171.6 bp 0 -- 450171.6 to 506369.3 bp 1 -- 506369.3 to 562567.0 bp
Links
Assemble reads using the SPAdes assembler.
This app completed without errors in 9m 12s.
Objects
Created Object Name Type Description
CB0904_SPAdes.Assembly_kmer Assembly Assembled contigs
Summary
Assembly saved to: fvieira:narrative_1695752967822/CB0904_SPAdes.Assembly_kmer Assembled into 43 contigs. Avg Length: 94444.44186046511 bp. Contig Length Distribution (# of contigs -- min to max basepairs): 28 -- 590.0 to 56787.7 bp 1 -- 56787.7 to 112985.4 bp 4 -- 112985.4 to 169183.09999999998 bp 2 -- 169183.09999999998 to 225380.8 bp 4 -- 225380.8 to 281578.5 bp 1 -- 281578.5 to 337776.19999999995 bp 1 -- 337776.19999999995 to 393973.89999999997 bp 1 -- 393973.89999999997 to 450171.6 bp 0 -- 450171.6 to 506369.3 bp 1 -- 506369.3 to 562567.0 bp
Links
Assemble reads using the SPAdes assembler.
This app completed without errors in 8m 8s.
Objects
Created Object Name Type Description
CB0909_SPAdes.Assembly_kmer Assembly Assembled contigs
Summary
Assembly saved to: fvieira:narrative_1695752967822/CB0909_SPAdes.Assembly_kmer Assembled into 34 contigs. Avg Length: 119508.23529411765 bp. Contig Length Distribution (# of contigs -- min to max basepairs): 21 -- 590.0 to 56787.7 bp 1 -- 56787.7 to 112985.4 bp 2 -- 112985.4 to 169183.09999999998 bp 1 -- 169183.09999999998 to 225380.8 bp 4 -- 225380.8 to 281578.5 bp 1 -- 281578.5 to 337776.19999999995 bp 1 -- 337776.19999999995 to 393973.89999999997 bp 1 -- 393973.89999999997 to 450171.6 bp 0 -- 450171.6 to 506369.3 bp 2 -- 506369.3 to 562567.0 bp
Links
Assemble reads using the SPAdes assembler.
This app completed without errors in 9m 8s.
Objects
Created Object Name Type Description
CB0912_SPAdes.Assembly_kmer Assembly Assembled contigs
Summary
Assembly saved to: fvieira:narrative_1695752967822/CB0912_SPAdes.Assembly_kmer Assembled into 39 contigs. Avg Length: 104194.43589743589 bp. Contig Length Distribution (# of contigs -- min to max basepairs): 20 -- 590.0 to 45368.8 bp 5 -- 45368.8 to 90147.6 bp 2 -- 90147.6 to 134926.40000000002 bp 2 -- 134926.40000000002 to 179705.2 bp 1 -- 179705.2 to 224484.0 bp 4 -- 224484.0 to 269262.80000000005 bp 1 -- 269262.80000000005 to 314041.60000000003 bp 0 -- 314041.60000000003 to 358820.4 bp 3 -- 358820.4 to 403599.2 bp 1 -- 403599.2 to 448378.0 bp
Links
Run QUAST (QUality ASsessment Tool) on a set of Assemblies to assess their quality.
This app completed without errors in 2m 7s.
Summary
All statistics are based on contigs of size >= 500 bp, unless otherwise noted (e.g., "# contigs (>= 0 bp)" and "Total length (>= 0 bp)" include all contigs). Assembly CB0019_SPAdes.Assembly_kmer CB021_SPAdes.Assembly_kmer CB27_SPAdes.Assembly_kmer CB069_SPAdes.Assembly_kmer CB0676_SPAdes.Assembly_kmer CB0687_SPAdes.Assembly_kmer CB0729_SPAdes.Assembly_kmer CB0893_SPAdes.Assembly_kmer CB902_SPAdes.Assembly_kmer CB0904_SPAdes.Assembly_kmer CB0909_SPAdes.Assembly_kmer CB0912_SPAdes.Assembly_kmer # contigs (>= 0 bp) 21 21 22 59 87 14 19 42 40 43 34 39 # contigs (>= 1000 bp) 18 18 19 48 68 13 15 37 34 36 30 34 # contigs (>= 10000 bp) 12 12 13 36 38 10 13 28 22 25 20 23 # contigs (>= 100000 bp) 6 6 6 16 21 5 6 11 14 14 12 14 # contigs (>= 1000000 bp) 2 2 2 0 0 3 1 1 0 0 0 0 Total length (>= 0 bp) 4198490 4198340 4195249 3906740 5551588 4092183 3677118 4215674 4061377 4061111 4063280 4063583 Total length (>= 1000 bp) 4196153 4196003 4192912 3899160 5538852 4091238 3674091 4211804 4056997 4056064 4060566 4060462 Total length (>= 10000 bp) 4179078 4179075 4175795 3854693 5444025 4076539 3671860 4185265 4011091 4018289 4031858 4028523 Total length (>= 100000 bp) 3992093 3992093 3955283 3022204 4911170 3893651 3390582 3457569 3684394 3617330 3705609 3666061 Total length (>= 1000000 bp) 2221724 2221724 2221724 0 0 3158798 1640325 1070584 0 0 0 0 # contigs 21 21 22 59 87 14 19 42 40 43 34 39 Largest contig 1136614 1136614 1136614 344965 993634 1120278 1640325 1070584 562567 562567 562567 448378 Total length 4198490 4198340 4195249 3906740 5551588 4092183 3677118 4215674 4061377 4061111 4063280 4063583 GC (%) 45.84 45.84 45.84 40.88 34.95 46.26 41.68 43.36 46.07 46.07 46.08 46.07 N50 1085110 1085110 1085110 164068 209859 1030512 871191 271969 260551 260568 295515 261384 N75 488275 488275 488275 102780 139059 1008008 308264 149653 206392 206392 225393 206392 L50 2 2 2 8 7 2 2 5 6 6 5 6 L75 4 4 4 16 15 3 3 9 10 10 9 10 # N's per 100 kbp 2.26 2.19 2.24 9.75 16.19 4.81 3.48 4.58 5.05 7.83 5.37 5.04
Links
Runs the CheckM lineage workflow to assess the genome quality of isolates, single cells, or genome bins from metagenome assemblies through comparison to an existing database of genomes.
This app completed without errors in 5m 2s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CheckM_summary_table.tsv.zip - TSV Summary Table from CheckM
  • full_output.zip - Full output of CheckM
  • plots.zip - Output plots from CheckM
Runs the CheckM lineage workflow to assess the genome quality of isolates, single cells, or genome bins from metagenome assemblies through comparison to an existing database of genomes.
This app completed without errors in 5m 10s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CheckM_summary_table.tsv.zip - TSV Summary Table from CheckM
  • full_output.zip - Full output of CheckM
  • plots.zip - Output plots from CheckM
Runs the CheckM lineage workflow to assess the genome quality of isolates, single cells, or genome bins from metagenome assemblies through comparison to an existing database of genomes.
This app completed without errors in 5m 20s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CheckM_summary_table.tsv.zip - TSV Summary Table from CheckM
  • full_output.zip - Full output of CheckM
  • plots.zip - Output plots from CheckM
Runs the CheckM lineage workflow to assess the genome quality of isolates, single cells, or genome bins from metagenome assemblies through comparison to an existing database of genomes.
This app completed without errors in 5m 7s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CheckM_summary_table.tsv.zip - TSV Summary Table from CheckM
  • full_output.zip - Full output of CheckM
  • plots.zip - Output plots from CheckM
Runs the CheckM lineage workflow to assess the genome quality of isolates, single cells, or genome bins from metagenome assemblies through comparison to an existing database of genomes.
This app completed without errors in 4m 42s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CheckM_summary_table.tsv.zip - TSV Summary Table from CheckM
  • full_output.zip - Full output of CheckM
  • plots.zip - Output plots from CheckM
Runs the CheckM lineage workflow to assess the genome quality of isolates, single cells, or genome bins from metagenome assemblies through comparison to an existing database of genomes.
This app completed without errors in 5m 1s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CheckM_summary_table.tsv.zip - TSV Summary Table from CheckM
  • full_output.zip - Full output of CheckM
  • plots.zip - Output plots from CheckM
Runs the CheckM lineage workflow to assess the genome quality of isolates, single cells, or genome bins from metagenome assemblies through comparison to an existing database of genomes.
This app completed without errors in 4m 51s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CheckM_summary_table.tsv.zip - TSV Summary Table from CheckM
  • full_output.zip - Full output of CheckM
  • plots.zip - Output plots from CheckM
Runs the CheckM lineage workflow to assess the genome quality of isolates, single cells, or genome bins from metagenome assemblies through comparison to an existing database of genomes.
This app completed without errors in 5m 23s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CheckM_summary_table.tsv.zip - TSV Summary Table from CheckM
  • full_output.zip - Full output of CheckM
  • plots.zip - Output plots from CheckM
Runs the CheckM lineage workflow to assess the genome quality of isolates, single cells, or genome bins from metagenome assemblies through comparison to an existing database of genomes.
This app completed without errors in 5m 11s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CheckM_summary_table.tsv.zip - TSV Summary Table from CheckM
  • full_output.zip - Full output of CheckM
  • plots.zip - Output plots from CheckM
Runs the CheckM lineage workflow to assess the genome quality of isolates, single cells, or genome bins from metagenome assemblies through comparison to an existing database of genomes.
This app completed without errors in 5m 0s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CheckM_summary_table.tsv.zip - TSV Summary Table from CheckM
  • full_output.zip - Full output of CheckM
  • plots.zip - Output plots from CheckM
Runs the CheckM lineage workflow to assess the genome quality of isolates, single cells, or genome bins from metagenome assemblies through comparison to an existing database of genomes.
This app completed without errors in 5m 19s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CheckM_summary_table.tsv.zip - TSV Summary Table from CheckM
  • full_output.zip - Full output of CheckM
  • plots.zip - Output plots from CheckM
Runs the CheckM lineage workflow to assess the genome quality of isolates, single cells, or genome bins from metagenome assemblies through comparison to an existing database of genomes.
This app completed without errors in 5m 7s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CheckM_summary_table.tsv.zip - TSV Summary Table from CheckM
  • full_output.zip - Full output of CheckM
  • plots.zip - Output plots from CheckM
Runs the CheckM lineage workflow to assess the genome quality of isolates, single cells, or genome bins from metagenome assemblies through comparison to an existing database of genomes.
This app completed without errors in 5m 32s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • CheckM_summary_table.tsv.zip - TSV Summary Table from CheckM
  • full_output.zip - Full output of CheckM
  • plots.zip - Output plots from CheckM

4. Genome Annotation

The genomes were annotated using Prokka (v1.14.5) [17], revealing the largest genome size of 5,551,588 bp from Bacillus cereus, and the smallest genome size of 3,677,118 bp from Bacillus safensis. The number of predicted genes among the genomes ranged from 3,742 to 5,644.

Annotate Assembly and Re-annotate Genomes with Prokka annotation pipeline.
This app completed without errors in 3m 42s.
Objects
Created Object Name Type Description
CB0019_genome_101323 Genome Annotated Genome
Summary
Annotated Genome saved to: fvieira:narrative_1695752967822/CB0019_genome Number of genes predicted: 4226 Number of protein coding genes: 4133 Number of genes with non-hypothetical function: 2744 Number of genes with EC-number: 995 Number of genes with Seed Subsystem Ontology: 0 Average protein length: 300 aa.
Annotate Assembly and Re-annotate Genomes with Prokka annotation pipeline.
This app completed without errors in 2m 17s.
Objects
Created Object Name Type Description
CB0021_genome_101323 Genome Annotated Genome
Summary
Annotated Genome saved to: fvieira:narrative_1695752967822/CB0021_genome Number of genes predicted: 4222 Number of protein coding genes: 4129 Number of genes with non-hypothetical function: 2744 Number of genes with EC-number: 995 Number of genes with Seed Subsystem Ontology: 0 Average protein length: 300 aa.
Annotate Assembly and Re-annotate Genomes with Prokka annotation pipeline.
This app completed without errors in 2m 21s.
Objects
Created Object Name Type Description
CB0027_genome_101323 Genome Annotated Genome
Summary
Annotated Genome saved to: fvieira:narrative_1695752967822/CB0027_genome Number of genes predicted: 4222 Number of protein coding genes: 4129 Number of genes with non-hypothetical function: 2743 Number of genes with EC-number: 995 Number of genes with Seed Subsystem Ontology: 0 Average protein length: 300 aa.
Annotate Assembly and Re-annotate Genomes with Prokka annotation pipeline.
This app completed without errors in 2m 19s.
Objects
Created Object Name Type Description
CB0069_genome_101323 Genome Annotated Genome
Summary
Annotated Genome saved to: fvieira:narrative_1695752967822/CB0069_g Number of genes predicted: 4103 Number of protein coding genes: 4022 Number of genes with non-hypothetical function: 2588 Number of genes with EC-number: 956 Number of genes with Seed Subsystem Ontology: 0 Average protein length: 283 aa.
Annotate Assembly and Re-annotate Genomes with Prokka annotation pipeline.
This app completed without errors in 2m 49s.
Objects
Created Object Name Type Description
CB0676_genome_101323 Genome Annotated Genome
Summary
Annotated Genome saved to: fvieira:narrative_1695752967822/CB0676_genome Number of genes predicted: 5644 Number of protein coding genes: 5538 Number of genes with non-hypothetical function: 3090 Number of genes with EC-number: 1180 Number of genes with Seed Subsystem Ontology: 0 Average protein length: 276 aa.
Annotate Assembly and Re-annotate Genomes with Prokka annotation pipeline.
This app completed without errors in 3m 36s.
Objects
Created Object Name Type Description
CB0687_genome_101323 Genome Annotated Genome
Summary
Annotated Genome saved to: fvieira:narrative_1695752967822/CB0687_genome Number of genes predicted: 4097 Number of protein coding genes: 4007 Number of genes with non-hypothetical function: 2713 Number of genes with EC-number: 984 Number of genes with Seed Subsystem Ontology: 0 Average protein length: 302 aa.
Annotate Assembly and Re-annotate Genomes with Prokka annotation pipeline.
This app completed without errors in 2m 14s.
Objects
Created Object Name Type Description
CB0729_genome_101323 Genome Annotated Genome
Summary
Annotated Genome saved to: fvieira:narrative_1695752967822/CB0729_genome_101323 Number of genes predicted: 3742 Number of protein coding genes: 3666 Number of genes with non-hypothetical function: 2585 Number of genes with EC-number: 970 Number of genes with Seed Subsystem Ontology: 0 Average protein length: 295 aa.
Annotate Assembly and Re-annotate Genomes with Prokka annotation pipeline.
This app completed without errors in 2m 18s.
Objects
Created Object Name Type Description
CB0893_genome_101323 Genome Annotated Genome
Summary
Annotated Genome saved to: fvieira:narrative_1695752967822/CB0893_genome_101323 Number of genes predicted: 4349 Number of protein coding genes: 4255 Number of genes with non-hypothetical function: 2907 Number of genes with EC-number: 1034 Number of genes with Seed Subsystem Ontology: 0 Average protein length: 289 aa.
Annotate Assembly and Re-annotate Genomes with Prokka annotation pipeline.
This app completed without errors in 2m 15s.
Objects
Created Object Name Type Description
CB0902_genome_101323 Genome Annotated Genome
Summary
Annotated Genome saved to: fvieira:narrative_1695752967822/CB0902_genome_1013 Number of genes predicted: 4032 Number of protein coding genes: 3942 Number of genes with non-hypothetical function: 2704 Number of genes with EC-number: 978 Number of genes with Seed Subsystem Ontology: 0 Average protein length: 302 aa.
Annotate Assembly and Re-annotate Genomes with Prokka annotation pipeline.
This app completed without errors in 3m 37s.
Objects
Created Object Name Type Description
CB0904_genome_101323 Genome Annotated Genome
Summary
Annotated Genome saved to: fvieira:narrative_1695752967822/CB0904_genome_101323 Number of genes predicted: 4031 Number of protein coding genes: 3941 Number of genes with non-hypothetical function: 2704 Number of genes with EC-number: 978 Number of genes with Seed Subsystem Ontology: 0 Average protein length: 302 aa.
Annotate Assembly and Re-annotate Genomes with Prokka annotation pipeline.
This app completed without errors in 2m 15s.
Objects
Created Object Name Type Description
CB0909_genome_101323 Genome Annotated Genome
Summary
Annotated Genome saved to: fvieira:narrative_1695752967822/CB0909_genome_101323 Number of genes predicted: 4032 Number of protein coding genes: 3942 Number of genes with non-hypothetical function: 2705 Number of genes with EC-number: 978 Number of genes with Seed Subsystem Ontology: 0 Average protein length: 303 aa.
Annotate Assembly and Re-annotate Genomes with Prokka annotation pipeline.
This app completed without errors in 2m 10s.
Objects
Created Object Name Type Description
CB0912_genome_101323 Genome Annotated Genome
Summary
Annotated Genome saved to: fvieira:narrative_1695752967822/CB0912_genome_101323 Number of genes predicted: 4035 Number of protein coding genes: 3945 Number of genes with non-hypothetical function: 2706 Number of genes with EC-number: 978 Number of genes with Seed Subsystem Ontology: 0 Average protein length: 302 aa.
Annotate Assembly and Re-annotate Genomes with Prokka annotation pipeline.
This app completed without errors in 2m 21s.
Objects
Created Object Name Type Description
CB0742_genome Genome Annotated Genome
Summary
Annotated Genome saved to: fvieira:narrative_1695752967822/CB0742_genome Number of genes predicted: 4353 Number of protein coding genes: 4258 Number of genes with non-hypothetical function: 2909 Number of genes with EC-number: 1035 Number of genes with Seed Subsystem Ontology: 0 Average protein length: 291 aa.

5. Taxonomic Identification

Taxonomic identification was performed on GTDB-Tk (v2.3.2) [18]. A Bacillus phylogenetic tree with closely related species was constructed using FastTree2 (v2.1.11) [19] through SpeciesTree (v2.2.0), on KBase, and annotated on iTOL (v6.8.1) [20]

Obtain objective taxonomic assignments for bacterial and archaeal genomes based on the Genome Taxonomy Database (GTDB)
This app completed without errors in 44m 40s.
Objects
Created Object Name Type Description
GCF_000007825.1 Genome Taxonomy unchanged, taxon_assignment added GTDB
GCF_000007845.1 Genome Taxonomy unchanged, taxon_assignment added GTDB
GCF_000712595.1 Genome Taxonomy unchanged, taxon_assignment added GTDB
GCF_000742675.1 Genome Taxonomy unchanged, taxon_assignment added GTDB
GCF_000746135.1 Genome Taxonomy unchanged, taxon_assignment added GTDB
GCF_000832605.1 Genome Taxonomy unchanged, taxon_assignment added GTDB
GCF_000948175.1 Genome Taxonomy unchanged, taxon_assignment added GTDB
GCF_001278705.1 Genome Taxonomy unchanged, taxon_assignment added GTDB
GCF_001420855.1 Genome Taxonomy unchanged, taxon_assignment added GTDB
GCF_001578205.1 Genome Taxonomy unchanged, taxon_assignment added GTDB
GCF_001591565.1 Genome Taxonomy unchanged, taxon_assignment added GTDB
GCF_900093775.1 Genome Taxonomy unchanged, taxon_assignment added GTDB
GCF_000008165.1 Genome Taxonomy unchanged, taxon_assignment added GTDB
CB0019_genome_101323 Genome Taxonomy unchanged, taxon_assignment added GTDB
CB0909_genome_101323 Genome Taxonomy unchanged, taxon_assignment added GTDB
CB0904_genome_101323 Genome Taxonomy unchanged, taxon_assignment added GTDB
CB0912_genome_101323 Genome Taxonomy unchanged, taxon_assignment added GTDB
CB0742_genome Genome Taxonomy and taxon_assignment updated with GTDB
CB0021_genome_101323 Genome Taxonomy unchanged, taxon_assignment added GTDB
CB0027_genome_101323 Genome Taxonomy unchanged, taxon_assignment added GTDB
CB0069_genome_101323 Genome Taxonomy unchanged, taxon_assignment added GTDB
CB0676_genome_101323 Genome Taxonomy unchanged, taxon_assignment added GTDB
CB0687_genome_101323 Genome Taxonomy unchanged, taxon_assignment added GTDB
GCF_000008505.1 Genome Taxonomy unchanged, taxon_assignment added GTDB
CB0729_genome_101323 Genome Taxonomy unchanged, taxon_assignment added GTDB
CB0893_genome_101323 Genome Taxonomy unchanged, taxon_assignment added GTDB
CB0902_genome_101323 Genome Taxonomy unchanged, taxon_assignment added GTDB
GCF_000009045.1 Genome Taxonomy unchanged, taxon_assignment added GTDB
GCF_000011645.1 Genome Taxonomy unchanged, taxon_assignment added GTDB
GCF_000017425.1 Genome Taxonomy unchanged, taxon_assignment added GTDB
GCF_000196735.1 Genome Taxonomy unchanged, taxon_assignment added GTDB
GCF_000430785.1 Genome Taxonomy unchanged, taxon_assignment added GTDB
GCF_000504165.1 Genome Taxonomy unchanged, taxon_assignment added GTDB
Genomeset_speciestree_102423 GenomeSet Taxonomy and taxon_assignment updated with GTDB
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • gtdbtk.backbone.bac120.classify.tree - gtdbtk.backbone.bac120.classify.tree - whole tree GTDB formatted Newick
  • gtdbtk.backbone.bac120.classify-ITOL.tree - gtdbtk.backbone.bac120.classify-ITOL.tree - whole tree ITOL formatted Newick
  • gtdbtk.bac120.classify.tree.6.tree - gtdbtk.bac120.classify.tree.6.tree - whole tree GTDB formatted Newick
  • gtdbtk.bac120.classify.tree.6-ITOL.tree - gtdbtk.bac120.classify.tree.6-ITOL.tree - whole tree ITOL formatted Newick
  • gtdbtk.backbone.bac120.classify-proximals.tree - gtdbtk.backbone.bac120.classify-proximals.tree - Newick
  • gtdbtk.backbone.bac120.classify-trimmed.tree - gtdbtk.backbone.bac120.classify-trimmed.tree - Newick
  • gtdbtk.backbone.bac120.classify-lineages.map - gtdbtk.backbone.bac120.classify-lineages.map - GTDB lineage
  • gtdbtk.backbone.bac120.classify-trimmed.tree-rectangle.PNG - gtdbtk.backbone.bac120.classify-trimmed.tree - Image
  • gtdbtk.backbone.bac120.classify-trimmed.tree-rectangle.PDF - gtdbtk.backbone.bac120.classify-trimmed.tree - Image
  • gtdbtk.backbone.bac120.classify-trimmed.tree-circle.PNG - gtdbtk.backbone.bac120.classify-trimmed.tree - Image
  • gtdbtk.backbone.bac120.classify-trimmed.tree-circle.PDF - gtdbtk.backbone.bac120.classify-trimmed.tree - Image
  • gtdbtk.backbone.bac120.classify-trimmed.tree-circle-ultrametric.PNG - gtdbtk.backbone.bac120.classify-trimmed.tree - Image
  • gtdbtk.backbone.bac120.classify-trimmed.tree-circle-ultrametric.PDF - gtdbtk.backbone.bac120.classify-trimmed.tree - Image
  • gtdbtk.bac120.classify.tree.6-proximals.tree - gtdbtk.bac120.classify.tree.6-proximals.tree - Newick
  • gtdbtk.bac120.classify.tree.6-trimmed.tree - gtdbtk.bac120.classify.tree.6-trimmed.tree - Newick
  • gtdbtk.bac120.classify.tree.6-lineages.map - gtdbtk.bac120.classify.tree.6-lineages.map - GTDB lineage
  • gtdbtk.bac120.classify.tree.6-trimmed.tree-rectangle.PNG - gtdbtk.bac120.classify.tree.6-trimmed.tree - Image
  • gtdbtk.bac120.classify.tree.6-trimmed.tree-rectangle.PDF - gtdbtk.bac120.classify.tree.6-trimmed.tree - Image
  • gtdbtk.bac120.classify.tree.6-trimmed.tree-circle.PNG - gtdbtk.bac120.classify.tree.6-trimmed.tree - Image
  • gtdbtk.bac120.classify.tree.6-trimmed.tree-circle.PDF - gtdbtk.bac120.classify.tree.6-trimmed.tree - Image
  • gtdbtk.bac120.classify.tree.6-trimmed.tree-circle-ultrametric.PNG - gtdbtk.bac120.classify.tree.6-trimmed.tree - Image
  • gtdbtk.bac120.classify.tree.6-trimmed.tree-circle-ultrametric.PDF - gtdbtk.bac120.classify.tree.6-trimmed.tree - Image
  • GTDB-Tk_classify_wf.zip - GTDB-Tk Classify WF output
Add one or more Genomes to a KBase SpeciesTree.
This app completed without errors in 5m 19s.
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/157793
  • Bacillus_species_tree_102423.newick
  • Bacillus_species_tree_102423-labels.newick
  • Bacillus_species_tree_102423.png
  • Bacillus_species_tree_102423.pdf

6. References

  1. Andrić S, Meyer T, Ongena M. 2020. Bacillus Responses to Plant-Associated Fungal and Bacterial Communities. Frontiers in Microbiology 11.
  2. Caulier S, Gillis A, Colau G, Licciardi F, Liépin M, Desoignies N, Modrie P, Legrève A, Mahillon J, Bragard C. 2018. Versatile Antagonistic Activities of Soil-Borne Bacillus spp. and Pseudomonas spp. against Phytophthora infestans and Other Potato Pathogens. Front Microbiol 9:143.
  3. Andrić S, Rigolet A, Argüelles Arias A, Steels S, Hoff G, Balleux G, Ongena L, Höfte M, Meyer T, Ongena M. 2023. Plant-associated Bacillus mobilizes its secondary metabolites upon perception of the siderophore pyochelin produced by a Pseudomonas competitor. ISME J 17:263–275.
  4. Radhakrishnan R, Hashem A, Abd_Allah EF. 2017. Bacillus: A Biological Tool for Crop Improvement through Bio-Molecular Changes in Adverse Environments. Frontiers in Physiology 8.
  5. Chaabouni I, Guesmi A, Cherif A. 2012. Secondary Metabolites of Bacillus: Potentials in Biotechnology, p. 347–366. In Sansinenea, E (ed.), Bacillus thuringiensis Biotechnology. Springer Netherlands, Dordrecht.
  6. Sansinenea E, Ortiz A. 2011. Secondary metabolites of soil Bacillus spp. Biotechnol Lett 33:1523–1538.
  7. Mondol MAM, Shin HJ, Islam MT. 2013. Diversity of Secondary Metabolites from Marine Bacillus Species: Chemistry and Biological Activity. Marine Drugs 11:2846–2872.
  8. Blacutt A, Ginnan N, Dang T, Bodaghi S, Vidalakis G, Ruegger P, Peacock B, Viravathana P, Vieira FC, Drozd C, Jablonska B, Borneman J, McCollum G, Cordoza J, Meloch J, Berry V, Salazar LL, Maloney KN, Rolshausen PE, Roper MC. 2020. An In Vitro Pipeline for Screening and Selection of Citrus-Associated Microbiota with Potential Anti-“ Candidatus Liberibacter asiaticus” Properties. Appl Environ Microbiol 86:e02883-19.
  9. BCL Convert. https://support-docs.illumina.com/SW/BCL_Convert/Content/SW/FrontPages/BCL_Convert.htm. Retrieved 2 November 2023.
  10. Arkin AP, Cottingham RW, Henry CS, Harris NL, Stevens RL, Maslov S, Dehal P, Ware D, Perez F, Canon S, Sneddon MW, Henderson ML, Riehl WJ, Murphy-Olson D, Chan SY, Kamimura RT, Kumari S, Drake MM, Brettin TS, Glass EM, Chivian D, Gunter D, Weston DJ, Allen BH, Baumohl J, Best AA, Bowen B, Brenner SE, Bun CC, Chandonia J-M, Chia J-M, Colasanti R, Conrad N, Davis JJ, Davison BH, DeJongh M, Devoid S, Dietrich E, Dubchak I, Edirisinghe JN, Fang G, Faria JP, Frybarger PM, Gerlach W, Gerstein M, Greiner A, Gurtowski J, Haun HL, He F, Jain R, Joachimiak MP, Keegan KP, Kondo S, Kumar V, Land ML, Meyer F, Mills M, Novichkov PS, Oh T, Olsen GJ, Olson R, Parrello B, Pasternak S, Pearson E, Poon SS, Price GA, Ramakrishnan S, Ranjan P, Ronald PC, Schatz MC, Seaver SMD, Shukla M, Sutormin RA, Syed MH, Thomason J, Tintle NL, Wang D, Xia F, Yoo H, Yoo S, Yu D. 2018. KBase: The United States Department of Energy Systems Biology Knowledgebase. Nat Biotechnol 36:566–569.
  11. FASTQC. A quality control tool for high throughput sequence data | BibSonomy. https://www.bibsonomy.org/bibtex/f230a919c34360709aa298734d63dca3. Retrieved 20 May 2024.
  12. Bushnell B. 2014. BBMap: A Fast, Accurate, Splice-Aware Aligner. LBNL-7065E. Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States).
  13. Schmieder R, Edwards R. 2011. Quality control and preprocessing of metagenomic datasets. Bioinformatics 27:863–864.
  14. Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, Pyshkin AV, Sirotkin AV, Vyahhi N, Tesler G, Alekseyev MA, Pevzner PA. 2012. SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing. J Comput Biol 19:455–477.
  15. Gurevich A, Saveliev V, Vyahhi N, Tesler G. 2013. QUAST: quality assessment tool for genome assemblies. Bioinformatics 29:1072–1075.
  16. Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. 2015. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res 25:1043–1055.
  17. Seemann T. 2014. Prokka: rapid prokaryotic genome annotation. Bioinformatics 30:2068–2069.
  18. Chaumeil P-A, Mussig AJ, Hugenholtz P, Parks DH. 2020. GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database. Bioinformatics 36:1925–1927.
  19. Price MN, Dehal PS, Arkin AP. 2010. FastTree 2 – Approximately Maximum-Likelihood Trees for Large Alignments. PLOS ONE 5:e9490.
  20. Letunic I, Bork P. 2021. Interactive Tree Of Life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Research 49:W293–W296.

Apps

  1. Annotate Assembly and Re-annotate Genomes with Prokka - v1.14.5
    • Seemann T. Prokka: rapid prokaryotic genome annotation. Bioinformatics. 2014;30: 2068 2069. doi:10.1093/bioinformatics/btu153
  2. Assemble Reads with SPAdes - v3.15.3
    • Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, et al. SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing. Journal of Computational Biology. 2012;19: 455-477. doi: 10.1089/cmb.2012.0021
    • Prjibelski A, Antipov D, Meleshko D, Lapidus A, Korobeynikov A. Using SPAdes De Novo Assembler. Curr Protoc Bioinformatics. 2020 Jun;70(1):e102. doi: 10.1002/cpbi.102.
  3. Assess Genome Quality with CheckM - v1.0.18
    • Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 2015;25: 1043 1055. doi:10.1101/gr.186072.114
    • CheckM source:
    • Additional info:
  4. Assess Quality of Assemblies with QUAST - v4.4
    • [1] Gurevich A, Saveliev V, Vyahhi N, Tesler G. QUAST: quality assessment tool for genome assemblies. Bioinformatics. 2013;29: 1072 1075. doi:10.1093/bioinformatics/btt086
    • [2] Mikheenko A, Valin G, Prjibelski A, Saveliev V, Gurevich A. Icarus: visualizer for de novo assembly evaluation. Bioinformatics. 2016;32: 3321 3323. doi:10.1093/bioinformatics/btw379
  5. Assess Read Quality with FastQC - v0.12.1
    • FastQC source: Bioinformatics Group at the Babraham Institute, UK.
  6. Classify Microbes with GTDB-Tk - v2.3.2
    • Pierre-Alain Chaumeil, Aaron J Mussig, Philip Hugenholtz, Donovan H Parks. GTDB-Tk v2: memory friendly classification with the genome taxonomy database. Bioinformatics, Volume 38, Issue 23, 1 December 2022, Pages 5315 5316. DOI: https://doi.org/10.1093/bioinformatics/btac672
    • Pierre-Alain Chaumeil, Aaron J Mussig, Philip Hugenholtz, Donovan H Parks, GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database, Bioinformatics, Volume 36, Issue 6, 15 March 2020, Pages 1925 1927. DOI: https://doi.org/10.1093/bioinformatics/btz848
    • Donovan H Parks, Maria Chuvochina, Christian Rinke, Aaron J Mussig, Pierre-Alain Chaumeil, Philip Hugenholtz. GTDB: an ongoing census of bacterial and archaeal diversity through a phylogenetically consistent, rank normalized and complete genome-based taxonomy. Nucleic Acids Research, Volume 50, Issue D1, 7 January 2022, Pages D785 D794. DOI: https://doi.org/10.1093/nar/gkab776
    • Parks, D., Chuvochina, M., Waite, D. et al. A standardized bacterial taxonomy based on genome phylogeny substantially revises the tree of life. Nat Biotechnol 36, 996 1004 (2018). DOI: https://doi.org/10.1038/nbt.4229
    • Parks DH, Chuvochina M, Chaumeil PA, Rinke C, Mussig AJ, Hugenholtz P. A complete domain-to-species taxonomy for Bacteria and Archaea. Nat Biotechnol. 2020;10.1038/s41587-020-0501-8. DOI:10.1038/s41587-020-0501-8
    • Rinke C, Chuvochina M, Mussig AJ, Chaumeil PA, Dav n AA, Waite DW, Whitman WB, Parks DH, and Hugenholtz P. A standardized archaeal taxonomy for the Genome Taxonomy Database. Nat Microbiol. 2021 Jul;6(7):946-959. DOI:10.1038/s41564-021-00918-8
    • Chivian D, Jungbluth SP, Dehal PS, Wood-Charlson EM, Canon RS, Allen BH, Clark MM, Gu T, Land ML, Price GA, Riehl WJ, Sneddon MW, Sutormin R, Zhang Q, Cottingham RW, Henry CS, Arkin AP. Metagenome-assembled genome extraction and analysis from microbiomes using KBase. Nat Protoc. 2023 Jan;18(1):208-238. doi: 10.1038/s41596-022-00747-x
    • Matsen FA, Kodner RB, Armbrust EV. pplacer: linear time maximum-likelihood and Bayesian phylogenetic placement of sequences onto a fixed reference tree. BMC Bioinformatics. 2010;11:538. Published 2010 Oct 30. doi:10.1186/1471-2105-11-538
    • Jain C, Rodriguez-R LM, Phillippy AM, Konstantinidis KT, Aluru S. High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries. Nat Commun. 2018;9(1):5114. Published 2018 Nov 30. DOI:10.1038/s41467-018-07641-9
    • Hyatt D, Chen GL, Locascio PF, Land ML, Larimer FW, Hauser LJ. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics. 2010;11:119. Published 2010 Mar 8. DOI:10.1186/1471-2105-11-119
    • Price MN, Dehal PS, Arkin AP. FastTree 2--approximately maximum-likelihood trees for large alignments. PLoS One. 2010;5(3):e9490. Published 2010 Mar 10. DOI:10.1371/journal.pone.0009490 link: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2835736/
    • Eddy SR. Accelerated Profile HMM Searches. PLoS Comput Biol. 2011;7(10):e1002195. DOI:10.1371/journal.pcbi.1002195
    • Ondov BD, Treangen TJ, Melsted P, Mallonee AB, Bergman NH, Koren S, Phillippy AM. Mash: fast genome and metagenome distance estimation using MinHash. Genome Biol. 2016 Jun 20;17(1):132. DOI: 10.1186/s13059-016-0997-x
  7. Compute Simple Read Library Stats
    • Arkin AP, Cottingham RW, Henry CS, Harris NL, Stevens RL, Maslov S, et al. KBase: The United States Department of Energy Systems Biology Knowledgebase. Nature Biotechnology. 2018;36: 566. doi: 10.1038/nbt.4163
    • EA Utils package source:
  8. Filter Out Low-Complexity Reads with PRINSEQ - v0.20.4
    • Schmieder R, Edwards R. Quality control and preprocessing of metagenomic datasets. Bioinformatics. 2011;27: 863 864. doi:10.1093/bioinformatics/btr026
    • PRINSEQ source:
  9. Insert Genome Into SpeciesTree - v2.2.0
    • Price MN, Dehal PS, Arkin AP. FastTree 2 Approximately Maximum-Likelihood Trees for Large Alignments. PLoS One. 2010;5. doi:10.1371/journal.pone.0009490
  10. Run the JGI RQCFilter pipeline (BBTools v38.22)
    no citations