Generated July 28, 2022

Genome-resolved metagenomics reveals novel archaeal and bacterial genomes from Amazonian forest and pasture soils

Andressa M. Venturini

Introduction

Genome-resolved metagenomics was applied to recover archaeal and bacterial metagenome-assembled genomes (MAGs) from forest and pasture soils of the Brazilian Eastern Amazon, which were subjected to different soil moisture levels (Venturini et al., 2022).

Brief sample description

The sampling was carried out in July 2015 in a pristine forest of the Tapajós National Forest (3°17'44.4"S 54°57'46.7"W) and an active cattle pasture (3°18'46.7"S 54°54'34.8"W) in the Brazilian Eastern Amazon. Three soil samples from 0 to 10 cm depth were collected at each site. Samples from each land use were combined, sieved, and subjected to a microcosm experiment under increasing soil moisture levels for 30 days at 25 °C. Moisture treatments were established in triplicates for each land use using 1.5 L glass jars filled with 350 g of soil each. These treatments included the original soil moisture of each site (22% for forest and 24% for pasture) and 100% of moisture at field capacity (FC, 50% for forest and 63% for pasture). Forest and pasture soils from these both moisture treatments were DNA-extracted in duplicate using the PowerLyzer PowerSoil DNA Isolation Kit, totaling 12 DNA samples. DNA samples were checked using 1% agarose gel electrophoresis and a Nanodrop 2000c spectrophotometer. Paired-end shotgun metagenomic sequencing (2 x 150 bp) was performed on an Illumina HiSeq platform at Novogene Co. Ltd. using the NEBNext Ultra II DNA Library Prep Kit for Illumina for library construction. Detailed information about the sites, their soil physicochemical properties, the design of the microcosm experiment, DNA extraction and quantification, and the metagenomic sequencing was previously reported (Lemos et al., 2020; Venturini et al., 2022).

Sample table:

ID Land use Treatment
A1 Forest Original
A2 Forest Original
A3 Forest Original
A10 Forest 100% FC
A11 Forest 100% FC
A12 Forest 100% FC
A13 Pasture Original
A14 Pasture Original
A15 Pasture Original
A22 Pasture 100% FC
A23 Pasture 100% FC
A24 Pasture 100% FC

1. Data trimming and filtering.

1.1. Evaluate paired-end reads using FastQC.

1.2. Trim and filter paired-end reads using Trimmomatic.

Changes in Trimmomatic: adapters (TruSeq3-PE-2)/sliding window minimum quality (20)/head crop length (10)/minimum read length (50).

1.3. Re-evaluate paired-end reads using FastQC.

A quality control application for high throughput sequence data.
This app completed without errors in 57m 22s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • A1.fq_reads_65084_66_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • A1.fq_reads_65084_66_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 48m 53s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • A2.fq_reads_65084_57_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • A2.fq_reads_65084_57_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 56m 0s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • A3.fq_reads_65084_54_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • A3.fq_reads_65084_54_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 47m 7s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • A10.fq_reads_65084_50_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • A10.fq_reads_65084_50_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 53m 6s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • A11.fq_reads_65084_60_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • A11.fq_reads_65084_60_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 1h 21m 57s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • A12.fq_reads_65084_68_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • A12.fq_reads_65084_68_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 1h 46m 33s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • A13.fq_reads_65084_56_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • A13.fq_reads_65084_56_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 1h 52m 39s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • A14.fq_reads_65084_72_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • A14.fq_reads_65084_72_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 1h 47m 26s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • A15.fq_reads_65084_70_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • A15.fq_reads_65084_70_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 1h 53m 40s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • A22.fq_reads_65084_63_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • A22.fq_reads_65084_63_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 43m 55s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • A23.fq_reads_65084_52_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • A23.fq_reads_65084_52_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 1h 6m 45s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • A24.fq_reads_65084_62_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • A24.fq_reads_65084_62_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
Trim paired- or single-end Illumina reads with Trimmomatic.
This app completed without errors in 4h 48m 38s.
Objects
Created Object Name Type Description
A1_trimmed_paired PairedEndLibrary Trimmed Reads
A1_trimmed_unpaired_fwd SingleEndLibrary Trimmed Unpaired Forward Reads
A1_trimmed_unpaired_rev SingleEndLibrary Trimmed Unpaired Reverse Reads
Trim paired- or single-end Illumina reads with Trimmomatic.
This app completed without errors in 2h 18m 50s.
Objects
Created Object Name Type Description
A2_trimmed_paired PairedEndLibrary Trimmed Reads
A2_trimmed_unpaired_fwd SingleEndLibrary Trimmed Unpaired Forward Reads
A2_trimmed_unpaired_rev SingleEndLibrary Trimmed Unpaired Reverse Reads
Trim paired- or single-end Illumina reads with Trimmomatic.
This app completed without errors in 3h 10m 56s.
Objects
Created Object Name Type Description
A3_trimmed_paired PairedEndLibrary Trimmed Reads
A3_trimmed_unpaired_fwd SingleEndLibrary Trimmed Unpaired Forward Reads
A3_trimmed_unpaired_rev SingleEndLibrary Trimmed Unpaired Reverse Reads
Trim paired- or single-end Illumina reads with Trimmomatic.
This app completed without errors in 6h 51m 37s.
Objects
Created Object Name Type Description
A10_trimmed_paired PairedEndLibrary Trimmed Reads
A10_trimmed_unpaired_fwd SingleEndLibrary Trimmed Unpaired Forward Reads
A10_trimmed_unpaired_rev SingleEndLibrary Trimmed Unpaired Reverse Reads
Trim paired- or single-end Illumina reads with Trimmomatic.
This app completed without errors in 7h 16m 55s.
Objects
Created Object Name Type Description
A11_trimmed_paired PairedEndLibrary Trimmed Reads
A11_trimmed_unpaired_fwd SingleEndLibrary Trimmed Unpaired Forward Reads
A11_trimmed_unpaired_rev SingleEndLibrary Trimmed Unpaired Reverse Reads
Trim paired- or single-end Illumina reads with Trimmomatic.
This app completed without errors in 11h 23m 9s.
Objects
Created Object Name Type Description
A12_trimmed_paired PairedEndLibrary Trimmed Reads
A12_trimmed_unpaired_fwd SingleEndLibrary Trimmed Unpaired Forward Reads
A12_trimmed_unpaired_rev SingleEndLibrary Trimmed Unpaired Reverse Reads
Trim paired- or single-end Illumina reads with Trimmomatic.
This app completed without errors in 1h 36m 19s.
Objects
Created Object Name Type Description
A13_trimmed_paired PairedEndLibrary Trimmed Reads
A13_trimmed_unpaired_fwd SingleEndLibrary Trimmed Unpaired Forward Reads
A13_trimmed_unpaired_rev SingleEndLibrary Trimmed Unpaired Reverse Reads
Trim paired- or single-end Illumina reads with Trimmomatic.
This app completed without errors in 3h 44m 46s.
Objects
Created Object Name Type Description
A14_trimmed_paired PairedEndLibrary Trimmed Reads
A14_trimmed_unpaired_fwd SingleEndLibrary Trimmed Unpaired Forward Reads
A14_trimmed_unpaired_rev SingleEndLibrary Trimmed Unpaired Reverse Reads
Trim paired- or single-end Illumina reads with Trimmomatic.
This app completed without errors in 3h 31m 40s.
Objects
Created Object Name Type Description
A15_trimmed_paired PairedEndLibrary Trimmed Reads
A15_trimmed_unpaired_fwd SingleEndLibrary Trimmed Unpaired Forward Reads
A15_trimmed_unpaired_rev SingleEndLibrary Trimmed Unpaired Reverse Reads
Trim paired- or single-end Illumina reads with Trimmomatic.
This app completed without errors in 1h 54m 58s.
Objects
Created Object Name Type Description
A22_trimmed_paired PairedEndLibrary Trimmed Reads
A22_trimmed_unpaired_fwd SingleEndLibrary Trimmed Unpaired Forward Reads
A22_trimmed_unpaired_rev SingleEndLibrary Trimmed Unpaired Reverse Reads
Trim paired- or single-end Illumina reads with Trimmomatic.
This app completed without errors in 2h 33m 32s.
Objects
Created Object Name Type Description
A23_trimmed_paired PairedEndLibrary Trimmed Reads
A23_trimmed_unpaired_fwd SingleEndLibrary Trimmed Unpaired Forward Reads
A23_trimmed_unpaired_rev SingleEndLibrary Trimmed Unpaired Reverse Reads
Trim paired- or single-end Illumina reads with Trimmomatic.
This app completed without errors in 2h 40m 6s.
Objects
Created Object Name Type Description
A24_trimmed_paired PairedEndLibrary Trimmed Reads
A24_trimmed_unpaired_fwd SingleEndLibrary Trimmed Unpaired Forward Reads
A24_trimmed_unpaired_rev SingleEndLibrary Trimmed Unpaired Reverse Reads
A quality control application for high throughput sequence data.
This app completed without errors in 48m 57s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • A1_trimmed_paired_65084_134_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • A1_trimmed_paired_65084_134_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 40m 19s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • A2_trimmed_paired_65084_114_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • A2_trimmed_paired_65084_114_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 1h 36m 25s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • A3_trimmed_paired_65084_118_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • A3_trimmed_paired_65084_118_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 38m 46s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • A10_trimmed_paired_65084_142_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • A10_trimmed_paired_65084_142_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 1h 22m 21s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • A11_trimmed_paired_65084_143_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • A11_trimmed_paired_65084_143_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 2h 51m 48s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • A12_trimmed_paired_65084_150_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • A12_trimmed_paired_65084_150_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 1h 35m 9s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • A13_trimmed_paired_65084_106_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • A13_trimmed_paired_65084_106_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 1h 13m 45s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • A14_trimmed_paired_65084_126_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • A14_trimmed_paired_65084_126_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 40m 58s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • A15_trimmed_paired_65084_122_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • A15_trimmed_paired_65084_122_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 53m 45s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • A22_trimmed_paired_65084_110_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • A22_trimmed_paired_65084_110_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 40m 4s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • A23_trimmed_paired_65084_130_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • A23_trimmed_paired_65084_130_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
A quality control application for high throughput sequence data.
This app completed without errors in 1h 21m 10s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • A24_trimmed_paired_65084_135_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • A24_trimmed_paired_65084_135_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report

2. Merge libraries.

2.1. Merge libraries of each land use to serve as inputs for the assemblers using Merge Reads Libraries.

Merge multiple Reads Libraries and/or ReadsSets into one Reads Library object.
This app completed without errors in 1d 1h 43m 38s.
Objects
Created Object Name Type Description
A1-12_trimmed_paired_merged PairedEndLibrary Merge trimmed paired-end libraries (from A1 to A12).
Summary
NUM READS LIBRARIES COMBINED INTO ONE READS LIBRARY: 6
Merge multiple Reads Libraries and/or ReadsSets into one Reads Library object.
This app completed without errors in 23h 52m 20s.
Objects
Created Object Name Type Description
A13-24_trimmed_paired_merged PairedEndLibrary Merge trimmed paired-end libraries (from A13 to A24).
Summary
NUM READS LIBRARIES COMBINED INTO ONE READS LIBRARY: 6
Output from Merge Reads Libraries - v1.0.1
The viewer for the output created by this App is available at the original Narrative here: https://narrative.kbase.us/narrative/91640
Output from Merge Reads Libraries - v1.0.1
The viewer for the output created by this App is available at the original Narrative here: https://narrative.kbase.us/narrative/91640
v1 - KBaseFile.PairedEndLibrary-2.1
The viewer for the data in this Cell is available at the original Narrative here: https://narrative.kbase.us/narrative/91640
v2 - KBaseFile.PairedEndLibrary-2.1
The viewer for the data in this Cell is available at the original Narrative here: https://narrative.kbase.us/narrative/91640

3. Assemble reads.

3.1. Assemble paired-end reads using MEGAHIT.

Changes in MEGAHIT: parameter preset (meta-large).

Assemble metagenomic reads using the MEGAHIT assembler.
This app completed without errors in 4d 5h 35m 16s.
Objects
Created Object Name Type Description
A1-12_MEGAHIT_large.assembly Assembly Assembled contigs
Summary
ContigSet saved to: andressamv:narrative_1592508457102/A1-12_MEGAHIT_large.assembly Assembled into 367892 contigs. Avg Length: 3588.6768263512117 bp. Contig Length Distribution (# of contigs -- min to max basepairs): 367501 -- 2000.0 to 39227.3 bp 305 -- 39227.3 to 76454.6 bp 46 -- 76454.6 to 113681.90000000001 bp 22 -- 113681.90000000001 to 150909.2 bp 6 -- 150909.2 to 188136.5 bp 7 -- 188136.5 to 225363.80000000002 bp 2 -- 225363.80000000002 to 262591.10000000003 bp 2 -- 262591.10000000003 to 299818.4 bp 0 -- 299818.4 to 337045.7 bp 1 -- 337045.7 to 374273.0 bp
Links
Assemble metagenomic reads using the MEGAHIT assembler.
This app completed without errors in 3d 17h 33m 4s.
Objects
Created Object Name Type Description
A13-24_MEGAHIT_large.assembly Assembly Assembled contigs
Summary
ContigSet saved to: andressamv:narrative_1592508457102/A13-24_MEGAHIT_large.assembly Assembled into 366749 contigs. Avg Length: 4121.073682545828 bp. Contig Length Distribution (# of contigs -- min to max basepairs): 366251 -- 2000.0 to 53356.2 bp 410 -- 53356.2 to 104712.4 bp 63 -- 104712.4 to 156068.59999999998 bp 15 -- 156068.59999999998 to 207424.8 bp 6 -- 207424.8 to 258781.0 bp 2 -- 258781.0 to 310137.19999999995 bp 0 -- 310137.19999999995 to 361493.39999999997 bp 0 -- 361493.39999999997 to 412849.6 bp 1 -- 412849.6 to 464205.8 bp 1 -- 464205.8 to 515562.0 bp
Links

4. Binning and optimization.

4.1. Cluster the assembled contigs in bins that correspond to putative population genomes using MetaBAT2.

4.2. Cluster the assembled contigs in bins that correspond to putative population genomes using MaxBin2.

Changes in MaxBin2: marker set (107 and 40)/minimum contig length (2500)/plot markers per contig (Y).

4.3. Cluster the assembled contigs in bins that correspond to putative population genomes using CONCOCT.

4.4. Bin optimization approach using DAS Tool.

Bin metagenomic contigs
This app completed without errors in 14h 1m 2s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • metabat_result.zip - Files generated by MetaBAT2 App
Bin metagenomic contigs
This app completed without errors in 14h 6m 48s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • metabat_result.zip - Files generated by MetaBAT2 App
Output from MetaBAT2 Contig Binning - v1.7
The viewer for the output created by this App is available at the original Narrative here: https://narrative.kbase.us/narrative/91640
Output from MetaBAT2 Contig Binning - v1.7
The viewer for the output created by this App is available at the original Narrative here: https://narrative.kbase.us/narrative/91640
Group assembled metagenomic contigs into lineages (Bins) using depth-of-coverage, nucleotide composition, and marker genes.
This app completed without errors in 20h 15m 18s.
Objects
Created Object Name Type Description
A1-12_MaxBin2_107m_bins BinnedContigs BinnedContigs from MaxBin2
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • maxbin_result.zip - File(s) generated by MaxBin2 App
  • Bin.marker.pdf - Visualization of the marker by MaxBin2 App
Group assembled metagenomic contigs into lineages (Bins) using depth-of-coverage, nucleotide composition, and marker genes.
This app completed without errors in 20h 52m 2s.
Objects
Created Object Name Type Description
A1-12_MaxBin2_40m_bins BinnedContigs BinnedContigs from MaxBin2
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • maxbin_result.zip - File(s) generated by MaxBin2 App
  • Bin.marker.pdf - Visualization of the marker by MaxBin2 App
Group assembled metagenomic contigs into lineages (Bins) using depth-of-coverage, nucleotide composition, and marker genes.
This app completed without errors in 23h 26m 45s.
Objects
Created Object Name Type Description
A13-24_MaxBin2_107m_bins BinnedContigs BinnedContigs from MaxBin2
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • maxbin_result.zip - File(s) generated by MaxBin2 App
  • Bin.marker.pdf - Visualization of the marker by MaxBin2 App
Group assembled metagenomic contigs into lineages (Bins) using depth-of-coverage, nucleotide composition, and marker genes.
This app completed without errors in 1d 0h 55m 41s.
Objects
Created Object Name Type Description
A13-24_MaxBin2_40m_bins BinnedContigs BinnedContigs from MaxBin2
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • maxbin_result.zip - File(s) generated by MaxBin2 App
  • Bin.marker.pdf - Visualization of the marker by MaxBin2 App
Output from Bin Contigs using MaxBin2 - v2.2.4
The viewer for the output created by this App is available at the original Narrative here: https://narrative.kbase.us/narrative/91640
Output from Bin Contigs using MaxBin2 - v2.2.4
The viewer for the output created by this App is available at the original Narrative here: https://narrative.kbase.us/narrative/91640
Output from Bin Contigs using MaxBin2 - v2.2.4
The viewer for the output created by this App is available at the original Narrative here: https://narrative.kbase.us/narrative/91640
Output from Bin Contigs using MaxBin2 - v2.2.4
The viewer for the output created by this App is available at the original Narrative here: https://narrative.kbase.us/narrative/91640
Group assembled metagenomic contigs into lineages (Bins) using depth-of-coverage and nucleotide composition
This app completed without errors in 21h 41m 23s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • concoct_result.zip - Files generated by CONCOCT App
Group assembled metagenomic contigs into lineages (Bins) using depth-of-coverage and nucleotide composition
This app completed without errors in 18h 50m 10s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • concoct_result.zip - Files generated by CONCOCT App
Optimize bacterial or archaeal genome bins using a dereplication, aggregation and scoring strategy
This app completed without errors in 1h 36m 58s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • das_tool_result.zip - Files generated by kb_das_tool App
Optimize bacterial or archaeal genome bins using a dereplication, aggregation and scoring strategy
This app completed without errors in 1h 36m 57s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • das_tool_result.zip - Files generated by kb_das_tool App

5. Bin quality assessment and filtering.

5.1. Assign bin completeness and contamination scores using CheckM.

Changes in CheckM: reference tree (full tree).

5.2. Filter bins using CheckM.

Changes in CheckM for ≥ 50% of completeness and ≤ 10% of contamination: reference tree (full tree)/completeness (50)/contamination (10).

Runs the CheckM lineage workflow to assess the genome quality of isolates, single cells, or genome bins from metagenome assemblies through comparison to an existing database of genomes.
This app completed without errors in 35m 11s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • CheckM_summary_table.tsv.zip - TSV Summary Table from CheckM
  • full_output.zip - Full output of CheckM
  • plots.zip - Output plots from CheckM
Runs the CheckM lineage workflow to assess the genome quality of isolates, single cells, or genome bins from metagenome assemblies through comparison to an existing database of genomes.
This app completed without errors in 1h 3m 14s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • CheckM_summary_table.tsv.zip - TSV Summary Table from CheckM
  • full_output.zip - Full output of CheckM
  • plots.zip - Output plots from CheckM
Runs the CheckM lineage workflow to assess the genome quality of isolates, single cells, or genome bins from metagenome assemblies through comparison to an existing database of genomes. Creates a new BinnedContigs object with High Quality bins that pass user-defined thresholds for Completeness and Contamination.
This app completed without errors in 36m 11s.
Objects
Created Object Name Type Description
A1-12_CheckM_bins50.BinnedContigs BinnedContigs HQ BinnedContigs A1-12_CheckM_bins50.BinnedContigs
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • CheckM_summary_table.tsv.zip - TSV Summary Table from CheckM
  • full_output.zip - Full output of CheckM
  • plots.zip - Output plots from CheckM
Runs the CheckM lineage workflow to assess the genome quality of isolates, single cells, or genome bins from metagenome assemblies through comparison to an existing database of genomes. Creates a new BinnedContigs object with High Quality bins that pass user-defined thresholds for Completeness and Contamination.
This app completed without errors in 1h 4m 32s.
Objects
Created Object Name Type Description
A13-24_CheckM_bins50.BinnedContigs BinnedContigs HQ BinnedContigs A13-24_CheckM_bins50.BinnedContigs
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • CheckM_summary_table.tsv.zip - TSV Summary Table from CheckM
  • full_output.zip - Full output of CheckM
  • plots.zip - Output plots from CheckM
v2 - KBaseMetagenomes.BinnedContigs-1.0
The viewer for the data in this Cell is available at the original Narrative here: https://narrative.kbase.us/narrative/91640
v2 - KBaseMetagenomes.BinnedContigs-1.0
The viewer for the data in this Cell is available at the original Narrative here: https://narrative.kbase.us/narrative/91640

6. Bin taxonomy.

6.1. Provide taxonomic assignments for medium-quality bacterial and archaeal bins based on the Genome Taxonomy Database using GTDB-Tk.

Obtain objective taxonomic assignments for bacterial and archaeal genomes based on the Genome Taxonomy Database (GTDB) ver R06-RS202
This app completed without errors in 39m 22s.
Links
Obtain objective taxonomic assignments for bacterial and archaeal genomes based on the Genome Taxonomy Database (GTDB) ver R06-RS202
This app completed without errors in 44m 4s.
Links

7. Bin extraction.

7.1. Extract selected bacterial and archaeal bins (high-quality bins plus bins of environmental relevance) as individual assemblies using Extract Bins as Assemblies from BinnedContigs.

Forest: Bin.003/Bin.004/Bin.013.

Pasture: Bin.001/Bin.002/Bin.005/Bin.006/Bin.020/Bin.027/Bin.031/Bin.034/Bin.035.

Extract a bin as an Assembly from a BinnedContig dataset
This app completed without errors in 4m 58s.
Objects
Created Object Name Type Description
Forest_H-QBs.AssemblySet AssemblySet Assembly set of extracted assemblies
Bin.003.fasta_forest_assembly Assembly Assembly object of extracted contigs
Bin.004.fasta_forest_assembly Assembly Assembly object of extracted contigs
Bin.013.fasta_forest_assembly Assembly Assembly object of extracted contigs
Summary
Job Finished Generated Assembly Reference: 91640/960/2, 91640/962/2, 91640/964/2 Generated Assembly Set: 91640/965/2
Extract a bin as an Assembly from a BinnedContig dataset
This app completed without errors in 9m 5s.
Objects
Created Object Name Type Description
Pasture_H-QBs.AssemblySet AssemblySet Assembly set of extracted assemblies
Bin.001.fasta_pasture_assembly Assembly Assembly object of extracted contigs
Bin.002.fasta_pasture_assembly Assembly Assembly object of extracted contigs
Bin.005.fasta_pasture_assembly Assembly Assembly object of extracted contigs
Bin.006.fasta_pasture_assembly Assembly Assembly object of extracted contigs
Bin.020.fasta_pasture_assembly Assembly Assembly object of extracted contigs
Bin.027.fasta_pasture_assembly Assembly Assembly object of extracted contigs
Bin.031.fasta_pasture_assembly Assembly Assembly object of extracted contigs
Bin.034.fasta_pasture_assembly Assembly Assembly object of extracted contigs
Bin.035.fasta_pasture_assembly Assembly Assembly object of extracted contigs
Summary
Job Finished Generated Assembly Reference: 91640/961/2, 91640/963/2, 91640/966/2, 91640/968/2, 91640/969/2, 91640/970/2, 91640/971/2, 91640/972/2, 91640/973/2 Generated Assembly Set: 91640/974/2

8. Bin abundance.

8.1. Calculate the abundance of each selected bin in the merged metagenomes using Align Reads using Bowtie2.

Changes in Bowtie2: preset (very-sensitive).

Align sequencing reads to long reference prokaryotic genome sequences using Bowtie2.
This app completed without errors in 18h 46m 38s.
No output found.
Align sequencing reads to long reference prokaryotic genome sequences using Bowtie2.
This app completed without errors in 21h 56m 10s.
No output found.
Align sequencing reads to long reference prokaryotic genome sequences using Bowtie2.
This app completed without errors in 18h 49m 49s.
No output found.
Align sequencing reads to long reference prokaryotic genome sequences using Bowtie2.
This app completed without errors in 21h 50m 7s.
No output found.
Align sequencing reads to long reference prokaryotic genome sequences using Bowtie2.
This app completed without errors in 21h 31m 42s.
No output found.
Align sequencing reads to long reference prokaryotic genome sequences using Bowtie2.
This app completed without errors in 18h 40m 19s.
No output found.
Align sequencing reads to long reference prokaryotic genome sequences using Bowtie2.
This app completed without errors in 19h 29m 45s.
No output found.
Align sequencing reads to long reference prokaryotic genome sequences using Bowtie2.
This app completed without errors in 21h 9m 58s.
No output found.
Align sequencing reads to long reference prokaryotic genome sequences using Bowtie2.
This app completed without errors in 23h 6m 13s.
No output found.
Align sequencing reads to long reference prokaryotic genome sequences using Bowtie2.
This app completed without errors in 18h 3m 48s.
No output found.
Align sequencing reads to long reference prokaryotic genome sequences using Bowtie2.
This app completed without errors in 19h 9m 53s.
No output found.
Align sequencing reads to long reference prokaryotic genome sequences using Bowtie2.
This app completed without errors in 17h 48m 51s.
No output found.
Align sequencing reads to long reference prokaryotic genome sequences using Bowtie2.
This app completed without errors in 19h 34m 7s.
No output found.
Align sequencing reads to long reference prokaryotic genome sequences using Bowtie2.
This app completed without errors in 19h 48m 28s.
No output found.
Align sequencing reads to long reference prokaryotic genome sequences using Bowtie2.
This app completed without errors in 23h 22m 32s.
No output found.
Align sequencing reads to long reference prokaryotic genome sequences using Bowtie2.
This app completed without errors in 18h 2m 3s.
No output found.
Align sequencing reads to long reference prokaryotic genome sequences using Bowtie2.
This app completed without errors in 19h 36m 42s.
No output found.
Align sequencing reads to long reference prokaryotic genome sequences using Bowtie2.
This app completed without errors in 18h 3m 26s.
No output found.
Align sequencing reads to long reference prokaryotic genome sequences using Bowtie2.
This app completed without errors in 23h 16m 20s.
No output found.
Align sequencing reads to long reference prokaryotic genome sequences using Bowtie2.
This app completed without errors in 18h 5m 42s.
No output found.
Align sequencing reads to long reference prokaryotic genome sequences using Bowtie2.
This app completed without errors in 18h 16m 53s.
No output found.
Align sequencing reads to long reference prokaryotic genome sequences using Bowtie2.
This app completed without errors in 16h 50m 25s.
No output found.
Align sequencing reads to long reference prokaryotic genome sequences using Bowtie2.
This app completed without errors in 18h 40m 5s.
No output found.
Align sequencing reads to long reference prokaryotic genome sequences using Bowtie2.
This app completed without errors in 16h 53m 29s.
No output found.

9. Genome annotation.

9.1. Create AssemblySet with the selected bins using Build AssemblySet.

Forest: Bin.003/Bin.004/Bin.013.

Pasture: Bin.001/Bin.002/Bin.005/Bin.006/Bin.020/Bin.027/Bin.031/Bin.034/Bin.035.

9.2. Annotate the selected bins using Annotate and Distill Assemblies with DRAM.

Allows users to create an AssemblySet object.
This app completed without errors in 9s.
Objects
Created Object Name Type Description
H-QBs.AssemblySet AssemblySet KButil_Build_AssemblySet
Summary
assembly objs in output set H-QBs.AssemblySet: 12
Annotate your assembly with DRAM. Annotations will then be distilled to create an interactive functional summary per assembly.
This app completed without errors in 8h 16m 14s.
Objects
Created Object Name Type Description
Bin.001.fasta_pasture_assembly_DRAM Genome Annotated Genome
Bin.002.fasta_pasture_assembly_DRAM Genome Annotated Genome
Bin.003.fasta_forest_assembly_DRAM Genome Annotated Genome
Bin.004.fasta_forest_assembly_DRAM Genome Annotated Genome
Bin.005.fasta_pasture_assembly_DRAM Genome Annotated Genome
Bin.006.fasta_pasture_assembly_DRAM Genome Annotated Genome
Bin.013.fasta_forest_assembly_DRAM Genome Annotated Genome
Bin.020.fasta_pasture_assembly_DRAM Genome Annotated Genome
Bin.027.fasta_pasture_assembly_DRAM Genome Annotated Genome
Bin.031.fasta_pasture_assembly_DRAM Genome Annotated Genome
Bin.034.fasta_pasture_assembly_DRAM Genome Annotated Genome
Bin.035.fasta_pasture_assembly_DRAM Genome Annotated Genome
H-QBs.GenomeSet GenomeSet H-QBs from forest and pasture
Summary
Here are the results from your DRAM run.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/91640
  • annotations.tsv - DRAM annotations in a tab separate table format
  • genes.fna - Genes as nucleotides predicted by DRAM with brief annotations
  • genes.faa - Genes as amino acids predicted by DRAM with brief annotations
  • genes.gff - GFF file of all DRAM annotations
  • rrnas.tsv - Tab separated table of rRNAs as detected by barrnap
  • trnas.tsv - Tab separated table of tRNAs as detected by tRNAscan-SE
  • genbank.tar.gz - Compressed folder of output genbank files
  • product.tsv - DRAM product in tabular format
  • metabolism_summary.xlsx - DRAM metabolism summary tables
  • genome_stats.tsv - DRAM genome statistics table

10. Patescibacteria comparison.

10.1. Download genomes from NCBI public links using Upload File to Staging from Web.

The Patescibacterial genomes were previously recovered by Lemos et al. (2020) using the metagenomic data from the high-moisture (100% FC) pasture soils.

WARW00000000: https://sra-download.ncbi.nlm.nih.gov/traces/wgs01/wgs_aux/WA/RW/WARW01/WARW01.1.gbff.gz.

WARV00000000: https://sra-download.ncbi.nlm.nih.gov/traces/wgs01/wgs_aux/WA/RV/WARV01/WARV01.1.gbff.gz.

10.2. Import GenBank files as genomes using Import GenBank File as Genome from Staging Area.

Changes in Import GenBank File as Genome from Staging Area: genome type (metagenome-assembled genome (MAG))/source of the GenBank file (other).

10.3. Extract bins of Patescibacteria as individual assemblies using Extract Bins as Assemblies from BinnedContigs.

Pasture: Bin.013/Bin.036/Bin.038.

10.4. Compare genomes using Compute ANI with FastANI.

Upload a data file (which may be compressed) from a web URL to your staging area.
This app completed without errors in 35s.
Summary
Uploaded Files: 2 /WARW01.1.gbff /WARV01.1.gbff
Import a GenBank file from your staging area into your Narrative as a Genome data object
This app completed without errors in 1m 43s.
Objects
Created Object Name Type Description
WARW01.1.gbff_genome Genome Imported Genome
Links
Output from Import GenBank File as Genome from Staging Area
The viewer for the output created by this App is available at the original Narrative here: https://narrative.kbase.us/narrative/91640
Import a GenBank file from your staging area into your Narrative as a Genome data object
This app completed without errors in 2m 16s.
Objects
Created Object Name Type Description
WARV01.1.gbff_genome Genome Imported Genome
Links
Output from Import GenBank File as Genome from Staging Area
The viewer for the output created by this App is available at the original Narrative here: https://narrative.kbase.us/narrative/91640
Extract a bin as an Assembly from a BinnedContig dataset
This app completed without errors in 5m 20s.
Objects
Created Object Name Type Description
Pasture_Patescibacteria.AssemblySet AssemblySet Assembly set of extracted assemblies
Bin.036.fasta_pasture_Patescibacteria_assembly Assembly Assembly object of extracted contigs
Bin.038.fasta_pasture_Patescibacteria_assembly Assembly Assembly object of extracted contigs
Bin.013.fasta_pasture_Patescibacteria_assembly Assembly Assembly object of extracted contigs
Summary
Job Finished Generated Assembly Reference: 91640/1102/2, 91640/1103/2, 91640/1165/1 Generated Assembly Set: 91640/1104/2
Allows users to compute fast whole-genome Average Nucleotide Identity (ANI) estimation.
This app completed without errors in 4m 42s.
Links

11. References.

Venturini, A.M., Gontijo, J.B., Mandro, J.A., Paula, F.S., Yoshiura, C.A., da França, A.G., Tsai, S.M., 2022. Genome-resolved metagenomics reveals novel archaeal and bacterial genomes from Amazonian forest and pasture soils. Microbial Genomics 8, 7. https://doi.org/10.1099/mgen.0.000853.

Venturini, A.M., Dias, N.M.S., Gontijo, J.B., Yoshiura, C.A., Paula, F.S., Meyer, K.M., Nakamura, F.M., da França, A.G., Borges, C.D., Barlow, J., Berenguer, E., Nüsslein, K., Rodrigues, J.L.M., Bohannan, B.J.M., Tsai, S.M., 2022. Increased soil moisture intensifies the impacts of forest-to-pasture conversion on methane emissions and methane-cycling communities in the Eastern Amazon. Environmental Research 212, 113139. https://doi.org/10.1016/j.envres.2022.113139

Lemos, L.N., Manoharan, L., Mendes, L.W., Venturini, A.M., Pylro, V.S., Tsai, S.M., 2020. Metagenome assembled‐genomes reveal similar functional profiles of CPR /Patescibacteria phyla in soils. Environmental Microbiology Reports 12, 651–655. https://doi.org/10.1111/1758-2229.12880

Released Apps

  1. Align Reads using Bowtie2 - v2.3.2
    • Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9: 357 359. doi:10.1038/nmeth.1923
    • Langmead B, Trapnell C, Pop M, Salzberg SL. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 2009;10: R25. doi:10.1186/gb-2009-10-3-r25
  2. Assemble Reads with MEGAHIT v1.2.9
    • Li D, Liu C-M, Luo R, Sadakane K, Lam T-W. MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph. Bioinformatics. 2015;31: 1674 1676. doi:10.1093/bioinformatics/btv033
  3. Assess Genome Quality with CheckM - v1.0.18
    • Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 2015;25: 1043 1055. doi:10.1101/gr.186072.114
    • CheckM source:
    • Additional info:
  4. Assess Read Quality with FastQC - v0.11.9
    • FastQC source: Bioinformatics Group at the Babraham Institute, UK.
  5. Bin Contigs using CONCOCT - v1.1
    • Alneberg J, Bjarnason BS, de Bruijn I, Schirmer M, Quick J, Ijaz UZ, Lahti L, Loman NJ, Andersson AF, Quince C. Binning metagenomic contigs by coverage and composition. Nature Methods. 2014;11: 1144-1146. doi:10.1038/nmeth.3103
    • CONCOCT source:
  6. Bin Contigs using MaxBin2 - v2.2.4
    • Wu Y-W, Simmons BA, Singer SW. MaxBin 2.0: an automated binning algorithm to recover genomes from multiple metagenomic datasets. Bioinformatics. 2016;32: 605 607. doi:10.1093/bioinformatics/btv638 (2) 1. Wu Y-W, Tang Y-H, Tringe SG, Simmons BA, Singer SW. MaxBin: an automated binning method to recover individual genomes from metagenomes using an expectation-maximization algorithm. Microbiome. 2014;2: 26. doi:10.1186/2049-2618-2-26
    • Wu Y-W, Tang Y-H, Tringe SG, Simmons BA, Singer SW. MaxBin: an automated binning method to recover individual genomes from metagenomes using an expectation-maximization algorithm. Microbiome. 2014;2: 26. doi:10.1186/2049-2618-2-26
    • Maxbin2 source:
    • Maxbin source:
  7. Build AssemblySet - v1.0.1
    • Arkin AP, Cottingham RW, Henry CS, Harris NL, Stevens RL, Maslov S, et al. KBase: The United States Department of Energy Systems Biology Knowledgebase. Nature Biotechnology. 2018;36: 566. doi: 10.1038/nbt.4163
  8. Classify Microbes with GTDB-Tk - v1.7.0
    • Pierre-Alain Chaumeil, Aaron J Mussig, Philip Hugenholtz, Donovan H Parks, GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database, Bioinformatics, Volume 36, Issue 6, 15 March 2020, Pages 1925 1927. DOI: https://doi.org/10.1093/bioinformatics/btz848
    • Parks, D., Chuvochina, M., Waite, D. et al. A standardized bacterial taxonomy based on genome phylogeny substantially revises the tree of life. Nat Biotechnol 36, 996 1004 (2018). DOI: https://doi.org/10.1038/nbt.4229
    • Parks DH, Chuvochina M, Chaumeil PA, Rinke C, Mussig AJ, Hugenholtz P. A complete domain-to-species taxonomy for Bacteria and Archaea. Nat Biotechnol. 2020;10.1038/s41587-020-0501-8. DOI:10.1038/s41587-020-0501-8
    • Rinke C, Chuvochina M, Mussig AJ, Chaumeil PA, Dav n AA, Waite DW, Whitman WB, Parks DH, and Hugenholtz P. A standardized archaeal taxonomy for the Genome Taxonomy Database. Nat Microbiol. 2021 Jul;6(7):946-959. DOI:10.1038/s41564-021-00918-8
    • Matsen FA, Kodner RB, Armbrust EV. pplacer: linear time maximum-likelihood and Bayesian phylogenetic placement of sequences onto a fixed reference tree. BMC Bioinformatics. 2010;11:538. Published 2010 Oct 30. doi:10.1186/1471-2105-11-538
    • Jain C, Rodriguez-R LM, Phillippy AM, Konstantinidis KT, Aluru S. High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries. Nat Commun. 2018;9(1):5114. Published 2018 Nov 30. DOI:10.1038/s41467-018-07641-9
    • Hyatt D, Chen GL, Locascio PF, Land ML, Larimer FW, Hauser LJ. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics. 2010;11:119. Published 2010 Mar 8. DOI:10.1186/1471-2105-11-119
    • Price MN, Dehal PS, Arkin AP. FastTree 2--approximately maximum-likelihood trees for large alignments. PLoS One. 2010;5(3):e9490. Published 2010 Mar 10. DOI:10.1371/journal.pone.0009490 link: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2835736/
    • Eddy SR. Accelerated Profile HMM Searches. PLoS Comput Biol. 2011;7(10):e1002195. DOI:10.1371/journal.pcbi.1002195
  9. Compute ANI with FastANI
    • [1] Jain C, Rodriguez-R LM, Phillippy AM, Konstantinidis KT, Aluru S. High-throughput ANI Analysis of 90K Prokaryotic Genomes Reveals Clear Species Boundaries. 2017; doi:10.1101/225342
    • [2] Goris J, Konstantinidis KT, Klappenbach JA, Coenye T, Vandamme P, Tiedje JM. DNA-DNA hybridization values and their relationship to whole-genome sequence similarities. Int J Syst Evol Microbiol. 2007;57: 81 91. doi:10.1099/ijs.0.64483-0
    • FastANI module and source code:
  10. Extract Bins as Assemblies from BinnedContigs - v1.0.2
    • Arkin AP, Cottingham RW, Henry CS, Harris NL, Stevens RL, Maslov S, et al. KBase: The United States Department of Energy Systems Biology Knowledgebase. Nature Biotechnology. 2018;36: 566. doi: 10.1038/nbt.4163
  11. Filter Bins by Quality with CheckM - v1.0.18
    • Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 2015;25: 1043 1055. doi:10.1101/gr.186072.114
    • CheckM source:
    • Additional info:
  12. Import GenBank File as Genome from Staging Area
    no citations
  13. Merge Reads Libraries - v1.0.1
    • Arkin AP, Cottingham RW, Henry CS, Harris NL, Stevens RL, Maslov S, et al. KBase: The United States Department of Energy Systems Biology Knowledgebase. Nature Biotechnology. 2018;36: 566. doi: 10.1038/nbt.4163
  14. MetaBAT2 Contig Binning - v1.7
    • Kang DD, Froula J, Egan R, Wang Z. MetaBAT, an efficient tool for accurately reconstructing single genomes from complex microbial communities. PeerJ. 2015;3: e1165. doi:10.7717/peerj.1165
    • MetaBAT2 source:
  15. Trim Reads with Trimmomatic - v0.36
    • Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30: 2114 2120. doi:10.1093/bioinformatics/btu170
  16. Upload File to Staging from Web - v1.0.12
    • Arkin AP, Cottingham RW, Henry CS, Harris NL, Stevens RL, Maslov S, et al. KBase: The United States Department of Energy Systems Biology Knowledgebase. Nature Biotechnology. 2018;36: 566. doi: 10.1038/nbt.4163

Apps in Beta

  1. Annotate and Distill Assemblies with DRAM
    • DRAM source code
    • DRAM documentation
    • DRAM publication
  2. Optimize Bacterial or Archaeal Binned Contigs using DAS Tool - v1.1.2
    • Sieber CMK, Probst AJ, Sharrar A, Thomas BC, Hess M, Tringe SG, Banfield JF. Recovery of genomes from metagenomes via a dereplication, aggregation and scoring strategy. 2018; 3(7): 836-843. doi:10.1038/s41564-018-0171-1
    • DAS_Tool source:
    • Hyatt D, Chen G-L, LoCascio PF, Land ML, Larimer FW, Hauser LJ. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics. 2010;11: 119. doi:10.1186/1471-2105-11-119
    • Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL. BLAST+: architecture and applications. BMC Bioinformatics. 2009;10: 421. doi:10.1186/1471-2105-10-421
    • Buchfink B, Xie C, Huson DH. Fast and sensitive protein alignment using DIAMOND. Nature Methods. 2015;12: 59-60. doi:10.1038/nmeth.3176
    • Pullseq:
    • R: A Language and Environment for Statistical Computing:
    • Ruby: A Programmers Best Friend: