MetaBAT2 Contig Binning - v1.7

Bin metagenomic contigs

MetaBAT2 clusters metagenomic contigs into different "bins", each of which should correspond to a putative genome.

MetaBAT2 uses nucleotide composition information and source strain abundance (measured by depth-of-coverage by aligning the reads to the contigs) to perform binning.

Implemented for KBase by Jeff Froula([email protected])

MetaBAT2 takes a metagenome assembly and the reads that produced the assembly and organizes the contigs into putative genomes, called "bins".

Configuration:

Assembly Object: The Assembly object is a collection of assembled genome fragments, called "contigs". These are the items that MetaBAT2 will bin. Currently only a single Assembly object is accepted by the MetaBAT2 App.

BinnedContig Object Name: The BinnedContig Object represents the directory of binned contigs created by MetaBAT2. This object can be used for downstream analysis

Read Library Object: The read libraries are aligned to the assembly using bbmap, and provide the abundance information for each contig that roughly follows the species abundance.

Minimum Contig Length: Contigs that are too short may slow down analysis and not give statistically meaningful nucleotide composition profiles. A value of 1000 - 2500 bp is a reasonable cutoff.

Output:

Output Object:The BinnedContig Object represents the directory of binned contigs created by MetaBAT2. This object can be used for downstream analysis.

Output Bin Summary Report:The number of bins produced, the number of contigs that were binned and the total number of contigs in the assembly.

Downloadable files: The enitre output of the MetaBAT2 run may be downloaded as a zip file. This zip file also contains a table of read-depth coverage per contig ("*.depth.txt")

column definitions for *.depth.txt file

1. contig name

2. contig length

3. total average depth (all libraries)

4. library 1 depth

5. library 1 variance

6. next library ... etc

Related Publications

Kang DD, Froula J, Egan R, Wang Z. MetaBAT, an efficient tool for accurately reconstructing single genomes from complex microbial communities. PeerJ. 2015;3: e1165. doi:10.7717/peerj.1165 , https://doi.org/10.7717/peerj.1165

MetaBAT2 source: , https://bitbucket.org/berkeleylab/metabat

App Specification:
https://gitlab.com/jfroula/kbase-metabat/tree/caf4e3c0c415602f3e11f1473734a7b682c6fdfb/ui/narrative/methods/run_metabat
Module Commit: caf4e3c0c415602f3e11f1473734a7b682c6fdfb