KBase provides users with a single comprehensive resource for analyzing a wide range of public bioinformatics data together with the data generated from their own experiments. The KBase data model integrates diverse biological data types, and describes the relationships between different data components.
The data in KBase ranges from thousands of genomes and metagenomes and their annotations, to expression and protein-protein interaction data, to inferred models of organismal and community metabolism and gene regulation, and even to geographical information about populations. The table to the right represents the total number of different data objects imported from public repositories or created by KBase analysis pipelines in the combined KBase datastores as of February 20, 2016.
Please see this page for KBase’s Data Policy and the sources of our public reference data.
In the table to the right, searchable data types can be clicked to go to the search page for that data type. We will be releasing a new version of data search soon, with an expanded set of searchable types.
|Category||Data Type||Total number|
|Annotation Classes||Genome Features||111,557,369|
|Ontological Aliases and Synonyms||132,383,718|
|Biochemistry||Biochemical Species (Compounds)||27,838|
|Functional Data||Expression Series||625|
|Expression Replicate Groups||553|
|Phenotype Data Sets||463|
|Protein-Protein Interaction Network Datasets||13|
|Pairwise Protein-Protein Interactions||231,220|
|Derived Data||Correlation Networks||245|
|Co-expressed Gene Pairs||93,018,189|
|Expression Data Biclusters||382|
|Fitness Data Biclusters||114|