Generated February 24, 2020

KBase Logo

Reconstruction and analysis of central metabolism in microbes

Cite this protocol as:

Edirisinghe J.N., Faria J.P., Harris N.L., Allen B.H., Henry C.S. (2018) Reconstruction and Analysis of Central Metabolism in Microbes. In: Fondi M. (eds) Metabolic Network Reconstruction and Modeling. Methods in Molecular Biology, vol 1716. Humana Press, New York, NY 

DOI https://doi.org/10.1007/978-1-4939-7528-0_5 
Print ISBN 978-1-4939-7527-3
Online ISBN 978-1-4939-7528-0


Authors and affiliations

Janaka N. Edirisinghe1,2,*, José P. Faria1, Nomi L. Harris3, Benjamin H. Allen 4, Christopher S. Henry1,2,

* Corresponding authors: JNE : ([email protected]), CSH : ([email protected])

  1. Computation Institute, University of Chicago, Chicago, Illinois, USA
  2. Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, Illinois, USA
  3. Environmental Genomics and Systems Biology Division, E. O. Lawrence Berkeley National Laboratory, Berkeley, California, USA
  4. Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee, USA

NOTE: This tutorial is view-only, allowing you to see, but not alter, the input and output of the KBase apps used in this workflow. To run the steps yourself in a new Narrative using your own data or different parameters, copy this Narrative using the "copy" button at the top right. If you just want to read this Narrative (without copying it), you still can see the data objects generated in the workflow by using the “Controls” link at the top left. For more information, please see the Narrative Interface User Guide.

Narrative Overview

Central carbon metabolism is a key component in the metabolic network of living organisms as these pathways harbor many of the most important mechanisms for energy biosynthesis, as well as producing the precursor compounds for most essential biomass building blocks. The energy production strategies defined in the central metabolic pathways have a significant impact on the behavior and growth conditions of microorganisms, thus playing a crucial role in the quantitative prediction of biomass and energy yields [1,2]. Energy production strategies in microbes are highly diversified, unlike those in higher eukaryotes. These strategies primarily depend on environmental factors such as: (i) carbon source utilization; (ii) ability to respire by reducing numerous electron acceptors; and (iii) fermentation capabilities.

It continues to be challenging to make accurate computational predictions based on metabolic models and in silico simulations interpreting complex microbial behavior. Tools for automated metabolic model reconstruction such as ModelSEED [3-5] can rapidly generate draft genome-scale metabolic models from annotated genome sequences [6]. However, these draft models, and in some cases even curated published models, can lack accuracy in predicting growth yields, ATP production yields, and central carbon flux profiles. This poor accuracy stems primarily from three common problems: (i) poor representation of energy biosynthesis pathways; (ii) a lack of diverse electron transport chain (ETC) variations; and (iii) addition of extensive gapfilling reactions that can sometimes misrepresent an organism’s behavior [7].

Many of these problems can be avoided by using a simplified model comprised of only the most confidently annotated and biologically critical pathways for energy biosynthesis [8] (Fig. 1). We define these models as Core Metabolic Models (CMM), and they consist primarily of the sugar oxidation pathways, the fermentation pathways (Fig. 2), and the ETC variations. We previously developed an approach for the reconstruction and analysis of CMMs based on annotated genome sequences [9], which we implemented as a pipeline in the DOE Systems Biology Knowledgebase (KBase). In this chapter, we demonstrate how this analysis workflow can be run in KBase. The complete workflow, including example data and commentary are displayed here in this Narrative. The pipeline is comprised of four main steps: (i) genome annotation by RAST [10]; (ii) CMM reconstruction [9]; (iii) gapfilling [7]; and (iv) flux balance analysis (FBA) [11]. We also discuss methods for exploring metabolic diversity by studying the variations in central metabolic pathways in a phylogenetic context.

Narrative Contents

  • [Core metabolic model (CMM) construction pipeline](#Core-metabolic-model-construction-pipeline)
  • [Gene annotations of _Escherichia coli_ K12](#Gene-annotations-of-Escherichia-coli-K12)
  • [Build a draft Core Metabolic Model and gapfill in minimal media aerobically](#Build-a-draft-Core-Metabolic-Model-and-gapfill-in-minimal-media-aerobically)
  • [Core Model of _Escherichia coli_ K12](#Core-Model-of-Escherichia-coli-K12)
  • [Core Model of _Paracoccous denitrificans_ PD1222](#Core-Model-of-Paracoccus-denitrificans-PD1222)
  • [Run Flux Balance Analysis of _Escherichia coli_ K12 on Glucose-aerobic Minimal Media under aerobic conditions](#Run-Flux-Balance-Analysis-of-Escherichia-coli-K12-on-Glucose-aerobic-Minimal-Media-under-aerobic-conditions)
  • [Run Flux Balance Analysis on Glucose-anaerobic Minimal Media, simulating the the growth of the _Escherichia coli_ K12 under anaerobic conditions](#Run-Flux-Balance-Analysis-on-Glucose-anaerobic-Minimal-Media,-simulating-the-the-growth-of-the-Escherichia-coli-K12-under-anaerobic-conditions)
  • [Run Flux Balance Analysis of _Escherichia coli_ K12 on Glucose-anaerobic-Nitrate minimal media under anaerobic conditions](#Run-Flux-Balance-Analysis-of-Escherichia-coli-K12-on-Glucose-anaerobic-Nitrate-minimal-media-under-anaerobic-conditions)
  • [Compare two models _Escherichia coli_ and _Paracoccous denitrificans_](#Compare-two-models-Escherichia-coli-and-Paracoccous-denitrificans)
  • [Flux Distribution comparison : growth of _Eshcherichi coli_ aerobic vs anaerobic](#Flux-Distribution-comparison-:-growth-of-Eshcherichi-coli-aerobic-vs-anaerobic)
  • [ATP yield predictions of core models under aerobic and anerobic conditions](#ATP-yield-predictions-of-core-models-under-aerobic-and-anerobic-conditions)
  • [Run Flux Balance Analysis on glucose minimal media aerobically using biomass as the objective function](#Run-Flux-Balance-Analysis-on-glucose-minimal-media-aerobically-using-biomass-as-the-objective-function)
  • [Distribution of gapfill reactions in core models](#Distribution-of-gapfill-reactions-in-core-models)
  • [References](#References)

Core metabolic model construction pipeline

The pipeline starts with an assembled genome with gene annotations assigned by the RAST annotation pipeline. Next, the CMMs are constructed based on a manually curated CMT that consists of GPR mappings derived from a phylogenetically diverse set of model organisms including Escherichia coli, Bacillus subtilis, Pseudomonas aeroginosa, Clostridium acetobutylicum, and Paracococcus denitrificans. As an optional step, CMMs could be gapfilled; however, most of the core models do not require any gapfilling. In the final step, FBA is performed, optimizing the biomass or ATP hydrolysis as the objective function. The pipeline also supports the comparison of the CMMs and metabolic flux distributions. Rectangles with dotted borders show the name(s) of the apps for each step.

figure1

Return to Narrative contents

Gene annotations of Escherichia coli K12

We start by assigning gene annotations to the assembled E. coli genome, using the RAST annotation pipeline.

Return to Narrative contents

Data Viewer
The viewer for the output created by this App is available at the original Narrative here: https://narrative.kbase.us/narrative/18065

Above, the Escherichia coli K12 genome is shown in a genome viewer. This viewer provides a concise, text-based overview of the genome as well as its contigs and genes.

In the Contigs and Genes tabs, each entry is clickable, opening either a browser for the contig or another tab with expanded information about the gene.

You can sort these entries by clicking on a column header to sort by that field (e.g., Length). Clicking the same column header again will reverse the sort order.

This Escherichia coli genome is faily complete and has a single contig: click on the contig to see neighboring genes and potential operons in this species.

To further explore this genome, click the genome name at the top of the viewer. This will open a Landing Page for the genome in a new tab in your browser. The Landing Page provides more details about the organism, its genome, and annotations.

Annotate or re-annotate bacterial or archaeal genome using RASTtk.
This app completed without errors in 15m 19s.
Objects
Created Object Name Type Description
RAST_Ecoli_Reannotated Genome Annotated genome
Report
Summary
The RAST algorithm was applied to annotating an existing genome: Escherichia coli K12. The sequence for this genome is comprised of 1 contigs containing 4639221 nucleotides. The input genome has 5328 existing features. The genome features were functionally annotated using the following algorithm(s): Kmers V2; Kmers V1; protein similarity. In addition to the original 5328 features, 0 new features were called. Of the original features, 0 were re-annotated by RAST with new functions. Overall, a total of 1995 genes are now annotated with 4788 distinct functions. Of these functions, 1822 are a match for the SEED annotation ontology.
Output from Annotate Microbial Genome
The viewer for the output created by this App is available at the original Narrative here: https://narrative.kbase.us/narrative/18065
v1 - KBaseGenomes.Genome-8.1
The viewer for the data in this Cell is available at the original Narrative here: https://narrative.kbase.us/narrative/18065
Annotate or re-annotate bacterial or archaeal genome using RASTtk.
This app completed without errors in 24m 48s.
Objects
Created Object Name Type Description
E_coli_K12_NCBI_Reannotated Genome Annotated genome
Report
Summary
The RAST algorithm was applied to annotating an existing genome: Escherichia coli str. K-12 substr. MG1655. The sequence for this genome is comprised of 1 contigs containing 4641652 nucleotides. The input genome has 8637 existing features. The genome features were functionally annotated using the following algorithm(s): Kmers V2; Kmers V1; protein similarity. In addition to the original 8637 features, 0 new features were called. Of the original features, 0 were re-annotated by RAST with new functions. Overall, a total of 3889 genes are now annotated with 3797 distinct functions. Of these functions, 1804 are a match for the SEED annotation ontology.
Output from Annotate Microbial Genome
The viewer for the output created by this App is available at the original Narrative here: https://narrative.kbase.us/narrative/18065

Reannotation of Paracoccous denitrificans PD1222 Genome

Return to Narrative contents

Annotate or re-annotate bacterial or archaeal genome using RASTtk.
This app completed without errors in 13m 2s.
Objects
Created Object Name Type Description
Paracoccus_dentrificans_PD1222_Re-Annotated Genome Annotated genome
Report
Summary
The RAST algorithm was applied to annotating an existing genome: Paracoccus denitrificans PD1222. The sequence for this genome is comprised of 1 contigs containing 2852282 nucleotides. The input genome has 5672 existing features. The genome features were functionally annotated using the following algorithm(s): Kmers V2; Kmers V1; protein similarity. In addition to the original 5672 features, 0 new features were called. Of the original features, 0 were re-annotated by RAST with new functions. Overall, a total of 2992 genes are now annotated with 1943 distinct functions. Of these functions, 925 are a match for the SEED annotation ontology.
Output from Annotate Microbial Genome
The viewer for the output created by this App is available at the original Narrative here: https://narrative.kbase.us/narrative/18065

Build a draft Core Metabolic Model and gapfill in minimal media aerobically

Metabolic models generally require an objective function (OF) that is optimized during flux balance analysis to predict flux profiles. However, in our Core Metabolic Models, we explored two OFs: a biomass biosynthesis objective function and an ATP hydrolysis objective function. While CMMs do not include the amino acids, nucleotides, lipids, and cofactors that are typically included in the biomass biosynthesis objective function of genome-scale models, they do include the central carbon precursor metabolites for these compounds. Thus the biomass biosynthesis OF for our CMMs was constructed based on the biomass precursor stoichiometry derived by Varma and Parlsson and used in one of the earliest models of E. coli. When analyzing CMMs using the biomass biosynthesis OF, we found that occasionally gapfilling was required to enable synthesis of all essential biomass precursors . To permit a focused study of energy biosynthesis in our models without gapfilling, we developed a second OF for our CMMs consisting only of the ATP hydrolysis reaction: ATP + H2O -> ADP + Pi + H+. Using this OF, we computed ATP production yields in all models without any gapfilling; hence, these computations were based solely on reactions derived from existing RAST annotations

First, we will use the Build Metabolic Model app to build an initial draft metabolic core model based on the gene annotations in the Escherichia coli K12 genome. We chose Core metabolism as the model template listed under the field Template for reconstruction. This app has two steps; when the first step (Build Metabolic Model) finishes, the second step (Gapfill Metabolic Model) starts automatically.

The gapfill step lets you specify a media condition (i.e., the metabolites available in the environment in which you want to analyze your organism’s growth). If you leave the Media field blank, "complete" media will be used by default. Complete media is a special type of media that does not include an exact list of compounds. Instead, complete media consists of all metabolites for which a transporter is available in the KBase biochemistry database. (Transporters are reactions that move metabolites across cell membranes.) In the case of core models we use a minimal media for our simulations (e.g., Glucose minimimal media or Glycerol minimal media).

In addition to the media formulations available in KBase, you can upload your own custom media. In this example, Escherichia coli K12 was tested for growth in a minimal media condition called Glucose-aerobic.

We are making a preliminary assertion that a model cannot make all required biomass components from the sources in the minimal media, however, core models are desigined based on highly curated template that many of the core models inculding the one based on Escherichia coli K12 does not require any gapfilling reactions added to the model (see Table 2) in order to proudce its biomass when using Glucose minimal media as the sole carbon source.

Below, you will see the input cells for running the Build Metabolic Model app on our annotated E. coli genome.

Return to Narrative contents

Core model pathway map

Core model pathway map displays sugar oxidation (glycolysis, gluconeogenesis, Enter-Doudoroff, pentose phosphate), TCA cycle and fermentation pathways. Central metabolic pathway metabolites produce key precursors that lead to production of all essential structural and functional components of the required for cell growth and maintenance. We have used biomass biosynthesis equation (Varma and Palsson 1993) in analyzing core metabolic model’s ability to produce these key metabolites in central metabolism; those compounds are colored in green. Fermentation pathway end products are displayed in squares with blue color borders.

figure1

Return to Narrative contents

Generate a draft metabolic model based on an annotated genome.
This app completed without errors in 3m 55s.
Objects
Created Object Name Type Description
Eschrichia_coli_K12_CoreModel FBAModel FBAModel-11 Eschrichia_coli_K12_CoreModel
Report
Summary
A new draft genome-scale metabolic model was constructed based on the annotations in the genome E_coli_K12_NCBI_Reannotated. No gapfilling was performed on the model. It is expected that the model will not be capable of producing biomass on any growth condition until gapfilling is run. Model was saved with the name Eschrichia_coli_K12_CoreModel. The final model includes 158 reactions, 168 compounds, and 478 genes.
Output from Build Metabolic Model
The viewer for the output created by this App is available at the original Narrative here: https://narrative.kbase.us/narrative/18065
Generate a draft metabolic model based on an annotated genome.
This app completed without errors in 1m 24s.
Objects
Created Object Name Type Description
RAST_E-Coli_Model FBAModel FBAModel-11 RAST_E-Coli_Model
Report
Summary
A new draft genome-scale metabolic model was constructed based on the annotations in the genome RAST_Ecoli. No gapfilling was performed on the model. It is expected that the model will not be capable of producing biomass on any growth condition until gapfilling is run. Model was saved with the name RAST_E-Coli_Model. The final model includes 156 reactions, 167 compounds, and 250 genes.
Output from Build Metabolic Model
The viewer for the output created by this App is available at the original Narrative here: https://narrative.kbase.us/narrative/18065
Use flux balance analysis to predict metabolic fluxes in a metabolic model of an organism grown on a given media.
This app completed without errors in 1m 16s.
Objects
Created Object Name Type Description
RAST_E-coli_GlucoseAerobic_FBA FBA FBA-13 RAST_E-coli_GlucoseAerobic_FBA
Report
Summary
A flux balance analysis (FBA) was performed on the metabolic model Ecoli_Glucose_Aerobic growing in Glucose-aerobic media.
Output from Run Flux Balance Analysis
The viewer for the output created by this App is available at the original Narrative here: https://narrative.kbase.us/narrative/18065

Core Model of Escherichia coli K12

An initial draft model of _Model of Escherichia coli K12 is produced based on RAST annotations. The model was not gapfilled, as the gapfilling option was not selected.

Return to Narrative contents

Data Viewer
The viewer for the output created by this App is available at the original Narrative here: https://narrative.kbase.us/narrative/18065

Above is the core model for Escherichia coli K12.

There are seven tabs for browsing the data in the model: Overview, Reactions, Compounds, Genes, Compartments, Biomass, Gapfilling and Pathways. The contents of these tabs are as follows:

  • Overview — Summary of key information about the model, including the associated genome, number of reactions, and number of compounds.
  • Reactions - Reaction information in detail, including reaction ID, enzyme name, the biochemical equation, and the associated gene IDs.
  • Compounds - Information about compounds in the model, including the chemical formula and charge.
  • Genes - Gene IDs and associated reaction IDs.
  • Compartments - The subcellular localization of the compounds and enzymes. Typically, there are three types of compartments in microbes: Cytosol (c0), Periplasm (p0) and Extracellular (e0). Reactions and compounds belonging to each compartment are identified using compartment notation, e.g., rxn00001[c0], cpd00001[c0].
  • Biomass — The biomass composition of the model. Typically, biomass is represented in the model as an equation where biomass compounds and ATP would make one gram of biomass. The coefficients of each biomass component are listed in the Coefficient column. Negative coefficients represent the compounds at the left side of the biomass equation and the positive coefficients represent the compounds at the right side of the equation.
  • Gapfilling - The reactions that were added to fill metabolic gaps in the model. These metabolic gaps occur as a result of missing or inconsistent annotations. During the gapfilling process, an optimization algorithm adds a minimal number of reactions and compounds to make the biochemical network generate its biomass. In this case the model was not gapfilled.
  • Pathways - KEGG maps that represent the metabolic network of the model. You can click on the name of a map (e.g., TCA cycle) to see the presence or absence of the reactions (colored in blue).

Building Core Model of Paracoccus denitrificans PD1222

Generate a draft metabolic model based on an annotated genome.
This app completed without errors in 2m 41s.
Objects
Created Object Name Type Description
Paracoccus_denitrificans_PD1222_CoreModel FBAModel FBAModel-11 Paracoccus_denitrificans_PD1222_CoreModel
Report
Summary
A new draft genome-scale metabolic model was constructed based on the annotations in the genome Paracoccus_denitrificans_PD1222. No gapfilling was performed on the model. It is expected that the model will not be capable of producing biomass on any growth condition until gapfilling is run. Model was saved with the name Paracoccus_denitrificans_PD1222_CoreModel. The final model includes 54 reactions, 94 compounds, and 42 genes.

Core Model of Paracoccus denitrificans PD1222

An initial draft model of _Model of Paracoccus denitrificans PD1222 is produced based on RAST annotations. The model was not gapfilled, as the gapfilling option was not selected.

Return to Narrative contents

Output from Build Metabolic Model
The viewer for the output created by this App is available at the original Narrative here: https://narrative.kbase.us/narrative/18065

Above is the core model for Paracoccous denitrificans PD1222.

There are seven tabs for browsing the data in the model: Overview, Reactions, Compounds, Genes, Compartments, Biomass, Gapfilling and Pathways. The contents of these tabs are as follows:

  • Overview — Summary of key information about the model, including the associated genome, number of reactions, and number of compounds.
  • Reactions - Reaction information in detail, including reaction ID, enzyme name, the biochemical equation, and the associated gene IDs.
  • Compounds - Information about compounds in the model, including the chemical formula and charge.
  • Genes - Gene IDs and associated reaction IDs.
  • Compartments - The subcellular localization of the compounds and enzymes. Typically, there are three types of compartments in microbes: Cytosol (c0), Periplasm (p0) and Extracellular (e0). Reactions and compounds belonging to each compartment are identified using compartment notation, e.g., rxn00001[c0], cpd00001[c0].
  • Biomass — The biomass composition of the model. Typically, biomass is represented in the model as an equation where biomass compounds and ATP would make one gram of biomass. The coefficients of each biomass component are listed in the Coefficient column. Negative coefficients represent the compounds at the left side of the biomass equation and the positive coefficients represent the compounds at the right side of the equation.
  • Gapfilling - The reactions that were added to fill metabolic gaps in the model. These metabolic gaps occur as a result of missing or inconsistent annotations. During the gapfilling process, an optimization algorithm adds a minimal number of reactions and compounds to make the biochemical network generate its biomass. In this case the model was not gapfilled.
  • Pathways - KEGG maps that represent the metabolic network of the model. You can click on the name of a map (e.g., TCA cycle) to see the presence or absence of the reactions (colored in blue).
Use flux balance analysis to predict metabolic fluxes in a metabolic model of an organism grown on a given media.
This app is new, and hasn't been started.
No output found.
Output from Run Flux Balance Analysis
The viewer for the output created by this App is available at the original Narrative here: https://narrative.kbase.us/narrative/18065

Run Flux Balance Analysis of Escherichia coli K12 on Glucose-aerobic Minimal Media under aerobic conditions

We have built a core metabolic model of Escherichia coli K12 ; now we can use the Run Flux Balance Analysis method to perform FBA to calculate the flow of metabolites through our model. FBA results can be used to predict the growth rate of an organism under certain conditions or the production rates for particular metabolites of interest. In this case we have used ATP hydrolysis (ATP+H2O -> ADP +Pi + H+) as the objective function.

To perform FBA, you must specify a media condition that you want to investigate using your metabolic model. In this example, we select the Glucose-aerobic minimal media, implying the organism grows on Glucose minimal media under aerobic conditions.

Return to Narrative contents

Flux balance analyis resutls (below) are organized into a table with six tabs: Overview, Reaction fluxes, Exchange fluxes, Genes, Biomass, and Pathways. You can see the objective value as 26.5 mmol of ATP/mmol of Glucose. We get this ATP yield as the organism undergoes oxidative phosphorylation by utilizing aerobic electron transport chains. Under the aerobic condition, glucose is fully oxidized into CO2, H2O and energy.

  • Overview — Among the summary information in this tab is the objective value (growth of the model), which is important because it represents the maximum achievable flux through the biomass reaction of the metabolic model. An objective value of 0 or something very close to 0 means that the model did not grow on the specified media. This tab also lists other information, including the genome, media formulation, number of reactions, and number of compounds associated with the FBA.
  • Reaction fluxes — Numerical flux values, minimum and maximum flux bounds, biochemical equations, and associated genes for each reaction in the model. This information represents the fluxes through all internal reactions that allow for growth and byproduct creation. These fluxes can be further broken down into biological pathways of interest (see Pathways tab). A user may ask, for example, “How much fatty acid is being produced?” or “What are the high flux reactions or pathways?”
  • Exchange fluxes — These fluxes describe the rates at which nutrients are taken in and byproducts are secreted. Positive exchange flux values represent the uptake of compounds, and negative exchange flux values represent the excretion of compounds.
  • Genes — This tab displays the gene knockout information, if any. Because this example uses the wildtype strain of Eshcerichia coli K12, no gene knockout information is available to display.
  • Biomass — We use the (bio2) ATP hydrolysis as the objective function in this case.
  • Pathways — This tab displays KEGG maps that represent the metabolic network of the model. Click on the name of a map (e.g., TCA cycle) to see the presence or absence of reactions (blue) and fluxes (positive fluxes are shades of red; negative fluxes are shades of green).

For more information on the Run Flux Balance Analysis method, see:

Run Flux Balance Analysis on Glucose-anaerobic Minimal Media, simulating the the growth of the Escherichia coli K12 under anaerobic conditions

We now run FBA on Ecoli_Glucose using Glucose minimal media under anaerobic conditions (without the presense of oxygen). We select the Glucose-anaerobic media formulation.

Return to Narrative contents

The viewer for the data in this Cell is available at the original Narrative here: https://narrative.kbase.us/narrative/18065
Use flux balance analysis to predict metabolic fluxes in a metabolic model of an organism grown on a given media.
This app completed without errors in 44s.
Objects
Created Object Name Type Description
Ecoli_K12_biomass_FBA FBA FBA-13 Ecoli_K12_biomass_FBA
Report
Summary
A flux balance analysis (FBA) was performed on the metabolic model Eschrichia_coli_K12_CoreModel growing in Glucose-aerobic media.
Output from Run Flux Balance Analysis
The viewer for the output created by this App is available at the original Narrative here: https://narrative.kbase.us/narrative/18065
Use flux balance analysis to predict metabolic fluxes in a metabolic model of an organism grown on a given media.
This app is new, and hasn't been started.
No output found.

Notice the objective value is now 2.75 mmol of ATP/mmol of glucose when simulated under the anaerobic condition. Compared to the objective value 26.5 mmol of ATP/mmol of glucose under the aerobic condition, it is significantly less. This is because there is no oxygen present in the media. As a result, oxidative phosphorylation is not active, electron transport chains are not utilized to produce energy. Under this condition, the organism produces energy solely from the fermentation process.

Data Viewer
The viewer for the output created by this App is available at the original Narrative here: https://narrative.kbase.us/narrative/18065

Run Flux Balance Analysis of Escherichia coli K12 on Glucose-anaerobic-Nitrate minimal media under anaerobic conditions

Facultative anerobic organisms like Escherichia coli can grow both in aerobic and anaerobic conditons, as shown before. They are able to reduce a number of anaerobic electron acceptors such as nitrate (NO3), dimethyl solfuxide (DMSO) and trimethyl amineoxide (TMAO) during anerobic respiration. If anaerobic electron acceptors are not present in the medium, these organisms are still able to grow solely using the fermentation process (as shown above). Now we run FBA on our model 'Ecoli_Glucose' anerobically with nitrate (NO3) present as an anaerobic electron acceptor.

Return to Narrative contents

Use Flux Balance Analysis (FBA) to predict metabolic fluxes in a metabolic model of an organism grown on a given media.
This app is new, and hasn't been started.
No output found.
Data Viewer
The viewer for the output created by this App is available at the original Narrative here: https://narrative.kbase.us/narrative/18065

Run Flux Balance Analysis of Paracoccous denitrificans PD1222 on Glucose-anaerobic-Nitrate minimal media under anaerobic conditions

Paracoccous denitrificans PD1222 can grow both in aerobic and anaerobic conditons, as shown before. They are able to reduce a number of anaerobic electron acceptors such as nitrate (NO3), dimethyl solfuxide (DMSO) and trimethyl amineoxide (TMAO) during anerobic respiration. If anaerobic electron acceptors are not present in the medium, these organisms are still able to grow solely using the fermentation process (as shown above). Now we run FBA on our model 'Ecoli_Glucose' anerobically with nitrate (NO3) present as an anaerobic electron acceptor.

Return to Narrative contents

Use flux balance analysis to predict metabolic fluxes in a metabolic model of an organism grown on a given media.
This app completed without errors in 40s.
Objects
Created Object Name Type Description
Paracoccus_denitrificans_PD1222_FBA_anaerobic_nitrate FBA FBA-13 Paracoccus_denitrificans_PD1222_FBA_anaerobic_nitrate
Report
Summary
A flux balance analysis (FBA) was performed on the metabolic model Paracoccus_denitrificans_PD1222_CoreModel growing in Glucose-anaerobic-Nitrate media.
Output from Run Flux Balance Analysis
The viewer for the output created by this App is available at the original Narrative here: https://narrative.kbase.us/narrative/18065

Diversity and complexity of the Electron Transport Chains of bacteria

Unlike the electron transport chains of higher eukaryotes, bacterial ETCs are highly diversified. As a result, they are able to grow in a variety of aerobic and anaerobic environments reducing anaerobic electron acceptors such as nitrate, nitrite, fumarate, dimethyl sulfoxide(DMSO) and trimethylamine N-oxide (TMAO). For instance, Escherichia coli (below) can respire aerobically and anaerobically reducing nitrate, fumarate, TMAO and DMSO. Paracoccus denitrificans (below) is also able to grow aerobically and able to reduce multiple nitrogen based compounds anaerobically including nitrate, nitrite, nitrous oxide and nitric oxide. Better annotation of ETCs helps us identify complex respiration types and make accurate energy yield predictions. In our CMMs, we have focused on adding these diverse ETC reactions across the bacterial tree of life that are derived from consistently assigned gene annotations.

figure1

Return to Narrative contents

Compare two models Escherichia coli and Paracoccous denitrificans

We are able to compare the two models of Escherichia coli and Paracoccous denitrificans

Compare FBA Models based on reactions, compounds, biomass and proteins
This app completed without errors in 60s.
Objects
Created Object Name Type Description
coli_denitrificans_model_comparison ModelComparison ModelComparison-4 coli_denitrificans_model_comparison
Report
Summary
The compouds, reactions, genes, and biomass compositions in the following 2 models were compared:Eschrichia_coli_K12_CoreModel; Paracoccus_denitrificans_PD1222_CoreModel. All models shared a common set of 83 compounds, 50 reactions, and 16 biomass compounds.
Output from Compare Models
The viewer for the output created by this App is available at the original Narrative here: https://narrative.kbase.us/narrative/18065
Compute pangenome of a set of individual genomes
This app completed without errors in 1m 9s.
Objects
Created Object Name Type Description
Ecoli_Pdenitrificans_PanGenome Pangenome Pangenome
Summary
Pangenome saved to janakakbase:1480449272898/Ecoli_Pdenitrificans_PanGenome
Output from Compute Pangenome
The viewer for the output created by this App is available at the original Narrative here: https://narrative.kbase.us/narrative/18065

Flux Distribution comparison : growth of Eshcherichi coli aerobic vs anaerobic

For each FBA solution, compare objective values, reaction fluxes, and metabolite uptake and excretion.
This app completed without errors in 46s.
Objects
Created Object Name Type Description
Ecoli_aerobic_Vs_anaerobic FBAComparison FBAComparison-5 Ecoli_aerobic_Vs_anaerobic
Output from Compare FBA Solutions
The viewer for the output created by this App is available at the original Narrative here: https://narrative.kbase.us/narrative/18065

ATP yield predictions of core models under aerobic and anaerobic conditions

Return to Narrative contents

As seen under Flux Balance Analysis tab -> Objective column, we can see aerobic growth yield 26.5 ATP verses anaerobic growth (without any electron acceptors) yield is much lower at 2.75. You can find ATP yield data on number of model organisms here https://narrative.kbase.us/narrative/ws.15253.obj.1#ATP-yield-predictions-of-core-models-under-aerobic-and-anerobic-conditions

Run Flux Balance Analysis on glucose minimal media aerobically using biomass as the objective function

As explained earlier, Core Models have two objective functions, ATP hydrolysis and biomass biosynthesis. About 41% of core models (3415), including the core model of Escherichia coli, do not need any gapfilling reactions addedd in order to produce the essential biomass precursors in the OF. However, some core models (see Table 1) require gapfilling reactions to be added in order to satisfy the biomass objective function. Now we run FBA, selecting biomass (bio2) as the objective function against glucose minimal media without the gapfilling option.

Return to Narrative contents

Use flux balance analysis to predict metabolic fluxes in a metabolic model of an organism grown on a given media.
This app is new, and hasn't been started.
No output found.
Data Viewer
The viewer for the output created by this App is available at the original Narrative here: https://narrative.kbase.us/narrative/18065

Distribution of gapfill reactions in core models

Return to Narrative contents

We can see the Escherichia coli core model was able to grow (0.12 objective value) without any gapfilling reactions added to the model. However, as some models do require gapfilling in order to produce biomass precursors, we have run an analysis identifying the distribution of number of gapfilling reactions needed by each model and organized them by phylogeny.

Figure 3. Number of gapfilled reactions that are required in CMMs in order to produce all biomass precursors, with CMMS organized by phylogenetic group. The blue bars represent the gene-associated reactions and the red bars represent the gapfilled reactions for all CMMs used in this study. The height of the bars represents the number of reactions. CMMs are grouped according to taxonomy.

figure1

References

  1. Gottschalk G (1988) Bacterial Metabolism. Springer-Verlag, New York
  2. Gottschalk G (1989) How Escherichia coli synthesizes ATP during aerobic growth of glucose. In: Bacterial Metabolism. Springer Vera, New York, pp 13-35
  3. Henry CS, DeJongh M, Best AA, Frybarger PM, Linsay B, Stevens RL (2010) High-throughput generation, optimization and analysis of genome-scale metabolic models. Nature biotechnology 28 (9):977-982. doi:10.1038/nbt.1672
  4. Karp PD, Paley S, Romero P (2002) The Pathway Tools software. Bioinformatics 18 Suppl 1:S225-232
  5. Becker SA, Feist AM, Mo ML, Hannum G, Palsson BO, Herrgard MJ (2007) Quantitative prediction of cellular metabolism with constraint-based models: the COBRA Toolbox. Nat Protoc 2 (3):727-738
  6. Monk J, Palsson BO (2014) Genetics. Predicting microbial growth. Science (New York, NY 344 (6191):1448-1449. doi:10.1126/science.1253388
  7. Kumar VS, Dasika MS, Maranas CD (2007) Optimization based automated curation of metabolic reconstructions. BMC Bioinformatics 8:212
  8. Varma A, Palsson BO (1994) Stoichiometric flux balance models quantitatively predict growth and metabolic by-product secretion in wild-type Escherichia-coli W3110. Applied and environmental microbiology 60 (10):3724-3731
  9. Edirisinghe JN, Weisenhorn P, Conrad N, Xia F, Overbeek R, Stevens RL, Henry CS (2016) Modeling central metabolism and energy biosynthesis across microbial life. BMC genomics 17:568. doi:10.1186/s12864-016-2887-8
  10. Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, Formsma K, Gerdes S, Glass EM, Kubal M, Meyer F, Olsen GJ, Olson R, Osterman AL, Overbeek RA, McNeil LK, Paarmann D, Paczian T, Parrello B, Pusch GD, Reich C, Stevens R, Vassieva O, Vonstein V, Wilke A, Zagnitko O (2008) The RAST Server: rapid annotations using subsystems technology. BMC genomics 9:75. doi:10.1186/1471-2164-9-75

Return to Narrative contents

Apps

  1. Annotate Microbial Genome
    • [1] Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, et al. The RAST Server: Rapid Annotations using Subsystems Technology. BMC Genomics. 2008;9: 75. doi:10.1186/1471-2164-9-75
    • [2] Overbeek R, Olson R, Pusch GD, Olsen GJ, Davis JJ, Disz T, et al. The SEED and the Rapid Annotation of microbial genomes using Subsystems Technology (RAST). Nucleic Acids Res. 2014;42: D206 D214. doi:10.1093/nar/gkt1226
    • [3] Brettin T, Davis JJ, Disz T, Edwards RA, Gerdes S, Olsen GJ, et al. RASTtk: A modular and extensible implementation of the RAST algorithm for building custom annotation pipelines and annotating batches of genomes. Sci Rep. 2015;5. doi:10.1038/srep08365
    • [4] Kent WJ. BLAT The BLAST-Like Alignment Tool. Genome Res. 2002;12: 656 664. doi:10.1101/gr.229202
    • [5] Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, et al. BLAST+: architecture and applications. BMC Bioinformatics. 2009;10: 421. doi:10.1186/1471-2105-10-421
    • [6] Lowe TM, Eddy SR. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1997;25: 955 964.
    • [7] Cobucci-Ponzano B, Rossi M, Moracci M. Translational recoding in archaea. Extremophiles. 2012;16: 793 803. doi:10.1007/s00792-012-0482-8
    • [8] Siguier P, Perochon J, Lestrade L, Mahillon J, Chandler M. ISfinder: the reference centre for bacterial insertion sequences. Nucleic Acids Res. 2006;34: D32 D36. doi:10.1093/nar/gkj014
    • [9] van Belkum A, Sluijuter M, de Groot R, Verbrugh H, Hermans PW. Novel BOX repeat PCR assay for high-resolution typing of Streptococcus pneumoniae strains. J Clin Microbiol. 1996;34: 1176 1179.
    • [10] Croucher NJ, Vernikos GS, Parkhill J, Bentley SD. Identification, variation and transcription of pneumococcal repeat sequences. BMC Genomics. 2011;12: 120. doi:10.1186/1471-2164-12-120
    • [11] Hyatt D, Chen G-L, Locascio PF, Land ML, Larimer FW, Hauser LJ. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics. 2010;11: 119. doi:10.1186/1471-2105-11-119
    • [12] Delcher AL, Bratke KA, Powers EC, Salzberg SL. Identifying bacterial genes and endosymbiont DNA with Glimmer. Bioinformatics. 2007;23: 673 679. doi:10.1093/bioinformatics/btm009
    • [13] Akhter S, Aziz RK, Edwards RA. PhiSpy: a novel algorithm for finding prophages in bacterial genomes that combines similarity- and composition-based strategies. Nucleic Acids Res. 2012;40: e126. doi:10.1093/nar/gks406
  2. Build Metabolic Model
    • [1] Henry CS, DeJongh M, Best AA, Frybarger PM, Linsay B, Stevens RL. High-throughput generation, optimization and analysis of genome-scale metabolic models. Nat Biotechnol. 2010;28: 977 982. doi:10.1038/nbt.1672
    • [2] Overbeek R, Olson R, Pusch GD, Olsen GJ, Davis JJ, Disz T, et al. The SEED and the Rapid Annotation of microbial genomes using Subsystems Technology (RAST). Nucleic Acids Res. 2014;42: D206 D214. doi:10.1093/nar/gkt1226
    • [3] Latendresse M. Efficiently gap-filling reaction networks. BMC Bioinformatics. 2014;15: 225. doi:10.1186/1471-2105-15-225
    • [4] Dreyfuss JM, Zucker JD, Hood HM, Ocasio LR, Sachs MS, Galagan JE. Reconstruction and Validation of a Genome-Scale Metabolic Model for the Filamentous Fungus Neurospora crassa Using FARM. PLOS Computational Biology. 2013;9: e1003126. doi:10.1371/journal.pcbi.1003126
    • [5] Mahadevan R, Schilling CH. The effects of alternate optimal solutions in constraint-based genome-scale metabolic models. Metab Eng. 2003;5: 264 276.
  3. Compare FBA Solutions
    no citations
  4. Compare Models
    no citations
  5. Compute Pangenome
    • Arkin AP, Cottingham RW, Henry CS, Harris NL, Stevens RL, Maslov S, et al. KBase: The United States Department of Energy Systems Biology Knowledgebase. Nature Biotechnology. 2018;36: 566. doi: 10.1038/nbt.4163
  6. Run Flux Balance Analysis
    • Henry CS, DeJongh M, Best AA, Frybarger PM, Linsay B, Stevens RL. High-throughput generation, optimization and analysis of genome-scale metabolic models. Nat Biotechnol. 2010;28: 977 982. doi:10.1038/nbt.1672
    • Orth JD, Thiele I, Palsson B . What is flux balance analysis? Nature Biotechnology. 2010;28: 245 248. doi:10.1038/nbt.1614