App Catalog
Sign Up Sign In
Reconstruct Plant Metabolism


By: seaver


Reconstruct the metabolic network of a plant based on an annotated genome.

The PlantSEED pipeline [1-3] was implemented within KBase to enable users to reconstruct genome-scale metabolic networks of plant primary metabolism using data they have imported or generated with other tools in the system. This overview of the PlantSEED pipeline details the steps for automated reconstruction of plant primary metabolism using KBase.

Step 1 Re-annotating Imported Genomes

Genomes imported into KBase, whether by the user, or copied from the Phytozome Genomes in the publicly available data, must be re-annotated using the PlantSEED functional ontology. This step is necessary because the functional annotations curated by the PlantSEED project are linked directly to the biochemical reactions in the ModelSEED biochemistry, which is used by KBase for metabolic modeling.

There are two approaches to annotate plant genomes, the first is to use a set of signature k-mers that were trained by the PlantSEED project to be unique for each functional annotation. The app for annotating the sequences using k-mers is Annotate Plant Transcripts with Metabolic Functions; this app takes 5-10 minutes to run. The second app (Annotate Plant Enzymes with OrthoFinder) uses a set of protein families generated by OrthoFinder [4]. This app will insert the users' sequences into the families, and cluster them accordingly to the functional annotations; this app takes 6-8 hours to run, but the resulting annotation will have a higher precision.

Step 2 - Reconstruction

Once a genome has been annotated using the PlantSEED functional ontology, it can be fed into this app for the reconstruction of plant primary metabolism, wherein the PlantSEED annotations are used to link the users' sequences to biochemical reactions in the resulting metabolic network, in the form of gene-protein-reaction (GPR) associations. The reconstruction process also adds a plant-specific biomass reaction, curated for the leaf.

The GPR associations, representing the link between the biochemical reactions and the PlantSEED functional annotations, allows the pipeline to differentiate between cases where protein products from multiple genes form a complex to catalyze a reaction, and cases where protein products from multiple genes can independently catalyze the same reaction. The reconstruction includes all reactions that have been curated to be part of plant primary metabolism, and will include the reactions even if a GPR assocation has not been formed. Notably, the reconstruction does not yet include secondary metabolism. Additionally, spontaneous reactions are added during this step.

Step 3 Flux Balance Analysis

Once model reconstruction is complete, the Flux Balance Analysis (FBA) can be applied to assess the capacity of reactions to carry flux and reaction essentiality. The Run FBA method uses Flux Variability Analysis (FVA) [9] to classify the reactions in the KBase models as essential, active or blocked. Reactions that must carry flux for growth to occur are classified as essential; reactions that only optionally carry flux are classified as active; and reactions that are unable to carry flux are classified as blocked. Genes catalyzing reactions that were classified as essential were subsequently classified as essential, as long as alternative isozymes did not exist for these genes. Essentially, the PlantSEED project provides two publicly available media formulations to be used with the reconstructions of plant primary metabolism: PlantHeterotrophicMedia and PlantAutotrophicMedia. These are so named because they use sucrose and carbon dioxide as the respective sources of carbon.

Related Publications

App Specification:

Module Commit: a8031aa0f4320c7cd96f78170d7644f696614f8d