Generated April 22, 2021

Identification of essential β-oxidation genes and corresponding metabolites for estrogen degradation by actinobacteria

Introduction

This narrative was used for a comparative transcriptomic analysis of an actinobacterium Rhodococcus sp. strain B50 incubated under three different steroid conditions. Via this analysis, the candidate genes and gene clusters of aerobic estrogen degradation in Strain B50 could be revealed.

RNA sample collection, purification and sequencing

RNA Sample Collection and purification

Strain B50 was incubated in the chemically defined mineral medium supplemented with 1 mM of estrone, testosterone, or cholesterol as the sole carbon source. Bacterial cells was harvested when substrate was consumed. Cell pellet was resuspended in 200 μL lysozyme buffer contained 20 mM Tris-HCl, 2 mM EDTA, 1% TritonX-100 and 0.8 mg lysozyme. After incubation at 37°C for 1 hour, 600 μL of TRI reagent® (Sigma-Aldrich, St. Louis, MO, USA) was added into the sample and kept in -80°C until further RNA extraction. Total RNA was extracted using Direct-zolTM RNA MiniPrep kit (Zymo Research, Irvine, CA, USA), and residual DNA was removed with the TURBO DNA-freeTM Kit (Thermo Fisher Scientific, Waltham, MA, USA). The quality and quantity of the resulting RNA samples were determined using BioAnalyzer 2100 (Agilent, Santa Clara, CA, USA) and Qubit RNA HS assay kit (Thermo Fisher Scientific, Waltham, MA, USA), respectively, while the DNA-removing efficacy was evaluated using Qubit 1x dsDNA HS assay kit (Thermo Fisher Scientific, Waltham, MA, USA).

RNA Sequencing

The QC-passed RNA samples were depleted for rRNA using the Ribo-Zeroâ„¢ Magnetic Kit for Bacteria (EPICENTRE Biotechnologies, Madison, WI, USA). Further library construction, including cDNA synthesis, adaptor ligation, and enrichment was performed following the instruction of TruSeq Stranded mRNA Library Prep (Illumina, San Diego, CA, USA). The constructed libraries were sequenced as pair-end reads (with a 301-bp read length) on the Illumina MiSeq system (Illumina, San Diego, CA, USA). Finally, we obtained 19,493,886, 20,592,482 and 19,110,700 raw reads from estrone, testosterone and cholesterol treatments, respectively.

Data Import, Trimming, Alignment and Assembling Transcripts

  1. The Strain B50 genome was downloaded in the GenBank format from the NCBI. Later, it was imported using default parameters through the Import GenBank File as Genome from Staging Area.
  2. Three different pairs of transcriptomic raw reads were imported using default parameters through the Import FASTQ/SRA File as Reads from Staging Area.
  3. All these transcriptomic raw reads were combined into a sample set through the Create RNA-seq Sample set.
  4. After quality check using Assess Read Quality with FastQC, the RNA-seq sample set was trimmed using default parameters through the Trim Reads with Trimmomatic.
  5. The trimmed RNA-seq reads were aligned to Strain B50 genome using default parameters through the Align Reads using HISAT2.
  6. Finally, each gene expression strength under different growth conditions was evaulated using default parameters through the Assemble Transcripts using StringTie.
Import a GenBank file from your staging area into your Narrative as a Genome data object
This app completed without errors in 3m 28s.
Objects
Created Object Name Type Description
B50.gbk_genome Genome Imported Genome
Links
Output from Import GenBank File as Genome from Staging Area
The viewer for the output created by this App is available at the original Narrative here: https://narrative.kbase.us/narrative/89341
Import a FASTQ/SRA file into your Narrative as a Reads data object
This app completed without errors in 32m 58s.
Objects
Created Object Name Type Description
Testosterone PairedEndLibrary Imported Reads
Links
Import a FASTQ/SRA file into your Narrative as a Reads data object
This app completed without errors in 11m 49s.
Objects
Created Object Name Type Description
Estrone PairedEndLibrary Imported Reads
Links
Import a FASTQ/SRA file into your Narrative as a Reads data object
This app completed without errors in 22m 44s.
Objects
Created Object Name Type Description
Cholesterol PairedEndLibrary Imported Reads
Links
Allows users to provide RNA-seq reads and the corresponding metadata to create an RNASeqSampleSet data object.
This app completed without errors in 4s.
No output found.
Output from Create RNA-seq Sample Set
The viewer for the output created by this App is available at the original Narrative here: https://narrative.kbase.us/narrative/89341
A quality control application for high throughput sequence data.
This app completed without errors in 26m 16s.
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/89341
  • Cholesterol_52796_5_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • Testosterone_52796_7_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • Estrone_52796_8_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • Cholesterol_52796_5_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • Testosterone_52796_7_1.rev_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
  • Estrone_52796_8_1.fwd_fastqc.zip - Zip file generated by fastqc that contains original images seen in the report
Trim paired- or single-end Illumina reads with Trimmomatic.
This app completed without errors in 2h 11m 8s.
Objects
Created Object Name Type Description
B50_trimmed_RNA_sampleset_trimm_paired ReadsSet Trimmed Reads
B50_trimmed_RNA_sampleset_trimm_paired_SampleSet RNASeqSampleSet Trimmed Reads
B50_trimmed_RNA_sampleset_trimm_unpaired_fwd ReadsSet Trimmed Unpaired Forward Reads
B50_trimmed_RNA_sampleset_trimm_unpaired_fwd_SampleSet RNASeqSampleSet Trimmed Unpaired Forward Reads
B50_trimmed_RNA_sampleset_trimm_unpaired_rev ReadsSet Trimmed Unpaired Reverse Reads
B50_trimmed_RNA_sampleset_trimm_unpaired_rev_SampleSet RNASeqSampleSet Trimmed Unpaired Reverse Reads
Align sequencing reads to long reference sequences using HISAT2.
This app completed without errors in 1h 53m 5s.
Objects
Created Object Name Type Description
Estrone_trimm_paired_alignment RNASeqAlignment Reads 89341/102/1;89341/16/4 aligned to Genome 89341/100/1
Cholesterol_trimm_paired_alignment RNASeqAlignment Reads 89341/102/1;89341/19/4 aligned to Genome 89341/100/1
Testosterone_trimm_paired_alignment RNASeqAlignment Reads 89341/102/1;89341/13/4 aligned to Genome 89341/100/1
B50_trimmed_RNA_sampleset_trimm_paired_alignment_set ReadsAlignmentSet Set of all new alignments
Summary
Created 3 alignments from the given alignment set.
Links
Assemble the transcripts from RNA-seq read alignments using StringTie.
This app completed without errors in 8m 59s.
Objects
Created Object Name Type Description
B50_trimmed_RNA_sampleset_trimm_paired_expression_set ExpressionSet ExpressionSet generated by StringTie
Testosterone_trimm_paired_expression RNASeqExpression Expression generated by StringTie
Estrone_trimm_paired_expression RNASeqExpression Expression generated by StringTie
Cholesterol_trimm_paired_expression RNASeqExpression Expression generated by StringTie
B50_trimmed_RNA_sampleset_trimm_paired_FPKM_ExpressionMatrix ExpressionMatrix FPKM ExpressionMatrix generated by StringTie
B50_trimmed_RNA_sampleset_trimm_paired_TPM_ExpressionMatrix ExpressionMatrix TPM ExpressionMatrix generated by StringTie
Links
Files
These are only available in the live Narrative: https://narrative.kbase.us/narrative/89341
  • stringtie_result.zip - File(s) generated by StringTie App

Apps

  1. Align Reads using HISAT2 - v2.1.0
    • Kim D, Langmead B, Salzberg SL. HISAT: a fast spliced aligner with low memory requirements. Nature Methods. 2015;12: 357 360. doi:10.1038/nmeth.3317
    • Kim D, Pertea G, Trapnell C, Pimentel H, Kelley R, Salzberg SL. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biology. 2013;14: R36. doi:10.1186/gb-2013-14-4-r36
  2. Assemble Transcripts using StringTie - v1.3.3b
    • Frazee AC, Pertea G, Jaffe AE, Langmead B, Salzberg SL, Leek JT. Ballgown bridges the gap between transcriptome assembly and expression analysis. Nat Biotechnol. 2015;33: 243 246. doi:10.1038/nbt.3172
    • https://www.nature.com/articles/nmeth.3317
    • Kim D, Pertea G, Trapnell C, Pimentel H, Kelley R, Salzberg SL. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biology. 2013;14: R36. doi:10.1186/gb-2013-14-4-r36
    • Pertea M, Pertea GM, Antonescu CM, Chang T-C, Mendell JT, Salzberg SL. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nature Biotechnology. 2015;33: 290 295. doi:10.1038/nbt.3122
    • Trapnell C, Pachter L, Salzberg SL. TopHat: discovering splice junctions with RNA-Seq. Bioinformatics. 2009;25: 1105 1111. doi:10.1093/bioinformatics/btp120
    • Trapnell C, Roberts A, Goff L, Pertea G, Kim D, Kelley DR, et al. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat Protoc. 2012;7: 562 578. doi:10.1038/nprot.2012.016
  3. Assess Read Quality with FastQC - v0.11.5
    • FastQC source: Bioinformatics Group at the Babraham Institute, UK.
  4. Create RNA-seq Sample Set
    • Arkin AP, Cottingham RW, Henry CS, Harris NL, Stevens RL, Maslov S, et al. KBase: The United States Department of Energy Systems Biology Knowledgebase. Nature Biotechnology. 2018;36: 566. doi: 10.1038/nbt.4163
  5. Import FASTQ/SRA File as Reads from Staging Area
    no citations
  6. Import GenBank File as Genome from Staging Area
    no citations
  7. Trim Reads with Trimmomatic - v0.36
    • Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30: 2114 2120. doi:10.1093/bioinformatics/btu170