Accompanying Data for: "Genome sequence of Methylobacterium sp. strain C14, isolated from concrete"
The C14 strain was isolated from a concrete cylinder that had weathered outside for 16 months as part of a long-term study on the microbiome of weathered concrete (Ref Kiledal 2021). The cylinder was sliced into ~2-cm slices using a saw with an alcohol-sterilized blade, then slices were broken with a hammer. Small (Less than 2cm^3) concrete pieces were placed in 30 mL phosphate-buffered saline (ThermoFisher, 137 mM NaCl, 2.7 mM KCl, 8 mM Na2HPO4, and 2 mM KH2PO4) with 0.5% TWEEN-20, sonicated in a bath sonicator for 5 minutes, shaken in an incubator at 25°C and ~180 rpm for 2 hours, then plated. The medium was 10% standard BG-11 (Allen & Stanier 1968) solidified with gellan gum and 8 mM CaCl2, amended with 2.5 g L-1 each peptone, yeast extract, and glucose.
Strain C14 was isolated from a concrete cylinder that had weathered outside for 15 months as part of a long-term study on the microbiome of weathered concrete (1). Because other Methylobacterium spp. are genetically tractable (2), we sequenced its genome as part of an effort to develop genetic tools for bacterial isolates from concrete. Prior to genome sequencing, strain C14 was revived from the freezer stock on LB medium (1.5% agar) and incubated at room temperature until colonies appeared. A single colony was inoculated into 10 mL LB and incubated with shaking at room temperature until stationary phase. DNA was extracted using a phenol-chloroform extraction protocol optimized for Gram-positive bacteria (4). A single-molecule real-time (SMRT) library was barcoded and prepared using the PacBio SMRTbell Express template preparation kit version 2.0. DNA fragments larger than 6 kb were size selected using BluePippin (Sage Science). The average library fragment size was 12 kb, as measured by a fragment analyzer (Advanced Analytical Technologies, Inc.). Sequencing was completed on a PacBio Sequel IIe single-molecule sequencer in one 1M version 3 LR SMRT Cell with a 30-h movie. Samples were demultiplexed using PacBio SMRT Link version 11. The PacBio sequencing yielded 55,332 raw reads with an N50 of 12,235 bp.
from biokbase.narrative.jobs.appmanager import AppManager
AppManager().run_app_batch(
[{
"app_id": "kb_uploadmethods/import_fastq_noninterleaved_as_reads_from_staging",
"tag": "release",
"version": "5b9346463df88a422ff5d4f4cba421679f63c73f",
"params": [{
"fastq_fwd_staging_file_name": "demultiplex.C14.hifi_reads.fastq",
"fastq_rev_staging_file_name": None,
"name": "C14_reads"
}],
"shared_params": {
"sequencing_tech": "PacBio CCS",
"single_genome": 1,
"read_orientation_outward": 0,
"insert_size_std_dev": None,
"insert_size_mean": None
}
}],
cell_id="6744313f-f7cc-48e5-8d22-c9adeb926dde",
run_id="f585d956-99dd-469f-bd9d-e48a5cfb6881"
)
A summary of the geNomad results is shown below:
contig | length (bp) | coordinates | virus score |
---|---|---|---|
contig_1 (provirus) | 65,472 | 70-65,541 | 0.979 |
contig_1 (provirus) | 55,857 | 5,715,213-5,771,069 | 0.953 |
contig_1 (provirus) | 56,316 | 6,161,344-6,217,659 | 0.952 |
contig_3 | 33,746 | NA | 0.907 |
contig | length (bp) | plasmid score |
---|---|---|
contig_9 | 54,537 | 0.988 |
contig_8 | 85,428 | 0.987 |
contig_7 | 76,184 | 0.986 |
contig_2 | 62,136 | 0.982 |
contig_5 | 51,201 | 0.979 |
contig_6 | 39,253 | 0.976 |
contig_4 | 35,452 | 0.965 |
The Strain C14 is most closely related to Methylobacterium fujisawaense strain C14 (98.19% ANI calculated by GTDB-TK v. 1.7.0), in the class Alphaproteobacteria.
Kiledal EA, Keffer JL, Maresca JA. 2021. Bacterial Communities in Concrete Reflect Its Composite Nature and Change with Weathering. mSystems 6:10.1128/msystems.01153-20.
Marx CJ, Lidstrom ME. 2001. Development of improved versatile broad-host-range vectors for use in methylotrophs and other Gram-negative bacteriaThe GenBank accession numbers for the sequences reported in this paper are AF327711, AF327712, AF327713, AF327714, AF327715, AF327716, AF327717, AF327718, AF327719 and AF327720. Microbiology 147:2065-2075.
Allen MM, Stanier RY. 1968. Growth and Division of Some Unicellular Blue-green Algae. Microbiology 51:199-202.
Kiledal E, Maresca JA. 2021. Chromosomal DNA extraction from Gram-positive bacteria. protocolsio.
Arkin AP, Cottingham RW, Henry CS, Harris NL, Stevens RL, Maslov S, Dehal P, Ware D, Perez F, Canon S, Sneddon MW, Henderson ML, Riehl WJ, Murphy-Olson D, Chan SY, Kamimura RT, Kumari S, Drake MM, Brettin TS, Glass EM, Chivian D, Gunter D, Weston DJ, Allen BH, Baumohl J, Best AA, Bowen B, Brenner SE, Bun CC, Chandonia J-M, Chia J-M, Colasanti R, Conrad N, Davis JJ, Davison BH, DeJongh M, Devoid S, Dietrich E, Dubchak I, Edirisinghe JN, Fang G, Faria JP, Frybarger PM, Gerlach W, Gerstein M, Greiner A, Gurtowski J, Haun HL, He F, Jain R, et al. 2018. KBase: The United States Department of Energy Systems Biology Knowledgebase. Nat Biotechnol 36:566-569.
Wick R. Filtlong https://github. com/rrwick. Filtlong.
Kolmogorov M, Yuan J, Lin Y, Pevzner PA. 2019. Assembly of long, error-prone reads using repeat graphs. Nat Biotechnol 37:540-546.
Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. 2015. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res 25:1043-1055.
Camargo AP, Roux S, Schulz F, Babinski M, Xu Y, Hu B, Chain PS, Nayfach S, Kyrpides NC. 2023. Identification of mobile genetic elements with geNomad. Nat Biotechnol:1-10.
Eloe-Fadrosh EA, Ahmed F, Babinski M, Baumes J, Borkum M, Bramer L, Canon S, Christianson DS, Corilo YE, Davenport KW. 2022. The National Microbiome Data Collaborative Data Portal: an integrated multi-omics microbiome data resource. Nucleic Acids Res 50:D828-D836.
Brettin T, Davis JJ, Disz T, Edwards RA, Gerdes S, Olsen GJ, Olson R, Overbeek R, Parrello B, Pusch GD, Shukla M, Thomason JA, Stevens R, Vonstein V, Wattam AR, Xia F. 2015. RASTtk: A modular and extensible implementation of the RAST algorithm for building custom annotation pipelines and annotating batches of genomes. Sci Rep 5:8365.
Tatusova T, DiCuccio M, Badretdin A, Chetvernin V, Nawrocki EP, Zaslavsky L, Lomsadze A, Pruitt KD, Borodovsky M, Ostell J. 2016. NCBI prokaryotic genome annotation pipeline. Nucleic Acids Res 44:6614-24.
Chaumeil P-A, Mussig AJ, Hugenholtz P, Parks DH. 2020. GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database. Bioinformatics 36:1925-1927.
Green P, Bousfield I, Hood D. 1988. Three new Methylobacterium species: M. rhodesianum sp. nov., M. zatmanii sp. nov., and M. fujisawaense sp. nov. Int J Syst Evol Microbiol 38:124-127.