Draft Genome Sequence of Lactobacillus sp. Strain TCF032-E4, Isolated from Fermented Radish Yuejian Mao,a Meng Chen,a Philippe Horvathb DuPont Nutrition & Health, Shanghai, Chinaa; DuPont Nutrition & Health, Dangé-Saint-Romain, Franceb
Here, we report the draft genome sequence of Lactobacillus sp. strain TCF032-E4 (ⴝ CCTCC AB2015090 ⴝ DSM 100358), isolated from a Chinese fermented radish. The total length of the 57 contigs is about 2.9 Mb, with a GⴙC content of 43.5 mol% and 2,797 predicted coding sequences (CDSs). Received 17 June 2015 Accepted 24 June 2015 Published 30 July 2015 Citation Mao Y, Chen M, Horvath P. 2015. Draft genome sequence of Lactobacillus sp. strain TCF032-E4, isolated from fermented radish. Genome Announc 3(4):e00821-15. doi:10.1128/genomeA.00821-15. Copyright © 2015 Mao et al. This is an open-access article distributed under the terms of the Creative Commons Attribution 3.0 Unported license. Address correspondence to Yuejian Mao, [email protected]
actobacillus sp. strain TCF032-E4 was isolated in 2013 from a homemade fermented white radish (brine pH 3.7) originating from Guilin, Guangxi Province, China. The 16S rRNA gene sequence of this strain shared high similarity with that of Lactobacillus plantarum subsp. plantarum ATCC 14917T (99.2%), Lactobacillus pentosus NCDO 363T (99.2%), and Lactobacillus paraplantarum CNRZ 1885T (99.1%). However, an abnormal amplification pattern (no product) was obtained with the recA multiplex PCR, a method described by Torriani et al. (1) for the differentiation of L. plantarum, L. paraplantarum, and L. pentosus species, which share highly similar 16S rRNA genes. TCF032-E4 also showed a different growth temperature range (no growth at temperatures of ⬎32°C) and sugar fermentation pattern (e.g., no sucrose utilization). These distinguishing phenotypic characters motivated a deeper analysis of TCF032-E4 through wholegenome sequencing. The genome of Lactobacillus sp. TCF032-E4 was sequenced by Majorbio Biopharm Technology Co., Ltd. (Shanghai, China) using an Illumina HiSeq 2500 platform. A paired-end library with a DNA insert length of 300 bp was generated, and 11,896,210 reads with a mean length of 101 nucleotides (nt) were obtained, representing ~413-fold genome coverage. Velvet 1.2.10 (2) was used to assemble the reads using a k-mer value of 31 (-ins_length 300, ⫺cov_cutoff 50, -exp_cov 410, -min_contig_lgth 500). The resulting draft genome sequence comprised 57 contigs ⬎500 bp in length, with a maximum contig length of 279,565 bp. The total length of all contigs is 2,904,660 bp, with a G⫹C content of 43.5 mol%. The contigs were ordered with Mauve 2.3.1 (3) using the complete genome of L. plantarum WCFS1 (4) (accession no. AL935263) as the reference, which shared 99% 16S rRNA gene similarity with TCF032-E4. The reordered draft genome sequence of TCF032-E4 was annotated with RAST (5), a fully automated online service for genome annotation. In total, 2,797 proteincoding sequences (CDS) were predicted, with an average length of 867 bp, representing a coding density of 83.5%. A total of 792 CDSs (28.3%) were identified to code for hypothetical proteins, while 1,225 CDS (44%) were classified into the RAST subsystems. Fifty-four tRNAs and one putative bacteriocin
July/August 2015 Volume 3 Issue 4 e00821-15
gene were predicted. Two intact prophages (45.3 kb and 34.4 kb) were detected by PHAST (6). No clustered regularly interspaced short palindromic repeat (CRISPR) array was detected using CRISPRFinder (7). Nucleotide sequence accession numbers. The strain is publicly available from two collections under references CCTCC AB2015090 and DSM 100358. This whole-genome shotgun project has been deposited at DDBJ/EMBL/GenBank under the accession no. LFEE00000000. The version described in this paper is version LFEE01000000. ACKNOWLEDGMENT This work was funded by DuPont Nutrition & Health.
REFERENCES 1. Torriani S, Felis GE, Dellaglio F. 2001. Differentiation of Lactobacillus plantarum, L. pentosus, and L. paraplantarum by recA gene sequence analysis and multiplex PCR assay with recA gene-derived primers. Appl Environ Microbiol 67:3450 –3454. http://dx.doi.org/10.1128/AEM.67.8.3450 -3454.2001. 2. Zerbino DR, Birney E. 2008. Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res 18:821– 829. http:// dx.doi.org/10.1101/gr.074492.107. 3. Darling AE, Mau B, Perna NT. 2010. progressiveMauve: multiple genome alignment with gene gain, loss, and rearrangement. PLoS One 5:e11147. http://dx.doi.org/10.1371/journal.pone.0011147. 4. Kleerebezem M, Boekhorst J, van Kranenburg R, Molenaar D, Kuipers OP, Leer R, Tarchini R, Peters SA, Sandbrink HM, Fiers MW, Stiekema W, Lankhorst RM, Bron PA, Hoffer SM, Groot MN, Kerkhoven R, de Vries M, Ursing B, de Vos WM, Siezen RJ. 2003. Complete genome sequence of Lactobacillus plantarum WCFS1. Proc Natl Acad Sci U S A 100:1990 –1995. http://dx.doi.org/10.1073/pnas.0337704100. 5. Overbeek R, Olson R, Pusch GD, Olsen GJ, Davis JJ, Disz T, Edwards RA, Gerdes S, Parrello B, Shukla M, Vonstein V, Wattam AR, Xia F, Stevens R. 2014. The SEED and the Rapid Annotation of microbial genomes using Subsystems Technology (RAST). Nucleic Acids Res 42: D206 –D214. http://dx.doi.org/10.1093/nar/gkt1226. 6. Zhou Y, Liang Y, Lynch KH, Dennis JJ, Wishart DS. 2011. PHAST: a fast phage search tool. Nucleic Acids Res 39:W347–W352. http://dx.doi.org/ 10.1093/nar/gkr485. 7. Grissa I, Vergnaud G, Pourcel C. 2007. CRISPRFinder: a Web tool to identify clustered regularly interspaced short palindromic repeats. Nucleic Acids Res 35:W52–W57. http://dx.doi.org/10.1093/nar/gkm360.