Comparative whole genome analysis of six diagnostic brucellaphages Jason Farlow, Andrey A. Filippov, Kirill V. Sergueev, Jun Hang, Adam Kotorashvili, Mikeljon P. Nikolich PII: DOI: Reference:
S0378-1119(14)00039-0 doi: 10.1016/j.gene.2014.01.018 GENE 39388
To appear in:
Gene
Received date: Revised date: Accepted date:
15 July 2013 4 November 2013 7 January 2014
Please cite this article as: Farlow, Jason, Filippov, Andrey A., Sergueev, Kirill V., Hang, Jun, Kotorashvili, Adam, Nikolich, Mikeljon P., Comparative whole genome analysis of six diagnostic brucellaphages, Gene (2014), doi: 10.1016/j.gene.2014.01.018
This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
ACCEPTED MANUSCRIPT Comparative whole genome analysis of six diagnostic brucellaphages Jason Farlow a, Andrey A. Filippov b, Kirill V. Sergueev b, Jun Hang c, Adam Kotorashvili and Mikeljon P. Nikolich b*
a
Ilia State University, Tbilisi, Georgia
b
Department of Emerging Bacterial Infections, Bacterial Diseases Branch, Walter Reed
SC
RI
PT
d
c
NU
Army Institute of Research, Silver Spring, Maryland, USA
Viral Diseases Branch, Walter Reed Army Institute of Research, Silver Spring,
Richard G. Lugar Center for Public Health Research, Tbilisi, Georgia
*corresponding author:
TE
D
d
MA
Maryland, USA
AC CE P
Department of Emerging Bacterial Infections, Bacterial Diseases Branch Walter Reed Army Institute of Research 503 Robert Grant Avenue
Silver Spring, MD 20910 Tel: +1-301-319-9469
Mobile: +1-301-640-1471 email:
[email protected] [Sequence data for this article have been deposited in GenBank under Accession numbers KC556897, KC556896, KC556898, KC556893, KC556895 and KC556894.]
1
ACCEPTED MANUSCRIPT Abstract Whole genome sequencing of six diagnostic brucellaphages, Tbilisi (Tb), Firenze
PT
(Fz), Weybridge (Wb), S708, Berkeley (Bk) and R/C, was followed with genomic
RI
comparisons including recently described genomes of the Tb phage from Mexico (TbM) and Pr phage to elucidate genomic diversity and candidate host range determinants.
SC
Comparative whole genome analysis revealed high sequence homogeneity among these
NU
brucellaphage genomes and resolved three genetic groups consistent with defined host range phenotypes. Group I was comprised of Tb and Fz phages that are predominantly
MA
lytic for Brucella abortus and Brucella neotomae; Group II included Bk, R/C, and Pr phages that are lytic mainly for B. abortus, Brucella melitensis and Brucella suis; Group
TE
D
III was comprised of Wb and S708 phage that are lytic for B. suis, B. abortus and B. neotomae. We found that the putative phage collar protein is a variable locus with
AC CE P
features that may be contributing to the host specificities exhibited by different brucellaphage groups. The presence of several candidate host range determinants is illustrated herein for future dissection of the differential host specificity observed among these phages.
1. Introduction Brucellosis is a common global zoonotic disease caused by members of the Brucella genus with severe animal and human health impacts worldwide [1]. The genus is composed of multiple species based on specificity to a primary mammalian host, including the six classical species Brucella melitensis (sheep and goats), Brucella suis
2
ACCEPTED MANUSCRIPT (hogs), Brucella abortus (cattle), Brucella ovis (sheep), Brucella canis (dogs), Brucella neotomae (wood rats) [2], as well as marine mammalian species Brucella pinnipedialis
PT
and Brucella ceti, and more recently identified atypical clades. Traditional typing of
RI
Brucella is based on colony morphology, CO2 requirement, H2S production, growth on serum dextrose agar (SDA) dye plates, serum agglutination and brucellaphage lysis at
SC
routine test dilution (RTD). Lytic bacteriophages infecting Brucella have been described
NU
for over half a century and are routinely used in diagnostic applications to confirm the identity of Brucella species [2, 3]. Brucellaphages are linear nonenveloped double-
MA
stranded DNA viruses currently classified as Podoviridae members (C1 Group morphology) with a 50-80 nm diameter icosahedral head and a 10-15 × 7-9 nm long
TE
D
noncontractile tail [4, 5]. To date all reported brucellaphages display a host range restricted to the genus Brucella. Most brucellaphages were isolated from various
AC CE P
Brucella strains possessing smooth lipopolysaccharide (LPS), including strains of B. abortus, B. suis and B. melitensis [6-11]. The brucellaphages that are active on rough LPS host strains were derived in the laboratory from phage lines lytic for smooth LPS Brucella hosts [12, 13]. The classical typing scheme used globally includes six groups of lytic brucellaphages represented by Tb, Fz, Wb, Bk, rough-specific phages R/O and R/C, and Iz. This brucellaphage typing set allows the identification of smooth and rough strains belonging to six species: B. abortus, B. melitensis, B. suis, B. canis, B. ovis and B. neotomae [3]. The biology of brucellaphage-host interaction is rarely reported in the current literature, and the genetic basis of and molecular determinants for observed host range patterns have not yet been characterized. In B. abortus, phage resistance has previously
3
ACCEPTED MANUSCRIPT been suggested to involve enzyme-resistant cell wall modifications preventing phage penetration [14] though this observation was not experimentally validated. Historically,
PT
brucellaphages have been noted to cause significant phenotypic changes in the colony
RI
morphology of Brucella cultures [7,15,16], and though phage exposure can lead to alterations in biochemical and metabolic phenotypes [17], specific genes and mechanisms
SC
involved in these processes remain unknown. Brucella spp. are genetically homogenous
NU
[2] and phage resistance systems such as CRISPRs are notably absent in Brucella genomes with the exception of minor eroded genetic structures in select B. melitensis and
MA
B. pinnipedialis strains [http://crispr.u-psud.fr/crispr/]. Recently the whole genomic sequences of a Tb phage utilized in Mexico (TbM)
TE
D
and an additional brucellaphage (Pr) were reported [18], providing the first whole genome data for this bacteriophage group. These authors observed high sequence
AC CE P
similarity between these brucellaphage genomes, Tb and Pr, and found two prominent major InDel differences between the two phages. They also noted a coding sequence (ORF) for a putative DnaA-like protein and an oriC-like origin of replication in both genomes, a finding previously unreported for bacteriophage genomes. Over 40 brucellaphages have been isolated over the past 60 years [19]. However, the population structure and fine scale molecular diversity of different brucellaphages with differential host specificity remains unknown. Thus, a comparative whole genome analysis of an expanded collection of commonly used typing brucellaphages is needed for better understanding of the phylogenetic structure of this phage group and to identify candidate loci involved in differential host range phenotypes. In this study we sequenced the complete genomes of six brucellaphages (see Table 1), including a reference Tb phage
4
ACCEPTED MANUSCRIPT along with Fz, Wb, S708, Bk and R/C. All these brucellaphages were acquired from the Félix d'Hérelle Reference Center for bacterial viruses (Université Laval, Canada) and
PT
propagated and analyzed at Walter Reed Army Institute of Research (herein the WRAIR-
RI
propagated Tb phage is designated TbW). These phage genomes were compared to the only two available sequences, those reported by Flores et al [18]: a Tb phage from
SC
Mexico (designated TbM herein) and the Pr phage. The bacteriophages sequenced and
NU
analyzed in this study are important diagnostic brucellaphages representing five out of the six typing groups [3]. Our data confirmed that the two major genome insertions
MA
or deletions (+ InDels and – InDels) that were observed previously are conserved among all the brucellaphage genomes represented in the present study. Comparative whole
TE
D
genome analysis in our expanded survey revealed extensive sequence homogeneity among these brucellaphage genomes and resolved three phylogenetic groups that were
AC CE P
consistent with defined host range phenotypes. We also noted a small but significant level of sequence divergence of the TbM phage from TbW and the other brucellaphages that were analyzed.
2. Results
2.1. Assessment of the phage lytic activity on different Brucella species
We acquired six important brucellaphages that are used globally for Brucella species determination from the Felix d’Herelle collection at Université Laval. These belong to five out of the six known typing groups, as all except the Iz group were
5
ACCEPTED MANUSCRIPT represented: Tb phage (Tb group), Fz (Fz group), Wb and S708 (Wb group), Bk (Bk group) and R/C (rough-specific phage group) [3]. The Izatnagar phage (Iz) was not
PT
available for this study. This set of typing brucellaphages allows the definitive
RI
identification of smooth and rough strains of B. abortus, B. suis, B. melitensis, B. neotomae, B. canis and B. ovis even without use of a representative of the Iz phage group
SC
(Table 1). In order to evaluate specific lytic activities, the ability of these phages to form
NU
plaques and provide confluent lysis was tested on different Brucella indicator strains commonly employed as controls in confirmatory testing regimes. Attenuated animal
MA
vaccine strain B. abortus S19 and three virulent reference Brucella strains were used: B. melitensis strain 16M, B. abortus strain 2308 and B. suis strain 23444 (also known as
D
strain 3130). Both routine test dilutions (RTD, defined as the minimal bacteriophage titer
TE
able to produce a confluent spot of lysis) and 10,000 routine test dilutions (104 × RTD)
AC CE P
were used in assays (Table 2). All the brucellaphages tested produced clear and large plaques on attenuated B. abortus S19 but formed smaller and turbid plaques on virulent B. abortus 2308. We also observed confluent lysis of B. suis 23444 by bacteriophages S708, Wb, Bk and R/C. The plaque morphology of the brucellaphages grown on B. suis 23444 differed: S708 and Wb phages produced turbid plaques, while Bk and R/C formed clear plaques. Additionally, none of the bacteriophages tested except for Bk phage were able to form clear plaques on B. melitensis 16M. Bk phage produced very small clear plaques at 10,000 RTD. These results were in agreement with the host range and specificity data published previously (see Table 1) and confirmed the authenticity of these brucellaphages acquired from Université Laval.
6
ACCEPTED MANUSCRIPT 2.2. Genomic comparison and phylogenetic structure
PT
Genomic sequences of each phage were determined and deposited in GenBank
RI
with Nucleotide Sequence Accession Numbers as follows: TbW (KC556897), S708 (KC556896), Wb (KC556898), Bk (KC556893), R/C (KC556895) and Fz (KC556894).
SC
We performed whole genome de novo assembly as well as assembly mapped to the
NU
previously reported TbM and Pr genomic sequences [18]. Similar genome sizes and GC content were observed across the phages in this study: TbW (41.142 Kb, GC content =
MA
48.2%), Fz (41.142 Kb, GC content = 48.2%), Wb (38.253 Kb, GC content = 48.2%), S708 (38.253 Kb, GC content = 48.2%), Bk (38.253 Kb, GC content = 48.2%), and R/C
TE
D
(38.253 Kb, GC content = 48.2%). Genome alignment comparison revealed pairwise nucleotide sequence identity of 96.1% or greater with strong synteny across all genomes,
AC CE P
consistent gene number (N=57 ORFs) and complete conservation of gene order. Despite the high sequence similarity observed across the phage genomes in this expanded collection of geographically and temporally diverse strains, three major evolutionary groups were evident based on our whole genome phylogenies. We noted the consistent separation of two of the major groups defined by single nucleotide polymorphisms (SNPs) and InDels (Figure 1A, 1B). Unrooted phylogenetic reconstructions using both whole genome alignments containing the two major deletions (Figure 2B) and the SNP data alone (Figure 2A) allowed us to detect this phylogenetic structure with significant bootstrap confidence. The brucellaphage genomes were consequently separated into three groups which well correlated to the distinctive lytic phenotypes: Group I was comprised of Tb and Fz phages that are predominantly lytic for
7
ACCEPTED MANUSCRIPT B. abortus and B. neotomae; Group II included Bk, R/C, and Pr phages that are lytic mainly for B. abortus, B. melitensis and B. suis; Group III was comprised of Wb and
PT
S708 phage that both originated from B. suis hosts and are lytic for B. suis, B. abortus
RI
and B. neotomae. Significant bootstrap confidence (> 0.50) support was observed for each of these major genetic groups: Group I phages TbM-TbW-Fz phages (95%), the TbW-
NU
Wb (99.4%) and the R/C-Pr subgroup (58%).
SC
Fz subgroup (85.4%), the Group II phages Bk-R/C-Pr (66.4%), Group III phages S708-
Within Group I phages (TbM, TbW and Fz), the TbW and Fz sequences differed by
MA
only a single nucleotide (SNP 22,365; TbW=T, Fz=G). In contrast, the TbM sequence had an overall nucleotide identity of 99.93% with TbW and Fz but was divergent from both in
TE
D
several intergenic (IG) regions and an insertion in ORF 21 (Figure 2A, 2B, Table 3). Only two IG loci discriminated other phage strains in this comparative study: SNP 40,644
AC CE P
shared by only Pr and TbM and SNP 40,845 that distinguished the Group I strains. In addition, genetic relationships observed from the SNP data (Figure 1A) are consistent with the distinction of the Group I clade based on whole genome comparisons (Figure 1B).
2.3. Candidate loci for host specificity
To guide future experimental identification of phage loci responsible for host ranges, we performed a comparative bioinformatic analysis of putative brucellaphage genes that might mediate host binding and specificity. In our analysis, the three major evolutionary groups identified here exhibited diversity within several putative genes with
8
ACCEPTED MANUSCRIPT sequence similarity to genes involved in host attachment and adhesion (Table 3). In Tb phage, the first major InDel region extends ORF 21 that encodes a structural protein that
PT
exhibits significant orthology to hypothetical proteins in other diverse phage and in some
RI
cases limited orthology to the neck proteins of Prochlorococcus phages. The second major InDel region both extends ORF 28 that encodes the tail fiber protein and contains
SC
an additional ORF (ORF29) that exhibits significant similarity to a pectin lyase-like
NU
domain containing carbohydrate binding proteins. It is possible the altered length of the tail fibers distinguishing Group I and Group II/III phages may reflect adaptations to
MA
distinct requisites in the receptor structure of their original bacterial hosts. ORF 14 in the TbM phage genome encodes a hypothetical protein of 137 residues
TE
D
(AEY69683) with similarity to phage head fiber proteins at C-terminal residues 86-136. An insertion at nucleotide position G96 in this gene (genomic position 9,189) is present
AC CE P
in all Group II and III members (Table 3) and results in the truncation of 17 C-terminal amino acid residues and the generation of a unique 13 residue N-terminus. ORF 29 and ORF 30 were previously shown to encode putative peptidoglycan binding and DUF847 domains that normally function in lytic activity and peptidoglycan degradation [18]. ORFs 29 and 30 showed no sequence diversity across the phage genomes analyzed in the present study. Our analysis revealed that TbM has an insertion in ORF 21 relative to all other brucellaphages. Residues G726C and A697V in the protein encoded by ORF 21 distinguished Group I members, including TbW and Fz. In addition, all Group I members including TbM were distinguished by residue E196 of ORF 43, which encodes a putative DNA methyltransferase. A substantial amount of diversity was observed within ORF 27 that encodes a putative tail collar protein (Pfam acc. no. PF07484). Seven amino acid
9
ACCEPTED MANUSCRIPT substitutions were observed in this gene, with residue 239 (TbM numbering) exhibiting extensive polymorphism across individual brucellaphage groups (Table 3). We tested for
PT
the presence of positive (diversifying) selection acting on the collar protein gene using
RI
the dN/dS ratio. Results were significant for positive selection with an overall dN/dS=2.4 (P = 0.009). ORF 27 terminates 106 bp upstream of the second deletion region that
SC
truncates ORF 28 and deletes ORF 29 in Group II brucellaphages. ORF 28 shows
NU
homology to a putative enzymatic tail fiber protein that encodes a putative GDSL/SGNH hydrolase. Enzymatic tail fiber proteins often function in the degradation of the host cell
MA
membrane. Group I brucellaphages contain the full amino acid complement of ORF 28 while Group II and III members encode a C-terminal truncation of the tail fiber gene
TE
D
leaving only an 36 N-terminal domain (Figure 3) that terminates at Pro36. ORF 29 is present only in the Group I brucellaphages and encodes a putative carbohydrate binding
AC CE P
tail spike protein containing pectin-lyase domains similar to phage P22. The P22 phage tail spike protein functions as an enzymatic viral adhesion protein that mediates binding to the bacterial host cell surface via pectin-lyase-like structural motifs (http://www.ebi.ac.uk/interpro/IEntry?ac =IPR012332). Overall, ORF 28 and 29 both appear to be large enzymatic tail fiber proteins that could be important in host cell adhesion and penetration. ORF 23 encodes a putative structural protein with a C-terminal peptidase S74 motif as well as a SNP at position 16,678 (K7E) and the InDel at positions 17,184-17,189 (A175/V176) relative to the published TbW and Pr genome comparisons (18). In our study, an additional SNP was identified at position 16,952 in ORF 23 resulting in a R98Q mutation in genomes of Group I members TbW and Fz. BLAST analysis using default search settings detected moderate similarity to the N-terminal
10
ACCEPTED MANUSCRIPT peptidase domains of the bacterial phage-related tail fiber protein of Pedobacter spp. (ZP_01885447) and the phage tail fiber protein of Burkholderia phage DC-1
PT
(YP_006589987). The Burkholderia phage DC1 tail fiber protein peptidase domain
RI
exhibited similarity to ORF 23 both in positional topology at the N-terminus and amino
SC
acid sequence identity (38%) suggesting this gene may encode a tail fiber protein.
NU
2.4. Discussion
MA
In an effort to characterize the genetic structure of bacteriophage specific for Brucella more broadly, and to identify the potential genetic requisites for differential host
TE
D
specificity, we sequenced the genomes of six diagnostic brucellaphage strains that belong to five of the six known phage typing groups and compared their whole genomic
AC CE P
sequences. The strong sequence homogeneity that was observed across these brucellaphages of differing origin suggested a lack of diversifying selection within the genetically homogenous Brucella host species. This high synteny and complete colinearity indicates that the brucellaphage group evolved in a close and perhaps clonal evolutionary history. Brucellaphages originally isolated in Georgia (Tb), England (Wb), Mexico (Pr) and Italy (Fz) were highly homogenous, consistent with the homogenous restriction profiles observed previously [22, 24] and recent genome comparisons of the TbM and Pr phages [18]. The presence of the ORF encoding a DnaA-like protein and the putative oriC-like origin of replication was conserved in the genomes of all the brucellaphages analyzed (Table 3). The oriC-like region was conserved in all the phage strains that were analyzed. Some strains have highly similar sequence and genomic
11
ACCEPTED MANUSCRIPT structures therefore are very likely to have originated from the same lineage. For example, phage Bk was isolated after incubation of a mixture of B. melitensis culture and
PT
phage Wb [23] and thus appears to be a host range mutant of Wb. R/C was derived from
RI
the R/O phage, whose ancestor was isolated from a mixture of three brucellaphages including Wb [12]. The highly homogenous genetic structure of brucellaphages observed
SC
even in our present expanded survey may be the result of hyperadaptation to the bacterial
NU
host, as suggested by Flores et al [18]. Alternatively, this feature may be the result of a genetic bottleneck in the brucellaphage population. Lack of genetic change through
MA
recombination or the absence of selective pressure for such recombination to occur could also account for this great genetic homogeneity. This may be influenced by host factors
TE
D
such as niche isolation and the observed lack of natural conjugative genetic elements in the host genus. It has also been suggested that lytic phages may exhibit more pronounced
AC CE P
genomic homogeneity and lower phylogenetic congruence with their bacterial hosts due to a lack of dependency on the host replication systems and lower frequencies of lateral genetic exchange [25]. In any case, this pronounced genetic similarity across the sequenced brucellaphages warrants formal designation as a single taxonomic species and suggests that host range selection does not impart a significant lasting structure on the population, at least among the brucellaphages that have been sequenced to date with this study. The recent discovery of atypical Brucella species that are divergent from the six classical species may provide new sources of bacteriophages specific for the genus with greater potential genetic diversity.
12
ACCEPTED MANUSCRIPT Future sequencing efforts could include other diverse brucellaphage such as Np and Iz as well as other novel strains isolated from the environment and not subject to
PT
extensive laboratory passage. Such efforts may confirm whether the genetic structure
RI
currently observed is a characteristic feature of genomes within the brucellaphage family. Genome-to-phenotype connections can also be further refined by including wider range
SC
of definitive assay results in the analysis. For instance, Fz and TbW show a high genetic
NU
similarity consistent with their shared host range; however previous studies suggest that Fz exhibits a distinct plaque morphology which enables the differentiation of B. suis
MA
biotype 4, as well as differences in physical and chemical stability [10, 26]. Similarly, the phylogenetic placement of the Wb and S708 phages had a close association (Figure
TE
D
2A, 2B) consistent with their shared origin from B. suis hosts.
AC CE P
The significance of the differences observed between the different Tb phage strains that have been sequenced is still speculative at this stage. Despite being derived from the same Georgian phage isolate, the TbM and TbW phage strains were more recently acquired from different sources, the Gamaleya Scientific Research Institute of Epidemiology and Microbiology, Moscow, Russian Federation [18] and the Félix d'Hérelle Reference Center for Bacterial Viruses of the Université Laval, Québec, Canada, respectively. The greater genetic distance observed between the TbM phage strain and other within-group brucellaphages TbW and Fz suggests that TbM may have been subjected to some diversifying selection and/or genetic drift in the laboratory, perhaps through the use of non-standard propagation host strains. In our study, lytic specificities were determined for all six brucellaphages prior to purification and genome
13
ACCEPTED MANUSCRIPT sequencing (Table 3) and all were propagated on attenuated vaccine strain B. abortus S19. In any case, confirmation of the phenotypic properties of TbM is warranted to
RI
PT
compare its combined lytic profile with other Tb reference strains including TbW.
The phenotypic characteristics of the Tb phage have been studied extensively [21,
SC
4]. Tb shows the highest efficiencies of adsorption and infection on smooth LPS B.
NU
abortus cultures and moderate and mild lytic activity towards B. neotomae and B. suis, respectively [27]. B. melitensis shows low susceptibility to the Tb phage, while cultures
MA
of naturally rough LPS B. ovis and B. canis are resistant to Tb [27]. In general, the phenotypes exhibited by Tb phage are also shared by Fz phage [28]. Wb phage is highly
TE
D
lytic for B. suis while Bk phage is primarily lytic for B. abortus and mildly lytic for B. suis and B. melitensis [28]. To date, the host receptor(s) utilized by brucellaphages
AC CE P
across their bacterial hosts have not been identified. Bacteriophage receptor ligands are specified by the distinct structure and composition of the bacterial cell wall architecture [29]. In Brucella, the structure of the outer envelope that distinguishes individual phage strain host range is characterized by the rough or smooth phenotype determined by the presence of the LPS O-polysaccharide, deeper LPS structure and the exposure of outer membrane proteins (OMPs). It has been noted that phage adsorption on smooth LPS Brucella occurs regardless of host strain-specific variations in O-polysaccharide structure, suggesting that fine scale differences in LPS profiles alone may not determine phage host specificity [2]. Indeed, previous studies of B. neotomae showed an association of the phage receptor with a protein-phospholipid-LPS complex [30]. Previously authors have postulated that because of their highly similar restriction profiles,
14
ACCEPTED MANUSCRIPT differences in brucellaphage host ranges are caused primarily by mutations in phage tail fiber receptors, or by host receptor accessibility since higher multiplicities of infection
RI
PT
can induce lysis of phage resistant hosts [22].
Our comparative analysis of multiple closely related brucellaphages led to the
SC
identification of an array of genetic differences among the brucellaphages analyzed,
NU
revealing SNPs and InDels in genes encoding enzymes and structural proteins, including in putative tail fiber proteins, DNA polymerase, the major head protein and a highly
MA
variable locus within the putative phage collar protein. Based on the differences observed across these genomes, the putative tail collar protein encoded by ORF 27 could
TE
D
be the phage receptor-binding protein, as it appears to be under some diversifying selective pressure. Changes in this gene and corresponding conformational alterations of
AC CE P
the protein could be responsible in part for the variety of brucellaphage host range phenotypes that have been described, but this will need to be tested. In structurally similar phages such as T7, conformational changes in tail collar proteins have been proposed to mediate complex interactions for penetration of the bacterial outer membrane and structural transitions in the channel for DNA injection [31, 32]. As with other Podoviridae members, brucellaphages may encode multiple tail fiber proteins, so it is possible that in addition to ORF 28 and 29, ORF 23 may also function in mediating host recognition. The truncation or absence of these proteins in the Group II brucellaphages may impart selective differences in host range phenotype.
15
ACCEPTED MANUSCRIPT Taken together, these observed genetic differences may indicate loci that are under adaptive selection which may contribute to the differential host specificity
PT
observed among these bacteriophage strains. Differences in bacterial cell surface
RI
receptors across Brucella species may be driving this observed variation in putative brucellaphage receptor-binding proteins. These relationships deserve further exploration.
SC
The molecular diversity and speculation of attributes for these putative host attachment
NU
and adhesion genes is provided here to inform future genetic interrogation and
TE
3. Materials and Methods
D
MA
experimental dissection of brucellaphage host range determinants.
AC CE P
3.1. Bacterial strains, bacteriophages and media used in this work
The characteristics of the brucellaphages used in this work are presented in Table 1. The phages were purchased from the bacteriophage collection of the Félix d'Hérelle Reference Center for Bacterial Viruses of the Université Laval (Laval, Québec, Canada). Attenuated vaccine strain B. abortus S19 and virulent strain B. abortus 2308 were both used for bacteriophage propagation. Virulent strain B. melitensis 16M and strain B. suis 23444 (also known as strain 3130) were used for phage characterization. BBL ™ Brucella Broth and BBL ™ Brucella Agar (BD Biosciences, San Jose, California USA) were used for all Brucella and brucellaphage growth procedures.
16
ACCEPTED MANUSCRIPT
PT
3.2. Bacteriophage propagation and lysis assays
RI
Bacteriophages were propagated on attenuated vaccine strain B. abortus S19 as described in [27] with a few modifications. Bacteriophage stock lysates were added to
SC
150 ml of early logarithmic phase broth bacterial cultures at a multiplicity of infection
NU
(MOI) of 0.01 and incubated at 37°C for 48 h in 1-L plastic erlenmeyer flasks with moderate shaking. Phage lysates were allowed to stand undisturbed at 4°C for another 24
MA
h and were treated with chloroform added to final concentration of 10% in order to destroy all remaining live bacteria. Bacterial debris was removed by 15 min of
TE
D
centrifugation at 5,000 g. The resulting lysates were filtered through sterile 0.22-µm membranes. Final titers of bacteriophage lysates were determined by the mixing of
AC CE P
phage lysate dilutions with a semi-solid (0.7%) agar overlay containing a B. abortus S19 suspension. Results were scored after 48-72 hours of growth at 37°C in the presence of 5% CO2.
3.3. Bacteriophage DNA isolation
Brucellaphage lysates were centrifuged at 13,000 g for 4 h and resuspended in SM buffer [33] in 1/100 of the original volume. Concentrated lysates were treated for 1 h at 37°C with RNaseA (60 µg/ml) and DNaseI (20 µg/ml) to remove residual Brucella host nucleic acids. After nuclease treatment, concentrated bacteriophage particles were incubated with proteinase K (final concentration 50 µg/ml) and SDS (0.5%) at 56°C for 1
17
ACCEPTED MANUSCRIPT h. DNA was purified by phenol:chloroform extraction as described [32] followed by overnight DNA precipitation with 70% ethanol at -20°C. The resulting DNA was re-
PT
suspended in nuclease-free water. The purity of the bacteriophage genomic DNA
RI
preparations was confirmed by restriction analysis with BamHI, EcoRI and XbaI endonucleases (New England BioLabs Inc, Ipswich, Massachusetts, USA) using standard
NU
SC
agarose gel electrophoresis.
MA
3.4. Whole genome pyrosequencing and genome assembly
Whole genome sequencing of the six brucellaphages was performed by using
D
Roche/454 GS FLX Titanium next-generation high throughput pyrosequencing system
TE
with Roche’s rapid shotgun genomic DNA library preparation method (Roche 454 Life
AC CE P
Sciences, Branford, CT, USA). Sequencing data were subjected to de novo assembly using Roche GSAssembler software (Newbler) version 2.5.3 and reference mapping assembly against the TbM (JN939331.1) and the Pr (JN939332.1) genomic sequences using the Geneious software package version 6.0.5 [34] with medium sensitivity and fine tuning at 5X iterations using trimmed sequences (yielded >100-fold coverage of each genome). Annotations were performed using Geneious and Sequin with the TbM and Pr strain genomes as references. Geneious software was used for whole genome alignments and to analyze the physical organization of reference genomic sequences including ORF content and synteny.
3.5. Comparative genomics, phylogenetic structure and selection
18
ACCEPTED MANUSCRIPT
Phylogenies were reconstructed using whole genome alignment data of TbW, TbM,
PT
Fz, Wb, S708, Bk and Pr genomic sequences using Geneious software. Unrooted
RI
Neighbor-Joining phylogenies were performed based on Tamura-Nei model with bootstrap performed with 1000 replicates. MEGA 5 software [35] was used to determine
SC
phylogenetic relationships among brucellaphage strains using unrooted Maximum-
NU
Likelihood parameters on SNP variants from Table 3 based on a Tamura-Nei model (Figure 2A). Branch lengths were measured in number of substitutions per site with
MA
bootstrap (BS) performed using 1000 replicates. The significance of dN/dS ratios in testing for the presence of positive/adaptive selection was determined by the two tailed Z-
AC CE P
Acknowledgements
TE
D
test (P