JVI Accepts, published online ahead of print on 26 November 2014 J. Virol. doi:10.1128/JVI.02968-14 Copyright © 2014, American Society for Microbiology. All Rights Reserved.
1
Title:
2
Human Noroviruses’ Fondness of Histo-Blood Group Antigens
3 4
Authors:
5
Bishal K. Singh1,2, Mila M. Leuthold1,2, and Grant S. Hansman1,2,*
6 7
Affiliations:
8
1
9
Heidelberg 69120
Schaller Research Group at the University of Heidelberg and the DKFZ, Germany,
10
2
11
Heidelberg 69120
Department of Infectious Diseases, Virology, University of Heidelberg, Germany,
12 13
*Corresponding author
14
CHS Foundation, University of Heidelberg, and German Cancer Research Center.
15
Norovirus Study Group.
16
Im Neuenheimer Feld 242, Heidelberg 69120, Germany.
17
Email:
[email protected] 18
Phone: +49 (0) 6221-1520
19 20
Abstract:
21
Human noroviruses are the dominant cause of outbreaks of gastroenteritis around the
22
world. Human norovirus interacts with the polymorphic human histo-blood group
23
antigens (HBGAs) and this interaction is thought to be important for infection.
24
Indeed, synthetic HBGAs or HBGA-expressing enteric bacteria were shown to
25
enhance norovirus infection in B cells. A number of studies have found a possible
1
26
relationship between HBGA type and norovirus susceptibility. The genogroup II
27
genotype 4 (GII.4) noroviruses are the dominant cluster, evolve every other year, and
28
are thought to modify their binding interactions to different HBGA types. Here, we
29
show the high-resolution X-ray crystal structures of the capsid protruding (P) domains
30
from epidemic GII.4 variants in 2004, 2006, and 2012, co-crystallized with a panel of
31
HBGA types (H type 2, Lewis Y, Lewis B, Lewis A, Lewis X, A-type, and B-type).
32
Many of the HBGA binding interactions were found to be complex, involving capsid
33
loop movements, alternative HBGA conformations, and HBGA rotations. We showed
34
that a loop (residues 391-395) was elegantly repositioned in order to allow for Lewis
35
Y binding. This loop was also slightly shifted to provide direct hydrogen and water-
36
mediated bonds with Lewis B. We considered that the flexible loop modulated Lewis
37
HBGA binding. The GII.4 noroviruses have dominated outbreaks over the past
38
decade, which may be explained by their exquisite HBGA binding mechanisms, their
39
fondness for Lewis HBGAs, and their temporal amino acid modifications.
40 41
Importance:
42
Our data provides a comprehensive picture of GII.4 P domain and HBGA binding
43
interactions. The exceptionally high resolutions of our X-ray crystal structures
44
allowed us to accurately recognize novel GII.4 P domain interactions with numerous
45
HBGA types. We showed that the GII.4 P domain HBGA interactions involved
46
complex binding mechanisms that were not previously observed in norovirus
47
structural studies. Many of the GII.4 P domain HBGA interactions we identified were
48
negative in earlier ELISA-based studies. Altogether, we showed that the GII.4
49
norovirus P domains could accommodate numerous HBGA types.
50
2
51
One Sentence Summary:
52
Norovirus GII.4 P domains have elaborate HBGA binding mechanisms.
53 54
Introduction:
55
Human norovirus is responsible for most epidemic outbreaks of gastroenteritis. There
56
are still no antivirals or vaccines approved, despite their discovery over four decades
57
ago (1). Noroviruses are genetically and antigenically diverse (2), yet a single genetic
58
cluster (genogroup II genotype 4, GII.4) has dominated over the past decade. (3). The
59
GII.4 noroviruses evolve ~5% every year and are believed to have a mechanism that
60
allows the virus to evade the immune system or alter receptor binding profiles (4-6).
61
However, immunity to norovirus is still poorly understood (7).
62 63
Human norovirus interacts with histo-blood group antigens (HBGAs), which is
64
thought to be important for viral infections (8-11). A recent report showed, for the
65
first time, that human norovirus infects B cells and that HBGAs (synthetic or HBGA-
66
expressing enteric bacteria) can enhance the infection (12). HBGAs are also found as
67
soluble antigens in saliva and are expressed on epithelial cells. Genetic
68
polymorphisms in genes that controls their synthesis is known to provide an
69
intraspecies diversity (13). To date, based on the ABH- and Lewis-HBGA types, at
70
least nine different HBGAs were found to interact with human norovirus. Individuals
71
expressing the O type are thought to have a significantly higher infection rate than
72
other blood types (11). The GII noroviruses are thought to have a preference to
73
HBGAs in a strain dependent manner (14-19).
74
3
75
Expression of the norovirus capsid protein in insect cells results in the formation of
76
virus-like particles (VLPs) that are antigenically similar to native virions. The X-ray
77
crystal structure of the prototype (GI.1) norovirus VLP identified two domains, shell
78
(S) and protruding (P) domain (20). The S domain forms a scaffold surrounding the
79
viral RNA, whereas the P domain is thought to contain the determinants for cell
80
attachment and strain diversity. The P domain can be further subdivided into P1 and
81
P2 subdomains and each subdomain likely has unique functions. In this study, we
82
determined the X-ray crystal structures of P domains from three epidemic GII.4
83
variants in 2004, 2006, and 2012 in complex with a panel of HBGAs in order to
84
elucidate HBGA binding mechanisms. Our data showed that the GII.4 norovirus
85
bound numerous HBGA types and that binding involved complex interactions,
86
including P domain loop movements and alternative HBGA conformations.
87
Importantly, many of our new findings challenge previous ELISA-based studies and
88
revealed interactions that have not been recognized so far (4-6). Altogether, we
89
showed that the GII.4 noroviruses were capable of binding diverse HBGA types,
90
which may correlate with a larger proportion of the human population susceptible to
91
GII.4 infections.
92 93
Materials and methods:
94
Sequence analysis and expression and purification of P domain
95
The P domain amino acid sequences from four GII.4 variants in 1998, 2004, 2006,
96
and 2012 [termed VA387-1998 (PDBID 2OBS), Farm-2004 (Genbank accession
97
number
98
respectively, were aligned using Clustal X. Farm-2004, Saga-2006, and NSW-2012 P
99
domains (residues 224 to 538) were expressed in E. coli and purified as previously
JQ478408),
Saga-2006
(AB447457),
and
NSW-2012
(JX459908),
4
100
described (21). Briefly, the codon optimized P domains were cloned into a modified
101
expression vector pMal-c2X and transformed into BL21 cells for protein expression.
102
Transformed cells were grown in LB medium supplemented with 100 μg/ml
103
ampicillin for 4 hours at 37°C. Expression was induced with IPTG (0.75 mM) at an
104
OD600 of 0.7 for 18 h at 22°C. Cells were harvested by centrifugation at 6000 rpm for
105
15 min and disrupted by sonication on ice. A His-tagged fusion-P domain protein was
106
purified from a Ni- column (Qiagen) and digested with HRV-3C protease (Novagen)
107
overnight at 4°C. Cleaved P domains were separated on the Ni-column and dialyzed
108
in gel filtration buffer (25 mM Tris-HCl and 300 mM NaCl) overnight at 4°C. The P
109
domains were purified by size exclusion chromatography, concentrated to 3-7 mg/ml,
110
and stored in gel filtration buffer at 4°C.
111 112
Crystallization of norovirus P domains
113
Crystals were grown in a 1:1 mixture of protein sample and mother liquor for 2-6
114
days at 18°C. Farm-2004 P domain crystals were grown in 20% PEG3350 and 0.2 M
115
magnesium formate; Saga-2006 crystals were grown in 3 M sodium acetate (pH 6.9);
116
and NSW-2012 crystals were grown in 20% PEG3350 and 0.2 M sodium formate.
117
Farm-2004, Saga-2006, and NSW-2012 formed long rod-shaped crystals, diamond-
118
shaped crystals, or both diamond- and plate-shaped crystals, respectively. For the P
119
domain and HBGA complexes, we co-crystallized 30-60 molar excess of HBGAs
120
(Dextra, UK). Prior to flash freezing, crystals were transferred to a cryoprotectant
121
containing mother liquor, 30 molar excess of HBGAs, and 30% ethylene glycol or
122
glycerol. Unfortunately, we were unable to produce complex crystals for all P
123
domains and HBGAs; and soaking experiments with HBGAs produced crystals with
124
high mosaicity and/ or the crystals dissolved.
5
125 126
Data collection, structure solution, and refinement
127
X-ray diffraction data were collected at the European Synchrotron Radiation Facility,
128
France at beamline BM30A and ID23-1 and processed with XDS (22). Structures
129
were solved using molecular replacement in PHASER (23). Saga-2006 P domain was
130
determined by molecular replacement using the previously solved GII.10 P domain as
131
a search model (21). Saga-2006 P domain was then used for determining the
132
structures of Farm-2004 and NSW-2012 P domains. Farm-2004 P domain formed
133
crystals in space group P212121, while Saga-2006 and NSW-2012 were both solved in
134
space group C2. Structures were refined in multiple rounds of manual model building
135
in COOT (24) with subsequent refinement with PHENIX (25). The HBGAs were
136
added to the models at the final stages of structural refinement in order to reduce bias
137
during refinement. Structures were validated with Molprobity (26) and Procheck
138
(27). HBGA interactions were analyzed using Accelrys Discovery Studio (Version
139
4.1), with hydrogen bonding interactions distances between 2.4-3.5Å and
140
hydrophobic interactions distances between 3.4-4.5Å. Figures and protein contact
141
potentials were generated using PyMOL (Version 1.12r3pre). Atomic coordinates and
142
structure factors are deposited in the Protein Databank.
143 144
Results:
145
Structures of unliganded GII.4 P domains
146
Three globally important epidemic GII.4 noroviruses in 2004, 2006, and 2012 [termed
147
Farm-2004, Saga-2006, and NSW-2012 (also known as Sydney 2012), respectively]
148
were selected for P domain and HBGA binding analysis using X-ray crystallography
149
(Fig. 1). Most of the amino acid variations were observed in the P2 subdomains. Data
6
150
statistics for GII.4 P domain apo structures are provided in Table 1. The P1
151
subdomains comprised of residues 224-274 and 418-530, whereas the P2 subdomains
152
were between residues 275-417 (Fig. 2). Similar to other human noroviruses, the P1
153
subdomain comprised of β-sheets and one α-helix, while the P2 subdomain contained
154
six antiparallel β-strands that formed a barrel-like structure. Overall, the GII.4 P dimer
155
structures were similar, with a maximum RMSD of 0.52 (Fig. 2D). This result
156
corresponded well with the high sequence identities (93-95%) and the amino acid
157
alignment (Fig. 1). In order to follow GII.4 P domain evolution, amino acid changes
158
from an earlier GII.4 P domain (in 1997) were projected onto Farm-2004, Saga-2006,
159
and NSW-2012 P dimer surfaces (Fig. 3). Most amino acid changes were surface
160
exposed and ~50% of these became fixed over the years. The region immediately
161
beneath the HBGA binding pocket showed little variation, whereas the surrounding
162
regions only contained a few amino changes. This result suggested that the HBGA
163
pocket was stable and likely contained important functions.
164 165
Structure of 2006 GII.4 P domain H2-trisaccharide (H2-tri) complex
166
The HBGAs chosen for this study were involved in a primary biosynthetic pathway
167
and were previously analyzed in ELISA-based studies (8, 10, 11, 28-31). Data
168
collection and refinement statistics for P domain HBGA complex structures are
169
provided in Tables 2, 3, and 4. The H2-trisaccharide (H2-tri) contains a single ABH-
170
fucose moiety. Two H2-trisaccharides (H2-tri) bound to the Saga-2006 P dimer. The
171
electron density was well defined for all three saccharide units, indicting the HBGA
172
was firmly held by the P domain (Fig. 4). The H2-tri was held in place by a network
173
of hydrophilic and hydrophobic interactions at the dimeric interface (Figs. 5A and
174
5B). The fucose was held by six direct hydrogen bonds, two from the side chain of
7
175
Asp374, two from the side chain of Arg345, one from the main chain of Thr344, and
176
one from the main chain of Gly443. A hydrophobic interaction was provided from
177
Tyr444. These five amino acids (Thr344, Arg345, Asp374, Gly443, and Tyr444) were
178
the common set of residues involved in other GII HBGA binding interactions at this
179
“regular pocket” (21). The central galactose of H2-tri was held by two hydrogen
180
bonds from the side chain of Ser442, while the terminal N-acetylglucosamine was
181
held by one hydrogen bond from the main chain of Gly392. A number of water-
182
mediated interactions were also observed in Saga-2006 H2-tri.
183 184
Comparisons with other GII.4 H2-tri complex structure were not possible, since the
185
Saga-2006 H2-tri represented the first known GII.4 H2-tri complex structure.
186
Superposition of a GII.4 2004 P dimer H1 pentasaccharide structure (3SLN) on the
187
Saga-2006 H2-tri revealed that the first three saccharides were similarly positioned as
188
H2-tri on the P dimers (Figs. 5C and 5D). However, the N-acetylglucosamine in
189
Saga-2006 H2-tri was flipped 180° compared to the H1 pentasaccharide. The two
190
remaining saccharides of H1 pentasaccharide were raised off the P domain, though
191
not held by any residues (19). This result showed that the H type orientation was
192
variable among GII.4 variants; or the longer pentasaccharide and H type influenced
193
the binding orientation.
194 195
Structure of 2006 GII.4 P domain Lewisy-tetrasaccharide (Ley-tetra) complex
196
Lewisy-tetrasaccharide (Ley-tetra) contains both an ABH and a Lewis fucose moiety.
197
Two Ley-tetra bound to the Saga-2006 P dimer. The electron density was well defined
198
for all four saccharide units (Fig. 4). The Lewis-fucose of Ley-tetra bound at the
199
regular pocket and was held by the common set of residues (Figs. 6A and 6B). The N-
8
200
acetylglucosamine was held by one hydrogen bond from the side chain of Ser442, the
201
galactose was held by one direct hydrogen bond from the hydroxyl group of Tyr444.
202
The ABH-fucose was not held with any direct hydrogen bonds. Several P domain
203
water-mediated interactions with fucose and galactose were also observed. In order
204
for Ley-tetra to bind to Saga-2006, a loop (residues 391-394) was shifted from the apo
205
position to an alternative conformation (Fig. 6C).
206 207
Other GII.4 P domain Ley-tetra complex structures were not available, however
208
comparison of the GII.10 P domain Ley-tetra structure (21) revealed different Ley-
209
tetra orientations on the P domains (Figs. 6D and 6E). The Lewis-fucose of Saga-
210
2006 Ley-tetra bound at the regular pocket, whereas the ABH-fucose of GII.10 Ley-
211
tetra bound at the regular pocket. Also, the terminal saccharides of GII.10 Ley-tetra
212
were directed towards the center of the P dimer, while the terminal saccharides of
213
Saga-2006 Ley-tetra were leaning towards the edge of the P dimer. Interestingly,
214
Saga-2006 had fewer direct hydrogen bonds with Ley-tetra than GII.10 (6 and 10,
215
respectively). Together, these findings suggested that there was a Ley-tetra placement
216
constraint among the different GII P domains.
217 218
Structure of 2004 and 2006 GII.4 P domain Lewisb-tetrasaccharide (Leb-tetra)
219
complexes
220
Lewisb-tetrasaccharide (Leb-tetra) contains both ABH and Lewis fucose moieties. One
221
Leb-tetra bound to the Farm-2004 P dimer, whereas two Leb-tetra bound to Saga-2006
222
P dimer. The electron density was well defined for all four saccharide units (Fig. 4)
223
The ABH-fucose of Farm-2004 Leb-tetra bound at the regular pocket and was held by
224
the common set of residues (Figs. 7A and 7B). The galactose of Farm-2004 Leb-tetra
9
225
was held by one hydrogen bond from the side chain of Ser442, while the N-
226
acetylglucosamine was not held with any hydrogen bonds. The Lewis-fucose of
227
Farm-2004 Leb-tetra was held by one hydrogen bond from the side chain of Asp391
228
and one hydrogen bond from the main chain of Gly392. A similar set of direct
229
hydrogen bonds was found in the Saga-2006 Leb-tetra structure (Figs. 7C and 7D).
230
Several P domain water-mediated interactions with ABH- and Lewis-fucose were also
231
observed.
232 233
In order to better understand Leb-tetra binding interactions, we superpositioned chains
234
A and B of Farm-2004 apo and Farm-2004 Leb-tetra (Fig. 7E). The loop in chain A
235
interacting with the Lewis-fucose of Farm-2004 Leb-tetra (residues 391-394) was in a
236
suitable position to allow direct hydrogen bonds with the Lewis-fucose. The
237
equivalent loop in chain A of Farm-2004 apo was in a slightly different conformation.
238
This result suggested that the loop was re-positioned to support the Lewis-fucose
239
binding. The loop at the unoccupied HBGA binding site of Farm-2004 Leb-tetra
240
(chain B) had a similar conformation to the equivalent loop (chain B) of Farm-2004
241
apo. The reason the second Leb-tetra did not bind to Farm-2004 was not determined,
242
although steric hindrance from the neighboring molecule could have played a role as
243
previously discussed (21). Nevertheless, these results highlighted the complexity and
244
importance of the flexible loop in Leb-tetra binding.
245 246
Structure of 2006 GII.4 P domain Lewisa-trisaccharide (Lea-tri) complex
247
Lewisa-trisaccharide (Lea-tri) contains a single Lewis fucose moiety. The electron
248
density was well defined for the Lewis-fucose and less defined for the two other
249
saccharides, which indicted that these saccharides were only loosely held on the P
10
250
domain (Fig. 4). Two Lea-tri bound to the Saga-2006 P dimer. The Lewis-fucose
251
bound at the regular pocket and was held by the common set of residues (Figs. 8A and
252
8B). The N-acetylglucosamine was held by one hydrogen bond from the side chain of
253
Ser442. Galactose was held by two hydrogen bonds from the hydroxyl group of
254
Tyr444. Several P domain water-mediated interactions were also observed with all
255
three saccharides.
256 257
Other GII P domain Lea-tri complex structures have yet to be determined.
258
Superposition of Farm-2004, Saga-2006, and NSW-2012 GII.4 apo structures showed
259
that the conformations of the side chains that interacted with N-acetylglucosamine
260
and galactose (i.e., Ser442 and Tyr444) were similarly orientated. This result
261
suggested that Farm-2004 and NSW-2012 were also capable of Lea-tri binding (Figs.
262
8C and 8D), although further studies are required.
263 264
Structure of 2012 GII.4 P domain Lewisx-trisaccharide (Lex-tri) complex
265
Lewisx-trisaccharide (Lex-tri) contains a single Lewis fucose moiety. Two Lex-tri
266
bound to the NSW-2012 P dimer. The electron density was well defined for all three
267
saccharide units, which indicated that the HBGA was firmly held by the P domain
268
(Fig. 4). The Lewis-fucose bound at the regular pocket and was held by the common
269
set of residues (Fig. 9). The N-acetylglucosamine was held by two hydrogen bonds
270
with the side chain of Ser442, while the galactose was held by one hydrogen bond
271
from the hydroxyl group of Tyr444. Interestingly, the terminal galactose of one Lex-
272
tri, was held in two conformations, ~1.5Å shift (data not shown; see PBD). However,
273
this shift did not result in any additional binding interactions.
274
11
275
Previous GII P domain and Lex-tri complex structures have not yet been determined.
276
Nevertheless, the side chains that interacted with N-acetylglucosamine and galactose
277
were similarly orientated in all three GII.4 P domains (see Figs. 8C and 8D), which
278
suggested Farm-2004 and Saga-2006 may also bind Lex-tri.
279 280
Structure of 2006 and 2012 GII.4 P domain A-trisaccharides (A-tri) complexes
281
Two A-trisaccharides (A-tri) bound to both Saga-2006 and NSW-2012 P dimers. The
282
electron density was well defined for all three saccharide units (Fig. 4). The
283
orientations of the A-tri in Saga-2006 and NSW-2012 P domains were similar. Fucose
284
was held by the common set of residues, while the galactose and N-
285
acetylgalactosamine were not supported with any direct hydrogen bonds (Fig. 10).
286
Compared to other GII structures, the A-tri saccharide units were similarly orientated
287
on the P dimers (21).
288 289
Structure of 2004, 2006, and 2012 GII.4 P domain B-trisaccharides (B-tri)
290
complexes
291
Two B-trisaccharides (B-tri) bound to Farm-2004 (Figs. 11A and 11B), Saga-2006
292
(Figs. 11C and 11D), and NSW-2012 P dimers (Figs. 11E and 11F). The electron
293
density was well defined for all three saccharide units (Fig. 4). The fucose was held
294
by the common set of residues in all complex structures, while the central and
295
terminal galactose were not held with any direct hydrogen bonds (Fig. 11). Similar to
296
NSW-2012 Lex-tri, the terminal galactose of B-tri bound to Saga-2006 was held in
297
two conformations, ~1.5Å shift, and this resulted in several new water-mediated
298
interactions (Figs. 11C and 11D). Interestingly, the loop described earlier (residues
299
391-394; see Figs. 8C and 8D) was found in two different positions in the Saga-2006
12
300
B-tri structure. In one conformation, the loop was orientated as in the Saga-2006 apo
301
structure, while the alternative conformation was similarly positioned as Saga-2006
302
Ley-tetra structure (see Figure 6C). The loop movement did not result in any
303
additional binding interactions, but merely indicated that the loop had a preference for
304
at least two conformations. Comparing to other GII structures, the B-tri saccharide
305
moieties were similarly positioned on the P domains (21).
306 307
Protein contact potential
308
The protein contact potential was calculated on a panel of GII.4 P dimers in order to
309
better understand the temporal variations in surface charge that might alter
310
antigenicity and HBGA binding (Fig. 12). The region ahead of the regular pocket and
311
towards the center of the P dimer (binding sites of A and B types) remained virtually
312
unchanged, mostly negatively charged. The regions that contributed to binding
313
terminal saccharide moieties of Lewis HBGAs underwent a modification, i.e., from
314
small areas of negative and positive charge (in 1998) to large areas of mostly negative
315
charge. In this view, it appeared that the more recent GII.4 HBGA binding pocket
316
became more negatively charged.
317 318
Discussion:
319
There is considerable debate on norovirus GII.4 evolution and their corresponding
320
interactions with different HBGA types (4-6, 8, 11, 28-31). In this study, we
321
determined the X-ray crystal structures of three P domains from epidemic GII.4
322
variants in 2004, 2006, and 2012 with a panel of HBGAs. The exceptionally high
323
resolutions of our structures allowed us to accurately define HBGA interactions,
324
several of which were not previously determined for GII.4 P domains (i.e., H2-tri,
13
325
Ley-tetra, Lea-tri, and Lex-tri). A common set of conserved residues (i.e., Asp374,
326
Arg345, Thr344, Tyr444, and Gly443) firmly held both the ABH- and Lewis-fucose.
327
The GII.4 variants were capable of binding numerous Lewis HBGA types and we
328
discovered that the Lewis HBGA, particularly Ley-tetra and LeB-tetra, binding
329
mechanisms involved more complex interactions than A-tri and B-tri binding
330
interactions. A flexible loop (residues 391-395) on the P dimer appeared versatile and
331
acted like a helping hand with Lewis HBGA tetrasaccharides. In one example, the
332
loop was cleverly repositioned to allow Ley-tetra binding (Fig. 6C). In another
333
example, the loop provided direct hydrogen and water-mediated bonds with Leb-tetra
334
after a slight repositioning (Fig. 7). This flexible loop likely modulates binding of
335
Lewis HBGAs, although in vivo interactions may involve additional mechanisms.
336 337
When comparing the sequences of the variant GII.4 P domains, we found that most
338
amino acid changes were surface exposed and ~50% became fixed (Fig. 3). The
339
region immediately beneath the HBGA binding pocket showed few amino acid
340
changes. On the other hand, the regions that contributed to binding terminal
341
saccharides of Lewis HBGAs underwent a more noticeable modification, i.e., from
342
small areas of negative and positive charge (in 1998) to larger areas of mostly
343
negative charge (Fig. 12). The amino acid variations likely corresponded with
344
temporal changes in antigenicity as previously described (32), but how these changes
345
related to apparent alterations in HBGA binding remains unclear.
346 347
Even though not all complex structures could be determined, we considered that these
348
three GII.4 P domains were capable of binding to all HBGA types examined, since
349
binding interactions were similar and only a few amino acid changes surrounding the
14
350
HBGA pocket were observed. We previously showed that the rarely detected GII.10
351
strain also firmly bound a number of HBGA types (H2-tri, A-tri, B-tri, and Ley-tetra),
352
but only weakly bound Leb-tetra and was unable to bind Lea-tri and Lex-tri (21).
353
Therefore, it is tempting to speculate that the GII.4 P domains were better adapted to
354
bind numerous HBGA types, whereas the rarely detected GII.10 virus was less
355
capable, which may also convey to the lower prevalence of the GII.10 strains in the
356
general population (21) and the world-wide distribution of the GII.4 viruses.
357 358
The affinity between norovirus and HBGAs is weak and in the high micromolar range
359
(33). We previously showed that the GII.10 P domain bound H2-tri with an affinity of
360
390 μM (33). Similarly, we found that Saga-2006 P domain had weak affinities to
361
HBGAs (~100 μM) using saturation transfer difference-NMR (unpublished data,
362
Mallagaray, Hansman, and Peters). In addition, a recent study found that a GII.4 P
363
domain (VA387 strain) had comparable affinities among different HBGA types in
364
vitro (34). Based on the number of direct hydrogen bonds and water-mediated
365
interactions, small changes in P domain affinities to HBGAs may exist and these
366
could be important in vivo.
367 368
The precise roles of HBGAs in a norovirus infection are still poorly understood,
369
although synthetic HBGAs or HBGA-expressing enteric bacteria were found to
370
enhance human norovirus infection in B cells (12). Interestingly, the synthetic HBGA
371
(H type disaccharide) in the infection experiment was conjugated to poly-acrylic acid
372
(PAA). Several studies have found that conjugated linkers may affect and/ or
373
influence HBGA binding interactions (35, 36). Further structural studies with
15
374
norovirus VLPs in complex with HBGAs could help explain the possible binding
375
mechanisms in vivo.
376 377
Many of our newly determined HBGA binding results challenged previous ELISA-
378
based findings (4-6). We found that Farm-2004 bound Leb-tetra and B-tri, whereas an
379
ELISA study showed that GII.4 VLPs with an identical P domain sequence (termed
380
2002) did not bind Leb-tetra and only weakly bound B-tri (6). We also found that
381
Saga-2006 bound H2-tri, A-tri and B-tri, Lea-tri, Leb-tetra, and Ley-tetra, while several
382
ELISA studies showed that GII.4 VLPs with an almost identical P domain sequence
383
(termed 2006) did not bind to Ley-tetra (5), A-tri (5), H2-tri (4), and Lea-tri (4).
384
Finally, we showed that NSW-2012 bound A-tri, B-tri, and Lex-tri, whereas a recent
385
ELISA study showed that GII.4 2012 VLPs with an identical P domain sequence
386
(termed GII.4-2012) did not to bind Lex-tri (4). Certainly, in vivo interactions may
387
also be different to the X-ray crystallography and ELISA-based studies. Nevertheless,
388
these new data provide a new focal point for improving HBGA binding assays in
389
order to increase our understanding of norovirus and HBGA interactions.
390 391
Acknowledgements:
392
The funding for this study was provided by the CHS foundation and the Helmholtz-
393
Chinese Academy of Sciences. G.S.H designed the research; M.L. and Anne-Kathrin
394
Herrmann performed initial Farm-2004 structural refinement; and G.S.H. and B.K.S.
395
finalized all structures. We acknowledge the European Synchrotron Radiation Facility
396
(ID23-1 and BM30A) for provision of synchrotron radiation facilities. We thank
397
Thomas Peters and Alvaro Mallagaray for performing the STD-NMR experiments
16
398
(unpublished). We also thank members of the Norovirus Study Group, Joel Sussman,
399
and Henri-Jacques Delecluse for critical comments of the manuscript.
400 401
References:
402
1.
Kapikian AZ, Wyatt RG, Dolin R, Thornhill TS, Kalica AR, Chanock
403
RM. 1972. Visualization by immune electron microscopy of a 27-nm particle
404
associated with acute infectious nonbacterial gastroenteritis. Journal of
405
virology 10:1075-1081.
406
2.
Hansman GS, Natori K, Shirato-Horikoshi H, Ogawa S, Oka T,
407
Katayama K, Tanaka T, Miyoshi T, Sakae K, Kobayashi S, Shinohara M,
408
Uchida K, Sakurai N, Shinozaki K, Okada M, Seto Y, Kamata K, Nagata
409
N, Tanaka K, Miyamura T, Takeda N. 2006. Genetic and antigenic
410
diversity among noroviruses. The Journal of general virology 87:909-919.
411
3.
Siebenga JJ, Vennema H, Renckens B, de Bruin E, van der Veer B, Siezen
412
RJ, Koopmans M. 2007. Epochal Evolution of GGII.4 Norovirus Capsid
413
Proteins from 1995 to 2006. J Virol.
414
4.
Debbink K, Lindesmith LC, Donaldson EF, Costantini V, Beltramello M,
415
Corti D, Swanstrom J, Lanzavecchia A, Vinje J, Baric RS. 2013.
416
Emergence of New Pandemic GII.4 Sydney Norovirus Strain Correlates With
417
Escape From Herd Immunity. The Journal of infectious diseases.
418
5.
Lindesmith LC, Debbink K, Swanstrom J, Vinje J, Costantini V, Baric
419
RS, Donaldson EF. 2012. Monoclonal antibody-based antigenic mapping of
420
norovirus GII.4-2002. Journal of virology 86:873-883.
17
421
6.
Lindesmith LC, Donaldson EF, Lobue AD, Cannon JL, Zheng DP, Vinje
422
J, Baric RS. 2008. Mechanisms of GII.4 norovirus persistence in human
423
populations. PLoS medicine 5:e31.
424
7.
immunity and the great escape. PLoS pathogens 8:e1002921.
425 426
Debbink K, Lindesmith LC, Donaldson EF, Baric RS. 2012. Norovirus
8.
Huang P, Farkas T, Marionneau S, Zhong W, Ruvoen-Clouet N, Morrow
427
AL, Altaye M, Pickering LK, Newburg DS, LePendu J, Jiang X. 2003.
428
Noroviruses bind to human ABO, Lewis, and secretor histo-blood group
429
antigens: identification of 4 distinct strain-specific patterns. The Journal of
430
infectious diseases 188:19-31.
431
9.
Huang P, Farkas T, Zhong W, Tan M, Thornton S, Morrow AL, Jiang X.
432
2005. Norovirus and histo-blood group antigens: demonstration of a wide
433
spectrum of strain specificities and classification of two major binding groups
434
among multiple binding patterns. Journal of virology 79:6714-6722.
435
10.
Harrington PR, Lindesmith L, Yount B, Moe CL, Baric RS. 2002. Binding
436
of Norwalk virus-like particles to ABH histo-blood group antigens is blocked
437
by antisera from infected human volunteers or experimentally vaccinated
438
mice. Journal of virology 76:12335-12343.
439
11.
Rockx BH, Vennema H, Hoebe CJ, Duizer E, Koopmans MP. 2005.
440
Association of histo-blood group antigens and susceptibility to norovirus
441
infections. The Journal of infectious diseases 191:749-754.
442
12.
Jones MK, Watanabe M, Zhu S, Graves CL, Keyes LR, Grau KR,
443
Gonzalez-Hernandez MB, Iovine NM, Wobus CE, Vinje J, Tibbetts SA,
444
Wallet SM, Karst SM. 2014. Enteric bacteria promote human and mouse
445
norovirus infection of B cells. Science 346:755-759.
18
446
13.
Ayukekbong JA, Fobisong C, Tah F, Lindh M, Nkuo-Akenji T,
447
Bergstrom T. 2014. Pattern of Circulation of Norovirus GII Strains during
448
Natural Infection. Journal of clinical microbiology 52:4253-4259.
449
14.
an answer to a historical puzzle. Trends in microbiology 13:285-293.
450 451
15.
Tan M, Jiang X. 2011. Norovirus-host interaction: multi-selections by human histo-blood group antigens. Trends in microbiology 19:382-388.
452 453
Tan M, Jiang X. 2005. Norovirus and its histo-blood group antigen receptors:
16.
Cao S, Lou Z, Tan M, Chen Y, Liu Y, Zhang Z, Zhang XC, Jiang X, Li X,
454
Rao Z. 2007. Structural basis for the recognition of blood group trisaccharides
455
by norovirus. Journal of virology 81:5949-5957.
456
17.
Bu W, Mamedova A, Tan M, Xia M, Jiang X, Hegde RS. 2008. Structural
457
basis for the receptor binding specificity of Norwalk virus. Journal of virology
458
82:5340-5347.
459
18.
Kubota T, Kumagai A, Ito H, Furukawa S, Someya Y, Takeda N, Ishii K,
460
Wakita T, Narimatsu H, Shirato H. 2012. Structural basis for the
461
recognition of Lewis antigens by genogroup I norovirus. Journal of virology
462
86:11138-11150.
463
19.
Shanker S, Choi JM, Sankaran B, Atmar RL, Estes MK, Prasad BV.
464
2011. Structural analysis of histo-blood group antigen binding specificity in a
465
norovirus GII.4 epidemic variant: implications for epochal evolution. Journal
466
of virology 85:8635-8645.
467
20.
Prasad BV, Hardy ME, Dokland T, Bella J, Rossmann MG, Estes MK.
468
1999. X-ray crystallographic structure of the Norwalk virus capsid. Science
469
286:287-290.
19
470
21.
Hansman GS, Biertumpfel C, Georgiev I, McLellan JS, Chen L, Zhou T,
471
Katayama K, Kwong PD. 2011. Crystal structures of GII.10 and GII.12
472
norovirus protruding domains in complex with histo-blood group antigens
473
reveal details for a potential site of vulnerability. Journal of virology 85:6687-
474
6701.
475
22.
Kabsch W. 1993. Automatic processing of rotation diffraction data from
476
crystals of initially unknown symmetry and cell constants. J Appl Cryst
477
26:795-800.
478
23.
McCoy AJ, Grosse-Kunstleve RW, Adams PD, Winn MD, Storoni LC,
479
Read RJ. 2007. Phaser crystallographic software. Journal of Applied
480
Crystallography 40:658-674.
481
24.
development of Coot. Acta Crystallogr D Biol Crystallogr 66:486-501.
482 483
Emsley P, Lohkamp B, Scott WG, Cowtan K. 2010. Features and
25.
Adams PD, Afonine PV, Bunkóczi G, Chen VB, Davis IW, Echols N,
484
Headd JJ, Hung L-W, Kapral GJ, Grosse-Kunstleve RW, McCoy AJ,
485
Moriarty NW, Oeffner R, Read RJ, Richardson DC, Richardson JS,
486
Terwilliger TC, Zwart PH. 2010. PHENIX: a comprehensive Python-based
487
system for macromolecular structure solution. Acta Crystallogr D Biol
488
Crystallogr 66:213-221.
489
26.
Chen VB, Arendall WB, 3rd, Headd JJ, Keedy DA, Immormino RM,
490
Kapral GJ, Murray LW, Richardson JS, Richardson DC. 2010.
491
MolProbity: all-atom structure validation for macromolecular crystallography.
492
Acta crystallographica. Section D, Biological crystallography 66:12-21.
493 494
27.
Morris AL, MacArthur MW, Hutchinson EG, Thornton JM. 1992. Stereochemical quality of protein structure coordinates. Proteins 12:345-364.
20
495
28.
Tan M, Hegde RS, Jiang X. 2004. The P domain of norovirus capsid protein
496
forms dimer and binds to histo-blood group antigen receptors. Journal of
497
virology 78:6233-6242.
498
29.
Harrington PR, Vinje J, Moe CL, Baric RS. 2004. Norovirus capture with
499
histo-blood group antigens reveals novel virus-ligand interactions. Journal of
500
virology 78:3035-3045.
501
30.
Tan M, Huang P, Meller J, Zhong W, Farkas T, Jiang X. 2003. Mutations
502
within the P2 domain of norovirus capsid affect binding to human histo-blood
503
group antigens: evidence for a binding pocket. Journal of virology 77:12562-
504
12571.
505
31.
Lindesmith L, Moe C, Marionneau S, Ruvoen N, Jiang X, Lindblad L,
506
Stewart P, LePendu J, Baric R. 2003. Human susceptibility and resistance to
507
Norwalk virus infection. Nature medicine 9:548-553.
508
32.
Allen DJ, Gray JJ, Gallimore CI, Xerry J, Iturriza-Gomara M. 2008.
509
Analysis of amino acid variation in the P2 domain of the GII-4 norovirus VP1
510
protein reveals putative variant-specific epitopes. PloS one 3:e1485.
511
33.
Hansman GS, Shahzad-Ul-Hussan S, McLellan JS, Chuang GY, Georgiev
512
I, Shimoike T, Katayama K, Bewley CA, Kwong PD. 2012. Structural basis
513
for norovirus inhibition and fucose mimicry by citrate. Journal of virology
514
86:284-292.
515
34.
Han L, Kitova EN, Tan M, Jiang X, Pluvinage B, Boraston AB, Klassen
516
JS. 2014. Affinities of human histo-blood group antigens for norovirus capsid
517
protein complexes. Glycobiology.
21
518
35.
Caddy S, Breiman A, le Pendu J, Goodfellow I. 2014. Genogroup IV and
519
VI canine noroviruses interact with histo-blood group antigens. Journal of
520
virology 88:10377-10391.
521
36.
Rademacher C, Krishna NR, Palcic M, Parra F, Peters T. 2008. NMR
522
experiments reveal the molecular basis of receptor recognition by a calicivirus.
523
Journal of the American Chemical Society 130:3669-3675.
524 525
Figure legends:
526
Figure 1. An amino acid alignment of norovirus GII.4 variants. The P domain
527
amino acid sequences of four GII.4 variants in 1998, 2004, 2006, and 2012 (termed
528
VA398, Farm-2004, Saga-2006, and NSW-2012) were aligned using Clustal X. The
529
capsid sequences shared between 93-95% amino acid identity. The S domain was
530
highly conserved with only seven amino acid differences (not shown), whereas the P1
531
(red) and P2 (yellow) subdomains were more variable. The common set of amino
532
acids interacting with HBGAs were shaded blue (chain A) and green (chain B).
533
Compared to the earlier discovered GII.4 P domain (VA387-1998), one amino acid
534
insertion was observed in 2004 and remained in 2006 and 2012.
535 536
Figure 2. X-ray crystal structures of unbound GII.4 P domains. (A) Farm-2004 P
537
domain apo structure contained one dimer per asymmetric unit. The P domain was
538
subdivided into P1 (chain A, pink and chain B, pale cyan) and P2 (chain A, green
539
cyan and chain B, light magenta). (B) Saga-2006 P domain apo structure contained
540
one monomer per asymmetric unit (dimer shown) and was subdivided into P1 (chain
541
A, brown and chain B, yellow orange) and P2 (chain A, deep teal and chain B, dirty
542
violet). (C) NSW-2012 P domain apo structure contained one monomer per
22
543
asymmetric unit (dimer shown) and was subdivided into P1 (chain A, lime and chain
544
B, marine) and P2 (chain A, blue white and chain B, teal). (D) Superposition of Farm-
545
2004, Saga-2006, and NSW-2012 P dimers revealed their overall structures were
546
similar.
547 548
Figure 3. Amino acid variations in GII.4 P dimers in 2004, 2006, and 2012.
549
Amino acid changes (red) were highlighted on GII.4 P dimers (side and top views).
550
The changes were numbered according to a change from 1998 to the respective year
551
(labeled once). A cumulative addition of amino acid changes was found. (A) Farm-
552
2004 contained a single amino acid insertion, Gly394, and this remained in 2012. A
553
small number of amino acid changes surrounding the HBGA pocket (black circle)
554
was observed, i.e., I389V, L375F, and Q376E. (B) Saga-2006 contained additional
555
changes, several of which became fixed, e.g., L375F and Q376E. (C) NSW-2012
556
showed the majority of changes, including several changes in the P1 subdomain.
557 558
Figure 4. Representative simulated annealing difference omit maps. The omit
559
map (blue) was contoured between 2.5-2.0 σ. The H2-tri is an α-L-fucose-(1-2)-β-D-
560
galactose-(1-4)-N-acetyl-β/α-D-glucosamine; A-tri is an α-L-fucose-(1-2)-α-D-
561
galactose-(3-1)-N-acetyl-α-D-galactosamine; B-tri is an α-L-fucose-(1-2)-β/α-D-
562
galactose-(3-1)-α-D-galactose; Ley-tetra is α-L-fucose-(1-2)-β-D-galactose-(1-4)-N-
563
acetyl-β/α-D-glucosamine-(3-1)-α-L-fucose; Leb-tetra is an α-L-fucose-(1-2)-β-D-
564
galactose-(1-3)-N-acetyl-β-D-glucosamine-(4-1)-α-L-fucose;
565
galactose-(1-3)-N-acetyl-β/α-D-glucosamine-(4-1)-α-L-fucose (the electron density
566
was well defined for fucose and partially defined for the N-acetylglucosamine and
567
galactose); and Lex-tri is a β-D-galactose-(1-4)-N-acetyl-β/α-D-glucosamine-(3-1)-α-
Lea-tri
is
a
β-D-
23
568
L-fucose. The underlined β represents the position of the reducing end hydroxyl
569
group, which was fixed in α position in the crystal structures.
570 571
Figure 5. Saga-2006 P dimer binding interactions with H2-tri. (A) Close-up
572
surface and ribbon representation of the Saga-2006 H2-tri complex structure, showing
573
the hydrogen bonds (black lines) with H2-tri (cyan sticks) and water-mediated
574
interactions (red sphere). (B) Saga-2006 P dimer and H2-tri binding interactions (α-
575
fucose, FUC; β-galactose, GAL; and α-N-acetylglucosamine, NDG). The black lines
576
represent the hydrogen bonds, the red line represents the hydrophobic interaction with
577
the hydroxyl group of Tyr444, and the sphere represents water. (C) The ABH-fucose
578
of Saga-2006 H2-tri bound at the regular pocket. The acetyl group of N-
579
acetylglucosamine was leaning towards the edge of the P dimer. (D) The ABH-fucose
580
of TCH05-2004 H1-pentasaccharide bound at the regular pocket. The acetyl group of
581
N-glucosamine was leaning towards the center of the P dimer.
582 583
Figure 6. Saga-2006 P dimer binding interactions with Ley-tetra. (A) A close-up
584
surface and ribbon representation of the Saga-2006 Ley-tetra complex structure,
585
showing the hydrogen bonds with Ley-tetra (green sticks) and water-mediated
586
interactions. (B) Saga-2006 P dimer and Ley-tetra binding interactions. (C) A loop in
587
the Saga-2006 P2 subdomain (residues 391-394) was repositioned from an apo
588
position (gray) to an alternative position (deep teal) in order for Ley-tetra binding. (D)
589
The Lewis-fucose of Saga-2004 Ley-tetra bound at the regular pocket on the P domain
590
and was leaning towards the edge of the P dimer. (E) The ABH-fucose of GII.10 Ley-
591
tetra bound at the regular pocket and was orientated towards the center of the P dimer.
592
24
593
Figure 7. Farm-2004 and Saga-2006 P dimer binding interactions with Leb-tetra.
594
(A) Close-up surface and ribbon representation of the Farm-2004 Leb-tetra complex
595
structure, showing the hydrogen bonds with Leb-tetra (marine sticks) and water-
596
mediated interactions. (B) Farm-2004 P dimer and Leb-tetra binding interactions (α-
597
fucose, FUC; β-galactose, GAL; and β-N-acetylglucosamine, NAG). The black lines
598
represent the hydrogen bonds, the red line represents the hydrophobic interaction with
599
the hydroxyl group of Tyr444, and the sphere represents water. (C) Close-up surface
600
and ribbon representation of the Saga-2006 Leb-tetra complex structure, showing the
601
hydrogen bonds with Leb-tetra and water-mediated interactions. (D) Saga-2006 P
602
dimer and Leb-tetra binding interactions. (E) Superposition of both A and B chains
603
from Farm-2004 apo (gray and black) and Farm-2004 Leb-tetra (cyan and pink)
604
structures.
605 606
Figure 8. Saga-2006 P dimer binding interactions with Lea-tri and superposition
607
of GII.4 P domains. (A) A close-up surface and ribbon representation of the Saga-
608
2006 Lea-tri complex structure, showing hydrogen bonds with Lea-tri (camel sticks)
609
and water-mediated interactions. (B) Saga-2006 P dimer and Lea-tri binding
610
interactions (α-fucose, FUC; α-N-acetylgalactosamine, NDG; and β-galactose, GAL).
611
(C) Superposition of apo and HBGA-bound Farm-2004, Saga-2006, and NSW-2012 P
612
dimer structures (HBGAs removed from the structures). The circle represents the
613
HBGA binding pocket. Farm-2004 P1 subdomains (chain A, pink and chain B, pale
614
cyan) and P2 subdomains (chain A, green cyan and chain B, light magenta); Saga-
615
2006 P1 subdomains (chain A, brown and chain B, yellow orange) and P2
616
subdomains (chain A, deep teal and chain B, dirty violet); NSW-2012 P1 subdomains
617
(chain A, lime and chain B, marine) and P2 subdomains (chain A, blue white and
25
618
chain B, teal) were colored accordingly. (D) Close-up of the P2 subdomain flexible
619
loop (residues 391-394). In the case of H2-tri, Ley-tetra, Lea-tri, and Lex-tri, the N-
620
acetylglucosamine was held with the side chain of Ser442, while the galactose was
621
held with a hydrogen bond from the hydroxyl group of Tyr444. The loop required for
622
the Lewis HBGA-tetrasaccharide interactions was found in multiple conformations on
623
both A and B chains.
624 625
Figure 9. NSW-2012 P dimer interaction with Lex-tri. (A) A close-up surface and
626
ribbon representation of the NSW-2012 Lex-tri complex structure, showing the
627
hydrogen bonds with Lex-tri (salmon sticks) and water-mediated interactions. (B)
628
NSW-2012 and Lex-tri binding interactions (α-fucose, FUC; α-N-acetylglucosamine,
629
NDG; and β-galactose, GAL). The black lines represent the hydrogen bonds, the red
630
line represents the hydrophobic interaction with the hydroxyl group of Tyr444, and
631
the sphere represents water.
632 633
Figure 10. Saga-2006 and NSW-2012 P dimer interactions with A-tri. (A) Close-
634
up surface and ribbon representation of the Saga-2006 A-tri complex structure,
635
showing the hydrogen bonds with A-tri (yellow sticks) and water-mediated
636
interactions. (B) Saga-2006 and A-tri binding interactions (α-fucose, FUC; α-
637
galactose, GLA; and α-N-acetylgalactosamine, A2G). The black lines represent the
638
hydrogen bonds, the red line represents the hydrophobic interaction with the hydroxyl
639
group of Tyr444, and the sphere represents water. (C) A close-up surface and ribbon
640
representation of the NSW-2012 A-tri complex structure, showing the hydrogen
641
bonds with A-tri and water-mediated interactions. (D) NSW-2012 and A-tri binding
642
interactions.
26
643 644
Figure 11. Farm-2004, Saga-2006, and NSW-2012 P dimer interactions with B-
645
tri. (A) Close-up surface and ribbon representation of the Farm-2004 B-tri complex
646
structure, showing hydrogen bonds with B-tri (pink sticks) and water-mediated
647
interactions. (B) Farm-2004 P dimer and B-tri binding interactions (α-fucose, FUC;
648
α-galactose, GLA). The black lines represent the hydrogen bonds, the red line
649
represents the hydrophobic interaction with the hydroxyl group of Tyr444, and the
650
sphere represents water. (C) Close-up surface and ribbon representation of the Saga-
651
2006 B-tri complex structure, showing hydrogen bonds with B-tri and water-mediated
652
interactions. The galactose was found in two different conformations (gray and pink
653
sticks) (D) Saga-2006 P dimer and B-tri binding interactions showing newly formed
654
hydrogen bonds (blue lines) with the alternative galactose position. (E) Close-up
655
surface and ribbon representation of the NSW-2012 B-tri complex structure, showing
656
hydrogen bonds with B-tri and water-mediated interactions. (F) NSW-2012 P dimer
657
and B-tri binding interactions.
658 659
Figure 12. Surface representation of protein contact potential of GII.4 P dimers.
660
The protein contact potential (where red represented negative charge, white
661
represented neutral charge, and blue represented positive charge; ~ -55 to +55 kT/e)
662
was calculated for VA387-1997 (2OBT), TCH-2004 (3SLD), Farm-2004, Saga-2006,
663
and NSW-2012 (top view and close-up of the HBGA pocket). Leb-tetra of Farm-2004
664
Leb-tetra structure (marine sticks) was modeled into VA387, TCH-05, Saga-2006, and
665
NSW-2012. B-tri (pink sticks) and A-tri (yellow sticks) were complex structures. The
666
regions surrounding the regular ABH-fucose binding pocket remained mostly
667
unchanged and negatively charged. The regions binding terminal saccharides of
27
668
Lewis HBGAs changed from small patches of negative/ positive charge to larger
669
areas of negative charge.
28
Figure 1
VA387-1998 Farm-2004 Saga-2006 NSW-2012
KPFTVPILTVEEMSNSRFPIPLEKLYTGPSSAFVVQPQNGRCTTDGVLLGTTQLSAVNICTFRGDVT .............T...........F.............................P........... .............T...........F....G........................P........... ...S..V......T...........F.............................P...........
291 291 291 291
VA387-1998 Farm-2004 Saga-2006 NSW-2012
HIAGSHDYIMNLASQNWNNYDPTEEIPAPLGTPDFVGKIQGMLTQTTREDGSTRAHKATVSTGSVHF ....T.N.T............................R..........G.....G........D... .....RN.T.....L..........................L.....KG.....G.....Y...AP. ..T..RN.T.........D......................V......T.....G.....Y...AD.
358 358 358 358
VA387-1998 Farm-2004 Saga-2006 NSW-2012
TPKLGSVQYTTDTNNDLQTGQNTKFTPVGVIQDGN-NHQNEPQQWVLPNYSGRTGHNVHLAPAVAPT ......I.FN......FE............V....GA...........S.................. ........FS...E..FE.H..............STT.R.........S....NV............ A....R..FE...DR.FEAN..............GTT.R.........S....NT............
423 424 424 424
VA387-1998 Farm-2004 Saga-2006 NSW-2012
FPGEQLLFFRSTMPGCSGYPNMNLDCLLPQEWVQHFYQEAAPAQSDVALLRFVNPDTGRVLFECKLH ................................................................... ......................D............................................ ......................D...........Y................................
490 491 491 491
VA387-1998 Farm-2004 Saga-2006 NSW-2012
KSGYVTVAHTGPHDLVIPPNGYFRFDSWVNQFYTLAPM ...........Q.......................... ...........Q.......................... ...........Q..........................
529 530 530 530
Figure 2 A B: P2
A: P2 90°
B: P1
A: P1 C
C N
N
B B: P2
A: P2 90°
B: P1
A: P1 C
C N N
C B: P2
A: P2 90°
B: P1
A: P1 C
C N
D
N
Figure 3 A S296T, D298N, I300T A’346G’ -394G, N395A
L’375F’, Q’376E’ S’296T’ H’297Q’, D’298N’ I’300T’, V’365I’, Y’367F’ -394G’, N’395A’
V365I, Y367F, T368N N407S
E’340G’
S296T, D298N, I300T L375F, Q376E A340G E346G
S355D
P504Q
90° Y250F
K329R I389V N395A P504Q Y250F
B L’375F’, Q’376E’ G’378H’ N’372E’ N393S, -394T, N395T E’340G’ R’339K’ T412N Q397R N448D P504Q
I’300T’ H’297R’, D’298N’ N’393S’, -394T’, N’395T’ V’356A’, H’357P’
V356A, H257P H297R N372E G378H T412N, G413V
Y367F, T368S S352Y N393S, -394T, N395T
T’412N’, G’413V’ Q’306L ’ S’255G 90° ’
S255G
L375F, Q376E R339K E340G A346G M333L
S255G P504Q
Y250F
Y250F
Q397R
C T’377A’, G’378N’ L’375F’, Q’376E’ N393G, -394T, N395T E’340T’ T412N Q397R N448D P504Q Y250F N’310D’ I231V T228S
N297R N’372D’, N’373R’ A’294T’ H’297R’, D’298N’ N’393G’, -394T’, N’395T’ I’300T’ V’356A’, H’357D’ T’412N’, G’413T’
S364R A294T N310D G378N V356A, H257D T412N, G413T Y367F, T368E S352Y
90°
N393G, -394T, N395T P504Q Y250F Q397R
L375F, Q376E, T377A N372E H297R E340T A346G M333L
H2-tri Saga-2006
Ley-tetra Saga-2006
Leb-tetra Farm-2004
Leb-tetra Saga-2006
Lex-tri NSW-2012
Lea-tri Saga-2006
A-tri Saga-2006
A-tri NSW-2012
B-tri Farm-2004
B-tri Saga-2006
Figure 4
B-tri NSW-2012
Figure 5
A
B
C
D N-acetylglucosamine
N-acetylglucosamine
ABH-fucose
ABH-fucose
Figure 6 A
B
C apo loop position Saga-2006-Ley-tetra loop position
D
E
Lewis-fucose
ABH-fucose
Lewis-fucose
ABH-fucose
Figure 7 B
A
C
E
D
Farm-2004 Leb-chain B (unbound) Farm-2004 Leb-chain A (bound) Farm-2004 apo-chain A
Farm-2004 apo-chain B
Lewis fucose
Figure 8
A
C
D
B
Chain A
Chain B
Loop 391-394 Chain A Tyr444
Loop 391-394 Chain B Tyr444
Ser442
Ser442
Figure 9
A
B
Figure 10
A
C
B
D
Figure 11 A
B
C
D
E
F
VA387-1997
TCH05-2004
Farm-2004
Saga-2006
NSW-2012
~ -55
55 kT/e
Figure 12
Table 1. Data collection and refinement statistics of apo GII.4 P domain structures
Data collection Space group Cell dimensions a, b, c (Å) α, β, γ (°) Resolution range (Å) Rmerge I/σI Completeness (%) Redundancy
Farm-2004 (4OOV)
Saga-2006 (4OOX)
NSW-2012 (4OOS)
P212121
C2
C2
62.95 90.12 109.21 90 90 90 46.70-1.50 (1.55-1.50)* 7.48 (54.74)* 14.18 (2.37)* 99.24 (97.64)* 4.9 (4.1)*
96.72 58.94 62.14 90 119.88 90 48.22-1.03 (1.07-1.03)* 3.40 (52.85)* 15.12 (1.93)* 96.03 (90.00)* 3.0 (2.7)*
98.48 55.07 63.46 90 120.10 90 46.25-1.60 (1.66-1.60)* 4.43 (24.53)* 20.63 (5.12)* 98.14 (94.97)* 3.5 (3.3)*
Refinement Resolution range (Å) 46.70-1.53 27.17-1.20 No. of reflections 93719 92504 Rwork/Rfree 14.08/16.41 11.96/14.32 No. of atoms 10090 4975 Protein 4755 2436 Ligand/ion 32 40 Water 778 295 2 Average B factors (Å ) Protein 13.80 13.90 Ligand/ion 22.30 22.90 Water 26.40 26.00 RMSD Bond length (Å) 0.009 0.010 Bond angle (°) 1.29 1.38 Each data set was collected from single crystals, respectively. *Values in parentheses are for highest-resolution shell.
42.60-1.64 35573 13.77/16.15 5021 2406 24 318 12.90 18.80 23.30 0.005 1.09
Table 2. Data collection and refinement statistics of Farm-2004 P domain and HBGA complex structures
Data collection Space group Cell dimensions a, b, c (Å) α, β, γ (°) Resolution range (Å) Rmerge I/σI Completeness (%) Redundancy Refinement Resolution range (Å) No. of reflections Rwork/Rfree No. of atoms Protein Ligand/ion Water Average B factors (Å2) Protein Ligand/ion Water RMSD Bond lengths (Å) Bond angles (°)
B-tri (4X05)
Leb-tetra (4OPS)
C2
P212121
175.11 89.54 106.73 90 127.55 90 19.89-1.96 (2.01-1.96)* 15.8 (113.90)* 8.89 (1.29)* 99.10 (99.30)* 5.5 (4.7)*
71.45 90.11 91.87 90 90 90 47.81-1.75 (1.79-1.75)* 11.20 (53.53)* 9.88 (2.41)* 97.30 (95.10)* 5.5 (5.7)*
19.89-1.98 90505 16.30/ 20.31 10774 9437 132 12045
47.81-1.76 57869 18.50/21.64 9541 4723 46 324
22.70 43.00 31.20
23.20 57.20 26.80
0.008 1.10
0.013 1.36
Each data set was collected from single crystals, respectively. *Values in parentheses are for highest-resolution shell.
Table 3. Data collection and refinement statistics of Saga-2006 P domain and HBGA complex structures Data collection Space group Cell dimensions a, b, c (Å) α, β, γ (°) Resolution range (Å) Rmerge I/σI Completeness (%) Redundancy
H2-tri (4WZK)
A-tri (4X07)
B-tri (4X06)
Lea-tri (4WZL)
Leb-tetra (4OPO)
Ley-tetra (4WZE)
C2
C2
C2
C2
C2
C2
114.99 58.81 98.02 90 108.11 90 46.58-1.47 (1.51-1.47)* 3.091 (48.00)* 17.31 (2.27)* 97.30 (95.10)* 2.9 (2.9)*
97.96 58.64 114.59 90 105.49 90 48.01-1.28 (1.32-1.28)* 6.353 (41.88) 7.40 (1.96)* 98.00 (92.70)* 4.5 (3.6)*
113.83 58.65 97.31 90 107.34 90 48.10-1.22 (1.25-1.22)* 3.519 (35.40)* 13.92 (2.54)* 96.70 (90.20)* 2.6 (2.5)*
96.87 58.83 124.44 90 119.8 90 49.59-1.57 (1.61-1.57)* 5.205 (48.73)* 8.11 (1.99)* 97.61 (88.60)* 3.0 (2.9)*
113.66 58.6 97.17 90 107.15 90 46.42-1.38 (1.42-1.38)* 7.537 (93.06)* 12.76 (1.71)* 99.20 (97.04)* 3.7 (3.6)
97.02 58.50 113.86 90 108.1 90 48.16-1.45 (1.49-1.45)* 2.8 (45.5)* 10.16 (1.84)* 95.17 (93.87)* 2.3 (2.2)*
32.32-1.22 176446 13.41/16.12 10037 4865 140 716
42.47-1.70 65702 17.03/19.55 9985 4812 124 569
46.42-1.40 119485 15.08/18.54 10227 4850 120 812
31.76-1.46 100015 17.74/21.17 9826 4801 100 542
15.80 27.90 25.60
17.10 37.40 28.20
12.90 30.40 25.30
22.80 41.70 32.30
0.010 1.36
0.006 1.08
0.007 1.20
0.033 1.73
Refinement Resolution range (Å) 29.66-1.49 43.18-1.46 No. of reflections 99198 106175 Rwork/Rfree 13.88/17.35 16.53/19.30 No. of atoms 9013 10143 Protein 4803 4842 Ligand/ion 80 80 Water 540 799 2 Average B factors (Å ) Protein 24.20 15.50 Ligand/ion 47.90 36.00 Water 33.10 27.40 RMSD Bond lengths (Å) 0.011 0.008 Bond angles (°) 1.29 1.20 Each data set was collected from single crystals, respectively. *Values in parentheses are for highest-resolution shell.
Table 4. Data collection and refinement statistics of NSW-2012 P domain and HBGA complex structures
Data collection Space group Cell dimensions a, b, c (Å) α, β, γ (°) Resolution range (Å) Rmerge I/σI Completeness (%) Redundancy
A-tri (4WZT)
B-tri (4OP7)
Lex-tri (4X0C)
P41212
P41212
P41212
104.74 104.74 190.85 90 90 90 48.26-1.85 (1.90-1.85)* 8.758 (98.50)* 16.73 (1.60)* 98.80 (98.30)* 6.5 (6.5)*
104.61 104.61 190.54 90 90 90 48.19-1.90 (1.97-1.90)* 11.31 (115.40)* 10.70 (1.09)* 93.80 (93.70)* 4.3 (4.2)*
104.83 104.83 191 90 90 90 45.95-1.70 (1.75-1.70)* 6.856 (128.60)* 15.62 (1.20)* 99.20 (97.60)* 3.6 (3.6)*
Refinement Resolution range (Å) 48.26-1.91 48.19-1.92 No. of reflections 81872 76345 Rwork/Rfree 15.37/17.91 18.47/20.81 No. of atoms 10006 9760 Protein 4803 4762 Ligand/ion 88 76 Water 718 492 2 Average B factors (Å ) Protein 26.30 26.90 Ligand/ion 42.80 37.90 Water 34.50 31.50 RMSD Bond lengths (Å) 0.008 0.007 Bond angles (°) 1.13 1.11 Each data set was collected from single crystals, respectively. *Values in parentheses are for highest-resolution shell.
40.14-1.72 112461 15.96/17.95 10085 4802 142 747 23.10 43.60 34.70 0.013 1.32