Extended secondary structures in proteins.

Biochimica et Biophysica Acta 1844 (2014) 384–388

Contents lists available at ScienceDirect

Biochimica et Biophysica Acta journal homepage: www.elsevier.com/locate/bbapap

Extended secondary structures in proteins Léo Degrève a,⁎, Carlos A. Fuzo a, Antonio Caliri b a b

Departamento de Química, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto, Universidade de São Paulo, 14040-901 Ribeirão Preto, SP, Brazil Departamento de Física e Química, Faculdade de Ciências Farmacêuticas de Ribeirão Preto, Universidade de São Paulo, 14040-903 Ribeirão Preto, SP, Brazil

a r t i c l e

i n f o

Article history: Received 24 July 2013 Received in revised form 2 October 2013 Accepted 5 October 2013 Available online 26 October 2013 Keywords: Protein's structure Secondary structure Extended secondary structure

a b s t r a c t Supersecondary structures of proteins have been systematically searched and classified, but not enough attention has been devoted to such large edifices beyond the basic identification of secondary structures. The objective of the present study is to show that the association of secondary structures that share some of their backbone residues is a commonplace in globular proteins, and that such deeper fusion of secondary structures, namely extended secondary structures (ESSs), helps stabilize the original secondary structures and the resulting tertiary structures. For statistical purposes, a set of 163 proteins from the protein databank was randomly selected and a few specific cases are structurally analyzed and characterized in more detail. The results point that about 30% of the residues from each protein, on average, participate in ESS. Alternatively, for the specific cases considered, our results were based on the secondary structures produced after extensive Molecular Dynamics simulation of a protein–aqueous solvent system. Based on the very small width of the time distribution of the root mean squared deviations, between the ESS taken along the simulation and the ESS from the mean structure of the protein, for each ESS, we conclude that the ESSs significantly increase the conformational stability by forming very stable aggregates. The ubiquity and specificity of the ESS suggest that the role they play in the structure of proteins, including the domains formation, deserves to be thoroughly investigated. © 2013 Elsevier B.V. All rights reserved.

1. Introduction The properties of proteins depend on their chemical composition and three-dimensional structure; the primary structure represents the former, and the relative positions of the atoms define the latter, which includes secondary and tertiary structures. The secondary structure has repetitive, ordered patterns, such as α-helices and β-sheets, commonly connected by regions, which do not have repetitive patterns, but they still can be ordered and often play important roles in defining the global protein structure [1–5]. Combinations of secondary structures, such beta–alpha–beta or helix–turn–helix, occur frequently, and are called supersecondary structures. Recurrent structures, as part of many distinct proteins, have been systematically searched and classified, summing about 1400 motifs (according the number of folds available in the PDB based on SCOP). All of them have specific names, including, for instance: beta-hairpin (two anti-parallel beta strands connected by a beta-turn); Greek keys (four adjacent anti-parallel beta strands folded on themselves); alpha–alpha hairpin (two helices connected by a short loop); coils (two helices wound around each other in a super coil); and EF-hand (two helices connected to a relatively large loop). In general, the main factors that define and maintain protein structures are: the chemical (covalent) bonds; the Coulomb and

⁎ Corresponding author. Tel.: +55 16 36023688; fax: +55 16 36024838. E-mail addresses: [email protected] (L. Degrève), [email protected] (C.A. Fuzo), [email protected] (A. Caliri). 1570-9639/$ – see front matter © 2013 Elsevier B.V. All rights reserved. http://dx.doi.org/10.1016/j.bbapap.2013.10.005

multipolar interactions; the so-called van der Waals interactions; and the interactions of the protein components with those of the medium, specifically, with molecules of the solvent and ions. It is not yet possible to describe accurately and fully these complex molecular systems, like proteins, by considering only the interactions of pairs of particles, atoms or groups of particles. Except for identifying classical secondary structures that “live together,” not enough attention has been devoted to extended edifices existing in the protein structures. Such extended secondary structures (ESSs) are easily identified as stabilizing factors in proteins, as have already been demonstrated for some cases, such as for amyloid fibril proteins [6] and for the special case of the polyproline II helix (PPII, a kind of extended secondary structure present in a number of proteins) [7], as well in RNA molecules [8–11]. The details and generalities of the energetic interactions that stabilize the ESS have not received much attention. Therefore, the main purpose of the present study is to show that the association of recognized secondary structures, sharing backbone atoms, is a commonplace in globular proteins; and that such composed larger structures, in turn, help to stabilize the original secondary structures and the resulting tertiary structures. Our study considers only those bonds that specifically define a secondary structure — the hydrogen bonds (HB) between main chain atoms. Thus, other stabilizing forces, such as van der Waals interactions or hydrophobic effects, are disregarded. For statistical purposes, our study considers a set of randomly chosen proteins from the protein databank, but we select a few specific cases and analyze their structural characteristics in greater detail for discussion and illustration.

L. Degrève et al. / Biochimica et Biophysica Acta 1844 (2014) 384–388

2. Material and method For a general analysis, we consider a set of 163 proteins randomly chosen from the protein databank (PDB IDs: 153L, 1A0B, 1A6F, 1A7D, 1ABO, 1ABZ, 1AD6, 1AIR, 1AMX, 1AU1, 1AVP, 1B04, 1B41, 1B8W, 1BAS, 1BB1, 1BCX, 1BD8, 1BDE, 1BFF, 1BFG, 1BK7, 1BRF, 1BXZ, 1BYY, 1C25, 1C44, 1CEM, 1CHD, 1CLK, 1COI, 1COL, 1CVS, 1CWY, 1CZT, 1D0B, 1D4A, 1D9G, 1DDE, 1DFN, 1DSL, 1DVK, 1DVN, 1E1X, 1E3Z, 1E65, 1EKG, 1ENH, 1ES3, 1ES6, 1F82, 1FGA, 1FIL, 1FSD, 1FSV, 1FYH, 1G2C, 1G6U, 1GO1, 1GZI, 1H5O, 1HEH, 1HKA, 1HOE, 1HST, 1HYP, 1IC9, 1ICL, 1IGD, 1JET, 1JS0, 1K3T, 1KFP, 1KUV, 1L35, 1LBA, 1LOP, 1MAZ, 1MIR, 1MJC, 1MOL, 1MWP, 1N6U, 1NAR, 1NFN, 1NOA, 1OKE, 1OMN, 1OVB, 1P9U, 1PAH, 1PAU, 1PBW, 1PHR, 1POA, 1PRC, 1PSZ, 1QGI, 1QJV, 1QLG, 1QMY, 1QNX, 1QUV, 1R69, 1RIE, 1S1Q, 1SEK, 1TAH, 1TES, 1TFG, 1TG8, 1THM, 1THV, 1TML, 1TRI, 1TSK, 1UBI, 1UCH, 1UDG, 1V37, 1VCC, 1VFY, 1VHH, 1W4F, 1WLJ, 1WU0, 1WU3, 1X22, 1XNB, 1YER, 1YTN, 1ZON, 2BLT, 2CBA, 2DLL, 2E5U, 2END, 2F66, 2G9Q, 2GAR, 2GMF, 2HJ8, 2HYM, 2KJD, 2LKF, 2LZM, 2MLT, 2OQ0, 2OVO, 2POL, 2PTH, 2QIQ, 2RN2, 2TRT, 2WJ8, 2WK3, 2XSP, 2XTC, 2ZC4, 3BA2, 4FXC, 4TF4, XTL2). Their size ranges from 17 to up to 7499 amino acids. More than 80%, however, have fewer than 300 residues. Most of them (114) are monomeric proteins, but the set also includes dimers (27), trimers (4), tetramers (12), and a few multi-domains, with up to 24 domains. We identify each secondary structure according to the Kabsch and Sander method [12], which recognizes a secondary structure as a repeat of the elementary hydrogen-bonding patterns that form “helices” through turns and “ladders” through parallel or anti-parallel bridges (“sheets” are connected ladders). In turn, we identify the ESSs by grouping the residues of the recognized secondary structures that share backbone atoms including as well the residues bound by one of them by hydrogen bonds between main chain atoms, identified by the Kabsch and Sander criteria [12]. In some cases these last residues can connect two secondary structures increasing the extension of the ESS. The construction of the ESS is illustrated in the Results section. For discussion and illustration, we consider, in detail, the structural characteristics of a monomer of the envelope protein (E protein) of the dengue virus serotype 2. The E protein has 394 residues organized in three domains: DI (residues 1–52; 133–193 and 281–296); DII (53–132 and 194–280); and DIII (296–394). Its structure in the dimeric conformation, determined by X-ray diffraction [13,14] is found as dengue 2 virus envelope protein PDB ID: 1OKE [15]. We obtained the complete set of information on the secondary structures from the all-atom trajectories produced by the Molecular Dynamics (MD) of a monomer–aqueous solvent system at physiologic ionic strength (150 mM), using the Gromacs 4.5.3 package [16–20]. The temperature and pressure of the systems were maintained, respectively, at 300 K and 1 atm, applying the Berendsen et al. thermostat and barostat [20,21]. The model SPC/e described the water molecule [22]. The GROMOS96 43a1 force field described the atom interactions [23]. The SETTLE algorithm [24] preserved the structure of the water molecules, and the LINCS algorithm helped maintain the bonds with the hydrogen atoms [19]. Electrostatic interactions were cut off at 1.2 nm, and the smooth particle mesh Ewald method [25] was applied to complete the energy calculations. We adjusted the ionic strength of the medium by adding Na+ and Cl−. The simulation time was 400 ns but only the interval from 250 to 400 ns was used in the study. We fixed the charges of the arginine and lysine residues at +e, and the residues of aspartic and glutamic acid at −e (e is the elementary charge) in order to emulate a pH of about 7 [29]. 3. Results Secondary structures may be defined by the regular pattern of backbone dihedral angles, or formally identified by the patterns of hydrogen bonds between backbone amino and carboxyl groups, as it is done in the Kabsch and Sander method [12], and adopted here. Therefore, the ESSs are settled down as follows: if two conventional secondary structures

385

share one or more common residue, they are identified as constituents of an ESS; this procedure can eventually add more than two secondary structures in a single ESS. Note that the method of Kabsch and Sander can identify pairs of residues, say R1 and R2, connected by a HB even if one of them, for example R1, does not belong to any conventional secondary structure. However, if R2 belongs to a specific ESS, then R1 is included as part of this ESS. On the other hand, since atoms of the main chain of helices are not exposed on their exterior side, helices themselves cannot pertain to an ESS (for instance, helix bundle fold), in which the stabilizing interactions between the helices do not occur through HB between main chain atoms. In our analysis, the residues totaled 32,120 and from these, 7007 residues formed 708 parallel or anti-parallel sheets; and 114 of the 163 proteins presented the ESS (about 70% of the set of the randomly selected proteins). We found 290 ESSs, involving 5639 residues. Fig. 1 gives the distribution of the number of residues NESS, namely d(NESS), in the ESS, for NESS N 4; the inset shows how the sizes of the selected proteins are distributed. Fig. 1B shows the distribution f(ϕ) of the relative number ϕ = NESS / NTOT of residues in the ESS: NESS and NTOT are, respectively, the number of residues belonging to the ESS of a protein and to the total number of residues of that same protein. The average of the number NESS of residues in the ESS is about bNESSN = 25 and, more representatively, the average of the relative number bϕN = 0.3 indicates that about 30% of the residues of each protein, on average, participate of ESS. In general, this data suggests that due to their extension, the ESSs are easily detected and can determine the structure and stability of the protein. To verify this conjecture and to validate the identification method of the ESS, by direct inspection, we analyzed a specific protein in detail, namely, the monomer of the envelope protein of the dengue virus, PDB ID: 1OKE [13–15]. Firstly, we identified the ESS, and then we investigated its temporal behavior in order to establish a correlation between the ESS and the stability of specific regions of the protein. We calculated the root mean square deviation (RMSD) for the Cα atoms between the experimental structure and the ensemble mean structure, obtained after a long MD run at physiological condition: pH about 6 and ionic strength of 150 mM (see “Material and Method” section); the resulting root mean square deviation is about 0.9 nm, which indicates that the two structures are similar overall with localized discrepancies [26]. The secondary structures were identified by the set of HB between the atoms of the backbone, as previously described; the 187 HB found in the EMS (the ensemble mean structure, EMS, is the mean structure of the protein calculated during the last 50 ns of the simulation) are practically the same ones found in the experimental structure. Table 1 shows the resulting β sheet structures in the EMS compared with the 1OKE data. Small differences are indicated in yellow for sheet labeled ns = 1; in red for the anti-parallel sheet ns = 3, 6, 9 and 21, which exist only in 1OKE; and, mainly, in the anti-parallel sheet (99–101, 106–108), ns = 11, found in the simulated structure but not found in 1OKE. A new structural pattern – the extended secondary structures, or ESSs – is always identified when certain residues participate in more than one secondary structure or are hydrogen bonded between main chain residues to a residue of a secondary structure. The secondary structures of the monomer of the E protein of the dengue virus are listed in Table 1. The β-sheet 2, ns = 2, shares the residues 285–288 with ns = 14 but ns = 14 shares in its turn the residues 182 and 183 with ns = 13 so that the group of residues 20–25, 171–175, 179–185 and 283–288 is bound by hydrogen bonds between main chain atoms. In the same way, the β-sheets 4, 5 and 12; 15 and 16; 18 and 19; and 20 and 22 form groups of residues bound by hydrogen bonds between main chain atoms. Moreover small numbers of hydrogen bonds do not define commonly a secondary structure. This is what is observed with some residues of just identified groups that are also hydrogen bound with other residues without defining secondary structures. This is the case of residues 291 and 292 that are hydrogen bound with residues

386


Fig. 1. (A): The distribution (NESS) of the number of residues NESS in the ESS suggests that in the present sample of proteins, in most of them, the ESSs involve a relatively small number of residues: The distribution's peak is in the interval 5 b NESS b 15, and the average about bNESSN = 25. The inset shows how the proteins' sizes are distributed; note that there are a few proteins with more than 800 residues and, therefore, larger ESS can occur, with the ESS containing more than 100 residues, as indicated in the figure. Also, the normalized amount ϕ = NESS / NTOT was calculated for each protein, where NTOT is the total number of residues in the pertinent protein; the distribution f(ϕ) is shown in part (B): It suggests that more than 40% of the proteins have about 20% of their residues contributing to the formation of the ESS. The average of the relative number is about bϕN = 0.3 indicating that about 30% of the residues of each protein, on average, participate in the ESS.

179 and 180 pertaining to ns = 13, so that the composition of the complete ESS is 20–25, 171–175,179–185, 283–288, 291 and 292; namely, the ESS #2 in Table 2. The other ESSs are constructed in the same way. The five identified ESSs are depicted in Fig. 2. Their constitution, size and residue identification, and their respective mean RMSD values are listed in Table 2, which shows that the ESSs are distributed among the three protein domains; the mean RMSD, represented here by bRMSDN, is calculated from a set of timely equidistant configurations taken along the simulation. The set of all the RMSD values was calculated taking into account, along the simulations, the deviations of the positions of these Cα atoms of the corresponding ESS relatively to their positions in the ensemble mean structure. Some ESSs are very large, involving up to 63 residues (ESS #3). These ESSs (identified from the EMS) allow the analysis of the internal movement of the protein through the evolution of their RMSD over time. The distributions of the RMSD are shown in Fig. 3. The distribution DESS(r) suggests that the ESS may significantly increase the protein stability, since most calculated average RMSDs, namely bRMSDN, are smaller than 0.1 nm while the average bRMSDN

Table 1 β-Sheet secondary structures, labeled ns, from the crystallographic structures of the E protein of the dengue virus (PDB ID: 1OKE) and from the corresponding simulated mean conformation (EMS). The sheets ns = 1 of 1OKE and EMS are parallel sheets while all others are anti-parallel sheets. The small differences between the set of β-sheets of EMS and 1OKE structures are the slightly modified parallel sheet, ns = 1, shown in yellow, and the four β-sheets, namely ns = 3, 6, 9 and 21, existing only in the 1OKE structure, shown in red; and the β-sheet ns = 11 existing only in the EMS is shown in green. Figures in black represent common strands in both cases. ns 1 2 3 4 5 6 7 8 9 10 11

1OKE 9–13 20–25 31–33 41–45 47–50 54–72 57–59 62–66 69–72 90–99 –

30–34 283–288 41–43 139–143 135–138 113–129 220–222 118–123 113–116 109–118 –

EMS 8–13 20–25 – 40–45 47–50 – 57–59 62–65 – 90–97 99–101

29–34 283–288 – 139–144 135–138 – 220–222 119–123 – 111–118 106–108

ns 12 13 14 15 16 17 18 19 20 21 22

1OKE 138–141 171–175 182–186 196–200 205–207 238–241 306–308 320–324 337–339 350–351 374–380

160–163 179–183 284–288 205–209 268–271 249–252 324–326 365–369 378–380 369–370 387–393

EMS 138–141 171–175 182–185 196–200 205–207 238–241 305–308 320–324 337–339 – 375–380

160–163 179–183 285–288 205–209 268–271 249–252 324–327 365–369 378–380 – 387–392

of the protein is 0.25 nm. The small values of the bRMSDN and the good definition of the profiles of the distributions of quadratic deviations demonstrate that the ESS can be thoroughly identified. Independent studies fully corroborate these results, specifically the ones obtained with two other proteins, PDB IDs: 1QLG and 2XSP (as shown in Appendix A): ESSs stabilize large regions of the protein or generate large patches of cooperative movements. Another characteristic of the ESS is that they are responsible for the domain definition by means of their aggregation features. As a last point, we comment that when the bRMSDNs refer to structures with significantly different numbers of residues (although the RMSD calculation is normalized with respect to the protein size), the comparison among them may be artful because larger chains may present, comparatively, an increased RMSD, due mainly to the possibility of molecular motions to great extent. However, in order the establish how much the distribution DESS(r) may change under different stability conditions, which is the point here, calculations were carried out again for the dengue 2 virus envelope protein (protein E; PDB ID: 1OKE [15]) under two distinct stability conditions: in one case, all the histidine residues of the protein were completely protonated and, in the second case all the histidine residues were protonated only in one of the δ or ε positions. The ESSs in both cases are formed by almost the same residues, but the distributions DESS(r) are very different, as

Table 2 Lists of the residues in the extended secondary structures (ESSs) found in the EMS of the E protein monomer. N is the number of residues pertaining to the ESS, and D indicates the respective domain. The last column (bRMSDN): average RMSD (nm, with reference to the EMS) and the corresponding standard deviations for each ESS. ESS N 1 2 3 4 5

D

37 I 26 I 63 II

8–13, 29–34, 40–45, 135–144, 160–163 20–25, 171–175, 179–185, 283–288, 291, 292 57–65, 90–108, 111–128, 196–200, 205–212, 220–221, 268–271 26 I&III 147, 148, 305–310, 313, 314, 320–327, 350, 351, 366–370 16 III 337–340, 375–380, 387–392

bRMSDN 0.06 ± 0.03 0.04 ± 0.02 0.08 ± 0.04 0.05 ± 0.03 0.04 ± 0.02


387

Fig. 2. The extended secondary structures of the E protein (monomer) are colored in yellow (ESS(1)), green (2), blue (3), dark yellow (4), and magenta (5).

shown in Fig. 4. Therefore, one can see that when the protein is more unstable (here, when all their histidine are completely protonated), large structural fluctuations around the native conformation is observed, as shown by a broad distribution of bRMSDN. On the contrary, the significant increase of the protein stability (under partial protonation of the histidine) is clearly indicated by the narrow distribution of bRMSDN.

4. Conclusion Extended secondary structures in proteins are formed by the aggregation of secondary structures, directly interacting through atoms of the main chain. The ensemble of residues constituted in such a way are held together by means of extensive set of hydrogen bonds, which can configure aggregates much larger than individual secondary structures with equivalent stability. The contact region between neighboring secondary structures has been the object of different studies in proteins and RNA molecules [7–11], and in particular as a important region for protein–protein recognition [27]. Some advantages of considering ESS are that they don't require further special criteria to be defined; they can be easily identified provided that the secondary protein structure is available; they form cohesive regions easily identified in the protein structure; and they indicate regions where structural fluctuations are more or are less likely. As defined here, an ESS never can be formed by a set of helices as, for example, alpha–alpha hairpin. On the other

Fig. 4. The same as in Fig. 3 where: (i) the histidine residues are protonated only in one of the δ or ε positions (in black); and (ii) the histidine residues are protonated in both δ and ε positions (in red).

hand, the inclusion in the ESS of residues that do not belong to any secondary structure extends the concept of stability of protein structures, showing that two or more regions, hitherto classified as distinct, can behave as a whole, belonging to the same motif. This is the case of residues 291 and 292 of ESS #2. Extended secondary structures play key roles in the final settlement and preservation of the protein conformation and, by extension, of the structural properties of the protein domains when they are present, or of the intrinsically disordered proteins when they are rare or absent [5]. Indeed, beyond the secondary structures, such as β strands and α helixes, the ESSs are found in many proteins, significantly increasing their conformational stability by forming very stable aggregates. The analyses of a random set of 163 globular proteins (of diverse sizes and totaling 32,120 residues) show that more than 20% of the residues, on average, are involved in forming the ESS. The normalization of the number NESS of residue belonging to the ESS in each protein, with respect to its total number NTOT of residues, namely, ϕ = NESS / NTOT, shows that the peak of the distribution f(ϕ) corresponds to the interval 5 b NESS b 15, and that the average bϕN is about 30%. This indicates that about 30% of the residues from each protein, on average, participate in one ESS. For the case considered here, the E protein of the dengue virus and the distribution of the RMSD values obtained from MD simulation show that almost all calculated RMSD for each ESS in the E protein – with reference to the ensemble mean structure – falls within the range 0 b RMSD b 0.1 nm. Furthermore, the ESSs are spread over all three domains of E protein. In particular, the ESS #4 in the interface of two domains (Fig. 2), DI and DIII, is a salient protein inter-atomic interaction factor keeping them steadily together. Finally, we argue that the ESS can be seen as one of the final stages of the hierarchical folding process, which begins locally by forming malleable building blocks and progresses through intra-building block interactions, until reaching the native structure, as described in Reference [28]. Therefore, our preliminary observation is that the ubiquity and specificity of the ESS suggest that the role they play in the structure and stability of proteins deserves to be thoroughly investigated.

Acknowledgements

Fig. 3. RMSD distribution DESS(r) of the Cα of the E protein (monomer) for each ESS (r = RMSD). The calculations took into account, along the simulations, the deviations of the Cα relatively to their respective positions in the ensemble mean structure. The curves give a visual impression of how RMSDs are, in general, slightly spread.

The authors are thankful to the Fundação de Amparo à Pesquisa do Estado de São Paulo (grant 2008/11706-3) and to the Conselho Nacional de Desenvolvimento Científico e Tecnológico (grant 300693-2009-5) for the financial support.

388


Appendix A Residues involved in the extended secondary structures (ESSs) for two more proteins, namely PDB IDs: 1QLG and 2XSP. N is the number of residues pertaining to the ESS. Last column (bRMSDN): RMSD averages between the crystallographic structures and the structures ESS obtained from 100 ns MD simulations and the corresponding standard deviations for each corresponding RMSD. ESS N

1QLG/residues

bRMSDN (nm)

1 2 3 4

26 23 16 54

0.12 0.15 0.05 0.09

5

16

63–67, 290, 291, 303–306, 311–316, 322–330 70, 71, 344–347, 350, 351, 356–364, 376–381 85–88, 98–103, 107–112 129–132, 135, 136, 141–149, 155–163, 169–173, 180, 181, 189, 190, 203–205, 211–218, 224–227 249–253, 2598–263, 273–277

ESS N 1

2XSP/residues

113 14–20, 26–33, 85–87, 91–98, 105–113, 122, 123, 128–135, 144–150, 215–221, 227–232, 260, 261, 264, 265, 287–295, 306–312, 333–343, 369–376, 424–428, 431–434

± ± ± ±

0.06 0.09 0.03 0.05

0.04 ± 0.02 bRMSDN (nm) 0.11 ± 0.07

References [1] L. Stryer, Biochemistry, fourth ed. W.H. Freeman, 1995. [2] T.E. Creighton, Proteins: Structures and Molecular Properties, second ed. W.H. Freeman, 1993. [3] A.G. Murzin, S.E. Brenner, T. Hubbard, C. Chothia, SCOP: a structural classification of proteins database for the investigation of sequences and structures, J. Mol. Biol. 247 (1995) 536–540. [4] D. Voet, J.G. Voet, Biochemistry, second ed. J. Wiley, 1995. [5] P. Tompa, Unstructural biology coming of age, Curr. Opin. Struct. Biol. 21 (2011) 419–425. [6] P. McPhie, CD studies on films of amyloid proteins and polypeptides: quantitative g-factor analysis indicates a common folding motif, Biopolymers 75 (2004) 140–147. [7] J. Kapitan, D. Gallo, N. Goasdoue, M. Nicaise, M. Desmadril, L.G. Leclercq, L.D. Barron, Y. Jacquot, Identification of a human estrogen receptor alpha-derived antiestrogenic peptide that adopts a polyproline II conformation, J. Pept. Sci. 15 (2009) 455–464. [8] D. Zimmern, An extended secondary structures model for the TMV assembly origin, and its correlation with protection studies and an assembly defective mutant, EMBO J. 2 (1983) 1901–1907.

[9] J. Mc Dougall, R.N. Nazar, Extended secondary structures in 5S ribosomal-RNAs from a sulfur metabolizing archaebacterium, Thermococcus celer, Biochem. Biophys. Res. Commun. 134 (1986) 1167–1174. [10] S.A. Elela, R.N. Nazar, Extended secondary structures as a basis of increased RNA stability in a thermophilic alga Cyanidium caldarium, Biochim. Biophys. Acta 1130 (1992) 339–342. [11] S. Chowdhury, C. Ragaz, E. Kreuger, F. Narberhaus, Temperature-controlled structural alterations of an RNA thermometer, J. Biol. Chem. 48 (2003) 47915–47921. [12] W. Kabsch, C. Sander, Dictionary of protein secondary structure — pattern-recognition of hydrogen-bonded and geometrical features, Biopolymers 22 (1983) 2577–2637. [13] Y. Modis, S. Ogata, D. Clements, S.C. Harrison, A ligand-binding pocket in the dengue virus envelope glycoprotein, Proc. Natl. Acad. Sci. U. S. A. 100 (2003) 6986–6991. [14] Y. Modis, S. Ogata, D. Clements, S.C. Harrison, Surface epitopes in the crystal structure of dengue virus type 3 envelope glycoprotein, J. Virol. 79 (2005) 1223–1231. [15] Research Collaboratory for Structural Bioinformatics, an information portal to biological macromolecular structures, http://www.pdb.org/pdb/home/home.do. [16] D. van der Spoel, E. Lindahl, B. Hess, A.R. van Buuren, E. Apol, P.J. Meulenhoff, D.P. Tieleman, A.L.T.M. Sijbers, K.A. Feenstra, R. van Drunen, H.J.C. Berendsen, Gromacs user manual version 4.5, http://www.gromacs.org 2010. [17] H.J.C. Berendsen, D. van der Spoel, R. van Drunen, GROMACS — a message-passing parallel molecular dynamics implementation, Comput. Phys. Commun. 91 (1995) 43. [18] E. Lindahl, B. Hess, D. van der Spoel, GROMACS 3.0: a package for molecular simulation and trajectory analysis, J. Mol. Model. 7 (2001) 306. [19] D. van der Spoel, E. Lindahl, B. Hess, G. Groenhof, A.E. Mark, H.J.C. Berendsen, GROMACS: fast, flexible, and free, J. Comput. Chem. 26 (2005) 1701. [20] B. Hess, H. Bekker, H.J.C. Berendsen, J.G.E.M. Fraaije, LINCS: a linear constraint solver for molecular simulations, J. Comput. Chem. 18 (1997) 1463–1472. [21] H.J.C. Berendsen, J.P.M. Postma, W.F. van Gunsteren, A. DiNola, J.R. Haak, Molecular dynamics with coupling to an external bath, J. Chem. Phys. 81 (1984) 3684–3690. [22] H.J.C. Berendsen, J.R. Grigera, T.P. Straatsma, The missing term in effective pair potentials, J. Phys. Chem. 91 (1987) 6269–6271. [23] W.F. van Gunsteren, S.R. Billeter, A.A. Eising, P.H. Hünenberger, P. Krüger, A.E. Mark, W.R.P. Scott, I.G. Tironi, Biomolecular Simulation: The GROMOS96 Manual and User Guide, Hochschulverlag, 1996. [24] S. Miyamoto, P.A. Kollman, SETTLE — an analytical version of the shake and rattle algorithm for rigid water models, J. Comput. Chem. 13 (1992) 952–962. [25] U. Essman, L. Perera, M.L. Berkowitz, T. Darden, H. Lee, L.G. Pedersen, A smooth particle mesh Ewald method, J. Chem. Phys. 103 (1995) 8577–8592. [26] L. Degrève, C.A. Fuzo, A. Caliri, Extensive structural change of the envelope protein of dengue virus induced by a tuned ionic strength: conformational and energetic analyses, J. Comput. Aided Mol. Des. 26 (2012) 1311–1325. [27] R. Preiβner, A. Goede, C. Frömmel, Dictionary of Interfaces in Proteins (DIP). Data bank of complementary molecular surface patches, J. Mol. Biol. 280 (1998) 535–550. [28] C.-J. Tsai, B. Ma, S. Kumar, H. Wolfson, R. Nussinov, Protein folding: binding of conformationally fluctuating building block via population selection, Crit. Rev. Biochem. Mol. Biol. 36 (5) (2001) 399–433. [29] C.A. Fuzo, L. Degrève, The pH dependence of flavivirus envelope protein structure: insights from molecular dynamics simulations J, Biomol. Struct. Dyn. (2013), http: //dx.doi.org/10.1080/07391102.2013.827132.

Identification of local variations within secondary structures of proteins.

Secondary structures in polyoma DNA.

SCOPe: Structural Classification of Proteins--extended, integrating SCOP and ASTRAL data and classification of new structures.

Role of Backbone Dipole Interactions in the Formation of Secondary and Supersecondary Structures of Proteins.

Prediction for secondary structures of ten proteins from the 50S subunit of the Escherichia coli ribosome.

Predicted secondary structures of amino-terminal extension sequences of secreted proteins.

Modeling RNA secondary structures. I. Mathematical structural model for predicting RNA secondary structures.

Role of dicarboxylate linkers in Mn(III)-salicylaldoximate based extended structures: synthesis, structures, and magnetic behavior.

Comparative visualization of protein secondary structures.

Secondary structures formed by random RNA sequences.

An algebraic representation of RNA secondary structures.

Evaluation of secondary structure predictions in proteins.

Secondary-structure dependent chemical shifts in proteins.

Predictions for secondary structures of six proteins from the 50 S subunit of the Escherichia coli ribosome.

Small-angle X-ray scattering: a bridge between RNA secondary structures and three-dimensional topological structures.

Combinatorics of RNA Secondary Structures with Base Triples.

Unified approach to partition functions of RNA secondary structures.

NMR structures of membrane proteins in phospholipid bilayers.

Mechanism of ribosome translation through mRNA secondary structures.

Secondary structures of rRNAs from all three domains of life.

Computer modeling and display of RNA secondary and tertiary structures.

Coding of odor stimulus features among secondary olfactory structures.

Base-pair probability profiles of RNA secondary structures.

Secondary structural complementarity between DNA and proteins.