Accepted Manuscript Title: Label free targeted detection and quantification of celiac disease immunogenic epitopes by mass spectrometry Author: Hetty C. van den Broeck Jan H.G. Cordewener Merel A. Nessen Antoine H.P. America Ingrid M. van der Meer PII: DOI: Reference:

S0021-9673(15)00343-X http://dx.doi.org/doi:10.1016/j.chroma.2015.02.070 CHROMA 356328

To appear in:

Journal of Chromatography A

Received date: Revised date: Accepted date:

27-11-2014 23-2-2015 26-2-2015

Please cite this article as: H.C. van den Broeck, J.H.G. Cordewener, M.A. Nessen, A.H.P. America, I.M. van der Meer, Label free targeted detection and quantification of celiac disease immunogenic epitopes by mass spectrometry, Journal of Chromatography A (2015), http://dx.doi.org/10.1016/j.chroma.2015.02.070 This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

1

Label free targeted detection and quantification of

2

celiac

3

spectrometry

disease

immunogenic

epitopes

by

mass

ip t

4 Hetty C. van den Broecka*, Jan H. G. Cordewenera, Merel A. Nessenb, Antoine H.

6

P. Americaa, Ingrid M. van der Meera

cr

5

8

a

9

b

us

7

Wageningen UR, Plant Research International, PO Box 16, 6700 AA Wageningen, The Netherlands RIKILT Wageningen UR, PO Box 230, 6700 AE Wageningen, The Netherlands

11

*

an

10

Corresponding author: [email protected]; Tel: +31 317 480974; Fax: +31 317 41809

13

M

12 Abstract

ed

14

Celiac disease (CD) is a food-related disease caused by certain gluten peptides

16

containing T-cell stimulating epitopes from wheat, rye, and barley. CD-patients

17

have to maintain a gluten-free diet and are therefore dependent on reliable

18

testing and labeling of gluten-free products. So far, the R5-ELISA is the approved

19

method to detect if food products can be labeled gluten-free. Because the R5-

20

ELISA detects gluten in general, there is a demand for an improved detection

21

method that quantifies specifically CD-epitopes. Therefore, we developed a new

22

method for detection and quantification of CD-epitopes, based on liquid

23

chromatography (LC) coupled to mass spectrometry (MS) in multiple reaction

24

monitoring (MRM) mode. This method enables targeted label free comparative

25

analysis of the gluten proteins present in different wheat varieties and species,

26

and in wheat-based food products. We have tested our method by analyzing

Ac ce

pt

15

1 Page 1 of 49

several wheat varieties that vary in CD-epitope content, as was shown before

28

using immunoblotting and specific monoclonal antibodies. The results showed

29

that a modern bread wheat variety Toronto contained the highest amounts of CD

30

immunogenic peptides compared with the older bread wheat variety Minaret and

31

the tetraploid wheat variety Dibillik Sinde. Our developed method can detect

32

quantitatively and simultaneously multiple specific CD-epitopes in a high

33

throughput manner.

cr

ip t

27

Keywords:

36

-Gliadin, celiac disease, LC-MRM/MS,

T-cell stimulatory epitopes, wheat

Ac ce

pt

ed

M

an

35

us

34

2 Page 2 of 49

37

38

1. Introduction Celiac disease (CD) is a food related disease that results in inflammation of

41

the small intestinal mucosa in genetically predisposed individuals caused by

42

intake of gluten proteins from wheat, rye, and barley [1]. The prevalence of

43

celiac disease is about 0.5 – 2% and is still increasing in Western parts of the

44

world as well as in developing countries [2,3]. CD-patients can present

45

symptoms such as diarrhea, abdominal pain, constipation, weight loss, and

46

dermatitis

47

asymptomatic presentation without gastrointestinal symptoms resulting in a high

48

number of undiagnosed cases [4][2,3]5-7]. For CD-patients still the only cure is

49

to adhere to a strict gluten-free diet. However, gluten proteins are increasingly

50

being applied in all kind of food products because of their interesting features

51

[8]. Therefore, it is very important that food labeling is accurate and reliable. The

52

reduced intake of gluten peptides and proteins containing CD-epitopes by all

53

consumers, including diagnosed and still undiagnosed CD-patients, will reduce

54

symptoms and may reduce the prevalence of CD.

however,

increasingly

more

patients

show

Ac ce

pt

ed

M

an

herpetiformis,

us

cr

ip t

39 40

55

Gluten proteins, i.e. gliadins and glutenins, represent the main part of

56

storage proteins in wheat, are insoluble in water and contain high percentages of

57

the amino acids proline (P) and glutamine (Q). Gliadins form a large protein

58

family in which /-, -, and -gliadins can be distinguished (~30 to ~80 kDa by

59

acidic-PAGE) [9,10]. Glutenins can be subdivided into low-molecular weight

60

glutenin subunits (LMW-GS; ~30 to ~70 kDa by SDS-PAGE) and high-molecular

61

weight

62

[11,12].[13][14]

glutenin

subunits

(HMW-GS;

~80

to

~130

kDa

by

SDS-PAGE)

3 Page 3 of 49

Bread wheat is hexaploid and contains three different genomes (A, B and D)

64

evolved from three different grass species. Therefore, many gluten protein

65

encoding genes are present and copy numbers for -gliadins can range between

66

100 and 150 [15,16]. [17,18]CD symptoms can be caused by many different CD-

67

epitopes present in wheat cultivars and wheat-derived food products. So far, 24

68

epitopes have been defined from wheat gluten proteins that give a T-cell

69

response in CD-patients [19]. These glutamine and proline rich epitopes are

70

derived from all different classes of gliadin and glutenin proteins. The

71

immunogenic epitopes present in -gliadins are Glia-2, Glia-9, and Glia-20 of

72

which the most immunogenic epitopes are Glia-2 and Glia-9 [20,21]. It has

73

been shown by Shan et al. [22] that a large 33-mer from wheat gluten is

74

resistant against human intestinal proteases and therefore can be present in the

75

small intestine. The 33-mer is composed of overlapping immunodominant Glia-2

76

and Glia-9 epitopes that can also be recognized separately and stimulate T-cells

77

in CD [20,21]. This 33-mer can be present in -gliadins that are encoded by the

78

Gli-2 locus (Gli-D2) on the short arms of chromosome 6 of the D-genome in

79

wheat (see Fig. 1) [23-25]. The same holds for the Glia-2 epitope that is

80

present in most -gliadins encoded by genes present at only the D-genome in

81

hexaploid wheat. The -gliadins encoded by the D-genome can also contain

82

derivatives of the 33-mer (a 19-mer and a 26-mer) that contain the Glia-2 and

83

Glia-9 epitopes. The -gliadins encoded by the A-genome in hexaploid and

84

tetraploid wheat contain only a 13-mer containing the Glia-9 epitope. Most -

85

gliadins encoded by both the A- and the D-genome contain the Glia-20 epitope.

86

The -gliadins encoded by the B-genome contain neither the Glia-2/9 epitopes

87

nor the Glia-20 epitope [24,26].

Ac ce

pt

ed

M

an

us

cr

ip t

63

4 Page 4 of 49

Food products can be labeled gluten-free if they contain less than 20 ppm

89

gluten. So far, the only approved test by the Codex Alimentarius to detect the

90

presence of gluten in food products and that can be used to label products

91

‘gluten-free’ is the R5-enzyme-linked immunosorbent assay (R5-ELISA) [27-29].

92

In general, the functionality of ELISA tests depends on the extraction protocol

93

[30-32], reference material for calibration [33], and the detecting antibody. [30-

94

32][33]The latter is one of the limitations of the R5-ELISA that makes use of a

95

monoclonal antibody that detects gluten in general and not specifically CD

96

stimulatory epitopes. The R5-antibody is a-specific and recognizes several, five

97

amino acids long, sequences present in gluten proteins from wheat, rye, and

98

barley, but not from soy, oats, corn, rice, millet, teff, buckwheat, quinoa, and

99

amaranth. This difference in specificity is essential for gluten-free testing.

100

Despite this ability, the short length of the recognition sequence might lead to

101

inaccurate results. Antibodies can recognize sequences as short as five amino

102

acids. This in contrast to the size of the recognition sequence of a CD-epitope

103

recognized by human T-cells in CD, which is at least nine amino acids in length

104

[34]. Therefore, R5-ELISA might result in an overestimation of the amount of

105

true T-cell epitopes present that stimulate CD.

106

[19][20,21][22][20,21][23-25][24,26] Simultaneous detection and quantification

107

of more than one CD-epitope is not possible by immunoblotting using a single

108

blot. By ELISA this is only possible when a multiplexing method with multiple

109

specific

110

chromatography-multiple reaction monitoring mass spectrometry (LC-MRM/MS)

111

method that can detect and quantify multiple CD-peptides in a single short run

112

will have a huge advantage over the existing methods.

Ac ce

pt

ed

M

an

us

cr

ip t

88

antibodies

is

used.

Therefore,

the

development

of

a

liquid

5 Page 5 of 49

LC-MRM/MS is a rapidly emerging method as an alternative to antibody based

114

protein quantification [35-37]. In this approach, the proteins to be quantified are

115

first digested with a specific protease, after which proteotypic peptides are

116

analyzed by MS in MRM mode. Using either a triple quadrupole or a quadrupole-

117

ion trap instrument, peptides are identified and quantified by monitoring several

118

transitions for each peptide. MRM allows sensitive, accurate and reproducible

119

quantification of the peptides and corresponding proteins.So far, several LC-MS

120

detection methods, aiming at the detection of different immunogenic peptides,

121

have been developed and described [38-43]. The developed methods differ in

122

the choice of the selected peptides for quantification and the protease treatments

123

to release the peptides from the gluten proteins.

an

us

cr

ip t

113

Here, we describe the development of an LC-MRM/MS method to quantify

125

individual immunogenic peptides in a gluten protein extract from wheat kernels,

126

using chymotrypsin digestion to release the peptides. We focused on detection of

127

those peptides that have been proven to be the most immunodominant in CD

128

[44-48]. This is in contrast to previously developed LC-MS methods, that focused

129

on the most intensely MS responding peptides. Besides immunodominant

130

peptides, we also incorporated some peptides with amino acid substitutions that

131

make the immunogenic epitopes inactive and are thereby considered safe for

132

CD-patients [24,26]. LC-MS allows detection of these amino acid substitutions,

133

which is not possible by antibody detection as used in ELISA. In addition, we

134

selected peptides that can be used for quantification of the total gliadin protein

135

amount. Furthermore, we show that with this method, wheat varieties,

136

containing different amounts of the epitopes, can be analyzed. This opens the

137

possibilities for a range of future applications such as support for the

138

development of reduced immunogenic wheat lines, the analysis of foods for the

Ac ce

pt

ed

M

124

6 Page 6 of 49

presence of immunogenic CD-epitopes, or the screening of body fluids of

140

allergenic patients for analysis of the presence of selected marker peptides.

Ac ce

pt

ed

M

an

us

cr

ip t

139

7 Page 7 of 49

141

142

2. Materials and methods

143

2.1. Wheat samples

145

ip t

144 Two hexaploid wheat varieties, Minaret (obtained from the Centre for Genetic Resources, CGN, the Netherlands) and Toronto (obtained from Limagrain,

147

Lelystad, The Netherlands), and one tetraploid wheat variety (Dibillik sinde,

148

obtained from CGN) were used for gluten protein extraction followed by LC-MS

149

analysis.

an

us

cr

146

150 2.2. Extraction of gluten proteins

M

151 152

Gluten proteins were extracted from wheat grains according to Van den

154

Broeck et al. [49]. Grains were ground and gluten proteins were extracted from

155

50 mg wheat flour by addition of 0.5 ml of 50% (v/v) aqueous iso-propanol with

156

continuous mixing (MS1 Minishaker, IKA Works, Inc.) at 1000 rpm for 30 min at

157

room temperature, followed by centrifugation at 10,000 rpm for 10 min at room

158

temperature. The residue was re-extracted twice with 50% (v/v) aqueous iso-

159

propanol, 50mM Tris-HCl, pH 7.5 containing 1% (w/v) DTT, for 30 min at 60°C

160

with mixing every 5 to 10 min followed by centrifugation at 10,000 rpm for 10

161

min at room temperature. After addition of each next extraction solution, the

162

residue was resuspended followed by sonication for 10 min in an ultrasonic bath

163

(Branson

164

supernatants were combined and considered the gluten protein extract. The

165

protein content was quantified using the Biorad Protein Assay (Bio-Rad

Ac ce

pt

ed

153

3510,

Branson

Ultrasonics

Corporation).

The

three

obtained

8 Page 8 of 49

166

Laboratories), based on the Bradford dye-binding procedure, according to

167

manufacturer's instruction with BSA as a standard.

168 169

2.3. Digestion conditions

ip t

170 Gluten proteins from the gluten extracts were precipitated with four volumes

172

of ice cold 100% acetone and kept overnight at -20C. After centrifugation for 10

173

min at 14,000 rpm and 4C, the resulting pellet was resuspended in 8 M urea to

174

keep the proteins denaturated. Proteins were reduced in 5 mM DTT for 45 min at

175

60C, followed by alkylation in 15 mM iodoacetamide for 60 min at room

176

temperature in the dark. Aliquots containing 10 g of protein were four fold

177

diluted with 100 mM Tris-HCl (pH 8.0) /10mM CaCl2, to a final concentration of 2

178

M urea. Different experiments were performed to obtain optimal conditions for

179

release of the selected peptides. Proteins were digested with chymotrypsin from

180

bovine pancreas (Roche) and shaken at 25C using three different ratios of

181

protease:protein (1:5, 1:10 and 1:20). The digestion time was varied from 10

182

min up to 22 hours. The proteolytic activity was quenched by addition of

183

trifluoroacetic acid (TFA; 0.5% final concentration and the acidified sample was

184

desalted using solid phase extraction (SPE). Supelco LC-18 1 ml SPE columns

185

were washed with 1 ml 95% acetonitrile (ACN) and equilibrated with 1 ml 2%

186

ACN, 0.1% TFA. Subsequently, the digests were added to the columns and after

187

washing with 1 ml 2% ACN, 0.1% TFA, peptides were eluted with 1 ml 50% ACN,

188

0.1% TFA. Solvents were evaporated using a SpeedVac vacuum concentrator and

189

peptides were dissolved in either 20 µl 0.1 M ammonium formate (AF) or 40 µl

190

5% ACN, 0.1% formic acid (FA).

Ac ce

pt

ed

M

an

us

cr

171

9 Page 9 of 49

191

The formation of peptides P1-P9 was finally studied after both 4 and 22 hours of

192

digestion with chymotrypsin in a 1:5 enzyme to protein ratio.

193

2.4. Marker peptides and preparation of peptide reference solution

194 Nine peptides from gluten proteins (P1-P9) were synthesized and purchased

196

from ProteoGenix (France). An overview of the sequences of the peptides is

197

given in Table 2. For each of the peptides, a stock solution of 10 mM (based on

198

the yield and corrected for purity percentage indicated by the supplier) was

199

prepared in 100% MeOH and stored in aliquots at -20C. The individual stock

200

solutions were mixed to obtain equimolar cocktails of the peptides that were

201

further diluted with MilliQ water to a final concentration of 20 µM in 10% MeOH.

202

Calibration standards were prepared by dilution of the 20 µM cocktail solution in

203

a matrix of tryptic digest of a gluten extract of hexaploid wheat variety Toronto

204

(25 ng/µl 5% ACN, 0.1% FA) or a tryptic digest of bovine serum albumin (Sigma

205

Aldrich) (25 ng/µl 5% ACN, 0.1% FA) .

ed

M

an

us

cr

ip t

195

Calibration standard curves were measured using 20, 50, 100, 200, 500,

207

1000, 2000, 5000, and 10000 fmol per peptide loaded on the reversed phase

208

(RP) TS3 1.0 mm × 150 mm analytical RP column (Waters, Milford, MA, USA).

210 211

Ac ce

209

pt

206

2.5. On-line 2D nano LC-MS/MS configuration for untargeted approach

212

Nanoscale LC separation of complex peptide mixtures was performed using

213

the 2-D nanoAcquity UPLC system on-line coupled to a Synapt HDMS Q-TOF MS

214

instrument (Waters, Milford, MA, USA) as described [50,51]. For each run, 4 μL

215

digest was injected (partial loop method) in 20 mM AF (pH 10) on the first RP

216

column (X-bridge BEH130 C18, 5 μm, 300 μm × 50 mm, Waters). Elution from

10 Page 10 of 49

this column was stepwise under high pH and ultrahigh pressure at 2 μL/min.

218

Samples were eluted in four fractions of 13, 18, 25, and 65% ACN in 20 mM AF

219

(pH 10). Peptide eluates were on-line diluted with an excess of 0.1% FA in water

220

at a flow rate of 20 μL/min to reach a ten-fold dilution before being trapped on a

221

C18 (2G-V/M Symmetry 5 μm) trap column (180 μm × 20 mm). Finally, peptides

222

were

223

75 μm × 150 mm analytical RP column (Waters) at 0.3 μL/min with a gradient

224

using 0.1% FA in water as eluent A and 0.1% FA in ACN as eluent B. The

225

separation was carried out using 5% B for 1 min, 10% B for 2 min, 10–40% B

226

over 62 min and 40–85% B over 9 min. After 6 min of rinsing with 85% B and a

227

linear gradient back to 5% B over 2 min, the column was re-equilibrated at initial

228

conditions. Both the Symmetry trap column temperature and the analytical

229

column temperature were maintained at 55 °C by a built-in column heater. Mass

230

spectrometric analyses were performed in positive mode using ESI with a

231

NanoLockSpray source. Eluates were immediately sprayed into a Q-TOF Synapt

232

G1 (Waters). As lock mass, [Glu1]-fibrinopeptide B (1 pmol/L) was delivered

233

from a syringe pump (Harvard Apparatus, USA) to the reference sprayer of the

234

NanoLockSpray source at a flow rate of 0.2 μL/min. The lock mass channel was

235

sampled every 30 sec. LC-MS data were collected from the Synapt G1 operating

236

in either MS/MS or MSE mode for data-dependent acquisition (DDA) or data-

237

independent acquisition (DIA). For MS/MS, the three most intensive single or

238

multiple charged ions eluting from the column were selected for fragmentation.

239

The eluting peptide ions were detected in the MS survey scan (0.6 sec) from a

240

m/z range of 300 to 1400 and MS/MS scan range from 50 to 2000 m/z. A

241

dynamic exclusion window was set at 60 sec.

from

this

column

and

separated

on

a

HSS

T3

1.7 μm,

Ac ce

pt

ed

M

an

us

cr

eluted

ip t

217

242 11 Page 11 of 49

243

2.6. Triple quadrupole settings for targeted analysis

244 245

Targeted LC-MRM/MS analyses were performed on an Acquity UPLC system

246

coupled to a Xevo TQ-S triple quad instrument (Waters, Milford, MA, USA). For

247

each run, 5 μL and 10 μL sample, corresponding to 250 and 500 ng protein

248

digest,

249

1.0 mm × 150 mm analytical RP column (Waters, Milford, MA, USA) at 60°C with

250

a flow rate of 0.15 mL/min. Peptides were eluted using an ten minute LC-method

251

(starting 1 min after injection): 1% B for 0.5 min, linear gradient of 1–42% B

252

over 5.5 min, linear gradient 42-90% B over 1 min, after 1 min of rinsing with

253

90% B, a linear gradient 90-20% for 1 min and back to 1% B over 1 min. The

254

column was re-equilibrated at initial conditions for 1.9 min, with 0.1% FA in

255

water (LC-MS grade) as eluent A and 0.1% FA in 100% ACN as eluent B.

(partial

loop

method)

and

separated

on

a

TS3

ip t

injected

M

an

us

cr

was

MRM mass spectrometric analyses were performed in positive ion mode. A

257

scheduled MRM method was used with MRM detection window set to 0.5-1.6 min

258

per peptide, the duty cycle was set to automatic and dwell times were minimal 5

259

msec. The transitions and detection windows of the marker peptides are listed in

260

Table 2. Cone voltage was set to 35 V.

262 263 264

pt

Ac ce

261

ed

256

2.7. Data analysis

2.7.1. Untargeted MS data analysis

265

DDA data obtained from the MS/MS analysis with the Synapt G1 were

266

processed by ProteinLynx Global Server software (PLGS version 2.5, Waters)

267

for peak detection. Produced peak lists were used for Mascot search (Matrix

268

Science) to identify peptides. Searches were performed using the Uniprot

12 Page 12 of 49

database (07/2013), green plants. Parameters were: Enzyme selection is

270

chymotrypsin, maximum of three missed cleavages; carbamidomethyl C as fixed

271

modification; deamidated NQ and oxidation M as variable modifications;

272

monoisotopic mass values; protein mass unrestricted; peptide mass tolerance

273

±50 ppm; fragment mass tolerance ± 30 mmu; peptide charges 2+, 3+, and 4+.

275

Raw MSE (DIA) data obtained were processed by Progenesis QI software to visualize chromatograms of peptide resolution per fraction.

cr

274

276 2.7.2. Targeted MS data analysis

us

277

ip t

269

To analyze the targeted MRM data from TQ-S, Skyline software (MacCoss Lab

279

at the University of Washington) was used [52]. The raw data were imported into

280

Skyline and peak detection and quantitation was performed on the given set of

281

transitions. Peak areas were exported as a transition result report and further

282

processed in Excel. Per injection the sum of the area of the multiple transitions

283

per peptide was used as quantitative value. In Skyline the ‘area’ value is the

284

peak

285

subtracted. Calculation of linear regression was performed on the summed area

286

for the dilution series of the reference peptides. We applied an intercept=0

287

setting, as it gave the highest R2 value (between 0.974 and 0.994). For the

288

gluten digests the peak areas were also summed per peptide. The peptide

289

amount was calculated using the slope coefficient obtained from the linear

290

regression of standard curves. Femtomole (fmol) levels were calculated to g

291

amount of injected gliadin protein, using the average Mw of gliadins (32285.5

292

Da).

above

the

background and

therefore

background is already

Ac ce

pt

area

ed

M

an

278

293

13 Page 13 of 49

294

2.8. SDS-PAGE and immunoblotting

295 Equal amounts of gluten proteins were separated on SDS-PAGE gels (10%)

297

as described [53] using a Hoefer SE 260 mighty small II system (GE Healthcare)

298

followed by staining with PageBlue (Fermentas). For immunoblotting, proteins

299

were blotted onto nitrocellulose (0.2 m, Bio-Rad Laboratories), omitting

300

methanol from the blotting buffer, using a Mini Trans-Blot Cell (Bio-Rad

301

Laboratories) at 100V for 1 h. B Blots were stained using a MemCode

302

Reversible Protein Stain Kit for Nitrocellulose Membranes (Fisher Scientific) prior

303

to incubation with monoclonal antibodies (mAbs). Blots were incubated as

304

described [54] using mAbs specific for T-cell stimulatory epitopes Glia-9 and

305

Glia-20 [55-57]. Antibody binding to the blots was visualized by staining for

306

alkaline

307

tetrazolium (NBT) and 5-bromo-4-chloro-3-indolyl phosphate (BCIP) (Sigma-

308

Aldrich). The gluten protein extract of ‘Toronto’ was used on each separate

309

immunoblot as an ‘inter-gel’ control.

an

us

cr

ip t

296

secondary

antibody,

using

Nitro

Blue

pt

ed

M

phosphatase-conjugated

Blots were scanned using a Bio-Rad GS-710 Calibrated Imaging Densitometer

311

(Bio-Rad Laboratories) and saved as TIFF images. Pixel intensities were

312

calculated per lane using Quantity One software (Bio-Rad Laboratories). Relative

313

intensities differed specifically per mAb used, but were normalized to values

314

obtained

Ac ce

310

for

the

‘inter-gel’

control.

14 Page 14 of 49

315

316

3. Results

317 3.1. Identification of gluten proteins in wheat grains by on-line 2D LC-MS/MS

ip t

318 319

In a previous genomics study, the genetic diversity present in -gliadin

321

transcripts and genes has been investigated showing that CD immunogenic

322

epitopes are encoded by the A- and D-genome and not by the B-genome. Fig. 1

323

shows the protein sequence variants of the most frequently found expressed

324

sequence tags (ESTs) originating from -gliadins in wheat, as present in the

325

NCBI-UniGene database. The amino acid sequences derived from the different

326

translated contig sequences have been aligned for the CD-epitope containing

327

region. Sequence variation outside this region is not displayed here. Sequence

328

variants are grouped per genome origin (Gli-A2, Gli-B2 and Gli-D2). The Glia-2

329

and Glia-9 epitopes are marked in yellow and the Glia-20 epitope is marked in

330

green. As mentioned before, these epitopes are encoded by both the A- and D-

331

genome, but not by the B-genome. The large protease resistant 33-mer,

332

containing multiple CD-epitopes, is marked in blue and the 19-mer and 26-mer

333

derivatives, encoded by only the D-genome, are marked in yellow. The -gliadins

334

encoded by the A-genome in hexaploid and tetraploid wheat contain only the 13-

335

mer peptide sequence containing the Glia-9 epitope.

Ac ce

pt

ed

M

an

us

cr

320

336

To study the -gliadins present in wheat varieties at protein level, a

337

previously developed two-step extraction protocol using isopropanol was applied

338

to efficiently extract the gluten proteins [49]. Total gluten concentrations in our

339

extracts were 1.02 g/l (30.45 g/mg flour), 0.72 g/l (21.45 g/mg flour), 15 Page 15 of 49

and 0.80 g/l (23.85 g/mg flour) for ‘Toronto’, Minaret’, and ‘Dibillik sinde’,

341

respectively. For LC-MS/MS analysis of the gluten fraction, proteins were first

342

digested with a specific protease in order to generate peptides with a length that

343

was amenable for proper LC separation and MS detection. Routinely, trypsin is

344

used for enzymatic digestion of complex protein mixtures, which cleaves proteins

345

C-terminal to the basic amino acids arginine (R) and lysine (K). However, due to

346

the low number of these amino acids in gluten proteins, the digestion of the

347

gluten proteins into smaller peptides by trypsin is not feasible. Therefore,

348

chymotrypsin from bovine pancreas was chosen for digestion of the gluten

349

proteins. Chymotrypsin cleaves C-terminal to phenylalanine (F), tyrosine (Y), and

350

tryptophan (W) unless the next residue is a proline (P), and at a lower rate C-

351

terminal to leucine (L), methionine (M), alanine (A), aspartic acid (D), and

352

glutamic acid (E). In Fig. 1, the chymotryptic cleavage sites present in the CD-

353

epitope containing domain of the different -gliadins are indicated in black (F, Y,

354

and W). From this result, we concluded that all -gliadin proteins encoded by the

355

A-genome will, theoretically, yield at least one peptide containing the Glia-9

356

epitope when digested with chymotrypsin (Fig. 1, indicated in yellow). This also

357

holds for -gliadin sequences encoded by the D-genome that will result in

358

different sized peptides (19-mer, 26-mer, and 33-mer) containing the Glia-2/9

359

epitopes (Fig. 1, indicated in yellow) or sequence variants thereof.

Ac ce

pt

ed

M

an

us

cr

ip t

340

360

The use of chymotrypsin as an optimal protease to generate CD-epitope

361

containing peptides was confirmed by digestion of the gluten protein extracts

362

from hexaploid wheat varieties Toronto and Minaret. The generated peptides

363

were separated into four fractions by 2D nano UPLC and on-line analyzed using

364

data-dependent MS/MS mode and data-independent MSE mode. Supplementary

365

Fig. S1 shows the resolution of the detected peptides present in the four different 16 Page 16 of 49

fractions. Supplementary Table S1 summarizes the different gliadin and glutenin

367

proteins identified in the gluten extract of wheat variety Toronto. Whereas the

368

majority of the identified peptides originated from -gliadins, also numerous

369

peptides from -gliadins and LMW-GS have been detected. The - and -gliadins

370

and the LMW-GS were classified into families according to the presence of at

371

least one unique identified (proteotypic) peptide in their sequence. In this way

372

we could discriminate between 24 different -gliadin families originating from

373

Triticum and Aegilops species (Table 1), 15 different -gliadin families, 7 different

374

LMW-GS families, and a single -gliadin and HMW-GS family.

us

cr

ip t

366

The peptide identifications covering the sequence area containing CD-

376

epitopes (displayed in bold) in -gliadins are displayed in Table 1. Different

377

sequence variants of the peptides have been identified and are listed as well.

378

From these identified peptides, we have selected the most frequently occurring

379

peptides as marker peptides for LC-MRM/MS detection, indicated P1 to P9.

ed

M

an

375

Peptides P1 to P4 occurred with peptide P6 in the -gliadin families, whereas

381

the -gliadin peptide containing the 33-mer (P5) was always present in

382

combination with the peptide containing the Glia-20 epitope having a P to S

383

substitution (P7) (Fig. 1; Table 1). Peptide P6 (in combination with P7)

384

containing the Glia-20 epitope occurred in nearly all -gliadin families encoded

385

by the A- and D-genome, but was absent in -gliadin families encoded by the B-

386

genome (Table 1, no. 8, 14, and 16). Peptides from the same region encoded by

387

the B-genome were more difficult to be identified because of the absence of

388

chymotrypsin digestion sites in this region. Instead, three different -gliadin

389

families from the B-genome have been identified (Table 1, no. 8, 14, and 16)

390

with proteotypic peptides outside this region of the gliadin sequence (see

391

Supplementary Table S1). Peptides P8 and P9 appeared in nearly all -gliadin

Ac ce

pt

380

17 Page 17 of 49

392

families, where P9 is the product of a missed cleavage and also contains peptide

393

P8. We reasoned that a targeted LC-MS detection method for the selected

395

peptides would be able to detect the presence of both the different CD-epitopes

396

individually (P1 to P7) and the sum of multiple gliadin proteins (P8 and P9).

ip t

394

397 3.2. Quantitative LC-MRM/MS assay development

cr

398 399

After identification of the different -gliadin peptides in wheat varieties

401

Toronto and Minaret by on-line 2D LC-MS/MS, a targeted MS-based assay was

402

developed for absolute quantification of the selected marker peptides P1-P9

403

(Table 1). In this study a triple quadrupole instrument (Xevo TQ-S) was used to

404

quantify CD-epitopes containing -gliadins in gluten extracts. Non-labelled

405

synthetic marker peptides P1-P9 were used both for the development of the MRM

406

assay and as standards for the generation of calibration curves for absolute

407

quantification of the corresponding -gliadins.Note that P9 has a missed cleavage

408

containing the P8 peptide sequence. Both peptides, P8 and P9, were present in

409

17 of the 24 -gliadin families (Table 1).

Ac ce

pt

ed

M

an

us

400

410

First, the optimal transitions for quantification and confirmation of each of

411

these peptides in a gluten digest were determined. Since the sensitivity of a

412

triple quadrupole MS is critically dependent on ionization conditions and tuning

413

parameters, a mix of all nine synthesized peptides was loaded on a RP column

414

and eluted with a short gradient to optimize cone voltage and collision energy of

415

the TQS. Table 2 summarizes the most optimal parameters achieved for each of

416

the marker peptides, and the transitions used for quantification and confirmation.

417

For most precursor peptides, one m/z value was chosen as the predominant 18 Page 18 of 49

charge state for selection in the first quadrupole. To obtain the most sensitive

419

assay, the collision energy was optimized per precursor to generate maximal

420

fragment ion intensities. In the third quadrupole, the most intense transitions

421

derived from one precursor ion were selected for quantification. In Table 2, the

422

peak retention time measured for each of the marker peptides is shown using the

423

LC conditions as described in section 2.6. (10 min gradient). For each peptide a

424

retention window of  0.5 to 2 min around its peak elution time was used for

425

acquisition in a time-scheduled MRM method. Scheduled MRM instructed the TQS

426

to measure a selected set of transitions during a particular retention time,

427

providing longer dwell times to monitor the target ions and thus improved signal

428

to noise ratios for most of the transitions. A prerequisite in a scheduled MRM

429

analysis is that the chromatography is stable and reproducible within the chosen

430

retention time window.

M

an

us

cr

ip t

418

Since most of the marker peptides contained overlapping sequences, several

432

fragment ions from different marker peptides had identical masses, as listed in

433

Table 2. The specificity for detection was achieved by using unique precursor to

434

product transitions and using scheduled MRM acquisitions around the elution time

435

of the precursor peptide. Supplementary Fig. S2 shows the chromatograms

436

obtained for a mixture of the nine marker peptides and for a ‘Toronto’ gluten

437

protein digestion.

439

pt

Ac ce

438

ed

431

3.3. Calibration curves of peptide standards

440 441

To determine the concentration range at which the nine marker peptides

442

showed a linear MRM response, calibration curves were generated by injection of

443

synthetic peptides in a range of 20-10000 fmol. Peak areas above background 19 Page 19 of 49

were determined by summing the multiple transitions per peptide using Skyline

445

software. Initially, a deviation from linearity was observed for most of the

446

peptides when the injected amount decreased below 100 fmol. Although different

447

glass vials and ‘low binding’ plastic tubes were tested to diminish the loss of

448

peptides when handling these low amounts of peptide, the calibration curves in

449

the lower concentration range remained quite variable. This problem could be

450

solved by preparing peptide dilutions in a matrix of a protein digest that did not

451

contain the marker peptides. When the dilutions of the peptides were prepared in

452

a matrix of tryptic digest from either bovine serum albumin (BSA) or gluten

453

protein, calibration curves showed excellent linearity in a range of four orders of

454

magnitude. The R square correlation values of the nine calibration curves varied

455

between 0.969 and 0.994, as listed in Table 2. In Supplementary Fig. S3, an

456

overview of the calibration curves for all the nine peptides is given.

M

an

us

cr

ip t

444

459

3.4. Digestion conditions

pt

458

ed

457

Prior to quantitative LC-MRM/MS analysis of the nine -gliadin peptides P1-P9

461

in protein digests from gluten extracts (see below), the reproducibility of the

462

digestion procedure with chymotrypsin was determined. Digestion was optimized

463

using gluten extracts from hexaploid wheat variety Toronto and tetraploid wheat

464

variety Dibillik sinde. In contrast to trypsin, which is mostly used in proteomics

465

analysis, chymotrypsin is not an endpoint protease. In addition to its primary

466

cleavage specificity it displays secondary, low affinity activity leading to further

467

breakdown of digested peptides. This means that an optimal digestion time and

468

enzyme to substrate ratio needs to be determined. Therefore, the progress of

469

chymotryptic digestion of gluten proteins dissolved in 2 M urea was followed by

Ac ce

460

20 Page 20 of 49

sampling at different time points of incubation at 25C. The ratio of chymotrypsin

471

to protein was also varied, i.e. 1:20, 1:10, and 1:5. Five peptides (P1-P5)

472

containing Glia-2/9 epitopes had reached a maximum MRM signal already at, or

473

even well-before, 4 h of incubation (Fig. 2A). Longer incubation did not result in

474

a significant decrease of the signals, and indicated that no or limited secondary

475

chymotrypsin activity to peptides P1-P5 took place (Fig. 2B). Peptides P6 and P7,

476

however, showed a much slower release and maximum MRM responses were

477

reached only after 22 h of incubation and an extra addition of chymotrypsin of an

478

enzyme to protein ratio of 1:5 (Fig. 2C and 2D). Finally, formation of all peptides

479

(P1-P9) was studied after 4 and 22 hours of incubation with chymotrypsin in a

480

1:5 enzyme to protein ratio. The results, shown in Supplementary Fig. S5,

481

confirmed the previous findings that P1-P5 reached a maximum response after

482

only 4 hours of digestion, while formation of P6 and P7 reached its maximum

483

after 22 hour of incubation. Also peptide P8, present in most -gliadin families,

484

showed a maximum response after 22 h. The slow formation of P8 is related to

485

the presence of a missed cleavage site, as is shown by the P9 peptide. The signal

486

of peptide P9 decreased over time, reaching a level just above background after

487

22 h when it was fully digested into P8. The reason why peptides P6, P7 (both

488

containing the Glia-20 epitope), and P8were released very slowly during

489

chymotrypsin digestion of -gliadins is not clear. In conclusion, these results

490

indicated that optimal digestion time to detect the CD-epitope containing

491

peptides is 22 hours and this was used for analysis of wheat varieties.

Ac ce

pt

ed

M

an

us

cr

ip t

470

492 493

3.5. Quantification of -gliadins in three wheat varieties by LC-MRM/MS

494

21 Page 21 of 49

After optimization of the digestion conditions, we analyzed protein extracts

496

from three wheat varieties with the developed MRM method: the modern wheat

497

variety Toronto (AABBDD), the landrace variety Minaret (AABBDD), and the

498

tetraploid variety Dibillik sinde (AABB). For LC-MRM/MS analysis two amounts

499

(250 ng and 500 ng) of digested gluten sample were injected in duplicate. Peak

500

areas (minus background) were determined by summing the selected transitions

501

per peptide using Skyline software. The total peak areas were converted to fmol

502

amounts of peptide, using the external calibration curves obtained from a dilution

503

series of the synthetic peptides. The fmol amount of peptide was converted to

504

the correlating amount of -gliadin per microgram digested gluten protein

505

extract using the average Mw of -gliadins (32285.5 Da). This is displayed in

506

Table 3 and Supplementary Fig. S4. Samples from two different digestion times

507

(4 h and 22 h) were analyzed and the 22 h sample was interpreted as completely

508

digested. In Table 3, an overview is given of the quantified peptides in the three

509

wheat varieties. In the tetraploid variety Dibillik sinde, the marker peptide P8

510

and the two peptides P1 and P6 containing the Glia-9 and Glia-20 CD-epitopes

511

were detected. These peptides, P1 and P6, are derived from -gliadins from the

512

A-genome. In the hexaploid varieties ‘Toronto’ and ‘Minaret’ all selected peptides

513

were detected (P9 was only detected in 4 h digests). The standard deviation of

514

the detected peptides was between 3% and 15% for most peptides, indicating a

515

good reproducibility of detection. The amounts of peptides P1 to P5 containing

516

the Glia-2/9 epitopes, could be related to the total ‘load’ of gliadin (P8+P9).

517

This allows comparison of the presence of these CD-epitopes in different wheat

518

varieties and gains insight into the immunogenicity (see Supplementary Fig. S5).

519

Even though peptides P8 and P9 did not cover the total -gliadin content, as will

520

be further discussed in the general discussion, they could be used to indicate the

Ac ce

pt

ed

M

an

us

cr

ip t

495

22 Page 22 of 49

amount of -gliadins present. Results showed that ‘Toronto’ contained the

522

highest amount of -gliadins per total gluten amount and ‘Dibillik sinde’ the

523

lowest amount (Table 3, Supplementary Fig. S5), whereas the total gluten

524

amount for ‘Dibillik sinde’ was higher compared with the total gluten amount for

525

‘Minaret’.

ip t

521

The total amount of Glia-2/9 and Glia-20 epitopes could be compared with

527

results obtained from previous studies where we performed large scale

528

immunoblotting experiments with antibodies against CD-epitopes Glia-9 and

529

Glia-20 (see Supplementary Fig. S6). While equal amounts of total gluten

530

protein extract were used for immunoblotting, the overall signal intensities

531

obtained with mAbs against the Glia-9 (Supplementary Fig. S6B) and Glia-20

532

(Supplementary Fig. S6C) epitopes were different for the three wheat varieties,

533

which indicated that these epitopes were present in different amounts. In Fig. 3

534

an overview is given of the LC-MRM/MS results and shows the sum of P1-P5

535

(Glia-2/9) and P6-P7 (Glia-20) using the average Mw of -gliadins (32285.5

536

Da). Peptide P1, containing the Glia-9 epitope and present in all three varieties,

537

was highly present in both the modern hexaploid variety Toronto and the

538

tetraploid variety Dibillik sinde (Table 3). The total amount of P1, the only

539

peptide present in the tetraploid wheat variety Dibillik sinde encoded by the A-

540

genome, was about half (56%) of the amount present in ‘Toronto’ (Fig. 3A). This

541

result was different from the immunoblotting experiments where the signal in

542

‘Dibillik sinde’ using the Glia-9 mAb (Supplementary Fig. S6B) was about three

543

quarters (75%) compared with the signal in ‘Toronto’, while equal amounts of

544

total gluten protein extract were used for immunoblotting. This was probably

545

caused by a-specific signals by the Glia-9 mAb. With LC-MRM/MS, it was

546

measured that in ‘Minaret’ the total amount of P1 to P5 was about half (52%) of

Ac ce

pt

ed

M

an

us

cr

526

23 Page 23 of 49

547

that in ‘Toronto’ (Fig. 3A). Results from immunoblotting using the Glia-9 mAb

548

(Supplementary Fig. S6B) showed in ‘Minaret’ more than half the signal

549

compared with ‘Toronto’, which was probably also caused by a-specificity of the

550

Glia-9 mAb. The amount of Glia-20 epitope (P6) in ‘Dibillik sinde’ was about one third

552

(32%) of the amount of Glia-20 epitopes (P6 and P7) in ‘Minaret’ and one tenth

553

(13%) of the amount (P6 and P7) in ‘Toronto’ (Fig. 3B). In general, peptides P1

554

to P7 were present in lower amounts in ‘Minaret’ compared with ‘Toronto’ (Table

555

3, Supplementary Fig. S5). The results obtained from immunoblotting using the

556

Glia-20 mAb (Supplementary Fig. S6C) were much more comparable with the

557

LC-MRM/MS results then the results obtained for the Glia-9 mAb, probably

558

because of the higher specificity of the Glia-20 mAb. The number of protein

559

bands reacting with both mAbs was less in ‘Minaret’ and ‘Dibillik sinde’ compared

560

with

‘Toronto’.

Ac ce

pt

ed

M

an

us

cr

ip t

551

24 Page 24 of 49

561

562

4. Discussion

563 The intention of the described study is to develop a peptide targeted

565

detection method that can accurately discriminate between different CD

566

stimulating epitope sequences originating from the multitude of -gliadin

567

proteins expressed in wheat varieties. [25,58-61][60,61][49][60][24]Improved

568

and new detection methods have been under development in the meantime [38-

569

43,62]. However, developing a new approved detection method for immunogenic

570

gluten is very difficult because of the complexity of detecting and quantifying

571

only a subset of ‘harmful’ gluten peptides instead of detecting and quantifying all

572

gluten proteins. We aimed to develop a method detecting and quantifying only

573

harmful epitopes in wheat varieties that stimulate the development of CD. The

574

accuracy of the method will depend on the sensitivity of the mass spectrometer

575

and the optimal digestion conditions of the gluten proteins. It has been shown

576

before by Šalplachta et al. [63] that using trypsin for the digestion of gluten

577

proteins did not result in sufficient amounts of peptides and results in peptides

578

that have too high molecular masses. LC-MS detection methods as developed by

579

Sealey-Voyskner et al. [42] and Prandi et al. [64] make use of the proteases

580

pepsin, trypsin, and chymotrypsin to simulate the human intestinal digestive

581

tract. However, results show that immunodominant peptides are the result of

582

chymotrypsin digestion only. Gastrointestinal digestion may also be different

583

among humans. By using chymotrypsin and optimal digestion conditions this

584

possible difference is excluded.

Ac ce

pt

ed

M

an

us

cr

ip t

564

25 Page 25 of 49

A second success factor for the accuracy of the method is the detection level.

586

The sensitivity and specificity of mass spectrometry are very high and allow

587

detection of peptides containing single amino acid substitutions that can

588

differentiate between immunogenic and non-immunogenic CD-epitope containing

589

peptides. As was shown by Mitea et al. [24], P to S substitutions can cause

590

inactivation of immunogenic CD-peptides. With this knowledge and using MRM

591

mass spectrometry (LC-MRM/MS) it is possible to identify these amino acid

592

substitutions within peptides and search for wheat varieties and species that are

593

low in immunogenic CD-epitopes.

us

cr

ip t

585

It is difficult to compare ELISA read outs with results from LC-MS measuring

595

CD-immunogenic gluten. Both methods are very different in the way of detection

596

and calibration. ELISA uses indirect detection using a mAb and calibrates against

597

a broad range of proteins or peptides depending on whether sandwich or

598

competitive ELISA is being used. LC-MS detects directly the presence of peptides

599

and calibration is against peptide standards. Because of the complexity of gluten

600

proteins, there is not a single peptide present in all gluten proteins and therefore

601

specific peptides of interest should be quantified. Cressey et al. [65] described

602

the use of the method developed by Sealey-Voyksner et al. [42] in which

603

quantification is based on one peptide present in LMW-GS. Because the

604

contribution of LMW-GS to the total gluten protein content varies among

605

hexaploid wheats [66], there should be a certain calculation factor taken into

606

account to obtain the true level of ppm in food products or even in bread wheat

607

flours. The method developed by Prandi et al. [64] described quantification by

608

using a labeled peptide of which the peptide is only encoded by the A-genome in

609

wheat. Because gluten proteins encoded by the A-, B-, and D-genome are not

610

identical, also a certain correcting calculation factor should be included to obtain

Ac ce

pt

ed

M

an

594

26 Page 26 of 49

the correct value for levels of ppm. The authors described the use of three

612

marker peptides for quantification of the gliadin amount. These marker peptides

613

were derived from the N-terminus of -gliadins and were released after pepsin

614

digestion. These three marker peptides, however, are present in only 60% of -

615

gliadins from T. aestivum (Uniprot, 2014). We used two different peptides for

616

total -gliadin quantification (P8 and P9) that are located at the start of the

617

unique domain 1 in -gliadins [67] and are released after chymotrypsin

618

digestion. These peptides are present in about 64% of -gliadins from T.

619

aestivum. This percentage could be increased to 93% when another isoform of

620

these peptides is included (Table 1). This will be necessary when we directly

621

want to quantify peptides present in food products.

an

us

cr

ip t

611

For the developed LC-MS/MRM method, we decided not to use labeled

623

peptides as was argued before by Sealey-Voyksner et al. [42]. The reason was

624

because of increasing costs when many CD-epitope containing peptides need to

625

be quantified. At this moment, more than 24 different CD-epitopes are known.

ed

M

622

Tetraploid wheat, known as pasta wheat, contains the A- and the B-genome

627

and many varieties exist, ranging from old landraces to cultivated lines.

628

Especially old landraces, but also nowadays cultivated pasta wheat, may be

629

mixtures of different genotypes including hexaploid bread wheat [61,68]. Prandi

630

et al. [68] described the detection of contamination of tetraploid wheat batches

631

with hexaploid wheat using the 33-mer as a biomarker. The 33-mer is encoded

632

by -gliadins from the D-genome only. However, not all hexaploid wheat

633

varieties may contain the 33-mer [58]. Here, we describe the use of three

634

additional peptides (P2, P3, and P4) specifically encoded by the D-genome that

635

can be used to detect this type of contamination. One of these peptides (P2,

636

LQLQPFPQPQLPYPQPQPF) was detected by Prandi et al. [40,41] in tetraploid

Ac ce

pt

626

27 Page 27 of 49

637

durum wheat showing that the used durum wheat variety was probably

638

contaminated with hexaploid wheat. It is shown by Mitea et al. (see Fig. 1) [24]

639

that only peptide sequence P1 (LQLQPFPQPQLPY) is encoded by the A-genome

640

and can be present in tetraploid wheat varieties. Immunodominancy of a CD-epitope is probably determined by the speed of

642

release of the gluten peptide containing this epitope and the amount of these

643

peptides present in the small intestine [64]. Gluten peptides shown to be

644

immunodominant (33-mer and derivatives) are indeed released very fast, within

645

one hour, from the gluten extracts in our in vitro digestion. In contrast, the

646

release of the peptide containing the Glia-20 epitope takes much more time, up

647

to 22 hours, and may therefore be less important in stimulating human T-cells.

an

us

cr

ip t

641

Recent research based on detection of CD-epitopes using specific T-cell-,

649

antibody-, and DNA analyses demonstrated that the CD-immunogenicity varies

650

across gluten proteins among and within different wheat varieties and species

651

[25,58-61]. In this study, we analyzed a modern hexaploid wheat variety

652

Toronto, an older hexaploid wheat variety Minaret, and a landrace tetraploid

653

wheat variety Dibillik sinde. Selected wheat varieties have been analyzed before

654

by immunoblotting using mAbs against CD-epitopes Glia-9 and Glia-20 for

655

relative quantification of the amount of these epitopes [60,61]. However, these

656

antibodies do not recognize the complete T-cell specific epitope. The mAb

657

recognition site for the Glia-9 epitope is found in sequences of -gliadins, -

658

gliadins, and -gliadins/D-type LMW-GS obtained from wheat species [49]. The

659

mAb recognition site for the Glia-20 epitope is only found in sequences of -

660

gliadins. In -gliadin sequences, the presence of the recognition site for both

661

mAbs correlates nicely with the presence of the recognition site of the human T-

662

cell [60]. However, both the Glia-9 and Glia-20 mAbs will also recognize

Ac ce

pt

ed

M

648

28 Page 28 of 49

naturally occurring non-epitope sequences that are inactive in T-cell response

664

e.g. because of a P to S substitution present outside the mAb recognition site

665

[24]. Using these mAbs in ELISA or immunoblotting for detecting the Glia-9 and

666

Glia-20 epitopes may therefore result in overestimation of the presence of

667

immuno-active peptides. The development of an LC-MRM/MS method for direct

668

quantitative detection enables us to compare the two different methods and we

669

show that LC-MRM/MS ensures more precise detection and quantification of CD-

670

epitopes. ELISA and LC-MRM/MS could be used in combination where ELISA

671

could be used to identify the presence of (total) gluten proteins and the

672

developed LC-MRM/MS method could be used to specifically identify and quantify

673

the particular immunogenic peptides containing CD-epitopes. The detection and

674

quantification of different and individual immunogenic peptides by LC-MRM/MS

675

will need to be coupled to a different threshold level of exposure levels for CD-

676

patients.

ed

M

an

us

cr

ip t

663

The developed LC-MRM/MS method could also be used in combination with

678

RNA sequencing as described by Salentijn et al. [69]. RNA sequencing is

679

performed on developing wheat seeds to analyze which -gliadin genes are being

680

expressed. This is in contrast to LC-MRM/MS that uses mature seeds for

681

extraction of gluten proteins. The developing stage of the seed is important in

682

determining the amount of mRNA present encoding the different gluten proteins

683

[70,71] and protein composition might change during seed development [72].

684

Furthermore, the relation between mRNA level and protein level is not fully

685

correlated and can differ among genes. In addition, even though Salentijn et al.

686

[69] found novel variants of the CD-epitopes via RNA sequencing, the true

687

presence of these peptide sequence variants in mature seeds can only be shown

688

by LC-MS gluten protein analysis. The results obtained by Salentijn et al. [69] for

Ac ce

pt

677

29 Page 29 of 49

689

the tetraploid ‘Dibillik sinde’ is in accordance with our LC-MRM/MS results. Low

690

expression levels are found for genes encoding the Glia-20 epitope by RNA

691

sequencing from developing seeds and results from LC-MRM/MS show very low

692

levels of corresponding peptides in mature seeds. Combining both techniques

693

may

694

immunogenic

development

of

genetic

markers

for

breeding

of

low

ip t

the

wheat.

Ac ce

pt

ed

M

an

us

cr

enable

30 Page 30 of 49

695

696

5. Conclusions

697 Analysis of hexaploid wheat for the presence of CD-epitopes from -gliadins

699

is more complex than analysis of those present in tetraploid wheat because of

700

the complexity of the D-genome. The targeted LC-MRM/MS method using triple

701

quadrupole mass spectrometry developed in this study, specifically quantifies the

702

amounts of individual immunogenic CD-epitopes from both A- and D-genome -

703

gliadins present in hexaploid wheat. It also quantifies the isoforms present of

704

peptides encoded by the D-genome. LC-MRM/MS technology is useful for

705

identification

706

immunogenic epitopes at low femtomolar levels of detection. It is a fast and

707

sensitive method and detection and quantification are reproducible. The method

708

can be extended to any other CD immunogenic epitope to obtain quantification

709

for all epitopes present in wheat varieties and species. It will enable the selection

710

of varieties or species with low amounts of T-cell stimulatory epitopes that can

711

be used in food products to lower the ‘gluten load’ and thereby prevent

712

development

quantification

of

biomarker

peptides

containing

CD

Ac ce

pt

ed

M

and

an

us

cr

ip t

698

and

prevalence

of

CD.

31 Page 31 of 49

713

714

Acknowledgements

715 This research was partially funded by the European regional development

717

fund and the province of Gelderland and Overijssel (GO EFRO 2007-2013) and

718

the Dutch Ministry of Economic Affairs through the DLO program ‘Plant and

719

Animal for Human Health’ (KB-05-001-019-PRI) and ‘Healthy and safe food in

720

chain perspective’ (KB-15-001-016). The authors thank Noor Bas (Centre for

721

Genetic Resources, CGN, The Netherlands) and Hein de Jong (Limagrain) for

722

providing the seeds, and Elma Salentijn for providing the original figure to adapt

723

Fig.

1.

Ac ce

pt

ed

M

an

us

cr

ip t

716

32 Page 32 of 49

724

725

References

726

728 729

coeliac disease, Nat Rev Immunol. 13 (2013) 294-302. [2]

730 731

L.M. Sollid, B. Jabri, Triggers and drivers of autoimmunity: lessons from N. Reilly, P. Green, Epidemiology and clinical presentations of celiac disease, Semin Immunopathol. 34 (2012) 473-478.

[3]

ip t

[1]

cr

727

K. Mustalahti, C. Catassi, A. Reunanen, E. Fabiani, M. Heier, S. McMillan, L. Murray, M.-H. Metzger, M. Gasparin, E. Bravi, M. Mäki, The prevalence of

733

celiac disease in Europe: Results of a centralized, international mass

734

screening project, Ann Med. 42 (2010) 587-595.

736 737

S.D. Rampertab, N. Pooran, P. Brar, P. Singh, P.H.R. Green, Trends in the

an

[4]

presentation of celiac disease, Am J Med. 119 (2006) 355.e359-355.e314. [5]

J.F. Ludvigsson, A. Rubio-Tapia, C.T. van Dyke, L.J. Melton, III, A.R.

M

735

us

732

738

Zinsmeister, B.D. Lahr, J.A. Murray, Increasing incidence of celiac disease

739

in a North American population, Am J Gastroenterol. 108 (2013) 818-824. [6]

A. Rubio-Tapia, J.F. Ludvigsson, T.L. Brantner, J.A. Murray, J.E. Everhart,

ed

740

The prevalence of celiac disease in the United States, Am J Gastroenterol.

742

107 (2012) 1538-1544.

743

[7]

J.Y. Kang, A.H.Y. Kang, A. Green, K.A. Gwee, K.Y. Ho, Systematic review: worldwide variation in the frequency of coeliac disease and changes over

749

Ac ce

744

pt

741

750

477-482.

745 746

time, Aliment Pharmacol Ther. 38 (2013) 226-245.

[8]

747 748

751 752 753 754

L. Day, in G.O. Phillips, P.A. Williams (Editors), Handbook of food proteins Woodhead Publishing Limited, 2011, pp. 267-288.

[9]

J.H. Woychik, J.A. Boundy, R.J. Dimler, Starch gel electrophoresis of wheat gluten proteins with concentrated urea, Arch Biochem Biophys. 94 (1961)

[10] F. MacRitchie, D. Lafiandra, in S. Damodaran, A. Paraf (Editors), Food proteins and their applications, Dekker, New York, 1997. [11] P.R. Shewry, A.S. Tatham, in P.R. Shewry, R. Casey (Editors), Seed Proteins, Kluwer Academic Publishers, Dordrecht, 1999, pp. 11-36.

33 Page 33 of 49

755

[12] M.C. Gianibelli, O.R. Larroque, F. MacRitchie, C.W. Wrigley, Biochemical,

756

genetic,

and

molecular

characterization

of

wheat

757

component subunits, Cereal Chem. 78 (2001) 635-646.

glutenin

and

its

[13] E.A. Jackson, L.M. Holt, P.I. Payne, Characterisation of high molecular

759

weight gliadin and low-molecular-weight glutenin subunits of wheat

760

endosperm by two-dimensional electrophoresis and the chromosomal

761

localisation of their controlling genes, Theor Appl Genet. 66 (1983) 29-37.

ip t

758

[14] P.I. Payne, L.M. Holt, C.N. Law, Structural and genetic-studies on the high-

763

molecular-weight subunits of wheat glutenin. 1. Allelic variation in subunits

764

amongst varieties of wheat (Triticum aestivum), Theor Appl Genet. 60

765

(1981) 229-236.

us

cr

762

[15] O.D. Anderson, J.C. Litts, F.C. Greene, I. Characterization of ten new

767

wheat alpha-gliadin genomic clones, evidence for limited sequence

768

conservation of flanking DNA, and southern analysis of the gene family,

769

Theor Appl Genet. 95 (1997) 50-58.

an

766

[16] T.W. Okita, V. Cheesbrough, C.D. Reeves, Evolution and heterogeneity of

771

the alpha-/beta-type and gamma-type gliadin DNA sequences, J Biol

772

Chem. 260 (1985) 8203-8213.

M

770

[17] G. Branlard, M. Dardevet, R. Saccomano, F. Lagoutte, J. Gourdon, Genetic

774

diversity of wheat storage proteins and bread wheat quality, Euphytica.

775

119 (2001) 59-67.

pt

777

[18] P.R. Shewry, A.S. Tatham, P.A. Lazzeri, Biotechnology of wheat quality, J Sci Food Agric. 73 (1997) 397-406.

Ac ce

776

ed

773

778

[19] L. Sollid, S.-W. Qiao, R. Anderson, C. Gianfrani, F. Koning, Nomenclature

779

and listing of celiac disease relevant gluten T-cell epitopes restricted by

780

HLA-DQ molecules, Immunogenetics. 64 (2012) 455-460.

781

[20] E.H. Arentz-Hansen, R. Körner, Ø. Molberg, H. Quarsten, W. Vader, Y.M.C.

782

Kooy, K.E.A. Lundin, F. Koning, P. Roepstorff, L.M. Sollid, S.N. McAdam,

783

The intestinal T-cell response to -gliadin in adult celiac disease is focused

784

on a single deamidated glutamine targeted by tissue transglutaminase, J

785

Exp Med. 191 (2000) 603-612.

786

[21] E.H. Arentz-Hansen, S.N. McAdam, Ø. Molberg, B. Fleckenstein, K.E.A.

787

Lundin, T.J.D. Jorgensen, G. Jung, P. Roepstorff, L.M. Sollid, Celiac lesion T

788

cells recognize epitopes that cluster in regions of gliadins rich in proline

789

residues, Gastroenterology. 123 (2002) 803-809. 34 Page 34 of 49

790

[22] L. Shan, Ø. Molberg, I. Parrot, F. Hausch, F. Filiz, G.M. Gray, L.M. Sollid, C.

791

Khosla, Structural basis for gluten intolerance in celiac sprue, Science. 297

792

(2002) 2275-2279. [23] C.L. Marino, N.A. Tuleen, G.E. Hart, J.C. Nelson, M.E. Sorrells, Y.H. Lu, P.

794

Leroy, C.R. Lopes, Molecular genetic maps of the group 6 chromosomes of

795

hexaploid wheat (Triticum aestivum L. em. Thell.), Genome. 39 (1996)

796

359-366.

ip t

793

[24] C. Mitea, E.M.J. Salentijn, P. van Veelen, S.V. Goryunova, I.M. van der

798

Meer, H.C. van den Broeck, J.R. Mujico, V. Monserrat, L.J.W.J. Gilissen,

799

J.W. Drijfhout, L. Dekking, F. Koning, M.J.M. Smulders, A universal

800

approach to eliminate antigenic properties of alpha-gliadin peptides in

801

celiac disease, PLoS ONE. 5 (2010) e15637.

us

cr

797

[25] Ø. Molberg, A.K. Uhlen, T. Jensen, N.S. Flæte, B. Fleckenstein, H. Arentz-

803

Hansen, M. Raki, K.E.A. Lundin, L.M. Sollid, Mapping of gluten T-cell

804

epitopes in the bread wheat ancestors: Implications for celiac disease,

805

Gastroenterology. 128 (2005) 393-401.

M

an

802

[26] T.W.J.M. van Herpen, S. Goryunova, J. van der Schoot, M. Mitreva, E.M.J.

807

Salentijn, O. Vorst, M. Schenk, P. van Veelen, F. Koning, L. van Soest, B.

808

Vosman, D. Bosch, R.J. Hamer, L.J.W.J. Gilissen, M.J.M. Smulders, Alpha-

809

gliadin genes from the A, B, and D genomes of wheat contain different sets

810

of celiac disease epitopes, BMC Genomics. 7 (2006) 1.

811

[27] E. García, M. Llorente, A. Hernando, R. Kieffer, H. Wieser, E. Mendez,

812

Development of a general procedure for complete extraction of gliadins for

813

Ac ce

pt

ed

806

814

heat processed and unheated foods, Eur J Gastroenterol Hepatol. 17 (2005) 529-539.

815

[28] E. Méndez, C. Vela, U. Immer, F.W. Janssen, Report of a collaborative trial

816

to investigate the performance of the R5 enzyme linked immunoassay to

817 818

determine gliadin in gluten-free food, Eur J Gastroenterol Hepatol. 17 (2005) 1053-1063.

819

[29] I. Valdés, E. García, M. Llorente, E. Méndez, Innovative approach to low-

820

level gluten determination in foods using a novel sandwich enzyme-linked

821

immunosorbent assay protocol, Eur J Gastroenterol Hepatol. 15 (2003)

822

465-474.

35 Page 35 of 49

823

[30] B. Geßendorfer, H. Wieser, P. Koehler, Optimisation of a solvent for the

824

complete extraction of prolamins from heated foods, J Cereal Sci. 52

825

(2010) 331-332.

826

[31] P. Kanerva, T. Sontag-Strohm, O. Brinck, H. Salovaara, Improved

827

extraction of prolamins for gluten detection in processed foods, Agr Food

828

Sci. 20 (2011) 206-216. [32] M.C.

Mena,

M.

Lombardía,

A.

Hernando,

E.

Méndez,

J.P.

Albar,

ip t

829

Comprehensive analysis of gluten in processed foods using a new

831

extraction method and a competitive ELISA based on the R5 antibody,

832

Talanta. 91 (2012) 33-40.

cr

830

[33] C. Diaz-Amigo, B. Popping, Accuracy of ELISA detection methods for

834

gluten and reference materials: A realistic assessment, J Agric Food Chem.

835

61 (2013) 5681-5688.

us

833

[34] D. Stepniak, L.W. Vader, Y. Kooy, P.A. Veelen, A. Moustakas, N.A.

837

Papandreou, E. Eliopoulos, J.W. Drijfhout, G.K. Papadopoulos, F. Koning,

838

T-cell recognition of HLA-DQ2-bound gluten peptides can be influenced by

839

an N-terminal proline at p-1, Immunogenetics. 57 (2005) 8-15.

M

an

836

[35] M. Koeberl, D. Clarke, A.L. Lopata, Next generation of food allergen

841

quantification using mass spectrometric systems, J Proteome Res. 13

842

(2014) 3499-3509. [36] C.

Diaz-Amigo,

B.

Popping,

Gluten

and

glutenfree:

Issues

and

pt

843

ed

840

considerations of labeling regulations, detection methods, and assay

845

validation, J AOAC Int. 95 (2012) 337-348.

Ac ce

844 846

[37] R. Haraszi, H. Chassaigne, A. Maquet, F. Ulberth, Analytical methods for

847

detection of gluten in food—Method developments in support of food

848

labeling legislation, J AOAC Int. 94 (2011) 1006-1025.

849

[38] S.B. Altenbach, W.H. Vensel, F.M. DuPont, Integration of transcriptomic

850

and proteomic data from a single wheat cultivar provides new tools for

851

understanding the roles of individual alpha gliadin proteins in flour quality

852

and celiac disease, J Cereal Sci. 52 (2010) 143-151.

853

[39] K.L. Fiedler, S.C. McGrath, J.H. Callahan, M.M. Ross, Characterization of

854

grain-specific peptide markers for the detection of gluten by mass

855

spectrometry, J Agric Food Chem. 62 (2014) 5835-5844.

856 857

[40] B. Prandi, A. Faccini, T. Tedeschi, A. Cammerata, D. Sgrulletta, M. D’Egidio,

G.

Galaverna,

S.

Sforza,

Qualitative

and

quantitative

36 Page 36 of 49

858

determination of peptides related to celiac disease in mixtures derived

859

from different methods of simulated gastrointestinal digestion of wheat

860

products, Anal Bioanal Chem. 406 (2014) 4765-4775. [41] B. Prandi, A. Faccini, T. Tedeschi, G. Galaverna, S. Sforza, LC/MS analysis

862

of proteolytic peptides in wheat extracts for determining the content of the

863

allergen amylase/trypsin inhibitor CM3: Influence of growing area and

864

variety, Food Chem. 140 (2013) 141-146.

ip t

861

[42] J.A. Sealey-Voyksner, C. Khosla, R.D. Voyksner, J.W. Jorgenson, Novel

866

aspects of quantitation of immunogenic wheat gluten peptides by liquid

867

chromatography-mass spectrometry/mass spectrometry, J Chromatogr A.

868

1217 (2010) 4167-4183.

us

cr

865

[43] L. Uvackova, L. Skultety, S. Bekesova, S. McClain, M. Hajduch, MSE based

870

multiplex protein analysis quantified important allergenic proteins and

871

detected relevant peptides carrying known epitopes in wheat grain

872

extracts, J Proteome Res. 12 (2013) 4862-4869.

an

869

[44] A. Camarca, R.P. Anderson, G. Mamone, O. Fierro, A. Facchiano, S.

874

Costantini, D. Zanzi, J. Sidney, S. Auricchio, A. Sette, R. Troncone, C.

875

Gianfrani, Intestinal T cell responses to gluten peptides are largely

876

heterogeneous: Implications for a peptide-based therapy in celiac disease,

877

J Immunol. 182 (2009) 4158-4166. [45] W. Vader, Y. Kooy, P. van Veelen, A. de Ru, D. Harris, W. Benckhuijsen, S.

879

Peña, L. Mearin, J.W. Drijfhout, F. Koning, The gluten response in children

880

with celiac disease is directed toward multiple gliadin and glutenin

881

pt

878

Ac ce

ed

M

873

peptides, Gastroenterology. 122 (2002) 1729-1737.

882

[46] A. Camarca, A. Del Mastro, C. Gianfrani, Repertoire of gluten peptides

883

active in celiac disease patients: Perspectives for translational therapeutic

884 885

applications, Endocr Metab Immune Disord Drug Targets. 12 (2012) 207219.

886

[47] J.A. Tye-Din, J.A. Stewart, J.A. Dromey, T. Beissbarth, D.A. van Heel, A.

887

Tatham, K. Henderson, S.I. Mannering, C. Gianfrani, D.P. Jewell, A.V.S.

888

Hill,

889

quantitative mapping of T cell epitopes in gluten in celiac disease, Sci

890

Transl Med. 2 (2010) 41ra51.

J.

McCluskey,

J.

Rossjohn,

R.P.

Anderson,

Comprehensive,

891

[48] R.P. Anderson, P. Degano, A.J. Godkin, D.P. Jewell, A.V.S. Hill, In vivo

892

antigen challenge in celiac disease identifies a single transglutaminase37 Page 37 of 49

893

modified peptide as the dominant -gliadin T-cell epitope, Nature Med. 6

894

(2000) 337-342.

895

[49] H.C. van den Broeck, A.H.P. America, M.J.M. Smulders, D. Bosch, R.J.

896

Hamer, L.J.W.J. Gilissen, I.M. van der Meer, A modified extraction protocol

897

enables detection and quantification of celiac disease-related gluten

898

proteins from wheat, J Chromatogr B. 877 (2009) 975-982. [50] M. Stapels, K. Fadgen, A reproducible online 2D reversed phase-reversed

900

phase high-low pH method for qualitative and quantitative proteomics,

901

Spectroscopy

902

http://www.spectroscopyonline.com/node/225338.

Special

issue

(2009)

cr

online.

ip t

899

[51] M. Gilar, P. Olivova, A.E. Daly, J.C. Gebler, Two-dimensional separation of

904

peptides using RP-RP-HPLC system with different pH in first and second

905

separation dimensions, J Sep Sci. 28 (2005) 1694-1703.

us

903

[52] B. MacLean, D.M. Tomazela, N. Shulman, M. Chambers, G.L. Finney, B.

907

Frewen, R. Kern, D.L. Tabb, D.C. Liebler, M.J. MacCoss, Skyline: an open

908

source document editor for creating and analyzing targeted proteomics

909

experiments, Bioinformatics. 26 (2010) 966-968.

M

head of bacteriophage T4, Nature. 227 (1970) 680-685. [54] J.H.G. Cordewener, G. Hause, E. Görgen, R. Busink, B. Hause, H.J.M.

913

Dons, A.A.M. Van Lammeren, M.M. Van Lookeren Campagne, P. Pechan,

914

Changes in synthesis and localization of members of the 70-kDa class of

915

heat-shock proteins accompany the induction of embryogenesis in Brassica

916

pt

912

Ac ce

911

[53] U.K. Laemmli, Cleavage of structural proteins during the assembly of the

ed

910

an

906

napus L. microspores, Planta. 196 (1995) 747-755.

917

[55] C. Mitea, Y.M.C. Kooy-Winkelaar, P.A. van Veelen, A. de Ru, J.W. Drijfhout,

918

F. Koning, L. Dekking, Fine specificity of monoclonal antibodies against

919 920

celiac disease-inducing peptides in the gluteome, Am J Clin Nutr. 88 (2008) 1057-1066.

921

[56] E.H.A. Spaenij-Dekking, E.M.C. Kooy-Winkelaar, W.F. Nieuwenhuizen, J.W.

922

Drijfhout, F. Koning, A novel and sensitive method for the detection of T

923

cell stimulatory epitopes of - and -gliadin, Gut. 53 (2004) 1267-1273.

924

[57] C. Mitea, R. Havenaar, J.W. Drijfhout, L. Edens, L. Dekking, F. Koning,

925

Efficient

degradation

of

gluten

by

a

prolyl

endoprotease

in

a

926

gastrointestinal model: implications for coeliac disease, Gut. 57 (2008) 25-

927

32. 38 Page 38 of 49

[58] E.M.J. Salentijn, S. Goryunova, N. Bas, I.M. van der Meer, H.C. van den

929

Broeck, T. Bastien, L.J.W.J. Gilissen, M.J.M. Smulders, Tetraploid and

930

hexaploid wheat varieties reveal large differences in expression of alpha-

931

gliadins from homoeologous Gli-2 loci, BMC Genomics. 10 (2009) 48.

932

[59] L. Spaenij-Dekking, Y. Kooy-Winkelaar, P. van Veelen, J.W. Drijfhout, H.

933

Jonker, L. van Soest, M.J.M. Smulders, D. Bosch, L.J.W.J. Gilissen, F.

934

Koning, Natural variation in toxicity of wheat: potential for selection of

935

nontoxic varieties for celiac disease patients, Gastroenterology. 129 (2005)

936

797-806.

cr

ip t

928

[60] H.C. van den Broeck, H.C. de Jong, E.M.J. Salentijn, L. Dekking, D. Bosch,

938

R.J. Hamer, L.J.W.J. Gilissen, I.M. van der Meer, M.J.M. Smulders,

939

Presence of celiac disease epitopes in modern and old hexaploid wheat

940

varieties: wheat breeding may have contributed to increased prevalence of

941

celiac disease, Theor Appl Genet. 121 (2010) 1527-1539.

an

us

937

[61] H.C. van den Broeck, C. Hongbing, X. Lacaze, J.-C. Dusautoir, L.J.W.J.

943

Gilissen, M.J.M. Smulders, I.M. van der Meer, In search of tetraploid wheat

944

accessions reduced in celiac disease-related gluten epitopes, Mol Biosyst. 6

945

(2010) 2206-2213.

M

942

[62] E. Halbmayr-Jech, E. Hammer, R. Fielder, J. Coutts, A. Rogers, M. Cornish,

947

Characterization of G12 sandwich ELISA, a next-generation immunoassay

948

for gluten toxicity, J AOAC Int. 95 (2012) 372-376.

949

[63] J. Šalplachta, M. Marchetti, J. Chmelík, G. Allmaier, A new approach in

950

proteomics of wheat gluten: combining chymotrypsin cleavage and matrix-

951

Ac ce

pt

ed

946

952

assisted laser desorption/ionization quadrupole ion trap reflectron tandem mass spectrometry, Rapid Commun Mass Sp. 19 (2005) 2725-2728.

953

[64] B. Prandi, M. Bencivenni, A. Faccini, T. Tedeschi, A. Dossena, R. Marchelli,

954

G. Galaverna, S. Sforza, Composition of peptide mixtures derived from

955 956 957

simulated gastrointestinal digestion of prolamins from different wheat varieties, J Cereal Sci. 56 (2012) 223-231.

[65] P. Cressey, P. Grounds, S. Jones, S. Ashmore, D. Saunders, Gluten

958

residues

in

gluten-free

foods

sold

in

Christchurch,

New

Zealand:

959

comparison of LC-MS and ELISA methods, Qual Assur Saf Crop. 5 (2013)

960

207-213.

39 Page 39 of 49

961

[66] H. Wieser, R. Kieffer, Correlations of the amount of gluten protein types to

962

the technological properties of wheat flours determined on a micro-scale, J

963

Cereal Sci. 34 (2001) 19-27.

964

[67] O.D. Anderson, F.C. Greene, The α-gliadin gene family. II. DNA and

965

protein

sequence

variation,

subfamily

structure,

966

pseudogenes, Theor Appl Genet. 95 (1997) 59-65.

and

origins

of

[68] B. Prandi, M. Bencivenni, T. Tedeschi, R. Marchelli, A. Dossena, G.

968

Galaverna, S. Sforza, Common wheat determination in durum wheat

969

samples through LC/MS analysis of gluten peptides, Anal Bioanal Chem.

970

403 (2012) 2909-2914.

cr

ip t

967

[69] E. Salentijn, D. Esselink, S. Goryunova, I. van der Meer, L. Gilissen, M.

972

Smulders, Quantitative and qualitative differences in celiac disease

973

epitopes among durum wheat varieties identified through deep RNA-

974

amplicon sequencing, BMC Genomics. 14 (2013) 905.

an

us

971

[70] K. Kawaura, K. Mochida, Y. Ogihara, Expression profile of two storage-

976

protein gene families in hexaploid wheat revealed by large-scale analysis

977

of expressed sequence tags, Plant Physiol. 139 (2005) 1870-1880.

M

975

[71] P.R. Shewry, R.A.C. Mitchell, P. Tosi, Y. Wan, C. Underwood, A. Lovegrove,

979

J. Freeman, G.A. Toole, E.N.C. Mills, J.L. Ward, An integrated study of

980

grain development of wheat (cv. Hereward), J Cereal Sci. 56 (2012) 21-

981

30. Katagiri,

pt

[72] M.

T.

Masuda,

F.

Tani,

N.

Kitabatake,

Expression

and

983

development of wheat proteins during maturation of wheat kernel and the

984

Ac ce

982

ed

978

985 986

rheological properties of dough prepared from the flour of mature and immature Wheat, Food Sci Technol Res. 17 (2011) 111-120.

40 Page 40 of 49

986

Figure legends

988

Fig. 1.

989

Sequence alignment of the CD-epitope domain of multiple -gliadin proteins from

990

wheat. The deduced amino acid sequences of multiple EST contigs (present in

991

the NCBI-UniGene database) were searched for sequence variants in the CD-

992

epitope containing domain. Homologous ESTs for this sequence domain were

993

grouped and only variants found in more than 5 ESTs (n) are displayed. The

994

variants are grouped per genome origin from which the ESTs are expressed: Gli-

995

A2, Gli-B2, or Gli-D2. The CD-epitopes, known to trigger the adaptive HLA-DQ2+

996

T-cell response (Glia-2/Glia-9, and Glia-20) in CD-patients, are color

997

indicated. In yellow: Glia-2/Glia-9 epitopes. In green: Glia-20 epitope, in

998

blue: 33-mer containing repetitive epitopes. In black: chymotrypsin digestion

999

sites. In red: amino acid (AA) variation in the sequence. In bold underlined:

1000

sequences selected for peptide synthesis. The peptide number code is indicated

1001

in the right-most column. Figure adapted from Mitea et al. [24].

1002

Ac ce

pt

ed

M

an

us

cr

ip t

987

1003

Fig. 2.

1004

Optimization of chymotrypsin digestion conditions for hexaploid wheat variety

1005

Toronto and tetraploid wheat variety Dibillik sinde. A) Release of P1 after

1006

incubation for 10’, 30’, 1 h, 2h, and 4 h at 25C with enzyme:protein ratio of

1007

1:20. The same trend was observed for peptides P2-P5. B) Release of P1 after

1008

incubation for 2 h, 4 h, 8 h, and 18 h at 25C with enzyme:protein ratios of 1:5

1009

and 1:10. The same trend was observed for P2-P5. C) Release of P6 after

1010

incubation for 2 h, 4h, and 8 h at 25C with enzyme:protein ratios of 1:5, 1:10,

41 Page 41 of 49

1011

and 1:20. D) Release of P6 after incubation for 2 h, 4 h, 8 h, 18 h, and 22 h

1012

(extra enzyme addition after 18 h) at 25C with enzyme:protein ratio of 1:5.

1013 Fig. 3.

1015

Total amounts of peptides containing CD-epitopes in mg -gliadin/g gluten (103

1016

ppm) present in three wheat varieties, Toronto (AABBDD), Minaret (AABBDD),

1017

and Dibillik sinde (AABB). Containing A) Glia-2/9 epitopes and B) Glia-20

1018

epitopes.

cr

ip t

1014

us

1019 Supplementary Fig. S1.

1021

Chromatograms, ion intensity maps, of the low pH peptide resolution in the four

1022

subsequent 2D LC-MS fractions of a ‘Minaret’ gluten protein digest by using raw

1023

MSE (DIA) data processed by Progenesis QI software. X-axis represents m/z

1024

value. Y-axis represents retention time in minutes.

1025

ed

M

an

1020

Supplementary Fig. S2.

1027

Chromatograms obtained from LC-MRM/MS showing the resolution of the nine

1028

selected marker peptides. A) Mixture of the nine marker peptides used for

1029

calibration and B) ‘Toronto’ gluten protein digest (22 h). The inlaid images show

1030

an example of the multiple transition signals per peptide as analyzed by Skyline

1031

software. X-axis represents retention time in minutes. Y-axis represents signal

1032

intensity (106).

Ac ce

pt

1026

1033 1034

Supplementary Fig. S3.

42 Page 42 of 49

1056 1057 1058

Highlights

1059

1061

 Targeted detection and quantification of multiple celiac disease stimulating

ip t

1060

epitopes.

 High-throughput screening for quantification of CD-epitopes in wheat.

1063

 Method enables selection of wheat varieties with reduced levels of CD-

us

1065

epitopes.

 LC-MRM/MS technology as a standard method for quantifying CD-epitopes.

an

1064

cr

1062

Ac ce

pt

ed

M

1066

44 Page 43 of 49

Ac

ce

pt

ed

M

an

us

cr

i

Figure 1

Page 44 of 49

Ac

ce

pt

ed

M

an

us

cr

i

Figure 2

Page 45 of 49

Ac ce p

te

d

M

an

us

cr

ip t

Figure 3

Page 46 of 49

ip t

Table 1

7

8a

9

10

11*

12

13

14b

15

16

17

18

19c

20

21d

22

23e

24

Q3S4V8_WHEAT

A5JSB3_WHEAT

I3RXV6_AEGTA

Q9M4M4_WHEAT

A5JSA6_WHEAT

A5JSA9_WHEAT

K7X0Q3_WHEAT

I0IT56_WHEAT

Q9M4L8_WHEAT

K7WV47_WHEAT

G9I1T5_AEGUN

Q2QL43_AEGTA

F6M8F7_9POAL

A5JSB7_WHEAT

M4WFD9_AEGSP

K7X1J5_WHEAT

Q2QL52_AEGTA

us

K7XEB5_WHEAT

6

F4YT74_WHEAT

Q0GK30_TRITI

Accession first hit

ce pt

Ac

*



 







 

ed

Peptide no. Peptide sequences LQLQPFPQPQPF LQLQPFPQPQLPY P1 LQLQPFPQPQLSY LQLQPFPQPQLPYPQPQPF P2 MQLQPFPQPQLPYPQPQLPYPQPQPF LQLQPFPQPQLPYPQPHLPYPQPQPF P3 LQLQPFPQPQLPYPHPQLPYPQPQPF LQLQPFPQPQLPYPQPQLPYPQPQPF P4 LQLQPFPQPQLPYPQPQLPYPQPQLPYPQPQPF P5 LPQLPYPQPQPFPPQQSYPQPQPQYPQPQQPISQQQAQL RPQQLYPQPQPQY RPQQPYPQPQPQY P6 RPQQPYPQSQPQY P7 RPQQSYPQPQPQY P8 QQQLIPCRDVVL P9 QQILQQQLIPCRDVVL QQQLIPCMDVVL QQILQQQLIPCMDVVL a, b Sequences from Triticum aestivum, probable B-genome c Sequence from Aegilops uniaristata, N-genome d Sequences from Triticum compactum, not full length e Sequences from Aegilops speltoides

-gliadin families

5

J7I026_WHEAT

4

K7WV12_AEGTA

3

I3RXV8_AEGTA

2

M an

1

Q9M4L6_WHEAT

cr

Table 1. Selection of peptide sequences identified from -gliadin families in Triticum and Aegilops species in Mascot search (Uniprot database). Selection of the sequences is based on immunogenic epitopes sequences. Peptide sequences shown include the nine selected peptide sequences used for quantification (P1-P9). Amino acid substitutions in peptide sequences are shown in italics. CD-epitope sequences within the peptides are shown in bold.

    





 

    

 

   









 

      

    



             

    

      



Score 46 54 54 47 41 73 60 39 53 33 46 67 59 27 52 51 40

Sequence from Aegilops tauschii, contains Glia-9 epitope on a different peptide

Page 47 of 49

QQILQQQLIPCRDVVL

cr us

14 15 20 21 19 19 29 18 26 16 35 22

6.8 7.0 7.2 6.9 7.3 4.6 4.4 4.4 5.8 5.8 6.3 6.3

6-7.5 6.6-7.5 6.7-8 6.6-7.5 6.7-8 3-5 4-5 4-5 5.5-6 5.5-6 6-6.6 6-6.6

R2 0.987 0.981 0.974 0.969 0.970 0.993 0.991 0.994 0.991 -

Ac

ce pt

MRM schedule (min)

P9

Retention time (min)

QQQLIPCRDVVL

M an

P8

784.9 (2+) 755.1 (3+) 1029.5 (3+) 1032.5 (3+) 978.3 (4+) 813.9 (2+) 808.9 (2+) 539.6 (3+) 734.9 (2+) 490.3 (3+) 976.0 (2+) 651.0 (3+)

Fragment ions m/z 483.3 (b4); 617.3 (y5); 727.4 (b6); 842.4 (y7); 952.5 (b8); 1290.7 (b11) 483.3 (b4); 488.3 (y4); 713.4 (y6); 727.4 (b6); 952.5 (b8); 973.5 (y8) 483.3 (b4); 713.4 (y6); 727.4 (b6); 824.5 (b7); 952.5 (b8); 973.5 (y8) 488.3 (y4); 645.9 (b11); 713.4 (y6); 727.4 (b6); 903.5 (y15); 952.5 (b8) 483.3 (b4); 488.3 (y4); 713.4 (y6); 727.4 (b6); 824.5 (b7); 952.5 (b8); 973.5 (y8) 407.2 (y3); 632.3 (y5); 770.4 (b6); 857.4 (y7); 995.5 (b8); 1220.6 (b10) 510.3 (b4); 770.4 (b6) 407.2 (y3); 510.3 (b4); 770.4 (b6); 847.4 (y7) 858.5 (y7); 971.5 (y8); 1212.7 (y10) 498.3 (b4); 611.4 (b5); 858.5 (y7) 858.5 (y7) 858.5 (y7); 867.5 (b7); 971.5 (y8); 980.6 (b8)

ed

P1 P2 P3 P4 P5 P6 P7

Sequence LQLQPFPQPQLPY LQLQPFPQPQLPYPQPQPF LQLQPFPQPQLPYPQPQLPYPQPQPF LQLQPFPQPQLPYPQPHLPYPQPQPF LQLQPFPQPQLPYPQPQLPYPQPQLPYPQPQPF RPQQPYPQPQPQY RPQQPYPQSQPQY

Precursor m/z (charge state)

Peptide

Table 2. MS conditions of the selected marker peptides.

Collision energy (V)

ip t

Table 2

Page 48 of 49

ip t

Table 3

ed

M an

us

cr

Table 3. Amounts of CD-epitopes present in three wheat varieties: Toronto (AABBDD), Minaret (AABBDD), and Dibillik sinde (AABB). Amounts in 3 mg -gliadin containing CD-epitope/g gluten (10 ppm) at t=22h of peptides from -gliadins containing CD-epitopes, (±SD). peptide sequence Toronto Minaret Dibillik sinde LQLQPFPQPQLPY P1 123.20 (±18.27) 56.14 (±7.07) 122.77 (±19.85) LQLQPFPQPQLPYPQPQPF P2 22.65 (±3.60) 14.93 (±1.23) nd LQLQPFPQPQLPYPQPHLPYPQPQPF P3 12.47 (±0.83) 8.82 (±0.10) nd LQLQPFPQPQLPYPQPQLPYPQPQPF P4 3.34 (±0.17) 2.18 (±0.11) nd LQLQPFPQPQLPYPQPQLPYPQPQLPYPQPQPF P5 56.26 (±4.51) 31.71 (±1.09) nd RPQQPYPQPQPQY P6 133.09 (±8.93) 49.82 (±3.28) 18.69 (±1.38) RPQQPYPQSQPQY P7 13.41 (±1.95) 9.26 (±1.08) nd P8 QQQLIPCRDVVL 162.10 (±21.61) 137.42 (±17.97) 98.75 (±13.06) P9 QQILQQQLIPCRDVVL nd nd nd 217.92 (±27.38) 113.79 (±9.60) 122.77 (±19.85) sum[P1-P5] Glia-2/9 P6 + P7 146.49 (±10.88) 59.08 (±4.36) 18.69 (±1.38) Glia-20 P8 + P9 162.10 (±21.61) 137.42 (±17.97) 98.75 (±13.06) Total -gliadin

Ac

ce pt

nd, not detected

Page 49 of 49

Label free targeted detection and quantification of celiac disease immunogenic epitopes by mass spectrometry.

Celiac disease (CD) is a food-related disease caused by certain gluten peptides containing T-cell stimulating epitopes from wheat, rye, and barley. CD...
488KB Sizes 0 Downloads 11 Views