Gut, 1990,31,497-499

497

Leading article Understanding the molecular basis of coeliac disease Coeliac disease in children and adults is characterised by small intestinal mucosal injury and malabsorption. For almost two decades, it has been recognised that coeliac disease occurs much more frequently in individuals having certain human leucocyte antigens. Human leucocyte antigens in man are encoded by genes in the major histocompatibility complex on the short arm of chromosome 6.' The major histocompatibility complex contains genes that encode a broad array of proteins. These genes can be divided into three major groups; the class I, II, and III genes (Figure). Class I and class II genes code for specialised and highly polymorphic cell surface glycoproteins which play a key role in antigen recognition by T cells. Class III genes code for components of the complement system - for example, C4A, C4B, Bf, and C2. The genes encoding steroid 21 hydroxylase also map to that region. The Class I genes code for molecules which are present on the surface of all nucleated cells. Human leucocyte antigen (HLA) class I molecules present antigen to T cells which express the CD8 protein on their surface. Thus, CD8 T cells are said to be class I restricted. Class II genes are encoded in the HLA-D region at the centromeric end of the major histocompatibility complex. The HLA-D region can be divided into three major subregions termed -DP, -DQ, and -DR (Figure). Each class II molecule on the cell surface occurs as a heterodimer containing an alpha chain encoded by an A gene and a beta chain encoded by a B gene. HLA class II molecules are present on the surface of B cells, macrophages, dendritic reticular cells, and activated T cells. They can also be expressed, under certain circumstances, however, on other cell types including intestinal epithelial cells.2 Moreover, their expression on cells can be up-regulated by T cell lymphokines such as gamma interferon. HLA class II molecules present antigen to T cells which have the CD4 molecule. Thus, CD4 T cells are said to be class II restricted.The HLA-D region contains multiple genes in addition to those described above (Figure). These include pseudogenes within DP (DPB2, DPA2) and DR (DRB2), DQB2 and DQA2 genes (also known as DX) which are not transcribed, and a DN A and DO B gene.3 Because protein products of these genes are not expressed on the cell surface, r

Class 11

Class 1111 m-- -i

i-1

|

-SP ITNF

DN DO

DP

nI n

B2 A2 Bl Al

A

B

DO

B C:1

-i

B2A2 B1 Al

0

.

,a

Molecules

L -J DQ

Molecules

E

A

Short arm chromosome 6

DR:~

r B11

II

BIa DP

Class

l~

I

B2 B3B4 A

9~4i~lS~ D Region genes D,3a

DR Molecules

a I

D Region encoded cell surface molecules

Figure: Major histocompatibility complex on chromosome 6.

they do not appear to have a role in antigen presentation to T cells. In addition to HLA class I and II genes, and complement genes, the major histocompatibility complex contains genes that encode tumour necrosis factor a and 13, and heat shock proteins - 70.45 Recent studies have also described five genes (termed BAT genes), whose function thus far is unknown.6 Those genes map between C2 and HLA B - that is, in the region that also contains the tumour necrosis factor and heat shock proteins - 70 genes. Initially, coeliac disease was associated with the HLA class I B locus marker -B8, as defined by serology.7 Later, it was recognised, also by serology, that coeliac disease had a still stronger association with the class II D region marker, HLADR3.5 The association with HLA-B8 simply appeared to reflect strong linkage disequilibrium between the allele that encodes HLA-B8 and the allele that encodes HLA-DR3. Polymorphism is an important characteristic of HLA class I and II major histocompatibility complex genes and their protein products. For example, there are 50 or more different alleles encoded by the class I HLA B locus and 18 or more different alleles at a single class II DR locus - that is, DRB1. Despite this extensive polymorphism and the theoretical possibility of enormous numbers of different HLA haplotypes within a given race, a relatively limited number of different haplotypes are present in most members of the population. Further, coeliac disease patients, like those with several other HLA-associated diseases - for example, insulin dependent diabetes mellitus and rheumatoid arthritis, carry a specific constellation of major histocompatibility antigen alleles, many of which are inherited together on the disease associated haplotype. The HLA B8, DR3 haplotype which is associated with coeliac disease is common among northern European caucasians.9 With the advent of convenient methods for cloning and sequencing of the polymorphic regions of the HLA class II genes, it became clear that their protein products were substantially more polymorphic than had been recognised by serology. Thus, there are at least eight different alleles of the DQA1 gene, and 19 different alleles of the DPB1 gene.3I1t2 In addition, there are multiple DRB gene loci - that is, DRB1, DRB3, DRB4 that code for expressed DR 13 chains, although some haplotypes lack the DRB3 and/or DRB4 locus.3 The DRB1 locus, which encodes the -DR3 serologic marker, is particularly polymorphic. In contrast, the DPA1 gene is relatively non-polymorphic (two alleles)'2 and the DR A gene is non-polymorphic. HLA-DR3 (DRw17) and -DQw2 are present in 90% or more of northern European caucasians with coeliac disease. In caucasians HLA-DQw2 is strongly linked to -DR3 (DRw17)'3 The same markers, however, can be found in as many as 25-30% of individuals, within the same population, who do not have coeliac disease. Further, less than 0 1% of individuals with the serologic markers HLA-DR3 and -DQw2 develop coeliac disease. This limits the value of the DR3 and DQw2 antigens as serologic markers of disease, and led us to examine the HLA class II genes associated with coeliac disease at the molecular level. To define the genetic markers for coeliac disease on the DR3, DQw2 haplotype more precisely, we used restriction

498

fragment length polymorphism analysis.'4 Our studies revealed that approximately 95% of -DR3, DQw2 coeliac patients had a polymorphic 4-0 kb Rsa I DNA fragment, derived from a HLA class II B gene, that was present in less than 30% of healthy -DR3, DQw2 matched controls. I'. This restriction fragment length polymorphism helped to distinguish the -DR3, -DQw2 haplotype associated with coeliac disease from the DR3, DQw2 haplotype present in most controls. In further studies, the HLA class II B gene associated with this restriction fragment length polymorphism was isolated from a genomic library and shown to encode a HLA -DP beta chain. 16 These findings indicate that susceptibility genes for coeliac disease may be encoded as far centromeric in the class II D region as the HLA-DP subregion. Additional evidence for the importance of genes in or linked to HLA-DP in determining coeliac disease susceptibility was provided by the finding that the 4 0 Kb DP B gene restriction fragment length polymorphism was as prevalent in individuals who lacked -DR3, DQw2 as in healthy -DR3, DQw2 subjects.'6 Thus, in non-coeliacs, the restriction fragment length polymorphism did not preferentially associate with -DR3,

DQw2. Our studies suggest that the HLA-associated susceptibility to coeliac disease is multigenic, with genes in the DQ/DR subregion and genes linked to, or in the DP subregion determining disease susceptibility. Generally there is a high degree of recombination, estimated to be 2-5%, between the

HLA DP and the HLA DR/DQ subregions and therefore only weak linkage disequilibrium between genes in those subregions. 1' Coeliac disease may select for an extended HLA haplotype that includes the genes coding for -DR3, -DQw2 and a particular DP allele marked by the 4 0 Kb DP B gene restriction fragment length polymorphism. Alternatively, it could be that the putative DP-linked susceptibility gene is present on the other haplotype. This work poses two further questions. First, is there a unique DR, DQ, or DP structural variant encoded within the HLA class II region in patients with coeliac disease? Second, which specific HLA-DP B genes are associated with coeliac disease? The greatest polymorphism in HLA class II genes occurs within the second exon sequences that encode the

portion of the HLA class II molecule that is involved in binding antigen and in interacting with the T cell receptor for antigen.'8 To address the first question, we sequenced the second exons of DR, DQ and DP genes (DRBL, B3, B4; DQA 1, B 1; DPB 1) from heterozygous coeliac disease patients having the serologic markers DR3, DQw2, and the 4 0 Kb Rsa 1 DPB gene restriction fragment length polymorphism. The relevant gene segments were amplified from genomic DNA using the polymerase chain reaction (PCR),'9 and then cloned and sequenced. In each case, the sequences from coeliac disease patients were identical to sequences that are also present, although at a significantly lower frequency, in unaffected individuals.20 This finding is compatible with a model in which specific HLA class II genes are necessary, but not sufficient, for the phenotypic expression of coeliac disease. In support of that model, only 25-30% of siblings who appear to share one or both HLA haplotypes (as determined by DR serology and, by inference DQ), are concordant for coeliac disease.2' Moreover, our findings in coeliac disease parallel those in other HLA class II associated diseases. Thus, specific alleles of HLA-DR and -DQ on various HLA DR4 haplotypes are over represented in patients with insulin dependent diabetes mellitus, rheumatoid arthritis and pemphigus vulgaris, compared with controls. However, these patients do not have class II gene sequences that are unique to the disease.'022 24 To analyse the HLA class II DPB genes associated with

Kagnoff

coeliac disease, sequence specific oligonucleotides were used to probe PCR amplified second exons of DP B genes from coeliac disease patients. Those studies revealed a significant over-representation in coeliac disease patients of the relatively rare DPB alleles, DPB1 and DPB3.20 Furthermore, the increased frequency of those alleles in coeliac disease accounted for the increased frequency of the 4-0 Kb Rsa I DPB gene restriction fragment length polymorphism in that disease.20 Our patients were mostly of northern European caucasian ancestry. Specific DP alleles, however, also appear to determine coeliac disease susceptibility among Italians. In that population, DPB4-2 and DPB3, but not DPB1, were most important.25 Italian coeliac disease patients also differ from northern European Caucasian coeliac disease patients in the distribution of DR markers as determined by serology26 that is, -DR3 (DRw") less common; DR5/7 heterozygotes more common.

Several conclusions can be drawn from the available data regarding which HLA class II genes are most important in determining coeliac disease susceptibility. Within the DR/ DQ subregions, the DQB2 allele is important for coeliac disease susceptibility. The DQB2 allele encodes the DQw2 serologic specificity and is present in virtually all caucasian coeliacs with -DR3 (DRw17). In addition, the DQB2 allele is present in most individuals with the -DR7 serologic specificity27 which is often associated with coeliac disease in southern Europeans (Spain, Italy). Coeliac disease patients with -DR7, however, usually are heterozygous for -DR5 or -DR3 on the second chromosome. In this regard, a specific DQ A allele (DQA4-1)"' that is common to DR5 and DR3 haplotypes also has been associated with coeliac disease.2829 The finding of increased susceptibility to coeliac disease in individuals with -DR7 - that is, having the DQB2 allele but lacking the DQA4- 1 allele, associated with heterozygosity for -DR5 - that is, lacking the DQB2 allele but having the DQA4- 1 allele, suggests that a DQ molecule encoded by both DQB2 and DQA4 1 alleles contributes to coeliac disease susceptibility. This DQ molecule, which is recognised as DQw2 by serology, would be encoded in cis - that is on the same chromosome, on -DR3 (DRw17) haplotypes and in trans on DR5/DR7 haplotypes that is by the DQA4- 1 allele from the chromosome having DR5 and the DQB2 allele from the chromosome having DR7. In further support of the importance of DQw2, coeliac disease is unusual among black populations in the United States. In this regard, the -DR3 haplotype present in many blacks has undergone recombination between DQ and DR such that those -DR3 (DRw18) individuals often lack DQw2.3° A final conclusion relates to the role of DP genes in determining coeliac disease susceptibility. There is a striking increase in coeliac disease among individuals with DQw2 who also have specific DP B alleles - that is, the DPB1 or DPB3 allele in Northern European caucasians; DP4-2 or DP3 in Italians. The association of those DP B alleles with coeliac disease may reflect a direct effect of those genes, or an effect of genes linked to the DPB locus. In either case, more than one gene within the major histocompatibility complex appears to determine coeliac disease susceptibility. What unique feature of those HLA class II alleles results in increased susceptibility to coeliac disease? To address this question, one can compare the DQ and DP alleles associated with coeliac disease susceptibility to those not associated with disease. The goal of such an approach is directed towards defining putative epitopes on the susceptibility alleles that differ from these on non-susceptibility alleles. Such an analysis suggests distinct amino acid residues on the class II molecules associated with coeliac disease that may be important in determining disease susceptibility. Moreover, those residues lie within the region of the class II molecules thought to be important in peptide binding.'8

499

Understanding the molecular basis ofcoeliac disease

Several issues have not been answered. For example, the mechanism by which HLA class II D region alleles result in coeliac disease susceptibility remains enigmatic. Further, which other genes, in or outside the major histocompatibility complex, interact with the class II D region genes to increase disease susceptibility is not known, although concordance data suggest additional susceptibility genes exist. In addition, it is not known whether coeliac disease is associated with specific HLA class II alleles because of events that take place in the thymus during development of the T cell repertoire, or because of events that take place in the periphery after T cell differentiation. Finally, the specific antigens and the determinants on those antigens that interact with HLA class II molecules and T cells as a key element in the pathogenesis of coeliac disease are poorly understood. However, our studies suggest that environmental factors other than dietary grains for example, viral proteins, also play a role in determining coeliac disease susceptibility.332 M F KAGNOFF

University ofCalifornia, San Diego La Jolla, CA 92037, USA Supported by NIH Grant DK35108. I would like to thank Ms V Shu for preparation of the manuscript. 1 Korman AJ, Boss JM, Spies T, Sorrentino R, Okada K, Strominger JL. Genetic complexity and expression of human Class II histocompatibility antigens. Immunol Rev 1985; 85: 45-86. 2 Volk BA, Brenner DA, Kagnoff MF. Analysis of RNA transcripts for HLA class II genes in human small intestinal biopsies. Gut 1989; 30: 1220-7. 3 Nomenclature for factors of the HLA system. Immunogenetics 1988; 28: 391-8. 4 Spies T, Morton CC, Nedospasov SA, Fiers W, Pious D, Genes for the tumor necrosis factors a and f3 are linked to the human major histocompatibility complex. Proc Natl Acad Sci 1986; 83: 8699-702. 5 Sargent CA, Dunham I Trowsdale J, Campbell RD. Human major histocompatibility complex contains genes for major heat shock protein HSP70. Proc Natl Acad Sci 1989; 86: 1968-72. 6 Spies T, Blanck G, Bresnahan M, Sands J. Strominger JL. A new cluster of genes within the human major histocompatibility complex. Science 1989; 243: 214-7. 7 Falchuk ZM, Rogentine GN, Strober W. Predominance of histocompatibility antigen HL-A8 in patients with gluten-sensitive enteropathy. J Clin Invest 1972; 51: 1602-5. 8 Ek J, Albrechtsen D, Solheim BG, Thorsby E. Strong association between the HLA-Dw3-related B cell alloantigen -DRw3 and coeliac disease. Scand J Gastroenterol 1978; 13: 229-33. 9 Alper CA, Fleischnick E, Awdeh Z, Katz AJ, Yunis EJ. Extended major histocompatibility complex haplotypes in patients with gluten-sensitive enteropathy. J Clin Invest 1987; 79: 251-6.

10 Horn GT, Bugawan TL, Long CM, Erlich HA. Allelic sequence variation of the HLA-DQ loci: Relationship to serology and to insulin-dependent diabetes susceptibility. Proc Natl Acad Sci 1988; 85: 6012-6. 11 Todd JA, Bell JI, McDevitt HO. HLA-DQ,B gene contributes to susceptibility and resistance to insulin-dependent diabetes mellitus. Nature 1987; 329: 599-04. 12 Bugawan TL, Horn GT, Long CM, et al. Analysis of HLA-DP allelic sequence polymorphism using the in vitro enzymatic DNA amplification of DP-a and DP-fl loci. J Immunol 1988; 141: 4024-30. 13 Duquesnoy RJ. Immunogenetic aspects of the relationship between HLA-DR and the MB and MT systems. Transplant Proc 1981; 13: 1804-7. 14 Cohen D, Paul P, Le Gall I, etal. DNA polymorphism of HLA Class I and Class II regions. Immunol Rev 1985; 85: 87-105. 15 Howell MD, Austin RK, Kelleher D, Nepom GT, KagnoffMF. An HLA-D region restriction fragment length polymorphism associated with celiac disease. J Exp Med 1986; 164: 333-8. 16 Howell MD, Smith JR, Austin RK, et al. An extended HLA-D region haplotype associated with celiac disease. Proc Natl Acad Sci 1988; 85: 222-6. 17 Termijtelen A, Kahn PM, Shaw S, van Rood JJ. Mapping SB in relation to HLA and GLOI using cells from first-cousin marriage offspring. Immunogenetics 1983; 18: 503-12. 18 Brown HJ, Jardetzky T, Saper MA, Samraoui B, Bjorkman PJ, Wiley DC. A hypothetical model of the foreign antigen binding site of class II histocompatibility molecules. Nature 1988; 332: 845-50. 19 Saiki RK, Gelfand DH, Stoffel S, et al. Primer-directed enzymatic amplification of DNA with a thermostable DNA polymerase. Science 1988; 239: 487-1. 20 KagnoffMF, Harwood JI, Bugawan TL, Erlich HA. Structural analysis of the HLA-DR, -DQ, and -DP alleles on the celiac disease-associated -DR3 (DRw17) haplotype. Proc NatlAcadSci 1989; 86: 6274-8. 21 Mearin ML, Biemond I, Pefna AS, et al. HLA-DR phenotypes in Spanish coeliac children: their contribution to the understanding of the genetics of the disease. Gut 1983; 24: 532-7. 22 Todd JA, Acha-Orbea H, Bell JI, et al. A molecular basis for MHC Class II associated autoinmmunity. Science 1988; 240: 1003-9. 23 Scharf SJ, Friedmann A, Brautbar C, et al. HLA class II allelic variation and susceptibility to pemphigus vulgaris. Proc Natl Acad Sci 1988; 85: 3504-8. 24 Sinha AA, Brautbar C, Szafer F, et al. A newly characterized HLA BQfi allele associated with pemphigus vulgaris. Science 1988; 239: 1026-9. 25 Bugawan TL, Angelini G, Larrick J, Auricchio S, Ferrara GB, Erlich HA. A combination of a particular HLA-DPfI allele and an HLA-DQ heterodimer confers susceptibility to coeliac disease. Nature 1989; 339: 470-3. 26 Morellini M, Trabace S, Mazzilli MC, et al. A study ofHLA Class II antigens in an Italian paediatric population with coeliac disease. Disease Markers 1988; 6: 23-8. 27 Karr RW, Gregersen PK, Obata F, et al. Analysis of DR,I and DQ,1 chain cDNA clones from a DR7 haplotype. J Immunol 1986; 137: 2886-90. 28 Roep BO, Bontrop RE, Penla AS, van Eggermond MCJA, van Rood JJ, Giphart MJ. An HLA-DQ alpha allele identified at DNA and protein level is strongly associated with celiac disease. Human Immunol 1988; 23: 271-9. 29 Sollid LM, Markussen G, Ek J, Gjerde H, Vartdal F, Thorsby E. Evidence for a primary association of celiac disease to a particular HLA-DQ a/,B heterodimer. J Exp Med 1989; 169: 345-50. 30 Hurley CK, Gregersen P, Steiner N, et al. Polymorphism of the HLA-D region in American blacks: A DR3 haplotype generated by recombination. JIImmunol 1988; 140: 885-92. 31 Kagnoff MF, Austin RK, Hubert JJ, Bernardin JE, Kasarda DD. Possible role for a human adenovirus in the pathogenesis ofceliac disease. J7 Exp Med 1984; 160: 1544-57. 32 Kagnoff MF, Paterson YJ, Kumar PJ, et al. Evidence for the role of a human intestinal adenovirus in the pathogenesis of celiac disease. Gut 1987; 28: 995-1001.

Understanding the molecular basis of coeliac disease.

Gut, 1990,31,497-499 497 Leading article Understanding the molecular basis of coeliac disease Coeliac disease in children and adults is characterise...
698KB Sizes 0 Downloads 0 Views