IOVS Papers in Press. Published on June 6, 2014 as Manuscript iovs.13-13684

1

Rare Complement Factor H Variant Associated with Age-related Macular Degeneration in the

2

Amish.

3

Joshua D. Hoffman1, Jessica N. Cooke Bailey4, Laura D’Aoust1, William Cade3, Juan Ayala-

4

Haedo3, Denise Fuzzell4, Renee Laux4, Larry D. Adams3, Lori Reinhart-Mercer3, Laura

5

Caywood3, Patrice Whitehead-Gay3, Anita Agarwal2, Gaofeng Wang3, William K. Scott3,

6

Margaret A. Pericak-Vance3, Jonathan L. Haines1,4*

7

1

8

USA

9

2

10

3

11

Miami, Miami, FL, USA

12

4

13

OH, USA

14

*

15

Jonathan Haines

16

Chair, Department of Epidemiology and Biostatistics

17

Director, Institute of Computational Biology

18

2-529 Wolstein Research Building, 2103 Cornell Road, Case Western Reserve University,

19

Cleveland, OH 44106, USA

20

Tel: 1-216-368-2478

21

E-mail: [email protected]

22

NIH Grants: A6019085 (J.L.H. and M.A.P.-V), AG019726 (W.K.S.), AG044089 (J.D.H.),

23

EY21453-2 (J.N.C.B)

Center for Human Genetics Research, Vanderbilt University Medical Center, Nashville, TN,

Department of Ophthalmology and Visual Sciences, Vanderbilt University, Nashville, TN, USA John P. Hussman Institute for Human Genomics, Miller School of Medicine, University of

Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland,

Corresponding Author:

24

1

Copyright 2014 by The Association for Research in Vision and Ophthalmology, Inc.

Abstract

25 26

Purpose: Age-related macular degeneration (AMD) is the leading cause of blindness among

27

the adult population in the developed world. To further the understanding of this disease, we

28

have studied the genetically isolated Amish population of Ohio and Indiana.

29

Methods: Cumulative genetic risk scores were calculated using the 19 known allelic

30

associations. Exome sequencing was performed in three members of a small Amish family with

31

AMD who lack the common risk alleles in CFH and ARMS2/HTRA1. Follow-up genotyping and

32

association analysis was performed in a cohort of 973 Amish individuals including 95 with self-

33

reported AMD.

34

Results: The cumulative genetic risk score analysis generated a mean genetic risk score of

35

1.12 (95% CI [1.10, 1.13]) in the Amish controls and 1.18 (95% CI [1.13, 1.22]) in the Amish

36

cases. This mean difference in genetic risk scores is statistically significant (p=0.0042). Exome

37

sequencing identified a rare variant (P503A) in CFH. Association analysis in the remainder of

38

the Amish sample revealed that the P503A variant is significantly associated with AMD (p=9.27

39

x10-13). P503A was absent when evaluated in a cohort of 791 elderly non-Amish controls, and

40

1,456 non-Amish cases.

41

Conclusions: Data from the cumulative genetic risk score analysis suggests that the variants

42

reported by the AMDGene consortium account for a smaller genetic burden of disease in the

43

Amish as compared to the non-Amish Caucasian population. Using exome sequencing data, we

44

identified a novel missense mutation that is shared among a densely affected nuclear Amish

45

family and located in a gene that has been previously implicated in AMD risk.

46 47

2

48

Introduction

49

Age-related macular degeneration (AMD) is the leading cause of blindness in individuals over

50

the age of 65 in the developed world1, 2 . AMD is a progressive neurodegenerative disease that

51

results in central vision loss. Vision loss caused by AMD is generally divided into two categories.

52

Non-neovascular or “dry” AMD is characterized by the presence of a broad range of

53

abnormalities in the retinal pigment epithelial layer (RPE). Clinical characteristics include the

54

presence of drusen, hyperpigmentation or hypopigmentation of the RPE, and, in later stages,

55

geographic atrophy, which results from retinal pigment epithelial (RPE) layer cell death3 .

56

Neovascular or “wet” AMD occurs when new blood vessels form behind the macula, leaking

57

fluid into the space separating the Bruch’s membrane and the RPE layer3 .

58

AMD risk has been attributed to many lifestyle influences such as age, race, smoking, and

59

cardiovascular risk factors including obesity and hypertension4-8 . Genetic studies have

60

identified common variants with strong associations in CFH and ARMS2/HTRA1, as well as

61

multiple loci of smaller effect, in populations of European descent9-14 . A recent meta-analysis

62

completed by the AMDGene Consortium showed that the most strongly associated variants in

63

CFH and ARMS2/HTRA1, in combination with 17 other loci that reached genome-wide

64

significance, account for less than 65% of the total genetic contribution to AMD15 . While this

65

shows success in common variant analysis, there is still a gap in the total genetic variation yet

66

to be explained by this complex disease. Some of this missing heritability is thought to lie within

67

rare variants of large effect16-19 . To address the deficit in knowledge of rare variants influencing

68

AMD, we exome sequenced individual members of a nuclear family who represent a subset of

69

the Ohio-Indiana Amish population.

70

The Amish are a genetically and culturally isolated founder population descended from Swiss

71

and German Anabaptists who emigrated from Western Europe to North America in the 1700s

3

72

and 1800s20 . Individuals of the Amish community typically marry within the faith and observe a

73

strict lifestyle, resulting in a community that is more genetically and environmentally

74

homogeneous than the surrounding population. Due to the intermarriage within the community

75

and relatively recent founder event, this population may also be enriched for some rare

76

variations. These factors, in addition to an extensive family record available through the

77

Anabaptist Genealogy Database (AGDB), make the Amish a unique and valued population for

78

genetic studies21-23 .

79

Materials and Methods

80

Subjects: Amish subjects were selected from the Collaborative Aging and Memory Project

81

(CAMP), an ongoing sample collection of Amish individuals living in the United States in Ohio

82

and Indiana, described elsewhere24, 25 . Construction and maintenance of the Anabaptist

83

Genealogy Database (AGDB) is covered under an IRB-approved protocol at the National

84

Institutes of Health (Dr. Leslie Biesecker, Principal Investigator). AMD affection status was

85

assigned based on self-report questionnaire response where subjects were asked if they had

86

ever been diagnosed with AMD by a physician. A subset of 73 participants (42 cases, 31

87

controls) received a follow up clinical exam by a retinal specialist. The self-report dataset

88

included 128 individuals with AMD, 728 individuals without AMD, and 294 with no self-report

89

information, and all individuals were connected into a single 13-generation pedigree based on

90

an “all common paths” query of the AGDB using PedHunter 2.0 software26 .

91

Non-Amish subjects were ascertained from the Duke University Eye Center (DUEC), the

92

Vanderbilt Eye Institute (VEI), and the Bascom Palmer Eye Institute (BPEI) at the University of

93

Miami Miller School Of Medicine. Participants were examined by a retinal specialist and graded

94

on a severity scale derived from the Age-Related Eye Disease Study (AREDS), described

95

elsewhere27, 28 . Grades were given on a scale of 1-5 where grades 1 and 2 were assigned to

4

96

controls, grade 3 represented early AMD, and grades 4 and 5 represented late AMD

97

(geographic atrophy and choroidal neovascularization, respectively). Amish participants who

98

received a clinical diagnosis were graded using the same criteria. The final non-Amish dataset

99

contained 1,732 cases, 943 controls, and 310 unknown samples.

100

As part of ongoing AMD family studies, a single nuclear Amish family (family 1) with multiple

101

individuals affected with AMD (three of eight siblings affected) was noted. Affected members

102

lacked risk alleles at the Y402H locus in CFH and at the A69S locus in ARMS2. The three

103

affected siblings (a-78, a-79, a-80) (Figure 1) were clinically evaluated to confirm disease status.

104

The other siblings were not available for clinical examination at the time of exome sequencing,

105

although a self-report diagnosis was completed. Subject a-78 was diagnosed with bilateral

106

choroidal neovascularization (Grade 5); subjects a-79 and a-80 were diagnosed with large

107

drusen in both eyes (Grade 3). Details of clinical grades and age at time of diagnosis for the

108

ascertained nuclear family are included in figure 1. All procedures followed the tenets of the

109

Declaration of Helsinki and were approved by the institutional review boards of the University of

110

Miami Miller School of Medicine and Vanderbilt University. Informed consent was obtained from

111

all research subjects involved in this study.

112

Cumulative Genetic Risk Score Analysis: Genetic risk scores were calculated using the 19

113

variants and their effect sizes reported by the AMDGene consortium’s meta-analysis15.

114

Genotyping was performed using the Sequenom MassARRAY genotyping platform (Sequenom,

115

San Diego, USA). Subjects that presented with missing genotypes at any of the 19 loci were

116

excluded from the analysis. Each variant was weighted and multiplied by the number of risk

117

alleles present at each locus, where the SNP weight is equal to the individual SNP beta-

118

estimate divided by the sum of the beta-estimates across all 19 loci (formula 1). Risk scores

119

from each of the 19 loci were summed to give the cumulative genetic risk score per person.

5

120

Two-sided t-tests assuming unequal variance were calculated in the Amish case-control group

121

and Non-Amish versus Amish case group.

122

(1) Cumulative genetic risk score = w1 Genotype1 + w2 Genotype2 + … + w19 Genotype19

123

Exome Sequencing: DNA for all samples was extracted from whole blood by the Vanderbilt

124

University DNA Resources core and the John P. Hussman Institute of Human Genomics (HIHG)

125

at the University of Miami Miller School of Medicine using PureGene DNA extraction methods

126

(Gentra Systems, Minneapolis, MN). The three clinically examined members of family 1 were

127

selected for exome sequencing. Exome capture was performed using the Agilent SureSelect All

128

Exon kit V. Exon enriched libraries were sequenced on the Illumina HiSeq 2000. Sequence

129

capture and high-throughput sequencing were completed at the HIHG. Paired end reads were

130

generated and mapped to the human reference genome (version hg19 from UCSC) using

131

Burrows-Wheeler Aligner29 . Duplicate reads were marked using Picard tools and local

132

realignment around insertions and deletions was performed using the Genome Analysis Toolkit

133

(GATK)30 . Realignment was performed using the base quality score recalibration walker

134

followed by variant calling. Variant quality score recalibration was completed using an additional

135

172 sets of exomes from Amish individuals who were not members of nuclear family 1.

136

Annotation of nucleotide variants was performed using the SeattleSeq Annotation server4 .

137

Variant filtration was completed as follows: variants found in the 1000 Genomes project

138

database, NHLBI exome sequencing project exome variation server (EVS) database, or

139

dbSNP137 database were excluded. In addition, single nucleotide variants that were not

140

missense, nonsense, splice junction, or frame shift causing mutations found within exon

141

boundaries were excluded. Genes not known to be associated with AMD were also excluded.

142

Variants found to be either homozygous or heterozygous and shared by all three affected

143

siblings were retained. Variant confirmation was performed using standard forward and reverse

144

Sanger sequencing practices on the ABI 3730xl. 6

145

Targeted Genotyping: Genotyping was performed using the Sequenom MassARRAY

146

genotyping platform (Sequenom, San Diego, USA). Independent Sequenom pools were

147

genotyped and evaluated for the rare-variant data, and the 19 loci used to calculate the

148

cumulative genetic risk score. A genotyping efficiency threshold of 95% was used in both

149

datasets to determine a valid Sequenom assay. See table 1 for final sample sizes used in each

150

step of the analysis.

151

Association and Linkage Analysis: Variants identified in the nuclear family were evaluated for

152

association with AMD in the full Amish dataset using the Modified Quasi Likelihood Score

153

statistic (MQLS)31 . MQLS is comparable to the chi-square test with the exception that it

154

estimates the variance on point estimates of the allele frequencies in cases and controls while

155

taking into account the correlation between related individuals in a pedigree. This effectively

156

allows all degrees of relationships to be included in the association analysis. Kinship and

157

inbreeding coefficients for all possible relationships in the 13-generation pedigree were

158

calculated using the MQLS recommended program KinInbcoef. MQLS analysis was performed

159

using option 1, which allows for individuals with genotype, but no phenotype data to contribute

160

to the analysis. An assumed disease prevalence of 10% was selected from published estimates

161

of the prevalence of AMD in individuals over age 60 in population samples1, 2, 32 . The nuclear

162

family was genotyped as part of the full Amish dataset and was not excluded from the analysis.

163

A sub-analysis was performed in which only those cases with a clinical exam completed by a

164

retinal specialist were analyzed.

165

Parametric linkage analysis was performed using the entire data set genotyped on the

166

Affymetrix Human SNP Array 6.0. The pedigree was divided into smaller, more computationally

167

feasible pedigrees using PedCut with a bit size threshold of 2433 . Parametric heterogeneity

168

LOD (HLOD) scores were calculated under affecteds-only dominant and recessive models

169

assuming incomplete penetrance in MERLIN34 . Under the dominant model, we specified 7

170

penetrance values of 0 for no copies of the disease allele and 0.0001 for one or two copies of

171

the disease allele. Under the recessive model, we specified penetrances of 0 for zero or one

172

copy of the disease allele and 0.0001 for 2 copies of the disease allele. The disease allele

173

frequency was set to 10% and marker allele frequencies corrected for relatedness were

174

estimated from all genotyped Amish individuals. Multipoint linkage analysis was performed on a

175

7 megabase region surrounding CFH. LD pruning for multipoint was performed using the

176

HAPMAP CEPH samples with a pairwise r2 cutoff of < 0.16. In addition to standard multipoint

177

linkage analysis, sub-analyses specifying liability classes modeled on the effect sizes of the rare

178

and common CFH alleles were examined. These included using the common SNP data derived

179

from the Affymetrix 6.0 GWAS chip; assuming only CFH Y402H as a risk allele; assuming only

180

CFH P503A as a risk allele, and assuming both CFH Y402H and P503A as risk alleles.

181

Detailed methods for quality control and linkage analysis procedures can be found elsewhere35 .

182

To examine linkage disequilibrium (LD) between the novel variant and the common Y402H

183

variant, we extracted the most distantly related Amish subjects genotyped in the pedigree. For

184

this analysis, we used a maximum pair-wise relationship cut-off between 2nd and 3rd cousins,

185

resulting 168 individuals available for an LD calculation. LD was measured using the software

186

package PLINK36 .

187

RESULTS

188

Clinical vs. Self-Report of AMD: Comparing clinical diagnoses to self-report of AMD status in

189

individuals for whom both were available, we observed positive and negative predictive values

190

of 89% and 90%, respectively (Table 3), indicating that self-report of AMD status is a good

191

proxy for AMD diagnosis in this population sample.

192

Cumulative Risk Score Analysis: In our cumulative genetic risk score analysis we observe a

193

mean risk score of 1.12 (95% CI [1.10, 1.13]) in the Amish controls and 1.18 (95% CI [1.13, 8

194

1.22]) in the Amish cases (P=0.0042). We also observe a mean risk score of 1.14 (95% CI

195

[1.13, 1.16]) in 841 non-Amish Caucasian controls and 1.31(95% CI [1.30, 1.32] in 1,573 non-

196

Amish Caucasian cases. When comparing the Amish cases with the non-Amish cases we see a

197

significant decrease in genetic risk score (p

Rare complement factor H variant associated with age-related macular degeneration in the Amish.

Age-related macular degeneration is the leading cause of blindness among the adult population in the developed world. To further the understanding of ...
215KB Sizes 3 Downloads 3 Views