Accepted Article

Received Date : 07-Dec-2012 Revised Date

: 09-May-2014

Accepted Date : 09-May-2014 Article type

: Resource Article

Discovery and validation of genic single nucleotide polymorphisms in the Pacific oyster Crassostrea gigas

Jiafeng Wang1, 2 #, Haigang Qi1 #, Li Li1*, Huayong Que1, Di Wang1, Guofan Zhang1* 1

National & Local Joint Engineering Laboratory of Ecological Mariculture,Institute of Oceanology, Chinese Academy of Sciences, Qingdao 266071, China. 2

Affiliated Hospital of Guangdong Medical College, Guangdong 524001, China.

Keywords: Crassostrea gigas, transcriptome, Illumina sequencing, genic SNPs, high-resolution melting

# Equal contribution * Correspondence: Guofan Zhang & Li Li Fax: +86 0532 8289 8701 E-mail: [email protected]; [email protected]

Running title: SNP development of Crassostrea gigas

This article has been accepted for publication and undergone full peer review but has not been through the copyediting, typesetting, pagination and proofreading process, which may lead to differences between this version and the Version of Record. Please cite this article as doi: 10.1111/1755-0998.12278 This article is protected by copyright. All rights reserved.

Accepted Article

Discovery and validation of genic single nucleotide polymorphisms in the Pacific oyster Crassostrea gigas Jiafeng Wang1, 2 #, Haigang Qi1 #, Li Li1*, Huayong Que1, Di Wang1, Guofan Zhang1* 1

National & Local Joint Engineering Laboratory of Ecological Mariculture,Institute of Oceanology, Chinese Academy of Sciences, Qingdao 266071, China. 2

Affiliated Hospital of Guangdong Medical College, Guangdong 524001, China.

Abstract The economic and ecological importance of the oyster necessitates further research on the molecular mechanisms that both regulate the commercially important traits of the oyster and help it to survive in the variable marine environment. Single nucleotide polymorphisms (SNPs) have been widely used to assess genetic variation and identify genes underlying target traits. In addition, high-resolution melting (HRM) analysis is a potentially powerful method for validating candidate SNPs. In this study, we adopted a rapid and efficient pipeline for the screening and validation of SNPs in the genic region of Crassostrea gigas based on transcriptome sequencing and HRM analysis. Transcriptomes of three wild oyster populations were sequenced using Illumina sequencing technology. In total, 50–60 million short reads, corresponding to 4.5–5.4 Gbp, from each population were aligned to the oyster genome, and 5.8 ×105 SNPs were putatively identified, resulting in a predicted SNP every 47 nucleotides on average. The putative SNPs were unevenly distributed in the genome and high-density (≥ 2%), non-synonymous coding SNPs were enriched in genes related to apoptosis and responses to biotic stimuli. Subsequently, 1,671 loci were detected by HRM analysis, accounting for 64.7% of the total selected candidate primers, and finally, 1,301 polymorphic SNP markers were developed based on HRM analysis. All of the validated SNPs were distributed into 897 genes and located in 672 scaffolds, and 275 of these genes were stress inducible under unfavorable salinity, temperature, and exposure to air and heavy metals. The validated SNPs in this study provide valuable molecular markers for genetic mapping and characterization of important traits in oysters. Keywords: Crassostrea gigas, transcriptome, Illumina sequencing, genic SNPs, high-resolution melting

Introduction Oysters are marine bivalves (Bivalvia, Mollusca) that are distributed worldwide and play important roles in coastal and estuarine ecological systems. They are also important for fisheries and aquaculture. The worldwide production of oysters is approximately 4.7 million tons/year, which is the highest production of all of the aquaculture marine animals (FAO, 2011). The Pacific oyster (Crassostrea gigas), which primarily originates from northeastern Asia, is a commercially important species and has been introduced to many countries all over the world because of its acclimation ability, rapid growth and high production. The Pacific oyster is now one of the most important economic bivalves produced throughout the world.

This article is protected by copyright. All rights reserved.

Accepted Article

Despite considerable progress in the oyster industry over the past few decades, oysters are still at an early stage of domestication (Hubert et al. 2004). The sustainable development of the oyster industry calls for the production of high-quality oysters with rapid growth and high disease- and stress-resistance. Genetic and genomic studies in the oyster industry have mainly focused on growth rate and yield (Rico-Villa et al. 2009; Meyer et al. 2010; Alunno et al. 2011; Guo et al. 2012; Kong et al. 2013; Olivier et al. 2013), summer mortality (Cotter et al. 2010; Dégremont et al. 2010; Fleury et al. 2010; Sauvage et al. 2010; Fleury et al. 2012), germplasm diversity (Plough et al. 2011; Miller et al. 2012), and viral infection (Pepin et al. 2010; Sauvage et al. 2010; William et al. 2011; Roque et al. 2012; Dégremont et al. 2013). Genetic mapping of important traits, gene characterization and molecular breeding require genetic markers (Aavey, et al, 2011). In the past few years, several studies have discovered traditional DNA markers for C. gigas (Sekino et al. 2003; Li et al. 2003; Yu et al. 2007; Yu et al. 2008; Bai et al. 2009; Qi et al. 2009; Sauvage et al. 2009; Yu et al. 2010; Li et al. 2011; Zhong et al. 2013), and linkage maps with different purposes have been constructed (Hubert et al. 2004; Li et al. 2004; Sauvage et al. 2010; Plough et al. 2011; Guo et al. 2012). Additionally, the whole genome sequencing of C. gigas was completed, deepening our understanding of the mechanisms of stress adaptation and shell formation of oysters (Zhang et al. 2012). The fine mapping of important traits is rare, mainly because of the lack of high-resolution genetic maps and association studies of gene functions. Therefore, more novel and useful DNA markers are needed to provide useful tools for estimating gene diversity, mapping quantitative traits and facilitating oyster molecular breeding (Beuzen et al. 2000; Liu et al. 2004). Single nucleotide polymorphisms (SNPs) are widespread nucleotide variations among individuals of a population and constitute the most abundant type of molecular marker in the genomes of plants and animals. Their high abundance, co-dominant mode of inheritance and ease of high-throughput detection have resulted in widespread use of SNPs in biological research (Brookes et al. 1999; Park et al. 2009; Level et al. 2011; Brandon et al. 2012). As next-generation high-throughput sequencing technologies have become more accessible (Feldmeyer et al. 2011; Davey et al. 2011; Gavery et al. 2012), it is now feasible to obtain genome and transcriptome sequences faster and at a lower cost (Pareek et al. 2011). In addition to assessing mRNA expression levels, transcriptome sequencing, including RNA-seq data, can also be used to discover SNPs (Novaes et al. 2008; Trick et al. 2009; Nielsen et al. 2010; Roberts et al. 2012; Helyar et al. 2012). The primary challenge in mining SNPs from transcript sequences is the assembly of millions of short reads (Feldmeyer et al. 2011; Yang et al. 2011); this difficulty can be addressed by reference-assisted transcriptome assembly and the improvement of de novo transcriptome assemblers or analysis workflows (Feldmeyer et al. 2011; Helyar et al. 2012; Roberts et al. 2012). The complete sequencing of the Pacific oyster genome (Zhang et al. 2012) has introduced a new research paradigm for C. gigas and its closest congeners, and will provide the basis for marker development based on transcriptome sequencing or resequencing. A high density of SNPs has been found in expressed sequence tags of C. gigas (Curole et al. 2005; Sauvage et al. 2007a, b), whereas no large-scale exploitation or evaluation of gene-targeted SNPs has been reported. Various methods are used for SNP genotyping. High-resolution melting (HRM) is a closed-tube method for mutation screening and genotyping that does not require fluorescently labeled probes. In contrast to conventional PCR product melting analysis, HRM characterizes saturating-dye-stained DNA samples based on their melting behavior, allowing even single-base changes to be readily

This article is protected by copyright. All rights reserved.

Accepted Article

identified (Ye et al. 2010). HRM-based SNP genotyping methods, including small-amplicon HRM (Smith et al. 2010; Ujino-Ihara et al. 2010) and unlabeled-probe HRM (Zhou et al. 2004; Erali et al. 2008; Jiang et al. 2011; Li et al. 2013; Zhong et al. 2013), have been widely used for SNP genotyping. In this study, transcriptome sequencing was conducted in three wild Pacific oyster populations of China, and the short read data were mapped to the reference oyster genome (Zhang et al. 2012). Genic SNPs were predicted, and more than 1,000 robust SNP markers were obtained using an improved small-amplicon HRM platform, which could be potentially useful for genetic, genomic and ecologic studies of the oyster. Materials and Methods Animal materials collection Wild oysters were collected from natural populations on the coasts of Qingdao (QD), Qinhuangdao (QH) and Dalian (DL), China. Twenty oysters were randomly selected from each site (G1 in Table 1), and the organs, including the gill, mantle and adductor muscle, of each oyster were excised, mixed and stored in liquid nitrogen for future RNA extraction. Another eight oysters, including three from QD, two from QH and three from DL (G2 in Table 1), were used to validate all of the selected SNPs. In total, 48 additional wild oysters from QD (G3 in Table 1) were used to characterize 40 SNPs that were initially validated with the eight oysters. The adductor muscle of each oyster was excised and stored in 75% ethanol for genomic DNA extraction. RNA extraction and transcriptome sequencing The frozen tissues (G1 in Table 1) were triturated, and total RNA was isolated using TRIzol reagent (Invitrogen, Carlsbad, CA, USA) according to the manufacturer’s instructions. DNA contaminants were removed by incubation with RQ1 RNase-Free DNase (Promega, Madison, WI, USA). Equal amounts of total RNA from 20 oysters within each population (G1 in Table 1) were mixed, and poly-A RNA was purified using an OligoTex mRNA mini kit (Qiagen, Hilden, Germany). After fragmentation into small pieces by ultrasound waves, the purified mRNA was used for first-strand cDNA synthesis using random hexamers and Superscript II reverse transcriptase (Invitrogen). The second strand was synthesized using Escherichia coli DNA PolI (Invitrogen), and the double-stranded cDNA was purified using a Qiaquick PCR purification kit (Qiagen). After end repair and addition of a 3’ dA overhang, the cDNA was ligated to the Illumina PE adapter oligo mix, and cDNA bands in the 200 ± 20 bp range were selected by agarose gel electrophoresis. After purification of the cDNA templates, 15 PCR cycles were performed to enrich the cDNA using two primers that annealed to the ends of the adapters. The libraries were then sequenced by the paired-end sequencing method with read length of 90 bp using Illumina Genome Analyzer (HiSeqTM2000 Sequencing System). Read alignment and SNP calling The Pacific oyster genome sequences were first “indexed” using Bowtie software (Langmead et al. 2009) so as to accelerate subsequent reads mapping. After filtering low-quality reads with more than one 'N' or mean base-calling quality 60% of the predicted gene set. In these regions, about 4.1, 4.2 and 4.3 × 105 putative SNPs were identified in the QD, QH and DL populations, respectively, corresponding to a SNP every 56 nucleotides on average (Table 3). In the combined population, the number of putative SNPs increased to 5.8 × 105 (File S2), and the SNP density reached 2.1%. Approximately 89% of the putative SNPs were heterozygous, implying high polymorphism of the expressed genes. The 5.8 × 105 putative SNPs included 4.1 × 105 synonymous SNPs and 1.7 × 105 non-synonymous SNPs. Transition SNPs comprised most of the SNPs, including 2.0 × 105 A/G and 2.0 × 105 C/T, and 1.9 × 105 transversion SNPs involved 4.9 × 104 A/C, 4.9 × 104 G/T, 5.9 × 104 A/T and 3.1 × 104 C/G (Fig. 1). Investigation of the putative SNP density in genes derived from the combined population revealed a right-skewed curve (Fig. 2) with a large proportion of genes containing extremely high densities of SNPs, indicating a non-uniform distribution of SNPs in different genes. There were 668 genes that contained SNPs in over 5% of their coding sequence, and GO annotation demonstrated that these genes were enriched for the biological processes of translation, protein dephosphorylation, protein polymerization, cellular protein metabolism, protein metabolism, translational elongation and superoxide metabolism (Table 4). No putative SNPs were detected in 561 genes, and GO analysis indicated that genes with functions in nucleic acid binding and acid-amino acid ligase activity were significantly overrepresented. There were 1395 genes where the density of non-synonymous coding SNPs exceeded 2% of the coding sequence, which represented a >3-fold increase relative to the average SNP density of 0.6%. GO annotation analysis indicated a significant enrichment of genes involved in protein dephosphorylation, apoptosis and the response to biotic stimuli (Table 5). SNP validation by HRM analysis PCR-amplicon melting curves distinguish between heterozygotes and homozygotes according to different melting temperatures, and specific genotyping can be obtained by melting the amplicon duplexes. Homozygotes present a single peak, whereas heterozygotes present double peaks, which are usually lower than the homozygote peaks (Fig. 3). Different alleles result in different product-melting transitions based on the stability of the nucleotide base pair mismatches, allowing even single nucleotide variants to be easily recognized and genotyped. In total, 2,962 pairs of primers were designed and screened in our study, among which 2,582 pairs passed the PAGE screening process. After HRM analysis, 1,671 loci were successfully genotyped in eight samples of C. gigas, accounting for 64.7% of the total selected candidate SNPs screened by HRM. Among the 1,671 candidate loci, 1,301 (77.9%) loci were polymorphic (Fig. 4, File S3). Approximately 76.0% of the genotyped SNPs were transitions and comprised 41.0% C/T and 35.0% A/G. Transversions were much less frequent than transitions, including 11.5% A/C, 8.3% G/T, 2.9% A/T and 1.6% C/G. The 1,301 polymorphic SNPs covered 672 scaffolds over the whole genome, and these SNPs were distributed in 897 genes of the 206 Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways. The pathways involved metabolism, stress, immunity, resistance adjustment and body composition of C. gigas. Among the 897 genes, 275 were stress-responsive genes (Zhang et al. 2012) (File S4) under the challenges of high/low salinity, high/low temperature and exposure to air and heavy metals. Notably, 70 polymorphic SNPs were located in 50 of the 88 expanded heat shock

This article is protected by copyright. All rights reserved.

Accepted Article

protein (HSP) 70 genes (Zhang et al. 2012) (File S3). PCR amplicons of ten randomly selected candidate SNPs were sequenced using the Sanger method. The genotypes of the selected SNPs obtained by HRM analysis were confirmed by Sanger sequencing, which validated the accuracy of the HRM analysis (File S5). For five of the six SNPs that could not be genotyped by HRM, unpredicted variation (including SNPs and In/Del) was found in the same amplicon. Sanger sequencing failed for the sixth locus because of poor amplification. Characterization of single nucleotide markers Among the 20 monomorphic loci, only one proved variable. Among the 20 polymorphic SNPs, two showed different melting curves in two and five of the 48 individuals because of the extra unpredicted SNPs in the same amplicon (Sanger sequencing certified, File S6). The rest of the SNPs (18) presented the same regular melting curves observed in the eight original samples (File S7). All 18 of the polymorphic loci had two alleles, and the minor allele frequency ranged from 0.022 to 0.490. The observed heterozygosities ranged from 0.044 to 0.521 and expected heterozygosities ranged from 0.043 to 0.505. No LD between loci was observed, but deviation from HWE was observed at three loci in the population after Bonferroni correction (P < 0.05) (Table 6). Discussion Using the Pacific oyster genome as a reference, we comprehensively investigated the genic SNPs in oysters by transcriptome analysis, summarized the putative SNP density and distribution of the coding region and further developed 1,301 polymorphic SNP markers based on an improved small-amplicon HRM assay. Because all of the SNPs were genic, and many of the related genes have been identified as involved in various commercially and ecologically important traits, these coding SNPs may be of great interest for their possible effects on phenotype. This study represents the first large-scale primer-based genic SNP marker development in C. gigas, which will provide valuable molecular resources for high-resolution linkage mapping, association analysis, gene verification and other genomic, genetic and ecological studies of the oyster. SNPs in the coding region of the oyster Oysters possess high sequence variation (Hedgecock et al. 2005). According to our previous estimate, 1.7% of the SNPs in the whole genome and 1.5% of the SNPs in exons were assessed in two oysters, suggesting a high density of SNPs in the oyster genome (Zhang et al. 2012). In our study, SNPs in coding regions occur on average every 47 bp in the oyster transcriptome. The estimated cSNP density was higher than the one SNP every 60 bp in coding regions reported by Sauvage (Sauvage et al. 2007a) and was also higher than the one SNP every 133 bp in ESTs reported by Zhong (Zhong et al, 2013), which may have resulted from different sample sizes of the sequence resources where SNPs were mined. The putative SNPs were distributed unevenly among different genes, with the SNP density ranging from 0 to >10%, suggesting that different constraints are placed on these genes by natural selection (Hu et al. 2002). SNP validation by improved HRM assay

This article is protected by copyright. All rights reserved.

Accepted Article

Among the numerous SNP genotyping methods, methods based on gel separation are easily affected by subtle differences in electrophoretic conditions and instrument output results. These differences make it difficult to integrate the data from different laboratories (Seeb et al. 2007). Additionally, PCR amplicon sequencing for SNP genotyping in a large-scale sample remains costly, and microarrays for SNP genotyping are not available for oysters (Fan et al. 2003; Gunderson et al. 2005). Fluorescent probe methods (Yan et al. 2010; Helyar et al. 2012) are accurate, but the labeled probes used in these methods are expensive. Methods based on PCR product melting with DNA dyes such as HRM avoid these drawbacks. DNA dyes are non-specific and can be used for any target, and PCR-product melting analysis can identify the genotypes of the amplified products if the resolution of the melting instrument is sufficiently accurate. Sanger sequencing confirmed that the failure of genotyping by HRM for some primers in some individuals mostly likely resulted from the existence of unanticipated variation not predicted from the transcriptome sequences (File S6). The amplicons in small-amplicon HRM are much shorter than those in unlabeled-probe HRM, and we further shortened the amplicon size to 40-100 bp, which effectively minimized the presence of extra SNP loci in the same amplicon. HRM analysis of small amplicons is thus more efficient than unlabeled-probe HRM analysis because of the high density of SNPs in C. gigas, although unlabeled-probe HRM has been proven to be more sensitive in other species (Liew et al. 2007). Additionally, we adopted a novel two-step HRM rather than the original close-tube assay. First, PCR amplification was employed without addition of internal controls and LC-green, and then the unblocked internal controls and LC-green were added to the PCR products, and HRM was performed. During the two-step HRM, cost-effective, unblocked controls were used after PCR rather than adding the blocked controls before PCR (Gundry et al. 2008) (File S8). Furthermore, polyacrylamide gel electrophoresis was adopted for the primer screen to reject primers without reliable amplification before HRM analysis. All of the above modifications make HRM an economic, relatively moderate-throughput SNP genotyping method. Ratio of SNP validation In our study, the overall ratio of SNP validation reached 43.9% (1,301/2,962), which was higher than the 24.0% (56/233) found in the Pacific oyster (Zhong et al. 2013) and the 11.6% (44/378) (Jiang et al. 2011) and 33.7% (101/300) (Li et al. 2013) found in the Zhikong scallop, where unlabeled-probe HRM was used. Of note, the failure rate of primer selection (12.8%, 380/2,962) decreased dramatically compared to the values of 33.9% (79/233) previously observed in the Pacific oyster (Zhong et al. 2013) and 50.5% (191/378) (Jiang et al. 2011) and 41.3% (124/300) (Li et al. 2013) previously observed in the Zhikong scallop, indicating that the higher validation rate in our study partially resulted from the great enhancement of the success rate of primer screening. One important reason for the amplification failure was the possible presence of intron-exon junctions in the expressed sequences used for primer design (Qi et al. 2010; Li et al. 2013). In addition to their use as a reference during transcriptome mapping, the oyster genome and predicted gene sets (Zhang et al. 2012) make the intron-exon junctions visible in coding sequences and help to avoid spanning or overlapping splice junctions in primer design. Additionally, the genotyping ratio of the novel two-step HRM was 50.4% (1,301/2,582), which was higher than the 36.4% (56/154) found in the Pacific oyster by unlabeled-probe HRM (Zhong et al. 2013). This ratio was similar to that in the Zhikong scallop, which ranged between 23.5% (44/187) (Jiang et al. 2011) and 57.4% (101/176) (Li et

This article is protected by copyright. All rights reserved.

Accepted Article

al. 2013) when unlabeled-probe HRM was used. These findings indicate that the two-step HRM assay used in our study was efficient for SNP validation of C. gigas. Annotation of SNP markers We performed HRM analysis for 1,671 putative genic SNPs of C. gigas, and all sequence and annotation information of these loci are summarized in Supplementary File S3. The 1,301 polymorphic SNPs covered 672 scaffolds over the whole genome and were located in 897 genes of 206 Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways, and 616 had GO categories assigned. According to the 61 stress responsive transcriptome provided by Zhang (Zhang et al. 2012), 275 of the 897 genes were stress-responsive genes. These genes would be useful in candidate gene association studies of stress adaptation, including adaptation to salinity, temperature and exposure to air and heavy metal. Among the 275 stress-responsive genes, 189 had GO categories assigned, including oxidoreductase activity (GO: 0016491), superoxide metabolism (GO: 0006801), transporter activity (GO: 0005215), monooxygenase activity (GO: 0004497) and protein modification (GO: 0006464). HSP 70 may play an important role in the oyster’s adaptation to the stressful intertidal environment, especially in the adaptation to heat (Zhang et al. 2012). In our study, 70 polymorphic SNPs obtained from 50 HSP 70 genes were validated, and their functional diversity was determined. In addition, the SNPs of more than 60 genes that belong to the solute carrier (SLC) families were also validated. SLC superfamily genes encode membrane-bound transporters and play important roles in the balance of ions, amino acids and organic anions under stress conditions (Schlessinger et al. 2010; Marra et al. 2012; Kenkel et al. 2013). For example, sodium- and chloride-dependent transporter genes may be important for oyster osmotic adaptation. SNPs in the oyster osmotic stress-responsive genes, pyrroline-5-carboxylate reductase (P5CR) and aquaporin (Meng et al. 2013), were also found in our results. SNPs were also validated in some other stress responsive genes, which include oxidation-reduction genes, immune response genes and chemical defense genes, such as Cu-Zn SOD, CYP450, TLR and GST. Acknowledgements We thank Dr. Ximing Guo from Rutgers University for his discussions about the work. We are also grateful to Dr. Jie Meng who provided help in stress-responsive gene analysis. This research was supported by the National Basic Research Program of China (973 Program, No. 2010CB126402), the National Natural Science Foundation of China (No. 41206149), the National High Technology Research and Development Program (863 program, No. 2012AA10A405), the earmarked fund for Modern Agro-industry Technology Research System (CARS-48), and the Taishan Scholars Climb Program of Shandong.

References Albrechtsen A, Nielsen F, Nielsen R (2010) Ascertainment biases in SNP chips affect measures of population divergence. Molecular Biology and Evolution, 27(11), 2534-2547. Alunno-Bruscia M, Bourlès Y, Maurer D et al. (2011) A single bio-energetics growth and reproduction model for the oyster Crassostrea gigas in six Atlantic ecosystems. Journal of Sea Research, 66(4), 340-348.

This article is protected by copyright. All rights reserved.

Accepted Article

Bai J, Li Q, Kong LF, Li RH (2009) Characterization of 20 single nucleotide polymorphism markers in the Pacific oyster (Crassostrea gigas). Animal Genetics, 40(6), 1004-1004. Beuzen ND, Stear MJ, Chang KC (2000) Molecular markers and their use in animal breeding. Vet J, 160(1), 42-52. Brandon Gaut (2012) Arabidopsis thaliana as a model for the genetics of local adaptation. Nature genetics, 44(2), 115-116. Brookes AJ (1999) The essence of SNPs. Gene, 234(2), 177-186. Chen S, Yang P, Jiang F et al. (2010) De Novo Analysis of Transcriptome Dynamics in the Migratory Locust during the Development of Phase Traits. PLoS one, 5(12), e15633. Clark AG, Hubisz MJ, Bustamante CD et al. (2005) Ascertainment bias in studies of human genome-wide polymorphism.Genome research, 15(11), 1496-1502. Cotter E, Malham SK, O'Keeffe S et al. (2010) Summer mortality of the Pacific oyster, Crassostrea gigas, in the Irish Sea: The influence of growth, biochemistry and gametogenesis. Aquaculture, 303(1), 8-21. Curole JP, Hedgecock D (2005) High frequency of SNPs in the Pacific oyster genome. Plant & Animal Genomes XIII Conference, W026: http://www.intl-pag.org/13/abstracts/PAG13_W026.html. Davey JW, Hohenlohe PA, Etter PD et al. (2011) Genome-wide genetic marker discovery and genotyping using next-generation sequencing. Nature Reviews Genetics, 12(7), 499-510. Dégremont L, Bédier E, Boudry P (2010) Summer mortality of hatchery-produced Pacific oyster spat (Crassostrea gigas). II. Response to selection for survival and its influence on growth and yield. Aquaculture, 299(1), 21-29. Dégremont L, Guyader T, Tourbiez D et al. (2013) Is horizontal transmission of the Ostreidherpervirus OsHV-1 in Crassostrea gigas affected by unselected or selected survival status in adults to juveniles? Aquaculture, 408-409: 51-57. Dundon WG, Arzul I, Omnes E et al. (2011) Detection of Type 1 Ostreid Herpes variant (OsHV-1 μvar) with no associated mortality in French-origin Pacific cupped oyster Crassostrea gigas farmed in Italy. Aquaculture, 314(1), 49-52. Erali M, Palais R, Wittwer C (2008) SNP genotyping by unlabeled probe melting analysis. Methods MolBiol, 429, 199-206. Fan JB, Oliphant A, Shen R et al. (2003) Highly parallel SNP genotyping. Cold Spring HarbSymp Quant Biol, 68, 69-78. Feldmeyer B, Wheat CW, Krezdorn N et al. (2011) Short read Illumina data for the de novo assembly of a non-model snail species transcriptome (Radix balthica, Basommatophora, Pulmonata), and a comparison of assembler performance. BMC Genomics, 12(1), 317.

This article is protected by copyright. All rights reserved.

Accepted Article

Fleury E, Huvet A (2012) Microarray analysis highlights immune response of Pacific oysters as a determinant of resistance to summer mortality. Marine Biotechnology, 14(2), 203-217. Fleury E, Moal J, Boulo V et al. (2010) Microarray-based identification of gonad transcripts differentially expressed between lines of Pacific oyster selected to be resistant or susceptible to summer mortality. Marine Biotechnology, 12(3), 326-339. Gavery MR, Roberts SB (2012) Characterizing short read sequencing for gene discovery and RNA-Seq analysis in Crassostrea gigas. Comparative Biochemistry and Physiology Part D: Genomics and Proteomics, 7(2), 94-99. Gunderson KL, Steemers FJ, Lee G et al. (2005) A genome-wide scalable SNP genotyping assay using microarray technology. Nat Genet, 37(5), 549-554. Gundry CN, Dobrowolski SF, Martin YR et al. (2008) Base-pair neutral homozygotes can be discriminated by calibrated high-resolution melting of small amplicons. Nucleic Acids Res, 36(10), 3401-3408. Guo X, Li Q, Wang QZ et al. (2012) Genetic mapping and QTL analysis of growth-related traits in the Pacific oyster. Marine Biotechnology, 14(2), 218-226. Hedgecock D, Gaffney PM, Goulletquer P et al. (2005) The case for sequencing the Pacific oyster genome. J Shellfish Res, 24, 429-441. Helyar SJ, Limborg MT, Bekkevold D et al. (2012) SNP discovery using next generation transcriptomic sequencing in Atlantic herring (Clupeaharengus). PloS one, 7(8), e42089. Hillman, RE (1965) The American oyster, Crassostrea virginica gmelin. Chesapeake Science, 6(3), 199-199. Hu G, Modreck B, Stensland HR et al. (2002) Efficient discovery of single-nucleotide polymorphisms in coding regions of human genes. The Pharmacogenomics Journal, 2(4), 236-242. Hubert S, Hedgecock D (2004) Linkage maps of microsatellite DNA markers for the Pacific oyster Crassostrea gigas. Genetics, 168(1), 351-362. Imai T, Mori K, Sugawara Y et al. (1969) Studies on the mass mortality of oysters in Matsushima Bay VII. Pathogenetic investigation. Jiang G, Li J, Li L et al. (2011) Development of 44 gene-based SNP markers in Zhikong scallop, Chlamys farreri. Conservation Genetics Resources, 3(4), 659-663. Kenkel CD, Meyer E, Matz MV (2013) Gene expression under chronic heat stress in populations of the mustard hill coral (Porites astreoides) from different thermal environments. Molecular ecology, 22(16), 4322-4334. Kong N, Li Q, Yu H et al. (2013) Heritability estimates for growth-related traits in the Pacific oyster (Crassostrea gigas) using a molecular pedigree. Aquaculture Research. doi: 10.1111/are.12205

This article is protected by copyright. All rights reserved.

Accepted Article

Kim WJ, Jung H, Gaffney PM (2011) Development of type I genetic markers from expressed sequence tags in highly polymorphic species. Marine Biotechnology, 13(2), 127-132. Langmead B, Trapnell C, Pop M et al. (2009) Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol, 10(3), R25. Level AF, Korte A, Cooper MD et al. (2011) A map of local adaptation in Arabidopsis thaliana. Science, 334(6052), 86-89. Liew M, Seipp M, Durtschi J (2007) Closed-Tube SNP Genotyping Without Labeled Probes - A Comparison Between Unlabeled Probe and Amplicon Melting. Am J ClinPathol, 127, 341-348. Li F, Niu B, Huang Y et al. (2012) Application of High-Resolution DNA Melting for Genotyping in Lepidopteran Non-Model Species: Ostrinia furnacalis (Crambidae). PLoS one, 7(1), e29664. Li G, Hubert S, Bucklin K et al. (2003) Characterization of 79 microsatellite DNA markers in the Pacific oyster Crassostrea gigas. Mol Ecol Notes, 3(2), 228-232. Li H, Handsaker B, Wysoker A et al. (2009) The Sequence Alignment/Map format and SAMtools. Bioinformatics, 25(16), 2078-2079. Li J, Bao Z, Li L et al. (2012) Development of 101 novel EST-derived single nucleotide polymorphism markers for Zhikong scallop (Chlamys farreri). Journal of Ocean University of China, 12(3), 403-412. Li L, Guo X (2004) AFLP-based genetic linkage maps of the pacific oyster Crassostrea gigas Thunberg. Mar Biotechnol (NY), 6(1), 26-36. Liu ZJ, Cordes JF (2004) DNA marker technologies and their applications in aquaculture genetics. Aquaculture, 238(1), 1-37. Li Q, Wang Q, Qi M et al. (2011) Development, characterization, and inheritance of 113 novel EST-SSR markers in the Pacific oyster (Crassostrea gigas). Genes & Genomics, 33(3), 313-316. Marra NJ, Eo SH, Hale MC et al. (2012) A priori and a posteriori approaches for finding genes of evolutionary interest in non-model species: Osmoregulatory genes in the kidney transcriptome of the desert rodent Dipodomys spectabilis (banner-tailed kangaroo rat). Comparative Biochemistry and Physiology Part D, 7(4), 328-339. Maurer D, Comps M, His E (1986) Caractéristiques des mortalités estivales de l'huître Crassostrea gigas dans le bassin d'Arcachon. Haliotis, (15), 309-317. Meng J, Zhu Q, Zhang L et al. (2013) Genome and Transcriptome Analyses Provide Insight into the Euryhaline Adaptation Mechanism of Crassostrea gigas. PloS one, 8(3), e58563. Meyer E, Manahan DT (2010) Gene expression profiling of genetically determined growth variation in bivalve larvae (Crassostrea gigas). The Journal of Experimental Biology, 213(5), 749-758. Meyers TR, Short S, Eaton W (1990) Summer mortalities and incidental parasitisms of cultured Pacific oysters in Alaska. Journal of Aquatic Animal Health, 2(3), 172-176.

This article is protected by copyright. All rights reserved.

Accepted Article

Miller PA, Elliott NG, Koutoulis A et al. (2012) Genetic diversity of cultured, naturalized, and native Pacific oysters, Crassostrea gigas, determined from multiplexed microsatellite markers. Journal of Shellfish Research, 31(3), 611-617. Nielsen R, Paul JS, Albrechtsen A et al. (2011) Genotype and SNP calling from next-generation sequencing data. Nature Reviews Genetics, 12(6), 443-451. Novaes E, Drost DR, Farmerie WG et al. (2008) High-throughput gene and SNP discovery in Eucalyptus grandis, an uncharacterized genome. BMC Genomics, 9, 312. Olivier D, Heinecken L, Jackson S (2013) Mussel and oyster culture in Saldanha Bay, South Africa: potential for sustainable growth, development and employment creation. Food Security, 1-17. Park YJ, Lee JK, Kim NS (2009) Simple sequence repeat polymorphisms (SSRPs) for evaluation of molecular diversity and germplasm classification of minor crops. Molecules, 14, 4546-4569. Pepin JF, Soletchnik P, Lupo C et al. (2011, September) Situation of wild beds of Crassostrea gigas regarding OsHV-1 µvar: spatiotemporal description of contamination in Pacific oyster wild beds in Marennes Oleron Bay. In 15th International European Association of Fish Pathologists conference, Split, Croatia, 12-16th September 2011. Plough LV, Hedgecock D (2011) Quantitative trait locus analysis of stage-specific inbreeding depression in the Pacific oyster Crassostrea gigas. Genetics, 189(4), 1473-1486. Qi H, Liu X, Wu F et al. (2010) Development of gene-targeted SNP markers for genomic mapping in Pacific abalone Haliotis discus hannai Ino. Molecular biology reports, 37(8), 3779-3784. Qi H, Wu Q, Li L et al. (2009) Development and characterization of microsatellite markers for the Pacific Oyster Crassostrea gigas Conservation Genet Resour, 1, 451-453. Rico-Villa B, Pouvreau S, Robert R (2009) Influence of food density and temperature on ingestion, growth and settlement of Pacific oyster larvae, Crassostrea gigas. Aquaculture, 287(3), 395-401. Roberts SB, Hauser L, Seeb LW et al. (2012) Development of genomic resources for Pacific Herring through targeted transcriptome pyrosequencing. PloS one, 7(2), e30908. Roque A, Carrasco N, Andree KB et al. (2012) First report of OsHV-1 microvar in Pacific oyster (Crassostrea gigas) cultured in Spain. Aquaculture, 324, 303-306. Sauvage C, Bierne N, Lapegue S et al. (2007a) Single Nucleotide polymorphisms and their relationship to codon usage bias in the Pacific oyster Crassostrea gigas. Gene, 406(1), 13-22. Sauvage C, Boudry P, Lapegue S (2009) Identification and characterization of 18 novel polymorphic microsatellite makers derived from expressed sequence tags in the Pacific oyster Crassostrea gigas. Molecular Ecology Resources, 9(3), 853-855. Sauvage C, Boudry P, De Koning DJ et al. (2010) QTL for resistance to summer mortality and OsHV-1 load in the Pacific oyster (Crassostrea gigas). Animal genetics, 41(4), 390-399.

This article is protected by copyright. All rights reserved.

Accepted Article

Sauvage C, Lapègue S, Boudry P (2007b) Identification of SNPs and their mapping for the identification of QTLs of resistance to summer mortality in the Pacific oyster Crassostrea gigas. Abstracts / Aquaculture, 272(S1): S238-S321 Schlessinger A, Matsson P, Shima JE et al. (2010) Comparison of human solute carriers. Protein Science, 19(3), 412-428. Seeb LW, Antonovich A, Banks MA et al. (2007) Development of a standardized DNA database for Chinook salmon. Fisheries, 32(11), 540-552. Seipp MT, Durtschi JD, Liew MA et al. (2007) Unlabeled Oligonucleotides as Internal Temperature Controls for Genotyping by Amplicon Melting. J MolDiagn, 9(3), 284-289. Sekino M, Hamaguchi M, Aranishi F et al. (2003) Development of novel microsatellite DNA markers from the Pacific oyster Crassostrea gigas. Mar Biotechnol (NY), 5(3), 227-233. Smith BL, Lu CP, Bremer JRA (2010) High-resolution melting analysis (HRMA) a highly sensitive inexpensive genotyping alternative for population studies. Molecular Ecology Resources, 10(1), 193-196. Trapnell C, Pachter L, Salzberg SL (2009) TopHat: discovering splice junctions with RNA-Seq. Bioinformatics, 25(9), 1105-1111. Trick M, Long Y, Meng JL et al. (2009) Single nucleotide polymorphism (SNP) discovery in the polyploid Brassica napus using Solexa transcriptome sequencing. Plant Biotechnol J, 7(4), 334-346. Ujino-Ihara T, Taguchi Y, Moriguchi Y et al. (2010) An efficient method for developing SNP markers based on EST data combined with high resolution melting (HRM) analysis. BMC research notes, 3(1), 51. Weir BS (1979) Inferences about linkage disequilibrium. Biometrics 35, 235-254. Yan J, Yang X, Shah T et al. (2010) High-throughput SNP genotyping with the Golden Gate assay in maize. Mol Breeding, 25(3), 441-451. Yang SS, Tu ZJ, Cheung F et al. (2011) Using RNA-Seq for gene identification, polymorphism detection and transcript profiling in two alfalfa genotypes with divergent cell wall composition in stems. BMC Genomics, 12, 199. Ye MH, Chen JL, Zhao GP et al. (2010) Sensitivity and specificity of high-resolution melting analysis in screening unknown SNPs and genotyping a known mutation. AnimSci Pap Rep, 28(2), 161-170. Yeh FC, Yang R, Boyle TJ et al. (2000) POPGENE 32, Microsoft Windows-Based Freeware for Population Genetic Analysis. Molecular Biology and Biotechnology Centre, University of Alberta, Edmonton, Canada. Yu H, Li Q (2007) EST-SSR markers from the Pacific oyster, Crassostrea gigas. Molecular Ecology Notes, 7(5), 860-862.

This article is protected by copyright. All rights reserved.

Accepted Article

Yu H, Li Q (2008) Exploiting EST databases for the development and characterization of EST–SSRs in the Pacific oyster (Crassostrea gigas). Journal of Heredity, 99(2), 208-214. Yu Z, Wang Y, Fu D (2010) Development of Fifty-one novel EST-SSR loci in the Pacific oyster, Crassostrea gigas by data mining from the public EST database. Conservation Genetics Resources, 2(1), 13-18. Zhang G, Fang X, Guo X et al. (2012) Oyster genome reveals stress adaptation and complexity of shell formation. Nature, 490, 49-54. Zhong X, Li Q, Yu H et al. (2013) Development and Validation of Single‐nucleotide Polymorphism Markers in the Pacific Oyster, Crassostrea gigas, Using High‐resolution Melting Analysis. Journal of the World Aquaculture Society, 44(3), 455-465. Zhou LM, Myers AN, Vandersteen JG et al. (2004) Closed-tube genotyping with unlabeled oligonucleotide probes and a saturating DNA dye. Clinical Chemistry, 50(8), 1328-1335. Authors’ contributions Jiafeng Wang exploited the genotyping platform based on the HRM technique, conducted the SNPs validation and wrote the paper. Haigang Qi performed the in silico putative SNP discovery and relevant data analysis and wrote the paper. Li Li performed the sampling of animal materials and transcriptome analysis and revised the manuscript. Huayong Que performed data analysis for the additional supporting files and revised the manuscript. Di Wang performed validation of some of the SNPs. Guofan Zhang planned the work and thoroughly revised the paper. Competing interests The authors declare that they have no competing interests. Data accessibility Transcriptome sequences: NCBI Sequence Read Archive (http://www.ncbi.nlm.nih.gov/Traces/sra/) accession number: QD: SRA062052, QH: SRA098807, DL: SRA098809. All images of the HRM curves of the 1,671 SNPs genotyped can be freely found at http://oysterdb.cn/art/supp/snp/snp.html Detailed information, including the 1,671 screened loci, the 0.58 million putative SNPs and the expression values of the 275 stress responsive genes have been provided as online supplemental data.

This article is protected by copyright. All rights reserved.

Accepted Article

Legends Figures Fig. 1 Classification of putative SNPs identified from the combination of Qingdao, Qinhuangdao and Dalian populations. The putative polymorphic SNPs are marked with grey bars, and putative monomorphic SNPs are marked with black bars. The letter before “>” indicates the base type in the oyster genome, i.e., the reference base, and the letter after “>” indicates the base type in the combined population. Y indicates C/T, R indicates A/G, M indicates A/C, K indicates G/T, W indicates A/T, and S indicates C/G. Fig. 2 Distribution of the SNP density in genes from the combined population of Qingdao, Qinhuangdao and Dalian. The bar (left y-axis) indicates the number of genes in the given SNP density range. The dotted curve (right y-axis) indicates the cumulative percentage of genes in the given SNP density range. Fig. 3 High-resolution melting curves of Cg_cSNP_1545 confirmed the presence of a polymorphic SNP. A) Original melting curves. B) Melting peaks after logarithm calculation. C) Normalized melting curves of the amplicons. The homozygote produced only one peak (a, b), whereas the heterozygote showed two peaks (c, d). One peak represented the Tm of mismatched hybrid amplicons (c), and the other represented the Tm of perfectly matched amplicons (d). The perfectly matched amplicon showed a higher characteristic melting temperature (Tm) than the hybrid mismatched amplicon. Fig. 4 Workflow of SNP development in the oyster. 1 Proportions in all of the primers screened by HRM. 2 Proportions in all of the primers successfully screened by HRM. Supplementary Data Additional supplementary information can be found in the online version of this article. File S1. The breadth of coverage of all genes in the predicted gene set of the oyster genome mapped by transcriptome sequencing reads. The file contains the breadth of coverage from Qingdao, Qinhuangdao, Dalian and the combined population, in addition to the coordinates of the coding regions with a minimum sequencing depth of 4 in the combined population. File S2. List of the 0.58 million putative SNPs found in the study. The file contains the putative SNP information including the scaffold ID, coordinate, the reference base at the corresponding position, SNP type and gene ID. File S3. Summary of the 1,671 putative SNPs genotyped by HRM analysis. The table contains putative SNP type, primer sequences, melting temperature of PCR, expected product sequence and gene annotation. File S4. Expression values of the 275 stress-responsive genes under varying degrees of different stimuli, including salinity, temperature and exposure to air and heavy metals. The expression levels were estimated by the reads per kilobase per million reads (RPKM) measure. The annotation of GO, KEGG and Swiss-Prot was provided for each gene.

This article is protected by copyright. All rights reserved.

Accepted Article

File S5. SNP confirmation by Sanger sequencing. A) Melting curves with three types of genotypes marked with c, d and g. B) PCR sequence alignment. Three types of amplicons corresponding to different melting curves were sequenced, respectively, and four clones were listed in each type of selected amplicon. Ref: reference sequence; a and c: sequence of the two homozygotes; b: sequence of the heterozygote. File S6. Existence of unpredicted SNPs in addition to the target SNP in the same amplicon.

A) High-resolution melting peaks of the amplicons that contained an unpredicted SNP in addition to the target SNP. a: Melting curves of one homozygote; b, c and d: melting curves of different heterozygotes. B) Sequence alignment of the amplicons indicates the presence of the extra SNP in the same amplicon. Light gray is used to highlight the target SNP, whereas dark gray shows the unpredicted SNP. Ref: the reference sequence. File S7. The genotyping result of Cg_cSNP_635 with different numbers of samples. A) Melting curves of Cg_cSNP_635 for eight samples. B) Melting curves of Cg_cSNP_635 for 48 samples. The same regular melting curves were acquired with different numbers of samples. File S8. High-resolution melting peaks with/without temperature calibration of Cg_cSNP_1965 (A1 and A2) and Cg_cSNP_2012 (B1 and B2). A1, B1) High-resolution melting peaks without temperature calibration. A2, B2) High-resolution melting peaks with temperature calibration. After temperature calibration, the tightly integrated melting curves made it easier to distinguish one peak from the other.

This article is protected by copyright. All rights reserved.

ccepted Article

Table 1. Statistics of samples used in the experiment

sample groups G1

G2

source

sample size

Qingdao

20

Qinhuangdao

20

Dalian

20

Qingdao

3

storage

sample use RNA,

liquid nitrogen

transcriptome sequencing

DNA, Qinhuangdao

2

Dalian

3

ethanol

HRM validation DNA,

G3

Qingdao

48

This article is protected by copyright. All rights reserved.

ethanol

Selected SNPs Characterization

ccepted Article

Table 2. Summary of transcriptome sequencing and mapping to the oyster genome

Mapped bases (Gbp)

Map rate (%)

Coverage

5.35

3.94

73.6

107

77.7

4.85

3.52

72.6

96

38.5

76.4

4.53

3.23

71.2

88

127.4

77.8

14.74

10.68

72.5

291

No. of reads (M)

Mapped reads (M)

Map rate (%)

No. of bases

QD

59.5

47.0

79.1

QH

53.9

41.9

DL

50.3

Combined

163.7

Population *

(Gbp)

* QD: Qingdao; QH: Qinhuangdao DL: Dalian. The combination of the three populations is denoted as “combined”.

This article is protected by copyright. All rights reserved.

(×)

ccepted Article

Table 3. Summary of putative SNPs identified from populations QD, QH and DL and the combined population QD 1

QH

DL

Combined

≥ 50

= 100

≥ 50

= 100

≥ 50

= 100

≥ 50

= 100

Gene_no

17,096

2,434

17,126

2,847

17,575

2,939

20,140

4,905

Gene rate (%)

61.0

8.7

61.1

10.2

62.7

10.5

71.9

17.5

Gene size (Mbp)

26.3

2.7

26.3

3.2

26.9

3.3

29.8

6.1

Covered size (Mbp)

23.0

2.7

23.2

3.2

23.8

3.3

27.3

6.1

Covered rate (%)

87.4

100

88.3

100

88.5

100

91.5

100

SNP no. (K)

413.2

51.3

415.3

60.6

432.4

64.7

582.0

135.2

SNP density (%)

1.80

1.91

1.79

1.89

1.82

1.95

2.13

2.20

2

111.1

11.6

111.4

13.8

117.3

14.8

168.1

32.9

ncSNP density (%)

0.48

0.43

0.48

0.43

0.49

0.45

0.62

0.54

3

302.1

39.8

303.9

46.8

315.1

49.9

413.9

102.3

scSNP density (%)

1.31

1.48

1.31

1.46

1.33

1.51

1.52

1.66

4

Mono SNP no. (K)

66.5

3.7

68.3

4.6

69.5

5.0

63.6

7.2

5

Poly SNP no. (K)

346.7

47.6

347.1

56.0

362.9

59.7

518.4

128.0

ncSNP no. (K)

scSNP no. (K)

1 The breadth of coverage of genes used for SNP calling was equal to or greater than 50%. 2 Putative non-synonymous cSNP. 3 Putative synonymous cSNP.

This article is protected by copyright. All rights reserved.

ccepted Article

4 Putative monomorphic SNP.

5 Putative polymorphic SNP.

Table 4. GO enrichment of genes with a SNP density of ≥ 5% GO_ID

GO_Term

GO_Class*

P value

No. of genes

GO:0006412

translation

BP

0.000000

30

GO:0006470

protein dephosphorylation

BP

0.000000

20

GO:0051258

protein polymerization

BP

0.000002

8

GO:0044267

cellular protein metabolic process

BP

0.000002

58

GO:0019538

protein metabolic process

BP

0.000022

67

GO:0006414

translational elongation

BP

0.001135

4

GO:0006801

superoxide metabolic process

BP

0.001589

4

GO:0032991

macromolecular complex

CC

0.000000

43

GO:0043232

intracellular non-membrane-bounded organelle

CC

0.000000

39

GO:0030529

ribonucleoprotein complex

CC

0.000000

28

GO:0005840

ribosome

CC

0.000000

27

GO:0044444

cytoplasmic part

CC

0.000000

31

GO:0005874

microtubule

CC

0.000002

8

This article is protected by copyright. All rights reserved.

ccepted Article

GO:0005737

cytoplasm

CC

0.000057

33

GO:0044430

cytoskeletal part

CC

0.000062

12

GO:0005198

structural molecule activity

MF

0.000000

35

GO:0003735

structural constituent of ribosome

MF

0.000000

27

GO:0005102

receptor binding

MF

0.000000

26

GO:0004725

protein tyrosine phosphatase activity

MF

0.000000

20

GO:0005515

protein binding

MF

0.000496

150

GO:0005044

scavenger receptor activity

MF

0.004141

8

*BP: biological process; CC: cellular component; MF: molecular function

Table 5. GO enrichment of genes with a non-synonymous cSNP density of ≥ 2% GO_ID

GO_Term

GO_Class

P value

No. of genes

GO:0006470

protein dephosphorylation

BP

0.000000

25

GO:0006915

apoptosis

BP

0.004130

12

GO:0009607

response to biotic stimulus

BP

0.008942

6

GO:0005102

receptor binding

MF

0.000000

40

GO:0005515

protein binding

MF

0.000000

299

This article is protected by copyright. All rights reserved.

ccepted Article

GO:0004721

phosphoprotein phosphatase activity

MF

0.000000

25

GO:0004725

protein tyrosine phosphatase activity

MF

0.000000

24

GO:0005488

binding

MF

0.000000

455

GO:0030414

peptidase inhibitor activity

MF

0.000008

18

GO:0004197

cysteine-type endopeptidase activity

MF

0.000040

12

GO:0008234

cysteine-type peptidase activity

MF

0.000133

14

GO:0005044

scavenger receptor activity

MF

0.000305

12

GO:0004866

endopeptidase inhibitor activity

MF

0.000327

13

GO:0005509

calcium ion binding

MF

0.004130

37

GO:0004402

histone acetyltransferase activity

MF

0.005400

3

GO:0043169

cation binding

MF

0.006425

144

This article is protected by copyright. All rights reserved.

ccepted Article

Table 6. Characterization of 18 SNP loci in 48 oysters of the Qingdao (G3) population

Locus

Obs_Het

Exp_Het1

Nei2

HWE

MAF

Cg_cSNP_485

0.200

0.182

0.180

0.484

0.100

Cg_cSNP_496

0.286

0.281

0.278

0.913

0.167

Cg_cSNP_501

0.478

0.410

0.406

0.251

0.283

Cg_cSNP_502

0.083

0.081

0.080

0.795

0.042

Cg_cSNP_635

0.479

0.503

0.498

0.737

0.469

Cg_cSNP_636

0.125

0.221

0.219

0.002#

0.125

Cg_cSNP_637

0.521

0.505

0.500

0.827

0.490

Cg_cSNP_638

0.370

0.305

0.301

0.138

0.185

Cg_cSNP_648

0.404

0.447

0.442

0.508

0.330

Cg_cSNP_649

0.070

0.190

0.187

0.000#

0.105

Cg_cSNP_652

0.044

0.043

0.043

0.916

0.022

Cg_cSNP_655

0.512

0.484

0.478

0.702

0.395

Cg_cSNP_659

0.458

0.483

0.478

0.717

0.396

Cg_cSNP_660

0.417

0.449

0.444

0.612

0.333

Cg_cSNP_665

0.195

0.438

0.433

0.000#

0.317

This article is protected by copyright. All rights reserved.

ccepted Article

Cg_cSNP_732

0.467

0.505

0.499

0.610

0.478

Cg_cSNP_753

0.396

0.468

0.463

0.278

0.365

Cg_cSNP_761

0.255

0.225

0.223

0.339

0.128

Mean

0.317

0.342

0.339

0.502

0.261

1 Expected heterozygosity was computed using Levene (1949). 2 Nei's (1973) expected heterozygosity. Exp_Het, expected heterozygosity; Obs_Het, observed heterozygosity; MAF, minor allele frequency. # Indicates a significant deviation (P < 0.05) from Hardy-Weinberg equilibrium after Bonferroni correction (k = 18).

This article is protected by copyright. All rights reserved.

ccepted Article

This article is protected by copyright. All rights reserved.

ccepted Article

This article is protected by copyright. All rights reserved.

ccepted Article

This article is protected by copyright. All rights reserved.

ccepted Article

This article is protected by copyright. All rights reserved.

Discovery and validation of genic single nucleotide polymorphisms in the Pacific oyster Crassostrea gigas.

The economic and ecological importance of the oyster necessitates further research on the molecular mechanisms, which both regulate the commercially i...
512KB Sizes 0 Downloads 0 Views