http://informahealthcare.com/aan ISSN: 1939-6368 (print), 1939-6376 (electronic) Syst Biol Reprod Med, 2014; 60(6): 329–337 ! 2014 Informa Healthcare USA, Inc. DOI: 10.3109/19396368.2014.955896

RESEARCH ARTICLE

Functional features and protein network of human sperm-egg interaction Soudabeh Sabetian*, Mohd Shahir Shamsir, and Mohammed Abu Naser

Syst Biol Reprod Med Downloaded from informahealthcare.com by Universitaet Zuerich on 01/03/15 For personal use only.

Department of Biological and Health Sciences, Faculty of Bioscience and Medical Engineering, Universiti Teknologi Malaysia, Johor, Malaysia

Abstract

Keywords

Elucidation of the sperm-egg interaction at the molecular level is one of the unresolved problems in sexual reproduction, and understanding the molecular mechanism is crucial in solving problems in infertility and failed in vitro fertilization (IVF). Many molecular interactions in the form of protein-protein interactions (PPIs) mediate the sperm-egg membrane interaction. Due to the complexity of the problem such as difficulties in analyzing in vivo membrane PPIs, many efforts have failed to comprehensively elucidate the fusion mechanism and the molecular interactions that mediate sperm-egg membrane fusion. The main purpose of this study was to reveal possible protein interactions and associated molecular function during sperm-egg interaction using a protein interaction network approach. Different databases have been used to construct the human sperm-egg interaction network. The constructed network revealed new interactions. These included CD151 and CD9 in human oocyte that interact with CD49 in sperm, and CD49 and ITGA4 in sperm that interact with CD63 and CD81, respectively, in the oocyte. These results showed that the different integrins in sperm may be involved in human spermegg interaction. It was also suggested that sperm ADAM2 plays a role as a protein candidate involved in sperm-egg membrane interaction by interacting with CD9 in the oocyte. Interleukin-4 receptor activity, receptor signaling protein tyrosine kinase activity, and manganese ion transmembrane transport activity are the major molecular functions in sperm-egg interaction protein network. The disease association analysis indicated that spermegg interaction defects are also reflected in other disease networks such as cardiovascular, hematological, and breast cancer diseases. By analyzing the network, we identified the major molecular functions and disease association genes in sperm-egg interaction protein. Further experimental studies will be required to confirm the significance of these new computationally resolved interactions and the genetic links between sperm-egg interaction abnormalities and the associated disease.

Infertility, protein interaction network, reproduction, sperm-egg interaction History Received 22 April 2014 Revised 15 June 2014 Accepted 24 June 2014 Published online 15 September 2014

Abbreviations: PPIs: protein-protein interactions; UMI: unexplained male infertility; CLU: clusterin; PINs: protein interaction networks; GAD: Genetic Association Database; DAVID: Database for Annotation Visualization and Integrated Discovery; IVF: in vitro fertilization; IUI: intra uterine insemination; MS: mass spectrometry; PPPs: phosphoprotein phosphatase; DIP: Database of Interacting Proteins; BIND: Bimolecular Interaction Network Database; BioGRID: Biological General Repository for Interaction Datasets; Pfam: protein families; InterPro: integrative protein; BioCys: Biological Databases; STRING: Search Tool for the Retrieval of Interacting Genes; GEO: Gene Expression Omnibus; DB: Data Base; ZP: Zona Pellucida; ICSI: Intra Cytoplasmic Sperm Injection

Introduction Infertility is a common clinical problem affecting approximately 15% of couples around the globe [Un-Nahar et al. 2013]. The occurrence differs throughout underdeveloped and developed countries, being greater in the former where resources for treatment and diagnosis are inadequate [Winters and Walsh 2014]. Males are partially or solely connected to

*Address correspondence to Soudabeh Sabetian, Faculty of Bioscience and Medical Engineering, Universiti Teknologi Malaysia, 81310 Johor, Malaysia. Tel: +60177615766. E-mail: [email protected]

20–50% of the cases of infertility [Jarow 2007]. Nevertheless, in spite of developments in diagnostic methods and technologies in the field of andrology, there remains a substantial subsection of these subfertile men who are categorized as holding the key to unexplained male infertility (UMI) [Baker et al. 2012; Hamada et al. 2011]. The type UMI is relevant to men with infertility of unidentified origin, having normal semen parameters and where female infertility elements have been excluded [Sigman et al. 2009]. Therefore normospermic infertile men might have defective sperm that are unable to fertilize. This supposition is established in the reflection of low success

Syst Biol Reprod Med Downloaded from informahealthcare.com by Universitaet Zuerich on 01/03/15 For personal use only.

330

S. Sabetian et al.

rates of intra uterine insemination (IUI) and in vitro fertilization (IVF) in specific cases of UMI. In conventional IVF, the chief reason of fertilization failure is owing to abnormalities of sperm-egg membrane interaction [Evans 2012]. Many molecular interactions in the form of protein-protein interactions mediate the sperm-egg membrane interaction [Clark 2010; Krawetz 2005; Rubinstein et al. 2006b]. Janice P. Evans collected the list of candidate sperm proteins for participation in sperm-egg membrane interactions [Evans 2012]. A number of previous studies attempted to find the molecules that were involved in the interaction process. Due to the complexity of the problem such as difficulties in observing membrane protein-protein interactions (PPIs) in vivo, many efforts have failed to comprehensively elucidate the fusion mechanism, leaving the molecular interactions that mediate sperm-egg membrane fusion still poorly understood [Brewis et al. 2005]. Thus the recognition of the candidate proteins involved in the crucial step of fertilization can help to better understand the research into new treatment methods in order to successfully assist reproductive technologies. In this study, all available potential sperm-egg membrane PPIs were retrieved from the selected databases. The retrieved information was then further investigated by constructing and analyzing a PPI network of all the membrane and surface proteins of the sperm and the oocyte proteins, identifying the essential PPI and the corresponding biological roles in the sperm-egg interaction process through various computational tools.

Results and Discussion Construction of the PPI network All collected proteins were loaded into cytoscape 2.8 [Smoot et al. 2011] to construct a sperm and oocyte protein interaction network. The two networks for sperm and egg/ oocyte associated proteins contain 409 protein nodes and 2746 interactions in the oocyte protein network, and 2076 protein nodes and 8565 interactions in the sperm protein network (Figure 1).

Syst Biol Reprod Med, 2014; 60(6): 329–337

Extraction sperm-egg interaction network The networks were merged and the proteins that contain a signal sequence and/or transmembrane domain were extracted. The extracted protein network consists of 86 nodes and 1,149 interactions (Figure 2). The sperm and oocyte proteins identified by mass spectrometry (MS) (see the first step of Methods) were categorized into two groups: sperm protein group and oocyte protein group (Figure 2). The proteins and their interactions in our network are represented in Supplementary File 1 and Supplementary File 2, respectively. Some of these interactions between sperm and oocyte membranes (see Supplementary File 2) have already been identified and their role in human sperm-egg interaction for fertility has been confirmed. Monoclonal antibodies identified prominent sperm proteins containing ADAM1 and ADAM2 [Primakoff et al. 1987] and IZUMO1 [Okabe et al. 1988]. The ADAM family, builds on the identification of sperm ADAM2 (fertilinb) from studies with a fertilization-blocking antibody and is one of the founding members of the ADAM family [Singson et al. 2008]. Sperm ADAMs are binding partners for numerous members of the integrin family [Edwards et al. 2008]. Many of the integrins are expressed in eggs and can contribute in sperm-egg interaction. ADAM3 has been suggested to be a sperm protein that facilitates sperm-ZP (Zona Pellucida) interaction on the basis that Adam3-null sperm bind poorly to the ZP [Evans 2012]. Even though these data might specify that ADAM3 binds a ZP component(s) directly, it is similarly probable that Adam3-null sperm lack crucial proteins for ZP interaction as Adam3-null sperm have an altered surface proteome, with reduced amounts of several ADAMs. Adam3-null sperm possess an altered surface proteome and, therefore, might lack other proteins [Han et al. 2009; Kim et al. 2006b]. IZUMO1 residing on the sperm surface is an immunoglobulin superfamily member (with an immunoglobulin-like (Ig) domain) that is vital for spermoocyte fusion [Inoue et al. 2005]. The role of IZUMO1 is not fully understood. IZUMO1 might act via IZUMO1-associated proteins (in cis) and/or might function in trans by interacting with a molecule on the egg. Tetraspanin CD9 is the key actor

Figure 1. Sperm and oocyte protein network. A: The sperm associated proteins network; B: The oocyte associated proteins network, using cytoscape 2.8.3. These protein interaction networks consist of A: 2076 nodes and 8565 interaction and B: 409 nodes and 2746 interaction among those protein.

Syst Biol Reprod Med Downloaded from informahealthcare.com by Universitaet Zuerich on 01/03/15 For personal use only.

DOI: 10.3109/19396368.2014.955896

Protein network of human sperm-egg interaction

331

Figure 2. The sperm-egg interaction network. The identified sperm and oocyte proteins by MS proteomics technology are represented in red and yellow respectively. The green nodes are the expressed proteins in both organelles.

recognized thus far in mouse eggs and is expected to function in combination with an additional tetraspanin, CD81, as Cd9//Cd81/ female mice are infertile [Rubinstein et al. 2006a]. The CD9 might work by interacting with a sperm protein in trans. Although there are data from antibody inhibition, the contribution of tetraspanin in human fertilization is not fully understood. Human sperm-egg fusion is partly repressed by a tetraspanin CD151 antibody [Ziyyat et al. 2006]. These data suggest that sperm-egg fusion in diverse mammalian species might depend on different members of the tetraspanin family. The protein interaction network was derived using different databases that consist of known and predicted protein interactions. New protein interaction in human sperm-egg interaction New predicted interactions between the groups have been identified based on a ‘combined score’ calculated between any pair of proteins which were predicted using the STRING

database. As a result, with high confidence (40.700) only new sperm-egg interactions that have not been previously reported in human PPIs were identified. A total of 13 proteins participating in 12 high confidence interactions (Table 1) were selected for further analysis. For example, the results indicated that SERPINE1 (plasminogen activator inhibitor) in sperm play a role in sperm-egg interaction by interacting with SERPING1 (plasma protease C1 inhibitor) in the oocyte. The plasminogen activation system is related to a variety of pathological and physiological processes, e.g. regeneration, thrombolysis, wound healing and morphogenesis, in addition to tumor invasion [Deryugina and Quigley 2012]. Plasmin, a protease capable of digesting a number of extracellular matrix proteins and of activating other proteases, is produced from the zymogen plasminogen by proteolytic cleavage [Syrovets et al. 2012]. SERPING1 is present on the surface, in the tail, and in the acrosome of mature spermatozoa [Ferrer et al. 2012; Manske et al. 1994]. The sperm-egg interaction has also been described as sensitive to serine proteinase inhibitors

332

S. Sabetian et al.

Syst Biol Reprod Med, 2014; 60(6): 329–337

Table 1. The new interactions between sperm and oocyte proteins that are extracted from the merged network.

Syst Biol Reprod Med Downloaded from informahealthcare.com by Universitaet Zuerich on 01/03/15 For personal use only.

Sperm proteins

Oocyte protein

Interaction detection method

Combined-score

ITGA3 (CD49C)

CD151

ITGA3 (CD49C) ITGA2 (CD49B)

CD63 CD9

SERPINE1 (plasminogen activator inhibitor) SERPINE1 (plasminogen activator inhibitor) HYOU1 (Hypoxia up-regulated protein 1) ITGA4 (CD49D) CD9

SERPING1 (Plasma protease C1 inhibitor) CLU/Clus (Clusterin)

Co-expression Physical interaction Co-expression Physical interaction Genetic interaction Co-Localization Shared protein domains Co-expression Co-expression

CLU/Clus

Co-expression

0.829

CD81 CD63

0.972 0.949

CD9

CD81

CLU/Clus ADAM2 IZUMO1

SERPING1 CD9 CD9

Co-expression Physical interaction Shared protein domains Co-expression Shared protein domains Co-expression Co-expression Physical interaction Text mining

[Simmons et al. 2013]. Collectively, these outcomes suggest a role of the plasminogen activation in sperm-egg interaction [Shen et al. 2013]. Hypoxia plays a critical part in various pathophysiological situations, comprising cancer biology, whereas hypoxia-inducible factor (HIF) controls transcriptional reactions underneath hypoxia [Shoji et al. 2013]. It has been reported that hypobaric hypoxia is responsible for altered male reproductive function. The mechanism of action concerning fertility has not been clearly established [Vargas et al. 2011]. A number of phosphoprotein phosphatase (PPPs) family members have been revealed to be present in sperm, signifying a noteworthy function in this cell [Fardilha et al. 2011]. One type of protein phosphatase 1 (PP1g2), is localized to the posterior region of the sperm head, the equatorial region, and is implicated in sperm-egg binding [Huang and Vijayaraghavan 2004]. Our protein network indicated that tetraspanins: CD151 and CD9 in human oocyte interacts with CD49 in sperm, and CD49 interacts with CD63 and CD81 in the oocyte. Numerous tetraspanins, comprising CD81 and CD9, have seemingly indirect (and still poorly described) roles in membrane fusion progression [Fanaei et al. 2011]. Cd81-null females have modest damage to reproductive function [Rubinstein et al. 2006a]. CD81 is an associated tetraspanin that is 45% similar to CD9. The Cd81/ mouse also exhibited flaws in sperm-egg interaction with in vitro–fertilized and in vivo–fertilized eggs [Rubinstein et al. 2006a]. Ziyyat and colleagues [2006] found that human sperm-egg fusion is moderately repressed by treating eggs with an antibody to a different tetraspanin, CD151. CD9 is a member of the tetraspanin family (so named as members have four transmembrane domains) [Hemler 2005; Rubinstein 2011]. Cd9/ females, are severely subfertile [Miyado et al. 2000]. It has been demonstrated that the different sperm tetraspanins may be involved in human sperm-egg interaction. This approach showed that the oocyte CD9 plays some role in sperm-egg interaction by interacting with sperm

0.912 0.735 0.748 0.916 0.908

0.996 0.900 0.799 0.766

CD49, IZUMO1, and ADAM2. IZUMO1 is vital for the sperm oocyte binding and CD9 is also essential. It is appealing to suggest that they interact with each other to form a fusogenic compound. If these proteins are actually interacting, it is possible that they are required on both the surfaces of the sperm and the oocyte [Ikawa et al. 2010]. Several sperm ADAMs (where ADAM signifies a ‘metalloprotease’ and a ‘disintegrin’) have been associated with the sperm-egg interaction. Even though no single ADAM is critical, there seems to be a relationship between the capability of sperm to interact with the oocyte membrane and the presence/levels of certain ADAM proteins. In accord with these results, sperm ADAM2 is a protein candidate involved in sperm-egg membrane interaction through the oocyte CD9 and ZP3. In several Adam knockouts, the sperm displays reduced passage into the oviduct via the uterotubal junction, decreased binding to the ZP, and/or reduced fusion and binding to the egg plasma membrane, as well as abnormalities in the sperm surface proteome with the loss of multiple ADAMs [Kim et al. 2006a; Vjugina and Evans 2007]. The Adam2/ knockout has the utmost severe flaws in gamete membrane interactions and has the lowest overall degree of ADAM proteins on the sperm surface [Cho et al. 1998; Han et al. 2009; Nishimura et al. 2001]. Other Adam knockouts have little or no obvious impact on male reproduction [Horiuchi et al. 2003; Kim et al. 2006b]. Clusterin (CLU) have been identified in the sperm proteome, but their sperm function has yet to be elucidated [Dorus et al. 2012]. Our findings show that CLU has an interaction with SERPING1 in oocyte, identified as a co-expression feature. According to these results, CLU may play a significant role in sperm-egg interaction. CD9 also has been identified in sperm [Baker et al. 2007] co-expression detection method indicating that CD9 interacts with some oocyte proteins, e.g. CD63, CD81. Therefore, it is tempting to speculate that human sperm CD9 enables the sperm and egg interaction.

Syst Biol Reprod Med Downloaded from informahealthcare.com by Universitaet Zuerich on 01/03/15 For personal use only.

DOI: 10.3109/19396368.2014.955896

Protein network of human sperm-egg interaction

333

Figure 3. ClueGO analysis of sperm-egg interaction protein network. The sperm-egg interaction network analyzed with the ClueGO program to identify the major molecular function involved in the protein network. The result showed that the majority of the proteins are involved in growth factor binding, transmembrane receptor protein kinase activity, glycosaminoglycan binding, interleukin-4 receptor activity, receptor signaling protein tyrosine kinase activity, manganese ion transmembrane transport activity, and receptor signaling protein activity.

Molecular function analysis The sperm-egg interaction network was analyzed with the ClueGO program to identify the major molecular function involved in the protein network. The result suggested that the majority of the proteins are involved in growth factor binding, transmembrane receptor protein kinase activity, glycosaminoglycan binding, interleukin-4 receptor activity, receptor signaling protein tyrosine kinase activity, manganese ion transmembrane transport activity, and receptor signaling protein activity (Figure 3). Related molecular functions include interleukin-4 receptor activity, receptor signaling protein tyrosine kinase activity, and manganese ion transmembrane transport activity. The remainder are common signal transduction and receptor activities. Manganese is an anti-oxidant that prevents the damaging effects of oxidative stress (OS) and has been used to improve the in vitro environment to aid successful IVF or ICSI (Intra Cytoplasmic Sperm Injection) [Cheema et al. 2009]. Our results show that the manganese ion transport activity is an important molecular function for sperm-egg interaction that if supressed finally leads to infertility. The other significant molecular function is receptor signaling protein tyrosine kinase activity. Confocal immunofluorescence analysis with antibodies to phosphotyrosine and phosphorylated protein tyrosine kinases allowed detection of minute signaling events localized to the site of sperm-oocyte interaction that were not amenable to biochemical analysis. The results provide evidence for localized accumulation of phosphotyrosine at the site of sperm contact, binding, or fusion, which suggests active protein tyrosine kinase signaling prior to and during sperm incorporation [McGinnis et al. 2013]. Interleukins typically function as intracellular messengers, mediating their effect via specific receptors on target cells. The principal functions of these cytokines include many complementary and many conflicting roles central to the

induction, regulation, and functioning of the immune system in mammals [O’Connell and McInerney 2005]. Their function is governed by binding to a receptor bound to a T cell membrane. The receptor is responsible for transmitting a signal into the cell upon binding the appropriate ligand [Akdis et al. 2011]. In our result IL2RG, IL4R (see Supplementary File 1) are the interleukin receptors in the sperm-egg interaction protein network. Sperm protein Il4i1 (interleukin 4 induced gene) is present in the acrosome and membrane fraction of sperm, however its function remains to be investigated [Stein et al. 2006]. Database for Annotation Visualization and Integrated Discovery (David) analysis The disease genes associated with sperm-egg interaction disorder were identified using the Genetic Association Database (GAD) with the DAVID bioinformatics tools. We restricted disease associated terms using 0.01 as a cutoff for the Benjamini–Hochberg multiple testing correction p value (Table 2). The results indicate that 23 genes of our protein network are involved in cardiovascular, three genes in hematological, and 12 genes are involved in breast cancer disease (the associated genes are provided in Supplementary File 3). It is interesting that others have found a link between parental fertility status and cardiovascular risk but concluded that it is not conclusive if the infertility is associated with an increased risk of cardiovascular disease [Eisenberg et al. 2011; Verit et al. 2014]. Three genes of our network are related to blood transfusion complications. Parental ABO incompatibility has been implicated as a possible contributor to infertility but was not a significant contributor to infertility [Ganitha et al. 2012]. Our results showed that 12 genes from the sperm-egg interaction network were linked to breast cancer. Meggiorini et al. [2011],reported that female

334

S. Sabetian et al.

Syst Biol Reprod Med, 2014; 60(6): 329–337

Table 2. Disease association of candidate genes in sperm-egg interaction network. Category GENETIC_ASSOCIATION_DB_DISEASE GENETIC_ASSOCIATION_DB_DISEASE GENETIC_ASSOCIATION_DB_DISEASE GENETIC_ASSOCIATION_DB_DISEASE GENETIC_ASSOCIATION_DB_DISEASE GENETIC_ASSOCIATION_DB_DISEASE GENETIC_ASSOCIATION_DB_DISEASE GENETIC_ASSOCIATION_DB_DISEASE

Syst Biol Reprod Med Downloaded from informahealthcare.com by Universitaet Zuerich on 01/03/15 For personal use only.

GENETIC_ASSOCIATION_DB_DISEASE GENETIC_ASSOCIATION_DB_DISEASE GENETIC_ASSOCIATION_DB_DISEASE GENETIC_ASSOCIATION_DB_DISEASE GENETIC_ASSOCIATION_DB_DISEASE GENETIC_ASSOCIATION_DB_DISEASE GENETIC_ASSOCIATION_DB_DISEASE GENETIC_ASSOCIATION_DB_DISEASE GENETIC_ASSOCIATION_DB_DISEASE

p Value

Term

Count

%

myocardial infarct; atherosclerosis, coronary myocardial infarction; stroke brain hemorrhage cerebrovascular disease; sickle cell anemia thromboembolism, venous myocardial infarct retinal vascular occlusion myocardial infarct; lymphoproliferative disorders; restenosis blood transfusion complications metabolism disorders; myocardial infarction; stroke, ischemic myocardial infarction; stroke, ischemic coronary artery stent thrombosis stroke, ischemic myocardial infarction breast cancer stroke atherosclerosis, coronary

7 4 5 5 6 9 4 3

0.738397 0.421941 0.527426 0.527426 0.632911 0.949367 0.421941 0.316456

3.66E-06 1.18E-04 3.04E-04 3.04E-04 3.58E-04 4.63E-04 5.30E-04 7.01E-04

3 3

0.316456 0.316456

7.01E-04 7.01E-04

3 3 6 6 12 5 8

0.316456 0.316456 0.632911 0.632911 1.265823 0.527426 0.843882

0.001387359 0.001387359 0.001531833 0.001939036 0.003455614 0.006961194 0.008969006

patients with primary infertility might represent a group at high risk for breast cancer. Our findings suggest that defective sperm-egg interaction may be a surrogate for other genes that are involved in different diseases. However further experimental studies are required to clarify the significance of these genetic links between sperm-egg interaction abnormalities and the associated disease (refer to table 2). This protein interaction network of human membrane and surface sperm-egg interaction proteins has been created using several computational approaches. This protein network identified a set of candidate sperm-egg protein interactions. New protein-protein interactions were resolved, e.g. CD49CD151, CD49-CD63, CD49-CD9, ITGA4-CD81, and IZUMO1-CD9. In addition, some proteins of the plasminogen activation system such as SERPINE1 and SERPING1 may play important roles in sperm-egg interaction. These interactions suggested that the different sperm integrins may also be involved in human sperm-egg interaction. The results also revealed that some proteins necessary for fertility like CD151 in humans might play an important role in promoting sperm-– egg interaction. Interleukin-4 receptor activity, receptor signaling protein tyrosine kinase activity, and manganese ion transmembrane transport activity are the major molecular functions in the sperm-egg interaction protein network. The disease association analysis indicated that successful spermegg interaction may be a surrogate of other diseases such as cardiovascular, hematological, and breast cancer disease. Further experimental studies will be required to confirm the importance of these new predicted interactions and genetic links between human sperm-egg interaction and other associated diseases.

Methods Initial collection of associated proteins with human sperm and egg We only considered spermatozoa and oocyte proteins identified by MS proteomics technology. As a methodological approach, all the articles that were retrieved from PubMed

were considered for inclusion, with the keywords ‘‘human’’, ‘‘sperm’’, ‘‘spermatozoa’’, ‘‘spermatozoon’’, ‘‘human’’, and ‘‘oocyte’’, ‘‘egg’’, ‘‘human’’ combined with the keyword ‘‘proteome", ‘‘proteomics’’, or ‘‘mass spectrometry’’. The only far-reaching human sperm proteome and oocyte proteome analysis available to date are by Baker et al. [2007] on human sperm proteome and Assou et al. [2006] on human egg proteome. We also used one MS-based proteomic database known as the Human Sperm Proteome Database (HSPD at http://reprod.njmu.edu.cn/hspd/). This website was designed to be user-friendly, where the proteome could be easily downloaded. The proteins involved in these studies were collected with our own UniProt ID using UniProt ID mapping (http://www.uniprot.org/?tab¼mapping). Construction of the protein-protein interaction (PPI) network As the first step to construct the PPI network, the proteinprotein interactions of the collected proteins were identified. To achieve this goal we used a variety of databases that catalog PPIs based on different features: human physical PPIs cataloged in Database of Interacting Proteins (DIP) [Salwinski et al. 2004], Bimolecular Interaction Network Database (BIND) [Alfarano et al. 2005], and Biological General Repository for Interaction Datasets (BioGRID) [Chatr-aryamontri et al. 2013]; human gene co-expression derived from DNA microarray studies in Gene Expression Omnibus (GEO) [Barrett et al. 2013]; human genetic interactions from Biological General Repository for Interaction Datasets (BioGRID) [Chatr-aryamontri et al. 2013]; shared protein-protein domain from integrative protein (InterPro) [Mulder et al. 2007], SMART [Letunic et al. 2012], and protein families (Pfam) [Finn et al. 2010]; biological process annotations from Ontology (GO) [Carbon et al. 2009]; copathway interactions from Reactome [Croft et al. 2011], BioCyc [Karp et al. 2005], PathwayCommons [Cerami et al. 2011]; and finally PPIs extracted from biomedical literature by the text-mining engine GeneWays [Rzhetsky et al. 2004]. We also used Search Tool for the Retrieval of Interacting Genes (STRING) database [Franceschini et al. 2013] to

Protein network of human sperm-egg interaction

Syst Biol Reprod Med Downloaded from informahealthcare.com by Universitaet Zuerich on 01/03/15 For personal use only.

DOI: 10.3109/19396368.2014.955896

predict PPI for our candidate proteins. To query our proteins in different databases, different identifiers with various accession numbers were used. All the extracted data are available in the TEXT format and PSI-MI format (Supplementary File 4). All retrieved data from these data sources were combined and loaded into Cytoscape 2.8 [Smoot et al. 2011] to construct the protein interaction network (PIN). These methods were applied to sperm proteins and oocyte proteins separately to map sperm and oocyte PINs. Then, we compared two networks and computed the intersection (overlapping) network using network modification plugin (compare two networks) in Cytoscape. Because our primary aim was to identify protein candidates involved in sperm-egg interactions, we extracted only the proteins that contain a signal sequence and/or transmembrane domain [Stein et al. 2006] by selecting signal peptide and transmembrane features from sequence annotation (features) in extensively curated UniProt database (www.uniprot.org). Possible sperm-egg protein interactions We categorized spermatozoa and oocyte proteins that were identified by MS (see Initial collection of associated proteins with human sperm and egg, the first section of Methods) as two groups (sperm protein nodes and oocyte protein nodes) with the aim to study the interactions and the interconnections between the two groups. PPIs were built using six separate prediction parameters: Neighborhoods, Co-occurrence (phylogenetic profiles), Fusion, Co-expression, Experimental Interactions, and Text-mining. Each of these parameters has its own score (raw) of measurements such as intergenic distances, Euclidean distances, fusion z-score, Pearsoncorrelation coefficient, various experimental score (e.g. qualitative binary score), and log-odds score, respectively. Each raw score was benchmarked using the KEGG database. PPIs that occured on the same metabolic KEGG map were considered to be true positive and those that occurred on a different map were not. Due to the sigmoidal correlation between raw score and fraction of PPIs on the same KEGG map, STRING fits those correlations to the hillequation to derive the confidence score. STRING derived scores correspond to the probability of finding the PPI within the same KEGG pathway or map [Franceschini et al. 2013]. Different scores on the same bench mark provide a platform of comparisons among the scores and equivalent scores can be calculated. This equivalency mapping helps to combine the scores into a single score, which express higher confidence and gives higher coverage (number of predicted PPI) at a specific accuracy. STRING uses a score combiner based on the product of probabilities using the following formula: S¼1

N Y

ð1  Si Þ

i

where Si is the probability score for database I, S the combined score, and N the total number of databases to be combined. Si, i, S, N The combined scores were further rescaled into the confidence range from 0.0 to 0.1 combining all the scores. Those indicate: 50.400 (low confidence), 0.400-0.700 (medium confidence), and 40.700 (high

335

confidence). As a result, we selected the high confidence PPIs (40.700). Biological function analysis The major molecular functions of the sperm-egg interaction protein network were identified using the ClueGo plugin of Cytoscape [Bindea et al. 2009]. ClueGo allows us to integrate several ontological sources because in each source, for each gene, there is a large amount of information. ClueGo can extract the non-redundant biological information for a large cluster of genes using, GO, KEGG, BioCarta, REACTOME, and WikiPathways [Bindea et al. 2009]. Disease association GAD [Becker et al. 2004] is an archive of human genetic association studies of complex diseases and disorders. To evaluate candidate with disease association, we mapped our candidate genes to the GAD database using DAVID 6.7 (updated in 2014) (http://david.abcc.ncifcrf.gov/home.jsp), that provides a comprehensive set of functional annotation tools for a set of genes of interest [Huang da et al. 2009]. We selected those terms enriched in our candidate gene list by requiring Benjamini–Hochberg multiple testing corrected p values less than 0.01.

Acknowledgments The authors would like to acknowledge the Universiti Teknologi Malaysia Institutional Research Grant for the funding and Chew Teong Han for technical assistance.

Declaration of interest The authors report no conflicts of interest. The authors alone are responsible for the content and writing of the paper.

Author contributions The work was carried out in collaboration between all the authors. Defined the research theme: MSS; Designed methods and experiments, carried out the computational analysis, and analyzed the data: SS, MAN; Interpreted the results and co-wrote the paper: MSS, SS, MAN.

References Akdis, M., Burgler, S., Crameri, R., Eiwegger, T., Fujita, H., Gomez, E., et al. (2011) Interleukins, from 1 to 37, and interferon-gamma: receptors, functions, and roles in diseases. J Allergy Clin Immunol 127:701–721.e701–70. Alfarano, C., Andrade, C., Anthony, K., Bahroos, N., Bajec, M., Bantoft, K., et al. (2005) The biomolecular interaction network database and related tools 2005 update. Nucleic Acids Res 33: D418–D24. Assou, S., Anahory, T., Pantesco, V., Le Carrour, T., Pellestor, F., Klein, B., et al. (2006) The human cumulus–oocyte complex geneexpression profile. Hum Reprod 21:1705–19. Baker, M.A., Nixon, B., Naumovski, N. and Aitken, R.J. (2012) Proteomic insights into the maturation and capacitation of mammalian spermatozoa. Syst Biol Reprod Med 58:211–17. Baker, M.A., Reeves, G., Hetherington, L., Mu¨ller, J., Baur, I. and Aitken, R.J. (2007) Identification of gene products present in Triton X-100 soluble and insoluble fractions of human spermatozoa lysates using LC-MS/MS analysis. Proteomics Clin Appl 1:524–32.

Syst Biol Reprod Med Downloaded from informahealthcare.com by Universitaet Zuerich on 01/03/15 For personal use only.

336

S. Sabetian et al.

Barrett, T., Wilhite, S.E., Ledoux, P., Evangelista, C., Kim, I.F., Tomashevsky, M., et al. (2013) NCBI GEO: archive for functional genomics data sets—update. Nucleic Acids Res 41:D991–D5. Becker, K.G., Barnes, K.C., Bright, T.J. and Wang, S.A. (2004) The genetic association database. Nat Genet 36:431–2. Bindea, G., Mlecnik, B., Hackl, H., Charoentong, P., Tosolini, M., Kirilovsky, A., et al. (2009) ClueGO: a Cytoscape plug–in to decipher functionally grouped gene ontology and pathway annotation networks. Bioinformatics 25:1091–3. Brewis, I.A., Van Gestel, R.A., Gadella, B.M., Jones, R., Publicover, S.J., Roldan, E.R., et al. (2005) The spermatozoon at fertilisation: current understanding and future research directions. Hum Fertil (Camb) 8: 241–51. Carbon, S., Ireland, A., Mungall, C.J., Shu, S., Marshall, B. and Lewis, S. (2009) AmiGO: online access to ontology and annotation data. Bioinformatics 25:288–9. ¨ ., Cerami, E.G., Gross, B.E., Demir, E., Rodchenkov, I., Babur, O Anwar, N., et al. (2011) Pathway Commons, a web resource for biological pathway data. Nucleic Acids Res 39:D685–D90. Chatr-aryamontri, A., Breitkreutz, B.-J., Heinicke, S., Boucher, L., Winter, A., Stark, C., et al. (2013) The BioGRID interaction database: 2013 update. Nucleic Acids Res 41:D816–D23. Cheema, R.S., Bansal, A.K. and Bilaspuri, G.S. (2009) Manganese provides antioxidant protection for sperm cryopreservation that may offer new consideration for clinical fertility. Oxid Med Cell Longev 2: 152–9. Cho, C., Bunch, D.O.D., Faure, J.-E., Goulding, E.H., Eddy, E.M., Primakoff, P., et al. (1998) Fertilization defects in sperm from mice lacking fertilin b. Science 281:1857–9. Clark, G.F. (2010) The mammalian zona pellucida: a matrix that mediates both gamete binding and immune recognition? Syst Biol Reprod Med 56:349–64. Croft, D., O’Kelly, G., Wu, G., Haw, R., Gillespie, M., Matthews, L., et al. (2011) Reactome: a database of reactions, pathways and biological processes. Nucleic Acids Res 39:D691–D7. Deryugina, E.I. and Quigley, J.P. (2012) Cell surface remodeling by plasmin: a new function for an old enzyme. J BioMed Biotechnol 2012:1–21. Dorus, S., Skerget, S. and Karr, T.L. (2012) Proteomic discovery of diverse immunity molecules in mammalian spermatozoa. Syst Biol Reprod Med 58:218–28. Edwards, D.R., Handsley, M.M. and Pennington, C.J. (2008) The ADAM metalloproteinases. Mol Aspects Med 29:258–89. Eisenberg, M.L., Park, Y., Hollenbeck, A.R., Lipshultz, L.I., Schatzkin, A. and Pletcher, M.J. (2011) Fatherhood and the risk of cardiovascular mortality in the NIH-AARP Diet and Health Study. Hum Reprod 26:3479–85. Evans, J.P. (2012) Sperm-egg interaction. Annu Rev Physiol 74: 477–502. Fanaei, M., Monk, P. and Partridge, L. (2011) The role of tetraspanins in fusion. Biochem Soc Trans 39:524–8. Fardilha, M., Esteves, S.L., Korrodi-Grego´rio, L., Pelech, S., e Silva, O.A.d.C. and e Silva, E.d.C. (2011) Protein phosphatase 1 complexes modulate sperm motility and present novel targets for male infertility. Mol Hum Reprod 17:466–77. Ferrer, M., Xu, W. and Oko, R. (2012) The composition, protein genesis and significance of the inner acrosomal membrane of eutherian sperm. Cell Tissue Res 349:733–48. Finn, R.D., Mistry, J., Tate, J., Coggill, P., Heger, A., Pollington, J.E., et al. (2010) The Pfam protein families database. Nucleic Acids Res 38:D211–D22. Franceschini, A., Szklarczyk, D., Frankild, S., Kuhn, M., Simonovic, M., Roth, A., et al. (2013) STRING v9. 1: protein-protein interaction networks, with increased coverage and integration. Nucleic Acids Res 41:D808–D15. Ganitha, G., Bhumkar, S. and Bhuvaneswari, J. (2012) Association of ABO Blood Groups and Infertility. Int J Health Science Res 2:72–7. Hamada, A., Esteves, S.C. and Agarwal, A. (2011) Unexplained male infertility: potential causes and management. Hum Androl 1:2–16. Han, C., Choi, E., Park, I., Lee, B., Jin, S., Nishimura, H., et al. (2009) Comprehensive analysis of reproductive ADAMs: relationship of ADAM4 and ADAM6 with an ADAM complex required for fertilization in mice. Biol Reprod 80:1001–8. Hemler, M.E. (2005) Tetraspanin functions and associated microdomains. Nat Rev Mol Cell Biol 6:801–11.

Syst Biol Reprod Med, 2014; 60(6): 329–337

Horiuchi, K., Weskamp, G., Lum, L., Hammes, H.-P., Cai, H., Brodie, T.A., et al. (2003) Potential role for ADAM15 in pathological neovascularization in mice. Mol Cell Biol 23:5614–24. Huang, Z. and Vijayaraghavan, S. (2004) Increased phosphorylation of a distinct subcellular pool of protein phosphatase, PP1g2, during epididymal sperm maturation. Biol Reprod 70:439–47. Huang da, W., Sherman, B.T. and Lempicki, R.A. (2009) Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc 4:44–57. Ikawa, M., Inoue, N., Benham, A.M. and Okabe, M. (2010) Fertilization: a sperm’s journey to and interaction with the oocyte. J Clin Invest 120: 984–94. Inoue, N., Ikawa, M., Isotani, A. and Okabe, M. (2005) The immunoglobulin superfamily protein Izumo is required for sperm to fuse with eggs. Nature 434:234–8. Jarow, J.P. (2007) Diagnostic approach to the infertile male patient. Endocrinol Metab Clin North Am 36:297–311. Karp, P.D., Ouzounis, C.A., Moore-Kochlacs, C., Goldovsky, L., Kaipa, P., Ahre´n, D., et al. (2005) Expansion of the BioCyc collection of pathway/genome databases to 160 genomes. Nucleic Acids Res 33: 6083–9. Kim, E., Yamashita, M., Nakanishi, T., Park, K.-E., Kimura, M., Kashiwabara, S.-i., et al. (2006a) Mouse sperm lacking ADAM1b/ ADAM2 fertilin can fuse with the egg plasma membrane and effect fertilization. J Biol Chem 281:5634–9. Kim, T., Oh, J., Woo, J.-M., Choi, E., Im, S.H., Yoo, Y.J., et al. (2006b) Expression and relationship of male reproductive ADAMs in mouse. Biol Reprod 74:744–50. Krawetz, S.A. (2005) Paternal contribution: new insights and future challenges. Nat Rev Genet 6:633–42. Letunic, I., Doerks, T. and Bork, P. (2012) SMART 7: recent updates to the protein domain annotation resource. Nucleic Acids Res 40: D302–D5. Manske, M., Welte, B., Hitschke, K., Wack, A. and Bade, E.G. (1994) Plasminogen activatior inhibitor type 1 in rat spermatozoa: localisation in the tail, in the acrosome and on the surface of the head. Zygote 2:243–52. McGinnis, L.K., Luo, J. and Kinsey, W.H. (2013) Protein tyrosine kinase signaling in the mouse oocyte cortex during sperm–egg interactions and anaphase resumption. Mol Reprod Dev 80:260–72. Meggiorini, M., Cipolla, V., Rech, F., Labi, L., Vestri, A. and de Felice, C. (2011) Mammographic features in infertile women as a potential risk for breast cancer: a preliminary study. Eur J Gynaecol Oncol 33:51–5. Miyado, K., Yamada, G., Yamada, S., Hasuwa, H., Nakamura, Y., Ryu, F., et al. (2000) Requirement of CD9 on the egg plasma membrane for fertilization. Science 287:321–4. Mulder, N.J., Apweiler, R., Attwood, T.K., Bairoch, A., Bateman, A., Binns, D., et al. (2007) New developments in the InterPro database. Nucleic Acids Res 35:D224–D8. Nishimura, H., Cho, C., Branciforte, D.R., Myles, D.G. and Primakoff, P. (2001) Analysis of loss of adhesive function in sperm lacking cyritestin or fertilin b. Dev Biol 233:204–13. O’Connell, M.J. and McInerney, J.O. (2005) Gamma chain receptor interleukins: evidence for positive selection driving the evolution of cell-to-cell communicators in the mammalian immune system. J Mol Evol 61:608–19. Okabe, M., Yagasaki, M., Oda, H., Matzno, S., Kohama, Y. and Mimura, T. (1988) Effect of a monoclonal anti-mouse sperm antibody (OBF13) on the interaction of mouse sperm with zona-free mouse and hamster eggs. J Reprod Immunol 13:211–19. Primakoff, P., Hyatt, H. and Tredick-Kline, J. (1987) Identification and purification of a sperm surface protein with a potential role in spermegg membrane fusion. J Cell Biol 104:141–9. Rubinstein, E. (2011) The complexity of tetraspanins. Biochemical Society Transactions 39:501–5. Rubinstein, E., Ziyyat, A., Prenant, M., Wrobel, E., Wolf, J.-P., Levy, S., et al. (2006a) Reduced fertility of female mice lacking CD81. Dev Biol 290:351–8. Rubinstein, E., Ziyyat, A., Wolf, J.-P., Le Naour, F. and Boucheix, C. (2006b) The molecular players of sperm–egg fusion in mammals. Semin Cell Dev Biol 17:254–63. Rzhetsky, A., Iossifov, I., Koike, T., Krauthammer, M., Kra, P., Morris, M., et al. (2004) GeneWays: a system for extracting,

Protein network of human sperm-egg interaction

Syst Biol Reprod Med Downloaded from informahealthcare.com by Universitaet Zuerich on 01/03/15 For personal use only.

DOI: 10.3109/19396368.2014.955896

analyzing, visualizing, and integrating molecular pathway data. J Biomed Inf 37:43–53. Salwinski, L., Miller, C.S., Smith, A.J., Pettit, F.K., Bowie, J.U. and Eisenberg, D. (2004) The database of interacting proteins: 2004 update. Nucleic Acids Res 32:D449–D51. Shannon, P., Markiel, A., Ozier, O., Baliga, N.S., Wang, J.T., Ramage, D., et al. (2003) Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res 13: 2498–504. Shen, C., Kuang, Y., Liu, J., Feng, J., Chen, X., Wu, W., et al. (2013) Prss37 is required for male fertility in the mouse. Biol Reprod 88: 1–11. Shoji, K., Murayama, T., Mimura, I., Wada, T., Kume, H., Goto, A., et al. (2013) Sperm-associated antigen 4, a novel hypoxia-inducible factor 1 target, regulates cytokinesis, and its expression correlates with the prognosis of renal cell carcinoma. AM J Pathol 182:2191–203. Sigman, M., Lipshultz, L. and Howards, S. (2009) Office evaluation of the subfertile male. In Infertility in the male. 4th ed. Larry I. Lipshultz, Stuart S. Howards and Craig S. Niederberger eds. Cambridge University Press, Cambridge, UK. pp.153–176. Simmons, L., Tan, Y.F. and Millar, A. (2013) Sperm and seminal fluid proteomes of the field cricket Teleogryllus oceanicus: identification of novel proteins transferred to females at mating. Insect Mol Biol 22: 115–30. Singson, A., Hang, J.S. and Parry, J.M. (2008) Genes required for the common miracle of fertilization in Caenorhabditis elegans. Int J Dev Biol 52:647–56.

337

Smoot, M.E., Ono, K., Ruscheinski, J., Wang, P.-L. and Ideker, T. (2011) Cytoscape 2.8: new features for data integration and network visualization. Bioinformatics 27:431–2. Stein, K.K., Go, J.C., Lane, W.S., Primakoff, P. and Myles, D.G. (2006) Proteomic analysis of sperm regions that mediate sperm-egg interactions. Proteomics 6:3533–43. Syrovets, T., Lunov, O. and Simmet, T. (2012) Plasmin as a proinflammatory cell activator. J Leukoc Biol 92:509–19. Un-Nahar, Z., Ali, M., Biswas, S., Kamrun, N., Bashar, T. and Arslan, M. (2013) Study of Seminal MDA Level as a Oxidative Stress Marker in Infertile Male. J Sci Found 9:85–93. ´ ., Bustos-Obrego´n, E. and Hartley, R. (2011) Effects of hypoxia Vargas, A on epididymal sperm parameters and protective role of ibuprofen and melatonin. Biol Res 44:161–7. Verit, F.F., Keskin, S., Omer, B., Yalcinkaya, S. and Sakar, N. (2014) Is there any relationship between cardiovascular risk markers and young women with diminished ovarian reserve? Gynecol Endocrino 2014:1–4. Vjugina, U. and Evans, J.P. (2007) New insights into the molecular basis of mammalian sperm-egg membrane interactions. Front Biosci 13: 462–76. Winters, B.R. and Walsh, T.J. (2014) The Epidemiology of Male Infertility. Urol Clin North Am 41:195–204. Ziyyat, A., Rubinstein, E., Monier-Gavelle, F., Barraud, V., Kulski, O., Prenant, M., et al. (2006) CD9 controls the formation of clusters that contain tetraspanins and the integrin a6b1, which are involved in human and mouse gamete fusion. J Cell Sci 119: 416–24.

Supplementary material available online Supplementary files 1–4.

Functional features and protein network of human sperm-egg interaction.

Elucidation of the sperm-egg interaction at the molecular level is one of the unresolved problems in sexual reproduction, and understanding the molecu...
1MB Sizes 2 Downloads 5 Views