Hindawi Publishing Corporation BioMed Research International Volume 2015, Article ID 780357, 15 pages http://dx.doi.org/10.1155/2015/780357

Research Article Cofunctional Subpathways Were Regulated by Transcription Factor with Common Motif, Common Family, or Common Tissue Fei Su,1 Desi Shang,1 Yanjun Xu,1 Li Feng,1 Haixiu Yang,1 Baoquan Liu,2 Shengyang Su,3 Lina Chen,1 and Xia Li1 1

College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China Department of Anatomy, Harbin Medical University, Harbin 150081, China 3 Department of Computer Science, University of Texas at Dallas, 800 W Campbell Road, Richardson, TX 75080, USA 2

Correspondence should be addressed to Lina Chen; [email protected] and Xia Li; [email protected] Received 26 July 2015; Revised 2 November 2015; Accepted 4 November 2015 Academic Editor: Stelvio M. Bandiera Copyright © 2015 Fei Su et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Dissecting the characteristics of the transcription factor (TF) regulatory subpathway is helpful for understanding the TF underlying regulatory function in complex biological systems. To gain insight into the influence of TFs on their regulatory subpathways, we constructed a global TF-subpathways network (TSN) to analyze systematically the regulatory effect of common-motif, commonfamily, or common-tissue TFs on subpathways. We performed cluster analysis to show that the common-motif, common-family, or common-tissue TFs that regulated the same pathway classes tended to cluster together and contribute to the same biological function that led to disease initiation and progression. We analyzed the Jaccard coefficient to show that the functional consistency of subpathways regulated by the TF pairs with common motif, common family, or common tissue was significantly greater than the random TF pairs at the subpathway level, pathway level, and pathway class level. For example, HNF4A (hepatocyte nuclear factor 4, alpha) and NR1I3 (nuclear receptor subfamily 1, group I, member 3) were a pair of TFs with common motif, common family, and common tissue. They were involved in drug metabolism pathways and were liver-specific factors required for physiological transcription. In short, we inferred that the cofunctional subpathways were regulated by common-motif, common-family, or common-tissue TFs.

1. Introduction The mechanism of transcriptional regulation of coding genes is one of the basic contents in systems biology. Transcriptional factors (TFs) are proteins that regulate several target genes by binding DNA motifs at the transcriptional level [1–5]. Some investigators have reported that TFs take part in many important biological functions and human diseases, such as cell differentiation, proliferation, immune response, apoptosis, cardiac diseases, and tumor development [6–9]. Dissecting the characteristics of TFs is helpful for understanding their regulatory function in complex biological systems. An increasing number of studies have demonstrated that TFs with similar motifs often (which were defined as common-motif TFs in the paper) recognize target genes with similar expression

patterns [10], and in a similar way, TFs with different motifs may directly show different functions [11]. Several researchers have revealed that TFs in the common family (were defined as common-family TFs in the paper) have strong homology and the TFs are functionally related [12]; TFs in the transcription factor family are functional redundancies [13]; and TF family members share a relatively high degree of similarity in genomic structure and gene arrangement [12–14]. Several studies have demonstrated that TFs in the common tissue or cell line (were defined as common-tissue TFs in the paper) play critical roles in disease progression [15]. Currently, the regulatory functional interpretation of TFs mainly relies on activating or inhibiting their target genes [16, 17]. Several studies have revealed the functions of TFs based on the enrichment analysis of their target genes from

2

BioMed Research International

a number of pathways [18–22]. Subpathways (local regions within an entire biological pathway) exert certain functions on practical biological processes, disease development, and drug treatment [23–27]. Subpathway recognition is crucial because it is capable of focusing on local variation, increasing our power to identify the underlying causal genes, and may give us more detailed explanations of the biological process and pathogenesis of human disease. In this study, we dissected the regulatory influence of common-motif, common-family, or common-tissue TFs on cofunctional subpathways. Cofunctional subpathways are defined as subpathways that belong to the same pathway or pathway class. To gain insight into the relationships between TFs and their regulatory subpathways, we constructed a bipartite network TF-subpathways network (TSN) to explore the regulatory influence of TFs on subpathways. Through one-way cluster analysis, the common-motif, common-family, or common-tissue TFs that regulated the same pathway classes tended to cluster together, and through Jaccard coefficient analysis, the subpathways regulated by the TFs with common motif, common family, or common tissue were functionally consistent at the subpathway, pathway, and pathway class levels. These findings help us to understand the mechanisms of the regulatory effects of TFs on subpathways and provide a more detailed picture of biological processes and complex human diseases.

2. Materials and Methods 2.1. Preparation of the Data 2.1.1. TF Information. Transfac Professional (licensed on December 15, 2012) [28] provided details of the relations between TFs and their target genes, verified by biologists. We acquired 4598 transcription relations between 492 human TFs and 1557 target genes. We extracted 1061 human TF binding motifs. We extracted 80 unique TFs belonging to 29 families. We obtained 343 unique TFs expressed on 122 unique human tissues or cell lines. 2.1.2. Subpathway Data. We used R package of iSubpathwayMiner [24] to find the subpathways in public Kyoto Encyclopedia of Genes and Genomes (KEGG) (http://www.kegg .jp/) pathways database [29]. According to the pathway classification in KEGG, all subpathways were grouped into cellular processes (CP), environmental information processing (EIP), human diseases (HD), metabolism (Met), and organismal systems (OS). 2.2. Identifying the Subpathways Regulated by TFs. Cumulative hypergeometric enrichment analysis was used for identification of subpathways, in which TF target genes were significantly enriched. The 𝑝 value can be calculated to evaluate the enrichment significance for that subpathway as follows: 𝑟−1

( 𝑥𝑡 ) ( 𝑚−𝑡 𝑛−𝑥 ) . 𝑚 (𝑛) 𝑥=0

𝑝=1− ∑

(1)

We supposed that the whole human genome had 𝑚 genes, and among those, 𝑡 genes were included in the subpathway. The number of target genes of one TF is 𝑛, of which 𝑟 are involved in the same subpathways. 2.3. Evaluating the Consistent Function of Subpathways. We proposed a hypothesis that one TF pair regulated the consistent functional subpathways if they had common motif, common family, or common tissue. We applied the Jaccard coefficient to evaluate the consistent function of subpathways for TF pairs with a common motif, common family, or common tissue. The Jaccard coefficient is 𝐽 (𝑋, 𝑌) =

|𝑋 ∩ 𝑌| . |𝑋 ∪ 𝑌|

(2)

Assume TF1 and TF2 are a pair of common-motif, common-family, or common-tissue TFs, where 𝑋 represents the subpathways regulated by TF1 and 𝑌 represents the subpathways regulated by TF2. 𝐽 is the number of intersections of 𝑋 and 𝑌 divided by the number of unions of 𝑋 and 𝑌. The range of Jaccard coefficient, 𝐽, is from 0 to 1. If TF1 and TF2 annotated the same subpathway set, then 𝐽 = 1, and if TF1 and TF2 annotated entirely different subpathways, then 𝐽 = 0. The higher Jaccard value means stronger similarity, while lower values mean weaker similarity, so we can use the Jaccard coefficient to evaluate the consistent functionality of subpathways regulated by a pair of TFs with common motif, common family, or common tissue. At the pathways or pathway classes level, Jaccard coefficient is also used to evaluate the consistent functionality of subpathways, where 𝑋 represents the pathways or pathway classes that include the subpathways regulated by TF1 and 𝑌 represents the pathways or pathway classes that include the subpathways regulated by TF2.

3. Results 3.1. TF-Subpathway Network (TSN). We constructed a global TSN based on TFs and subpathway data (Figure 1) and systematically analyzed the characteristics of TFs that affected subpathways. We obtained all terms of TF binding sites (𝑛 = 21 038) information from the authoritative Transfac database. After filtering, we obtained 4598 specific human transcriptional regulatory relations between 492 TFs and 1557 target genes, verified by biologists. For each TF, we used the “𝑘-cliques” subpathway identification method provided by the iSubpathwayMiner software package to identify significantly enriched subpathways based on the TF-regulated target gene set, when 𝑘 = 3 and 𝑝 < 0.01. 𝑘 = 3 meant that the distance among the genes in one subpathway was not greater than 3 to ensure that genes in the subpathways have highly similar functions. We obtained 1490 significant TFsubpathway associations between 89 TFs and 468 subpathways (Table S1 in Supplementary Material available online at http://dx.doi.org/10.1155/2015/780357). We constructed a bipartite network consisting of TFs and subpathways in which a TF and a subpathway were connected if the TF target genes were significantly enriched in the corresponding subpathways (Figure 2).

BioMed Research International

3

Identify TF target genes TFi

TF1

TFi

TFm Filter redundant terms and species

TF1 TFi

···

Gene

Gene

···

Experiments

Integrate human TFs and human target genes

Gene

···

Gene

Keep 4598 pairs of human TF target genes associations TFi target gene set

TF1 target gene set TFi Subpathway 3

Subpathway 1 Gene

Subpathway 3 TFi

TF1

(a)

Subpathway p

Subpathway 2 Identify subpathways

Subpathway 16 Subpathway 1 TFm Subpathway 9

Subpathway 18

Combine all TFssubpathway associations:

TFi target gene set

Subpathway 1 Subpathway n

TFm target gene set

Subpathway j

Hypergeometric Subpathway 3 test

Subpathway p

“k-clique” subpathway identification algorithm:

(1) Convert pathways to the undirected graths with genes as nodes (2) Mine all 3-cliques as subpathways (c) Transfac professional data build 21038 TF binding sites

(b)

Pathway class

Pathway class CREM JUND ETV4 NRL ATF1 HMGA1 GATA2 GATA1 GATA3 STAT5A STAT4 NR2F1 THRB TCF7 JUNB MITF TFAP2C KLF5 NFATC1 CEBPA FOXA2 REL SREBF1 IRF5 SRF NFE2L2 GLI3 GLI1 STAT3 STAT6 NR1I3 NR1I2 GATA6 EGR1 ELF1 SMAD2 SMAD3 SPI1 FOXA3 HNF1A LEF1 NR5A1 NR4A1 NR5A2 GATA4 HNF4A POU2F1 IRF7 IRF3 IRF1 CEBPB FOS STAT1 RELA JUN ESR1 TP73 SMAD4 NFYA TP63 RFX1 XBP1 RFX2 RFX3 CREB1 PAX6 AR STAT5B NFIC VDR TFAP2A YBX1 E2F1 KLF4 MYC YY1 ATF2 SP3 TP53 SP1

Common-family TFs

Common-motif TFs

GATA2 GATA1 ATF1 HMGA1 STAT5A STAT4 THRB ETV4 CREM FOSL1 JUND RUNX3 GATA3 NR2F1 TCF7 JUNB MITF GLI1 TFAP2C NFATC1 CEBPA FOXA2 REL SREBF1 IRF5 SRF GLI3 NFE2L2 STAT3 STAT6 ESR1 TP73 SMAD4 ELF1 SPI1 SMAD2 SMAD3 ZBTB7B EGR1 LEF1 TCF7L2 AR STAT5B TP63 NFYA NR5A1 NR4A1 NR5A2 GATA4 HNF4A POU2F1 NR1I3 NR1I2 GATA6 FOXA3 HNF1A RFX2 RFX1 RFX3 XBP1 CREB1 IRF7 IRF3 IRF1 STAT1 FOS CEBPB RELA JUN PAX6 VDR NFIC RBPJ TP53 TFAP2A E2F1 ZBTB7A KLF4 MYC YY1 ATF2 SP3 SP1

CREB AP 1 Ets type AP 1 CREB Fungal regulators GATA factors GATA factors GATA factors STAT STAT Steroid hormone receptors (NR3) Steroid hormone receptors (NR3) TCF 1 AP 1 Ubiquitous bHLH ZIP factors AP 2 Developmental/cell cycle regulators NF AT C/EBP like factors Tissue specific regulators Rel/ankyrin Ubiquitous bHLH ZIP factors Interferon regulating factors Responders to external signals AP 1 Developmental/cell cycle regulators Developmental/cell cycle regulators STAT STAT Steroid hormone receptors (NR3) Steroid hormone receptors (NR3) GATA factors Developmental/cell cycle regulators Ets type SMAD SMAD Ets type Tissue specific regulators Homeo domain only TCF 1 Steroid hormone receptors (NR3) Steroid hormone receptors (NR3) Steroid hormone receptors (NR3) GATA factors Steroid hormone receptors (NR3) POU domain factors Interferon regulating factors Interferon regulating factors Interferon regulating factors C/EBP like factors AP 1 STAT Rel/ankyrin AP 1 Steroid hormone receptors (NR3) p53 like SMAD Heteromeric CCAAT factors p53 like Regulators with a DNA recognition wing AP 1 Regulators with a DNA recognition wing Regulators with a DNA recognition wing CREB Paired plus homeo domain Steroid hormone receptors (NR3) STAT NF 1 Steroid hormone receptors (NR3) AP 2 csd Cell cycle controlling factors Developmental/cell cycle regulators Cell cycle controlling factors Ubiquitous factors AP 1 Ubiquitous factors p53 like Ubiquitous factors

Common-tissue TFs

Pathway class T cells Breast T cells Breast HeLa HepG2 Hep3B T cells HepG2 Liver MCF7 T cells Testis Heart Brain Kidney Liver HUVEC HeLa HeLa K562 HepG2 MCF7 MCF7 K562 MCF7 Lymph node Kidney Heart Liver Liver Caco HUVEC Hep3B K562 T cells Thymus Spleen Liver Bone marrow Spleen T cells Testis Thymus Intestine Bone marrow Liver Lymph node Lymphocytes Spleen T cells Testis Thymus Thymus T cells Heart Breast Thymus Lymphocytes HeLa Caco Breast Intestine Kidney Liver Spleen Liver Liver Brain Caco Intestine Hep3B Kidney Liver Testis Heart Brain Kidney Liver Lymphocytes Spleen Testis Thymus Spleen Lymph node Thymus Lymph node Bone marrow Spleen Thymus K562 HUVEC MCF7 HepG2 MCF7 HeLa MCF7 HeLa Brain Bone marrow Intestine Lymphocytes MCF7 Testis HeLa HUVEC HeLa MCF7

TCF7 TCF7 CREM ETV4 NR2F1 GATA2 GATA2 GATA3 GATA3 FOXA2 TFAP2C NFATC1 KLF5 SREBF1 SREBF1 SREBF1 SREBF1 REL REL SRF NFE2L2 NFE2L2 STAT3 FOS FOS ESR1 TP73 NR1I3 NR1I3 NR1I3 NR1I2 GATA6 EGR1 EGR1 EGR1 EGR1 ELF1 ELF1 SPI1 SPI1 SPI1 SPI1 SPI1 SPI1 AKNA AKNA AKNA AKNA AKNA AKNA AKNA AKNA AKNA LEF1 LEF1 TP63 TP63 TP63 NFYA NFYA HNF1A HNF1A HNF1A HNF1A HNF1A HNF1A FOXA3 NR4A1 NR4A1 GATA4 HNF4A HNF4A HNF4A HNF4A HNF4A CREB1 CREB1 CREB1 CREB1 CREB1 CREB1 CREB1 CREB1 IRF7 IRF7 IRF7 IRF5 IRF5 IRF5 IRF5 RELA RELA E2F1 STAT5B VDR VDR NFIC NFIC TP53 TP53 TP53 TP53 TP53 TP53 TFAP2A SP1 SP1 SP1

(d) Clustering

Figure 1: Continued.

4

BioMed Research International 100

40

60 40 Observed

20

80 60 40 Observed

20 0

0 0.035

0.045

20 Observed 10

0.04 0.05 0.06 0.07 0.08 0.09 0.10

Subpathway

0.30

0.35 0.40 Pathway class

Pathway

Probability density

800 600 400 200

30

0

0.055

Observed

250

100

200

80

150 100 Observed

50

Probability density

0.025

Probability density

Probability density

80

Probability density

Probability density

100

0

0 0.00 0.02 0.04 0.06 0.08 0.10 0.12

0.45

60 40 Observed

20 0

0.00

0.05 0.10 Pathway

Subpathway

0.15

0.20

0.25

0.30

0.35

0.40

Pathway class

Probability density

Probability density

800 600 400 200

Observed

200 150 100 Observed

50 0

0 0.000

0.010

0.020

0.030

Probability density

150 1000

100

50 Observed 0

0.00

0.02

0.04

Subpathway

0.06

0.08

Pathway

0.0

0.1

0.2 0.3 Pathway class

(e) Jaccard coefficient

Subpathway level Pathway level

Pathway class level

Path: 00120_15 Path: 00120_19 Path: 03320_2 Path: 00120_6 Path: 04610_2 Path: 00120_7 Path: 00120_9 Path: 00590_1

Path: 04060 Path: 00590

Path: 04610_3 Path: 04610_4 Path: 04610_5

Path: 00590_2

Path: 00830_2 Path: 00982_4 Path: 00982_5

Path: 04610_6 Path: 00982_9 Path: 00590_3 Path: 04610_8 Path: 00590_4 Path: 04950_4 Path: 00591_1 Path: 03320_1 Path: 00982_10

Path: 00980_1

Path: 00591

Path: 00830

Path: 03320

Path: 00980

Path: 04610

Path: 00982

Path: 04950

Path: 00980_2

(f) Jaccard coefficient of HNF4A and NR1I3

Figure 1: Continued.

Organismal systems Human diseases

Metabolism

0.4

0.5

BioMed Research International

5 4-Glutathionyl-CPA (inactive) 4-Keto-CPA

Dechloroethyl-CPA Chloroacetaldehyde

2.5.1.18

CYP3A4

CYP3A4

CYP3A4 CYP2C CYP2B6

Ifosfamide (prodrug)

Nornitrogen mustard (active) Carboxyphosphamide

4-OH-CPA 4-Hydroxyifosfamide

Aldophosphamide

Phosphamide mustard (active)

Acrolein Aldoifosfamide

CYP3A4 CYP3A4

Alcophosphamide 1.1.1.1 1.2.1.5

CYP2C CYP2B6

Cyclophosphamide (prodrug)

Excretion

Isophosphamide mustard (active) Carboxyphosphamide

CYP3A5

CYP2B6 4-Ketoifosfamide

Alcophosphamide

Chloroacetaldehyde Dechloroethylifosfamide (g) Drug metabolism-cytochrome P450

Figure 1: Schematic of the construction of the global TSN based on TF target genes and pathway structure data from KEGG. (a) We extracted 21038 TFs that were verified through biology experiment from Transfac Professional database and then filtered redundant terms and species and obtained 4598 pairs of human TF target gene associations. (b) We used the 𝑘-clique subpathway method to identify significantly enriched subpathways to generate TF-subpathway associations. (c) We combined these TF-subpathway associations to form the TSN. (d) We used oneway clustering between TFs and subpathways to analyze the regulatory effects of common-motif, common-family, and common-tissue TFs on cofunctional subpathways. (e) Jaccard coefficient was applied to evaluate the functional consistency of the common-motif, common-family, and common-tissue TFs at the subpathway, pathway, and pathway class levels. (f) HNF4A and NR1I3 were a pair of TFs with common motif, common family, and common tissue. Jaccard coefficient at subpathways level, pathways level, and pathway classes level. (g) HNF4A regulated subpathway 00982-10, and NR1I3 regulated subpathway 00982-4, and they have shared genes (CYP3A4, CYP3A5, and CYP2C).

The TSN was composed of 557 nodes (468 subpathways and 89 TFs) and 1490 edges (Figure 2(a)). We calculated the size of the giant component of TSN (Figure 2(b)). The giant component was the largest connected component of a network and measured local functional clustering when compared to random networks. The TSN network connected most TFs and subpathways into a highly interlinked giant component, with strong local clustering of TFs and subpathways. We constructed 1000 random networks by randomly shuffling the relations between the 1490 pairs of TFs and subpathways, while keeping the number of each TF and subpathway unchanged. We compared the giant component of the real TSN and 1000 random networks, which showed that the actual size of the giant component of the TSN (521) was significantly smaller (experiential 𝑝 = 0) than the average giant component of randomized networks (554.963). We paid close attention to the TF degree distribution. As shown in Figure 2(c), a small number of TFs had higher connectivity, which meant that these TFs regulated many subpathways. For example, SP1 had the highest degree (186) (Table S2). This might be because it could activate or inhibit the expression of a number of essential oncogenes and tumor suppressors, and SP1 took part in the main biological processes such as cell growth, apoptosis, differentiation, immune response, response to DNA damage, and chromatin remodeling [30, 31]. Tumor suppressor gene TP53 had higher

degree (72) than other TFs, and it is always present in many types of human cancer. For example, TP53 gene mutations are more frequent in cervical adenocarcinoma [32]. In addition, we also calculated the degree distribution of the subpathways (Figure 2(d)). We evaluated which kind of subpathway was preferentially regulated by TFs. We grouped all the subpathways into the categories of Met, OS, CP, HD, and EIP, according to the pathway classification of KEGG. The average degrees of the five subpathway groups were 4.0000, 2.6026, 5.2982, 3.0372, and 1.9250 (𝑝 = 2.77 × 10−11 by one-way ANOVA test), respectively. Therefore, the cellular processes subpathways had the greatest opportunity of being regulated by TFs, such as the cell cycle subpathway (path: 04110 18) and TP53 signaling pathway (path: 04115 1). This finding was consistent with a previous study that showed that TF preferred to regulate cellular process genes [33]. 3.2. Common-Motif TFs Tend to Regulate Cofunctional Subpathways. The common conserved sequence of the TF binding sites is called a motif. Some studies have indicated that motif sequence is associated with TF function. TFs with similar motifs often recognize target genes with similar expression patterns [10], and TFs with different motifs may directly influence different TF function [11], and TFs influence pathways initiated by specific DNA-binding motifs

6

BioMed Research International Path: 00140_23 Path: 00140_13 Path: 00140_22 Path: 00140_9 Path: 00590_4 Path: 00140_10

Path: 00140_19 Path: 00140_12

Path: 00140_6

Path: 00140_4

Path: 00140_7 Path: 00140_5

Path: 00140_20

Path: 00140_18

Path: 04350_13 Path: 04630_4

Path: 04650_12

HNF1A

Path: 04670_14

Path: 05200_43 Path: 04115_1

Path: 04670_15

Path: 04145_6 Path: 04145_2

Path: 04115_7

Path: 05332_1 RFX2

Path: 05219_4

CREB1

Path: 05330_2

Path: 05320_4

Path: 05218_8 RFX3

Path: 05220_12

XBP1

Path: 05330_1

Path: 05200_47 Path: 04650_7 Path: 05220_11 Path: 04620_9 Path: 05214_14 Path: 04145_5 Path: 05218_5 Path: 05210_10 Path: 05218_7 Path: 04620_17 Path: 05218_6 Path: 04620_22 Path: 05214_15 Path: 05220_9

Path: 05210_9

RFX1

ETV4 NR5A1

Path: 04620_16 Path: 04620_18

Path: 05150_17

Path: 05310_3 Path: 05150_14

Path: 05200_29

Path: 04060_21

Path: 04670_10

Path: 05214_13

STAT4

CREM

SMAD3 SPI1

Path: 05310_1 Path: 05330_4

Path: 05200_52

NR5A2

GATA4

Path: 05330_7

Path: 05320_3

Path: 05219_3

JUND FOSL1 NR4A1

Path: 04060_3 Path: 04630_3

Path: 04650_10 Path: 05140_4

Path: 05200_51

STAT3

ENO1

Path: 04060_41

Path: 05322_6

STAT5A

STAT5B

Path: 05320_1 Path: 05320_6

Path: 05220_8

STAT6 STAT1 YY1

Path: 04514_55 Path: 04650_13 Path: 04514_62

Path: 00140_14

Path: 00140_3 Path: 00140_8 Path: 00140_24

Path: 00140_16

Path: 00120_5

Path: 05416_4

Path: 04514_3

Path: 05322_5

Path: 04514_1 Path: 04940_1 Path: 04612_3 Path: 04672_5 Path: 04672_4

Path: 04664_3 Path: 04650_1 Path: 04650_15

Path: 04664_9 Path: 04722_19 SMAD4 Path: 04060_47 Path: 04060_32 Path: 04512_24 Path: 04512_10 Path: 04512_11 Path: 04630_2 Path: 04010_21 Path: 04512_4 Path: 04010_5 Path: 04514_57 Path: 04512_18 Path: 04512_23 Path: 04514_50 Path: 04012_5 Path: 04144_2 Path: 04512_13 Path: 04514_49 Path: 04512_9 Path: 04010_26 Path: 04060_56 Path: 04060_62 Path: 04010_15 Path: 04912_8 Path: 04060_22 Path: 04512_3 Path: 04060_43 Path: 04110_7 Path: 04020_2 Path: 04010_16

Path: 04115_2 Path: 04115_3

Path: 04060_51 Path: 04060_38

Path: 04210_20

Path: 04620_11

Path: 04520_3

Path: 04620_12 IRF1

Path: 04510_8

IRF3

Path: 04510_6 IRF7

Path: 04650_6

IRF5

Path: 04620_1 Path: 04110_15

Path: 04620_2

Path: 04110_17

Path: 04010_31

Path: 04110_18

Path: 04610_8 NFATC1

PAX6

SP1

E2F1

TP53

TP63

TFAP2C

CEBPB Path: 04110_20

ZBTB7A

Path: 04110_27

Path: 05211_2 SRF Path: 05160_12 Path: 05020_9 Path: 05223_7 Path: 05220_5 Path: 05142_10 Path: 05410_2 Path: 05131_5 Path: 05142_11 Path: 05212_1 Path: 05214_4 Path: 05140_6 Path: 05414_3 Path: 05140_10 Path: 05146_5

Path: 04110_3

Path: 04110_26

Path: 04110_25 Path: 04110_23

Path: 05200_42 Path: 05222_2 Path: 05222_7

TCF7L2 Path: 05221_1 LEF1

Path: 05200_31

YBX1

Path: 05210_12

Path: 05215_4

Path: 04916_8 Path: 05210_7 Path: 04916_11

Path: 04514_10 Path: 04514_78 Path: 04514_9

Path: 04672_2

Path: 04510_11

Path: 04340_2

Path: 00100_9 GLI3

NR1I2

FOXA3 Path: 05140_1

Path: 04920_11

Path: 00980_1 Path: 00590_1

Path: 05142_18 Path: 05215_1 Path: 05145_14 Path: 05014_4 Path: 05215_3

ZBTB7B

Path: 05212_11 Path: 05212_14 Path: 05146_1 Path: 05214_11

Path: 04340_1 Path: 00980_2

Path: 05142_17 Path: 05200_53 Path: 05223_3 Path: 05100_10 Path: 05214_21 Path: 05200_12 Path: 05332_2 Path: 05214_10 Path: 05200_9 Path: 05144_3 Path: 05200_11 Path: 05200_19 Path: 05200_20 Path: 05215_11 Path: 05214_3 Path: 05200_4 Path: 05414_2 Path: 05100_3 Path: 05218_2 Path: 05214_8 Path: 05200_38 Path: 05145_21 Path: 05200_24 Path: 05223_5 Path: 05214_17 Path: 05211_8 Path: 05100_2 Path: 05332_5 Path: 05200_48 Path: 05320_9 Path: 05200_15 Path: 05214_6 Path: 05200_3 Path: 05145_19 Path: 05200_6 Path: 05142_20 Path: 05145_8 Path: 05212_12 Path: 05213_6 Path: 05213_2 Path: 05330_5 Path: 05214_5 Path: 05200_8 Path: 05200_16 Path: 05214_1 Path: 05100_15 Path: 05214_12 Path: 05215_7 Path: 05200_10 Path: 05211_7 Path: 05016_8 Path: 05142_4 Path: 05020_8 Path: 05218_3 Path: 05222_4 Path: 05200_21 Path: 05416_1 Path: 05120_6 Path: 05131_6 ··· Path: 05211_1 Path: 05219_2 Path: 05214_2 Path: 05200_54

Path: 05216_4

Path: 05213_1

NFIC

Path: 05131_12 Path: 05215_12 Path: 05142_8

Path: 04940_4 Path: 05218_4 Path: 05212_13 Path: 05146_2 Path: 05200_5

SMAD2 NFYA

Path: 04623_5

Path: 05131_9

Path: 04622_3 Path: 04664_5 Path: 04062_19 Path: 04920_1 Path: 04664_2

POU2F1 GATA6

Path: 05200_56

Path: 05222_1

Path: 05200_50

GATA1

Path: 05160_3

Path: 05142_21

Path: 04650_22

Path: 00240_2

Path: 00591_1

Path: 00330_1 Path: 00140_11 Path: 00120_1 Path: 00350_3 Path: 00350_1 Path: 00240_5 Path: 00350_2 Path: 00140_1 Path: 00120_2 Path: 00590_3

GATA2 Path: 05200_17 Path: 05014_3 Path: 05215_10 Path: 05146_9 Path: 05160_15

Path: 04110_4

ATF1

Path: 04660_1

Path: 00240_17

Path: 00330_9 MYC

FOS

Path: 05140_9

Path: 04612_7

Path: 04650_23

Path: 00120_3

ATF2

KLF5

Path: 04110_16

Path: 04110_19

Path: 04350_11

Path: 04210_3

Path: 04670_9

AR

Path: 04722_24

REL

Path: 05220_6

Path: 00240_16

Path: 04610_4

MITF RELA

Path: 05140_13

Path: 04062_1

EGR1

SP3

Path: 04620_10

RBPJ

HMGA1

TXK

ESR1

Path: 04210_23 Path: 04810_32 Path: 04510_23 Path: 04110_12 Path: 04510_17 Path: 04110_1

Path: 04350_6

Path: 05222_9

Path: 04670_20

Path: 04610_3 Path: 04660_8

Path: 04110_8 Path: 04510_24 Path: 04110_22 Path: 04110_11

Path: 04062_2 Path: 04670_2

Path: 05160_8

ELF1 Path: 04722_13

Path: 04060_8

Path: 04620_13

JUN

Path: 04620_25

Path: 05160_17

Path: 04650_20 Path: 04910_2

Path: 04310_7

Path: 04520_9 KLF4

Path: 04622_5

Path: 05222_10 Path: 04620_15

Path: 04060_33

Path: 04010_10 Path: 04512_12 Path: 04512_27 Path: 04512_7 Path: 04512_5

Path: 04520_2 Path: 04520_7

Path: 04520_5

Path: 04810_29

Path: 05215_2 Path: 05120_3

Path: 04512_19 Path: 04514_56 Path: 04662_6 Path: 04512_22 Path: 04080_7 Path: 04010_30 Path: 04916_3 Path: 04630_1 Path: 04512_2 Path: 04512_1 Path: 04512_20 Path: 04512_8 Path: 04621_7 Path: 04512_6 Path: 04610_7

Path: 04510_13 Path: 04540_13

Path: 04621_4

Path: 05145_5

Path: 04512_14

Path: 04210_24 Path: 04510_7

Path: 05310_4 Path: 05310_5 Path: 05200_18 Path: 05131_10

Path: 04662_1

Path: 04650_14

Path: 04310_6

Path: 05160_6

Path: 04620_14

Path: 04664_1

Path: 04664_10

TP73 Path: 04115_4

GLI1

Path: 05217_2

Path: 00590_2 Path: 05200_57 Path: 00830_2

Path: 05217_1 Path: 05200_34

Path: 04510_20 Path: 04510_19

Path: 04510_25

Path: 04510_18 Path: 04510_1 Path: 04510_16 TFAP2A Path: 04510_4

Path: 04510_15

Path: 05144_9

Path: 04514_12

Path: 04514_11

Path: 00982_7 Path: 00982_5 Path: 00982_10 Path: 04950_4

Path: 05320_2

Path: 00982_4

AKNA Path: 04060_31

Path: 04510_5

Path: 05330_12

Path: 00480_1

Path: 05223_6

Path: 04612_4

Path: 04744_2 RUNX3

Path: 00480_4

Path: 03320_2

GATA3 NRL

Path: 05330_3

Path: 05310_2

Path: 05215_8 Path: 04510_10 Path: 05214_16

Path: 00480_6

Path: 04610_6

Path: 00982_9 Path: 00982_11

Path: 05214_7

Path: 04910_5

Path: 00982_3 Path: 00982_8

Path: 05322_7

Path: 05145_20 Path: 05416_3 Path: 05320_5

Path: 04660_20

NFE2L2

Path: 04910_6

TCF7

Path: 00480_2

JUNB THRB Path: 00982_2

HNF4A

Path: 00982_6

VDR

Path: 04610_2

NR1I3

Path: 03320_1 CEBPA

Path: 04080_2

Path: 00480_3 Path: 04350_12 Path: 00480_5 Path: 04350_10

NR2F1

Path: 00140_26

Path: 04610_5 SREBF1

FOXA2

Path: 00140_25

Path: 04060_54

Path: 00120_19

Path: 04012_14

Path: 00120_15

Path: 04012_11

Path: 00120_6

Path: 04012_1 Path: 04012_12

Path: 00120_9

Path: 00120_7 Path: 04012_7 Path: 04012_4

Pathway class Cellular processes Human diseases Metabolism Organismal systems Environmental information processing

Node side TF 6–50

1–5

51–100 101–150 151–186

Subpathway 1

2–5

6–10 11–15 16–17 Edge color

0.01–0.001 0.001–0.0001

0.0001–0

Observed

The number of subpathways

0.30 0.25 0.20 0.15 0.10 0.05 0.00

The number of TFs

Probability density

(a)

15 10 5 0

500 520 540 560 580 600 Size of giant component (b)

5

10 15 20 25 30 34 · · · 200 TF (deg.) (c)

1 2 3 4 5 6 7 8 9 10 11 12 13 15 17 Subpathway (deg.) (d)

Figure 2: The TSN is a bipartite network. (a) The circles and rectangles in the network represent TFs and subpathways, respectively. A pair of TFs and a subpathway were connected by an edge if the set of target genes of TF was significantly enriched in the corresponding subpathway. Node size was proportional to the degree of the node. Edges were colored according to the enrichment significance (𝑝 < 0.01) of associations between TF target genes and subpathways. (b) Component size distributions of TFs and subpathways in the TSN and random network. The actual size of the real network giant component was smaller than that of the 1000 randomly shuffled TFs (521 versus 554.9630, experiential 𝑝 = 0). (c) Bar plot of degree of TF distribution in the TSN. (d) Bar plot of degree of distribution of subpathways in TSN.

BioMed Research International

7 Pathway class

Common-motif TFs

GATA2 GATA1 ATF1 HMGA STAT5 STAT4 THRB ETV4 CREM FOSL1 JUND RUNX3 GATA3 NR2F1 TCF7 JUNB MITF GLI1 TFAP2 NFATC CEBPA FOXA2 REL SREBF RF5 SRF GLI3 NFE2L STAT3 STAT6 ESR1 TP73 SMAD4 ELF1 SPI1 SMAD2 SMAD3 ZBTB7 EGR1 LEF1 TCF7L AR STAT5 TP63 NFYA NR5A1 NR4A1 NR5A2 GATA4 HNF4A POU2F NR1I3 NR1I2 GATA6 FOXA3 HNF1A RFX2 RFX1 RFX3 XBP1 CREB1 RF7 RF3 RF1 STAT1 FOS CEBPB RELA JUN PAX6 VDR NFIC RBPJ TP53 TFAP2 E2F1 ZBTB7 KLF4 MYC YY1 ATF2 SP3 SP1

(b)

Path: 05160_6

Path: 05320_4

Path: 05322_6

Path: 05322_5

Path: 05330_2

Path: 05320_1

Path: 05310_3

Path: 05150_14

Path: 05330_1

Path: 05310_1

Path: 05416_4

Path: 05332_1

Path: 05330_4

Path: 04940_1

Path: 05320_3

Path: 05320_6

Path: 05330_7 Path: 05320_6 Path: 05320_3 Path: 05330_4 Path: 04940_1 Path: 05416_4 Path: 05332_1 Path: 05330_1 Path: 05310_1 Path: 05150_14 Path: 05310_3 Path: 05330_2 Path: 05320_1 Path: 05322_5 Path: 05320_4 Path: 05322_6 Path: 05160_6

RFX2 RFX1 RFX3 XBP1 CREB1 IRF7

Path: 05330_7

(a)

(c)

Figure 3: One-way clustering to TFs with common motif of the TSN. (a) One-way clustering between 83 TFs and 436 subpathways. The corresponding cell was colored orange if there was an edge between the TF and subpathway in the TSN. Subpathway labels were colored according to the pathway class colors used in Figure 2. (b) Zoom-in plot of part of the orange circled region in (a), showing the TFs with similar motif and pathway class. (c) Zoom-in plot of part of the subpathway labels in the lower-right region of (a).

[34]. To explore whether the common-motif TFs tended to regulate cofunctional subpathways, we extracted 1061 human DNA motifs from the Transfac database. We obtained 15 150 pairs of TFs with similar DNA motifs (using STAMP software (http://www.benoslab.pitt.edu/stamp) [35] with 𝑒 value < 0.01), of which 610 pairs (83 unique TFs in the TSN) had similar DNA motifs. These TFs annotated 436 subpathways, forming 1402 relations (Table S3). To examine whether common-motif TFs regulated the cofunctional subpathways, we applied one-way clustering to the 83 unique TFs and 436 unique subpathways (Figure 3(a)). The common-motif TFs that regulated the same pathway classes tended to group together. For example, IRF7 (the interferon regulatory factor 7) driven regulatory cascade in which genetic variation on chromosome 15q25 leads to

type 1 diabetes (pathway: 04940) [36] and XBP1 (X-box binding protein 1) induce pancreatic beta cell dysfunction and apoptosis of type 1 diabetes [37] (Figure 3(b)). That is, common-motif TFs tend to regulate the same pathway classes and lead to the same disease. To further investigate the effect of the common motif TFs on subpathway, we randomly shuffled the 610 pairs of common-motif TFs 1000 times, while keeping the number of each TF unchanged. The average Jaccard coefficient of the real common-motif TF pairs was significantly higher than the average value for the 1000 times randomly shuffled TF pairs at the subpathway level. The mean Jaccard coefficient of the real common-motif TF pairs was 0.0568 compared with 0.0399 for the randomly shuffled TF pairs (experiential 𝑝 = 0.007, Figure 4(a)). The average Jaccard coefficient of the

8

BioMed Research International

40

100

60 40 Observed

20

80 60 40 Observed

20

20 Observed 10

0.30

0.04 0.05 0.06 0.07 0.08 0.09 0.10

0.025 0.035 0.045 0.055 Jaccard at subpathway level

Jaccard at pathway level

(a)

400

Observed

0

250

100

200

80

150 100 Observed

50

0.45

60 40 Observed

20 0

0 0.00

0.00 0.02 0.04 0.06 0.08 0.10 0.12

0.40

(c)

Probability density

Probability density

600

0.35

Jaccard at pathway class level

(b)

800

200

30

0

0

0

Probability density

Probability density

80

Probability density

Probability density

100

Jaccard at subpathway level

0.05

0.10

0.15

0.20

Jaccard at pathway level

(d)

0.25

0.30

0.35

0.40

Jaccard at pathway class level

(e)

(f)

150 800 600 400 200

Observed

0

200 150 100 Observed

50 0

0.000 0.005 0.010 0.015 0.020 0.025 0.030 Jaccard at subpathway level (g)

Probability density

Probability density

Probability density

1000 100

50 Observed 0

0.00

0.02 0.04 0.06 0.08 Jaccard at pathway level (h)

0.0

0.1 0.2 0.3 0.4 Jaccard at pathway class level

0.5

(i)

Figure 4: Jaccard coefficient. (a) The average Jaccard coefficient of common-motif TF pairs in TSN (0.0568) was significantly greater than that of 1000 times randomly shuffled TFs (0.0399) at the subpathway level (experiential 𝑝 = 0.007). (b) The average Jaccard coefficient of real common-motif TF pairs was significantly higher than that of random TF pairs (0.1045 versus 0.0723, experiential 𝑝 = 0) at the pathway level. (c) The average Jaccard coefficient of real common-motif TF pairs was significantly higher than that of random TF pairs (0.3867 versus 0.3649, experiential, 𝑝 = 0.005) at the pathway class level. The result implied that TFs with similar DNA motifs regulated the cofunctional subpathway. (d) The average Jaccard coefficient of real common-family TF pairs in TSN (0.1239) was significantly higher than that of random TF pairs (0.005) at the subpathway level (experiential 𝑝 = 0). (e) The average Jaccard coefficient of real common-family TF pairs in TSN (0.1478) was significantly higher than that of random TF pairs (0.0402) at the pathway level (experiential 𝑝 = 0). (f) The average Jaccard coefficient of real common-family TF pairs in TSN (0.4088) was significantly higher than that of random TF pairs (0.2420) at the pathway class level (experiential 𝑝 = 0). The result implied that common-family TFs regulated the cofunctional subpathway. (g) The average Jaccard coefficient of real common-family TF pairs in TSN (0.0289) was significantly higher than that of random TF pairs (0.0048) at the subpathway level (experiential 𝑝 = 0). (h) The average Jaccard coefficient of real common-family TF pairs in TSN (0.0935) was significantly higher than that of random TF pairs (0.0386) at the pathway level (experiential 𝑝 = 0). (i) The average Jaccard coefficient of real common-family TF pairs in TSN (0.4779) was significantly higher than that of random TF pairs (0.2413) at the pathway class level (experiential 𝑝 = 0).

BioMed Research International real common-motif TF pairs was significantly higher than for the randomly shuffled TF pairs (0.1045 versus 0.0723, experiential 𝑝 = 0, Figure 4(b)) at the pathway level. The average Jaccard coefficient of the real common-motif TF pairs was significantly higher than that of the randomly shuffled TF pairs (0.3867 versus 0.3649, experiential 𝑝 = 0.005, Figure 4(c)) at the pathway class level. These results revealed a high degree of consistent functionality of subpathways regulated by the common-motif TFs. Thus, the TSN can help us explain the consistent function of the subpathways regulated by the common-motif TFs. For example, HNF4A (hepatocyte nuclear factor 4, alpha) and NR1I3 (nuclear receptor subfamily 1, group I, member 3) have similar DNAbinding motifs (using STAMP software [35] with 𝑒 value < 0.01) (Figure 5), and the Jaccard coefficient at the subpathway level was 0.1538 (Figure 5(a)); the Jaccard coefficient at the pathway level was 0.3333 (Figure 5(b)); the Jaccard coefficient at the pathway class level was 0.3333 (Figure 5(c)). They, respectively, regulated subpathways 00982 4 and s 0098210, and these subpathways belonged to the same drug metabolism-cytochrome P450 pathway (pathway: 00982, Figure 5(d)) [38]. HNF4A and NR1I3 regulate the drugmetabolizing enzyme cytochrome P450 3A4 (CYP3A4) expression. HNF4A is critically involved in the NR1I3mediated transcriptional activation of CYP3A4. CYP3A4 is thought to be involved in the metabolism of nearly 50% of all the drugs currently prescribed, and alteration in the activity or expression of this enzyme seems to be a key predictor of drug responsiveness and toxicity [38]. Recent studies have identified HNF4A sites in the CYP2C8 and CYP2C9 promoter. Silencing HNF4A reduces the constitutive expression of CYP2C8 and CYP2C9. CYP2C enzymes are expressed constitutively and comprise about 20% of the total cytochrome P450 in human liver [39]. Constitutive androstane receptor NR1I3 activates CYP2B6, CYP2C9, and CYP3A4 expression to increase metabolic capability [40]. 3.3. Common Family TFs Tend to Regulate Cofunctional Subpathways. TFs in a common family had similar structure. Some researchers have revealed functional redundancies of the TF family, and common-family TFs show functional similarity [12–14]. Whether common-family TFs tend to regulate the cofunctional subpathways was still unknown. We extracted 80 unique TFs belonging to 28 families from the Transfac database, and these TFs regulated 419 unique subpathways from five pathway classes in KEGG. Eighty unique TFs and 419 unique subpathways formed 1353 relations (Table S4). To test the functional consistency of subpathways regulated by the common-family TFs, we performed one-way clustering to the 80 TFs and 419 subpathways (Figure 6(a)). The TFs that regulated the same pathway classes tended to gather in the same families at the local level. For example, the steroid hormone receptors family of TFs (NR5A1, NR4A1, and NR5A2) regulated the steroid hormone biosynthesis pathway (pathway: 00140, Figure 6(b)). That is, commonfamily TFs tended to regulate the cofunctional subpathways.

9 This local clustering phenomenon may be interpreted by the small average number of TFs (2.86) in each family. To further analyze the effects of the common-family TFs on subpathways, we applied the Jaccard coefficient to evaluate the consistent functionality of subpathways regulated by pairs of TFs with common family. We randomly shuffled the family information of TFs 1000 times and kept the number of each TF (80 unique TFs) and family (28 unique families) unchanged. The average Jaccard coefficient of real common-family TF pairs was greater than that of the randomly shuffled TF pairs (0.1239 versus 0.0050, experiential 𝑝 = 0, Figure 4(d)) at the subpathway level. The average Jaccard coefficient of real common-family TF pairs was significantly greater than that of the randomly shuffled TF pairs (0.1477 versus 0.0402, experiential 𝑝 = 0, Figure 4(e)) at the pathway level. The average Jaccard coefficient of real common-family TF pairs was significantly greater than that of randomly shuffled TF pairs (0.4088 versus 0.2420, experiential 𝑝 = 0, Figure 4(f)) at the pathway class level. These results indicated that common-family TFs regulate cofunctional subpathways. For example, HNF4A and NR1I3 appeared in the common steroid hormone receptors TF family, and they, respectively, regulated subpathways 0098-24 and 00982-10, and these subpathways belonged to the same drug metabolism-cytochrome P450 pathway (Figure 5). 3.4. Common-Tissue TFs Tend to Regulate Cofunctional Subpathways. TFs are always expressed in specific tissues or cell lines. Common-tissue TFs have functional consistency [15]. We wanted to explore the consistent function of subpathways regulated by the common-tissue TFs. We obtained 1464 relations between 122 unique tissues or cell lines and 343 unique TFs, of which 47 unique TFs were from 45 unique tissues and enriched in 385 unique subpathways in the TSN. So, there are 3494 relations between 47 TFs and 385 subpathways (Table S5). We retained tissues or cell types that included at least three unique TFs and obtained 44 unique TFs belonging to 21 tissue types, and these TF target genes were located in 378 unique subpathways, forming 2339 relations between TFs and subpathways. To examine whether common-tissue TFs regulated the cofunctional subpathways, we applied one-way clustering to the 44 unique TFs and 378 unique subpathways (Figure 7(a)). The TFs that regulated the same pathway classes tended to gather in the same tissues at the local level. For example, SP1 and TP53 were expressed in MCF7 cells (Figure 7(b)), increasing binding of SP1 complex to the TP53 promoter region, which enhanced expression of tumor suppressor factor TP53 and led to breast cancer cell apoptosis (pathway: 04210) [41]. This means that cofunctional subpathways regulated by common-tissue TFs tended to form many small clusters. This local clustering phenomenon might be explained by the following reasons: (i) the average amount of TF (5.35) related to tissues or cell lines was small; (ii) many TFs belonged to different tissue or cell types; and (iii) some special tissue TFs had a fundamental biological function and took part in many basic functions [42].

10

BioMed Research International HNF4A and NR1R3 common motif 1 0

HNF4A and NR1R3 common family steroid hormone receptors family

2 (bits)

(bits)

2

1

2

3

4

5

6

HNF4A and NR1R3 common tissue

1 0

7

1

2

3

4

5

6

Path: 00120_15 Path: 00120_19 Path: 00120_6 Path: 00120_7 Path: 00120_9

Path: 03320_2

Path: 04610_3

Path: 04610_5

Path: 00590_2 Path: 00590_3

Path: 00982_5

Path: 04610_6 Path: 04610_8

Path: 00590_4

Path: 00590 Path: 00830_2 Path: 00982_4

Path: 04610_4

Path: 00590_1

Path: 04060

Path: 04610_2

Path: 00980_1

Path: 00982_9

Path: 00830

Path: 03320

Path: 00980

Organismal systems Human diseases

Metabolism

Path: 00982

Path: 04610

Path: 04950_4

Path: 00591_1

Path: 00591

Path: 04950

Path: 03320_1

Path: 00982_10

Path: 00980_2

(a) Jaccard coefficient at subpathway level is 0.1538

(b) Jaccard coefficient at pathway level is 0.3333 4-Glutathionyl-CPA (inactive) 4-Keto-CPA

Dechloroethyl-CPA Chloroacetaldehyde

1.2.1.5 Carboxyphosphamide

CYP3A4 CYP2C

4-OH-CPA 4-Hydroxyifosfamide

Nornitrogen mustard (active) Phosphamide mustard (active)

CYP2B6

Ifosfamide (prodrug)

Aldophosphamide Acrolein Aldoifosfamide

CYP2B6 CYP3A4

CYP3A4

Alcophosphamide 1.1.1.1

CYP2C Cyclophosphamide (prodrug)

Excretion

2.5.1.18 CYP3A4

CYP3A4

(c) Jaccard coefficient at pathway class level is 0.3333

Carboxyphosphamide

Isophosphamide mustard (active)

CYP3A5

CYP2B6 4-Ketoifosfamide

Alcophosphamide

Chloroacetaldehyde Dechloroethylifosfamide (d) Drug metabolism-cytochrome P450

Figure 5: Common-motif, common-family, and common-tissue TF pair. HNF4A and NR1I3 are a pair of TFs with a common motif, common family (steroid hormone receptors), and common tissue (liver). (a) Jaccard coefficient at the subpathway level of HNF4A and NR1I3 was 0.1538. (b) Jaccard coefficient at the pathway level was 0.3333. (c) Jaccard coefficient at the pathway class level was 0.3333. (d) HNF4A regulated subpathway 00982-10, and NR1I3 regulated subpathway 00982-4. Red genes represent the shared genes between subpathways 00982-10 and 00982-4. Pink genes represent the genes only in subpathway 00982-4. Purple nodes represent the genes in the entire pathway 00982.

To test the effects of common-tissue TFs on subpathways in a general sense, we calculated the Jaccard coefficient for pairs of common-tissue TFs. We randomly shuffled the tissue information of TFs (47 unique TFs and 45 unique tissues) 1000 times and kept the number of

each TF unchanged. The average Jaccard coefficient of real common-tissue TF pairs was significantly greater than that of randomly shuffled TF pairs at the subpathway level (0.0289 versus 0.0048, experiential 𝑝 = 0, Figure 4(g)). The average Jaccard coefficient of real common-tissue TF pairs

BioMed Research International

11

Common-family TFs

Pathway class CREB AP 1 Ets type AP 1 CREB Fungal regulators GATA factors GATA factors GATA factors STAT STAT Steroid hormone receptors (NR3) Steroid hormone receptors (NR3) TCF 1 AP 1 Ubiquitous bHLH ZIP factors AP 2 Developmental/cell cycle regulators NF AT C/EBP like factors Tissue specific regulators Rel/ankyrin Ubiquitous bHLH ZIP factors Interferon regulating factors Responders to external signals AP 1 Developmental/cell cycle regulators Developmental/cell cycle regulators STAT STAT Steroid hormone receptors (NR3) Steroid hormone receptors (NR3) GATA factors Developmental/cell cycle regulators Ets type SMAD SMAD Ets type Tissue specific regulators Homeo domain only TCF 1 Steroid hormone receptors (NR3) Steroid hormone receptors (NR3) Steroid hormone receptors (NR3) GATA factors Steroid hormone receptors (NR3) POU domain factors Interferon regulating factors Interferon regulating factors Interferon regulating factors C/EBP like factors AP 1 STAT Rel/ankyrin AP 1 Steroid hormone receptors (NR3) p53 like SMAD Heteromeric CCAAT factors p53 like Regulators with a DNA recognition wing AP 1 Regulators with a DNA recognition wing Regulators with a DNA recognition wing CREB Paired plus homeo domain Steroid hormone receptors (NR3) STAT NF 1 Steroid hormone receptors (NR3) AP 2 csd Cell cycle controlling factors Developmental/cell cycle regulators Cell cycle controlling factors Ubiquitous factors AP 1 Ubiquitous factors p53 like Ubiquitous factors

CREM JUND ETV4 NRL ATF1 HMGA1 GATA2 GATA1 GATA3 STAT5A STAT4 NR2F1 THRB TCF7 JUNB MITF TFAP2C KLF5 NFATC1 CEBPA FOXA2 REL SREBF1 IRF5 SRF NFE2L2 GLI3 GLI1 STAT3 STAT6 NR1I3 NR1I2 GATA6 EGR1 ELF1 SMAD2 SMAD3 SPI1 FOXA3 HNF1A LEF1 NR5A1 NR4A1 NR5A2 GATA4 HNF4A POU2F1 IRF7 IRF3 IRF1 CEBPB FOS STAT1 RELA JUN ESR1 TP73 SMAD4 NFYA TP63 RFX1 XBP1 RFX2 RFX3 CREB1 PAX6 AR STAT5B NFIC VDR TFAP2A YBX1 E2F1 KLF4 MYC YY1 ATF2 SP3 TP53 SP1

(a)

Steroid hormone receptors (NR3)

NR5A1 NR4A1

Steroid hormone receptors (NR3)

NR5A2

Steroid hormone receptors (NR3)

GATA4

GATA factors

HNF4A

Steroid hormone receptors (NR3)

POU2F1 Path: 00140_11 Path: 00140_12 Path: 00140_14 Path: 00140_20 Path: 00140_19 Path: 00140_13 Path: 00140_16 Path: 00140_10 Path: 00140_6 Path: 00140_5 Path: 00140_3 Path: 00140_4 Path: 00140_9 Path: 00140_8 Path: 00140_18 Path: 00140_7 Path: 00980_2 Path: 00982_10 Path: 00590_4 Path: 00590_3 Path: 00590_1 Path: 00590_2 Path: 00591_1 Path: 00982_9 Path: 00982_4 Path: 00982_5

POU domain factors

(b)

Figure 6: One-way clustering to TFs with common family of the TSN. (a) One-way clustering between 80 TFs and 419 subpathways. The corresponding cell was colored orange if there was an edge between the TF and subpathway in the TSN. Subpathway labels were colored according to the pathway class colors used in Figure 2. (b) Zoom-in plot of part of the orange circled region in (a), showing the common family TFs and subpathways.

was significantly greater than that of randomly shuffled TF pairs at the pathway level (0.0935 versus 0.0386, experiential 𝑝 = 0, Figure 4(h)). The average Jaccard coefficient of real common-tissue TF pairs was significantly greater than that of randomly shuffled TF pairs at the pathway class level (0.4779 versus 0.2413, experiential 𝑝 = 0, Figure 4(i)). We conclude that common-tissue TFs tend to regulate the consistent functional subpathway at different functional levels. For example, HNF4A and NR1I3 were

expressed in the liver (Figure 5), where they, respectively, regulated subpathways 00982-4 and subpathways 00982-10, which belonged to the same drug metabolism-cytochrome P450 pathway. HNF4A and NR1I3 regulated CYP3A4, and CYP3A4 promoter activity was most pronounced in liver cells, indicating that CYP3A4 is a liver-specific factor that is required for physiological transcriptional response [38]. Constitutive androstane receptor NR1I3 is predominantly expressed in the liver and it activates CYP2B6, CYP2C9,

12

BioMed Research International

Common-tissue TFs

Pathway class

T cells Breast T cells Breast HeLa HepG2 Hep3B T cells HepG2 Liver MCF7 T cells Testis Heart Brain Kidney Liver HUVEC HeLa HeLa K562 HepG2 MCF7 MCF7 K562 MCF7 Lymph node Kidney Heart Liver Liver Caco HUVEC Hep3B K562 T cells Thymus Spleen Liver Bone marrow Spleen T cells Testis Thymus Intestine Bone marrow Liver Lymph node Lymphocytes Spleen T cells Testis Thymus Thymus T cells Heart Breast Thymus Lymphocytes HeLa Caco Breast Intestine Kidney Liver Spleen Liver Liver Brain Caco Intestine Hep3B Kidney Liver Testis Heart Brain Kidney Liver Lymphocytes Spleen Testis Thymus Spleen Lymph node Thymus Lymph node Bone marrow Spleen Thymus K562 HUVEC MCF7 HepG2 MCF7 HeLa MCF7 HeLa Brain Bone marrow Intestine Lymphocytes MCF7 Testis HeLa HUVEC HeLa MCF7

TCF7 TCF7 CREM ETV4 NR2F1 GATA2 GATA2 GATA3 GATA3 FOXA2 TFAP2C NFATC1 KLF5 SREBF1 SREBF1 SREBF1 SREBF1 REL REL SRF NFE2L2 NFE2L2 STAT3 FOS FOS ESR1 TP73 NR1I3 NR1I3 NR1I3 NR1I2 GATA6 EGR1 EGR1 EGR1 EGR1 ELF1 ELF1 SPI1 SPI1 SPI1 SPI1 SPI1 SPI1 AKNA AKNA AKNA AKNA AKNA AKNA AKNA AKNA AKNA LEF1 LEF1 TP63 TP63 TP63 NFYA NFYA HNF1A HNF1A HNF1A HNF1A HNF1A HNF1A FOXA3 NR4A1 NR4A1 GATA4 HNF4A HNF4A HNF4A HNF4A HNF4A CREB1 CREB1 CREB1 CREB1 CREB1 CREB1 CREB1 CREB1 IRF7 IRF7 IRF7 IRF5 IRF5 IRF5 IRF5 RELA RELA E2F1 STAT5B VDR VDR NFIC NFIC TP53 TP53 TP53 TP53 TP53 TP53 TFAP2A SP1 SP1 SP1

(a) MCF7 HeLa Brain Bone marrow Intestine Lymphocytes MCF7 Testis HeLa HUVEC HeLa MCF7 Path: 04540_13 Path: 04110_26 Path: 04520_7 Path: 04520_5 Path: 04144_2 Path: 04520_3 Path: 04810_29 Path: 04510_4 Path: 04510_24 Path: 04510_23 Path: 04510_20 Path: 04510_19 Path: 04510_13 Path: 04510_1 Path: 04110_18 Path: 04110_25 Path: 04510_8 Path: 04510_6 Path: 04510_10 Path: 04510_18 Path: 04210_23 Path: 04110_8 Path: 04110_20 Path: 04110_27 Path: 04210_20 Path: 04210_24 Path: 04510_7 Path: 04510_5 Path: 04510_25 Path: 04510_17 Path: 04510_15 Path: 04510_16 Path: 04115_1 Path: 04115_7 Path: 04115_4 Path: 04115_2 Path: 04115_3 Path: 04110_1 Path: 04110_7 Path: 04110_17 Path: 04110_19 Path: 04110_4 Path: 04110_3 Path: 04110_22 Path: 04110_23

NFIC NFIC TP53 TP53 TP53 TP53 TP53 TP53 TFAP2A SP1 SP1 SP1

(b)

Figure 7: One-way clustering to TFs with common tissue of the TSN. (a) One-way clustering between 44 unique TFs and 378 unique subpathways. The corresponding cell was colored orange if there was an edge between the TF and subpathway in the TSN. Subpathway labels were colored according to the pathway class colors used in Figure 2. (b) Zoom-in plot of part of the orange circled region in (a), showing the common tissue TFs and subpathways.

and CYP3A4 expression to increase metabolic capability [40].

4. Discussion and Conclusion To gain insight into the influence of TF characteristics on their regulatory subpathways, we constructed a TSN using the TF target gene associations and 𝑘-clique subpathway identification method. We used the 𝑘-clique (𝑘 = 3) method to divide large pathways into multiple subpathways. This

ensured that the identified TF-regulated subpathways were located in the TF-related local regions of the pathways and also increased the tendency for genes in a subpathway to share similar biological functions and be involved in similar biological processes. To date, most pathway identification methods have only identified entire pathways without considering the scale of these pathways. However, each pathway has obviously different scales. These differences in scale hinder the evaluation of TF-related pathways from global networks and the identification of TF-related subpathways that are

BioMed Research International usually located in the local regions of the pathways. By using the subpathway identification method, we ensured that each subpathway was represented on a small scale. Through our analyses of the TSN, we found that the cofunctional subpathways were regulated by the commonmotif, common-family, or common-tissue TFs. We performed one-way clustering on the TSN based on commonmotif, common-family, and common-tissue TFs. The results showed that all of the TFs with three types of characters that regulated the same pathway classes tended to cluster together. Furthermore, we applied the Jaccard coefficient to establish whether common-motif, common-family, and common-tissue TFs regulate cofunctional subpathways. Our results provided strong support for cofunctional subpathways that were regulated by TFs with three types of characters. For example, HNF4A and NR1I3 were a pair of TFs with a common motif, common family (steroid hormone receptors), and common tissue (liver). HNF4A and NR1I3 were expressed in the liver and, respectively, regulated subpathways 00982-4 and 00982-10, which belonged to the same drug metabolismcytochrome P450 pathway. HNF4A and NR1I3 regulated expression of the drug-metabolizing enzyme cytochrome P450 3A4 (CYP3A4) expression [28]. HNF4A is critically involved in the NR1I3-mediated transcriptional activation of CYP3A4, which is thought to be involved in the metabolism of nearly 50% of all the drugs currently prescribed. Alteration in the activity or expression of CYP3A4 seems to be a key predictor of drug responsiveness and toxicity. CYP3A4 promoter activity was most pronounced in liver cells, indicating that CYP3A4 is a liver-specific factor that is required for physiological transcriptional response [38]. Recent studies have identified HNF4A sites in the CYP2C8 and CYP2C9 promoter. Silencing HNF4A reduces constitutive expression of CYP2C8 and CYP2C9, as shown by quantitative polymerase chain reaction analysis [39]. Constitutive androstane receptor NR1I3 activated CYP2B6, CYP2C9, and CYP3A4 expression to increase metabolic capability [40]. We further discussed the clinical or cell development study of HNF4A and NR1I3. We increased the following four aspects of analysis. First, we downloaded the large-scale liver cancer RNAseq data from TCGA public database (http://cancergenome.nih.gov/); we used the MARS method of R package “DEGseq” to analyze the 423 samples (373 liver cancer samples and 50 normal samples) and found HNF5A (𝑞 value < 1.0𝑒 − 6) and NR1I3 (𝑞 value < 1.0𝑒 − 6) were significantly differentially expressed in the liver cancer dataset; thus, they may be used for the study of liver cancer. Second, we searched PubMed database and confirmed that HNF4 and NR1I3 were associated with liver cancer. Previous studies have demonstrated that downregulation of HNF4A was associated with hepatocellular carcinoma (HCC) progression in rodents and humans. In addition, the study of Huang et al. suggested that NR1I3 was a primary regulator of drug metabolism and detoxification and NR1I3 activation or transient strictly limited liver growth [43–45]. Third, we extracted the direct neighbours of this pair of TFs in the protein-protein interaction network and performed the pathway enrichment analysis by DAVID (https://david.ncifcrf.gov/). Results suggested that these genes were significantly enriched in some well-known

13 clinical or cell development related pathways of KEGG such as “pathways in cancer” (hsa05200, 𝑝 = 1.33𝑒 − 04) and “cell cycle pathway” (hsa04110, 𝑝 = 2.88𝑒 − 04). Finally, we tested the Pearson correlation coefficient of this pair of TFs, and the result showed that they were significant positive correlation (𝑟 = 0.4257, 𝑝 < 2.2𝑒−16), which indicated that these two TFs may collectively regulate their downstream target genes; thus, they played important role in the initiation and progression of liver cancer. In summary, the above analysis result indicated that this pair of TFs (HNF4A and NR1I3) may be potential clinical biomarkers of liver cancer. To confirm the validity of our results, we also constructed the TSN with 𝑘 = 4 (𝑝 < 0.01) (Table S6–S9). First, we compared the two networks and found that the two networks were robust. There were 1470 edges between 99 TFs and 343 subpathways in the TSN with 𝑘 = 4, while there were 1490 edges between 89 TFs and 468 subpathways in the TSN with 𝑘 = 3. The overlap of TFs (86 TFs) in the two networks was statistically significant (hypergeometric test 𝑝 < 1.0𝑒 − 6). In the TSN with 𝑘 = 4, there were 343 subpathways corresponding to 95 pathways, and the overlap of pathways (87 pathways) in the two networks was statistically significant (hypergeometric test 𝑝 < 1.0𝑒 − 6). Thus, these two networks were robust. We then repeated some of the analyses and found that the results of the two networks were similar. We compared one-way clustering results of the two networks and found that the results of two networks were similar. The clustering results of TSN with 𝑘 = 4 (Figures S1, S2, and S3) showed that the subpathways regulated by TFs with common motif, common family, and common tissue tended to cluster in the same pathway class. In particular, we verified the consistent function of subpathways regulated by the TFs with common motif, common family, and common tissue by calculating Jaccard coefficient and found the results of two networks were similar. In the TSN with 𝑘 = 4, the average Jaccard coefficient of the real common-motif, commonfamily, and common-tissue TF pairs was significantly higher than the random TF pairs by shuffling TSN 1000 times at the subpathway level, pathway level, and pathway class level (Table S10). As a control, TSN with 𝑘 = 4 presented the similarity of results when compared with that of 𝑘 = 3. These results indicated that our conclusions were robust in different threshold of subpathway networks. In summary, our study is significant for understanding the TFs underlying the regulatory function in complex biological systems.

Conflict of Interests The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgments This work was supported by the National High Technology Research and Development Program of China (863 Program, Grant no. 2014AA02110), the National Program on Key Basic Research Project (973 Program, Grant no. 2014CB910504), the National Natural Science Foundation of China (Grants

14 nos. 91439117 and 61473106), and the Research Project of Health Department of Heilongjiang Province (2014-418).

References [1] B. Xu, D. E. Schones, Y. Wang, H. Liang, and G. Li, “A structuralbased strategy for recognition of transcription factor binding sites,” PLoS ONE, vol. 8, no. 1, Article ID e52460, 2013. [2] M. Xu and Z. Su, “A novel alignment-free method for comparing transcription factor binding site motifs,” PLoS ONE, vol. 56, no. 1, article e8797, 2010. [3] G.-H. Wei, G. Badis, M. F. Berger et al., “Genome-wide analysis of ETS-family DNA-binding in vitro and in vivo,” The EMBO Journal, vol. 29, no. 13, pp. 2147–2160, 2010. [4] S. Zhang, X. Zhou, C. Du, and Z. Su, “SPIC: a novel similarity metric for comparing transcription factor binding site motifs based on information contents,” BMC Systems Biology, vol. 7, supplement 2, article S14, 2013. [5] A. S. Bais, N. Kaminski, and P. V. Benos, “Finding subtypes of transcription factor motif pairs with distinct regulatory roles,” Nucleic Acids Research, vol. 39, no. 11, article e76, 2011. [6] P. Li, G. Xu, G. Li, and M. Wu, “Function and mechanism of tumor suppressor gene LRRC4/NGL-2,” Molecular Cancer, vol. 13, article 266, 2014. [7] D. M. Budden, D. G. Hurley, J. Cursons, J. F. Markham, M. J. Davis, and E. J. Crampin, “Predicting expression: the complementary power of histone modification and transcription factor binding data,” Epigenetics & Chromatin, vol. 7, article 36, 2014. [8] R. F. Lowdon, B. Zhang, M. Bilenky et al., “Regulatory network decoded from epigenomes of surface ectoderm-derived cell types,” Nature Communications, vol. 5, article 5442, 2014. [9] W. Czaja, K. Y. Miller, M. K. Skinner, and B. L. Miller, “Structural and functional conservation of fungal MatA and human SRY sex-determining proteins,” Nature Communications, vol. 5, article 5434, 2014. [10] A. Vandenbon, Y. Kumagai, S. Akira, and D. M. Standley, “A novel unbiased measure for motif co-occurrence predicts combinatorial regulation of transcription,” BMC Genomics, vol. 13, supplement 7, article S11, 2012. [11] R. Torella, J. Li, E. Kinrade et al., “A combination of computational and experimental approaches identifies DNA sequence constraints associated with target site binding specificity of the transcription factor CSL,” Nucleic Acids Research, vol. 42, no. 16, pp. 10550–10563, 2014. [12] L. H. Wang, N. H. Ing, S. Y. Tsai, B. W. O’Malley, and M. J. Tsai, “The COUP-TFs compose a family of functionally related transcription factors,” Gene Expression, vol. 1, no. 3, pp. 207–216, 1991. [13] S. Danisman, A. D. J. van Dijk, A. Bimbo et al., “Analysis of functional redundancies within the Arabidopsis TCP transcription factor family,” Journal of Experimental Botany, vol. 64, no. 18, pp. 5673–5685, 2013. [14] B. Huang, Z. T. Qi, Z. Xu, and P. Nie, “Global characterization of interferon regulatory factor (IRF) genes in vertebrates: glimpse of the diversification in evolution,” BMC Immunology, vol. 11, article 22, 2010. [15] D.-X. Ding, F.-F. Tian, J.-L. Guo et al., “Dynamic expression patterns of ATF3 and p53 in the hippocampus of a pentylenetetrazole-induced kindling model,” Molecular Medicine Reports, vol. 10, no. 2, pp. 645–651, 2014.

BioMed Research International [16] M. Kann, E. Bae, M. O. Lenz et al., “WT1 targets Gas1 to maintain nephron progenitor cells by modulating FGF signals,” Development, vol. 142, no. 7, pp. 1254–1266, 2015. [17] M. Yoshida, K. Hata, R. Takashima et al., “The transcription factor Foxc1 is necessary for Ihh–Gli2-regulated endochondral ossification,” Nature Communications, vol. 6, article 6653, 2015. [18] Y. Zhi, Y. Duan, X. Zhou et al., “NF-kappaB signaling pathway confers neuroblastoma cells migration and invasion ability via the regulation of CXCR4,” Medical Science Monitor, vol. 20, pp. 2746–2752, 2014. [19] Q. Li, G. Liu, D. Shao et al., “Mucin1 mediates autocrine transforming growth factor beta signaling through activating the cJun N-terminal kinase/activator protein 1 pathway in human hepatocellular carcinoma cells,” The International Journal of Biochemistry and Cell Biology, vol. 59, pp. 116–125, 2015. [20] J. Shan, W. Donelan, J. N. Hayner, F. Zhang, E. E. Dudenhausen, and M. S. Kilberg, “MAPK signaling triggers transcriptional induction of cFOS during amino acid limitation of HepG2 cells,” Biochimica et Biophysica Acta, vol. 1853, no. 3, pp. 539–548, 2015. [21] J. V. Almaden, R. Tsui, Y. C. Liu et al., “A pathway switch directs BAFF signaling to distinct NF𝜅B transcription factors in maturing and proliferating B cells,” Cell Reports, vol. 9, no. 6, pp. 2098–2111, 2014. [22] A. Limonciel, K. Moenks, S. Stanzel et al., “Transcriptomics hit the target: monitoring of ligand-activated and stress response pathways for chemical testing,” Toxicology in Vitro, 2015. [23] T. Judeh, C. Johnson, A. Kumar, and D. Zhu, “TEAK: topology enrichment analysis framework for detecting activated biological subpathways,” Nucleic Acids Research, vol. 41, no. 3, pp. 1425– 1437, 2013. [24] C. Li, X. Li, Y. Miao et al., “SubpathwayMiner: a software package for flexible identification of pathways,” Nucleic Acids Research, vol. 37, no. 19, article e131, 2009. [25] X. Li, C. Li, D. Shang et al., “The implications of relationships between human diseases and metabolic subpathways,” PLoS ONE, vol. 6, no. 6, Article ID e21131, 2011. [26] C. Li, D. Shang, Y. Wang et al., “Characterizing the network of drugs and their affected metabolic subpathways,” PLoS ONE, vol. 7, no. 10, Article ID e47326, 2012. [27] P. Khatri, M. Sirota, and A. J. Butte, “Ten years of pathway analysis: current approaches and outstanding challenges,” PLoS Computational Biology, vol. 8, no. 2, Article ID e1002375, 2012. [28] V. Matys, E. Fricke, R. Geffers et al., “TRANSFAC: transcriptional regulation, from patterns to profiles,” Nucleic Acids Research, vol. 31, no. 1, pp. 374–378, 2003. [29] H. Ogata, S. Goto, K. Sato, W. Fujibuchi, H. Bono, and M. Kanehisa, “KEGG: kyoto encyclopedia of genes and genomes,” Nucleic Acids Research, vol. 27, no. 1, pp. 29–34, 1999. [30] K. Beishline and J. Azizkhan-Clifford, “Sp1 and the ‘hallmarks of cancer’,” The FEBS Journal, vol. 282, pp. 224–258, 2015. [31] J.-P. Zhang, H. Zhang, H.-B. Wang et al., “Down-regulation of Sp1 suppresses cell proliferation, clonogenicity and the expressions of stem cell markers in nasopharyngeal carcinoma,” Journal of Translational Medicine, vol. 12, article 222, 2014. [32] M. L. Tornesello, C. Annunziata, L. Buonaguro, S. Losito, S. Greggi, and F. M. Buonaguro, “TP53 and PIK3CA gene mutations in adenocarcinoma, squamous cell carcinoma and high-grade intraepithelial neoplasia of the cervix,” Journal of Translational Medicine, vol. 12, article 255, 2014. [33] N. Yamamoto, Y. Takemori, M. Sakurai, K. Sugiyama, and H. Sakurai, “Differential recognition of heat shock elements by

BioMed Research International

[34]

[35]

[36]

[37]

[38]

[39]

[40]

[41]

[42]

[43]

[44]

[45]

members of the heat shock transcription factor family,” The FEBS Journal, vol. 276, no. 7, pp. 1962–1974, 2009. V. P. Ramirez, W. Krueger, and B. J. Aneskievich, “TNIP1 reduction of HSPA6 gene expression occurs in promoter regions lacking binding sites for known TNIP1-repressed transcription factors,” Gene, vol. 555, no. 2, pp. 430–437, 2015. S. Mahony and P. V. Benos, “STAMP: a web tool for exploring DNA-binding motif similarities,” Nucleic Acids Research, vol. 35, no. 2, pp. W253–W258, 2007. M. Heinig, E. Petretto, C. Wallace et al., “A trans-acting locus regulates an anti-viral expression network and type 1 diabetes risk,” Nature, vol. 467, pp. 460–464, 2010. F. Allagnat, F. Christulia, F. Ortis et al., “Sustained production of spliced X-box binding protein 1 (XBP1) induces pancreatic beta cell dysfunction and apoptosis,” Diabetologia, vol. 53, no. 6, pp. 1120–1130, 2010. R. G. Tirona, W. Lee, B. F. Leake et al., “The orphan nuclear receptor HNF4𝛼 determines PXR- and CAR-mediated xenobiotic induction of CYP3A4,” Nature Medicine, vol. 9, no. 2, pp. 220–224, 2003. R. Rana, Y. Chen, S. S. Ferguson, G. E. Kissling, S. Surapureddi, and J. A. Goldstein, “Hepatocyte nuclear factor 4𝛼 regulates rifampicin-mediated induction of CYP2C genes in primary cultures of human hepatocytes,” Drug Metabolism & Disposition, vol. 38, no. 4, pp. 591–599, 2010. J. M. Pascussi, S. Gerbal-Chaloin, L. Drocourt, P. Maurel, and M. J. Vilarem, “The expression of CYP2B6, CYP2C9 and CYP3A4 genes: a tangle of networks of nuclear and steroid receptors,” Biochimica et Biophysica Acta, vol. 1619, no. 3, pp. 243–253, 2003. G. Gu, I. Barone, L. Gelsomino et al., “Oldenlandia diffusa extracts exert antiproliferative and apoptotic effects on human breast cancer cells through ER𝛼/Sp1-mediated p53 activation,” Journal of Cellular Physiology, vol. 227, no. 10, pp. 3363–3372, 2012. B. Roson-Burgo, F. Sanchez-Guijo, C. Del Ca˜nizo, and J. De Las Rivas, “Transcriptomic portrait of human Mesenchymal Stromal/Stem cells isolated from bone marrow and placenta,” BMC Genomics, vol. 15, article 910, 2014. N. L. Lazarevich, O. A. Cheremnova, E. V. Varga et al., “Progression of HCC in mice is associated with a downregulation in the expression of hepatocyte nuclear factors,” Hepatology, vol. 39, no. 4, pp. 1038–1047, 2004. B.-F. Ning, J. Ding, C. Yin et al., “Hepatocyte nuclear factor 4𝛼 suppresses the development of hepatocellular carcinoma,” Cancer Research, vol. 70, no. 19, pp. 7640–7651, 2010. W. Huang, J. Zhang, M. Washington et al., “Xenobiotic stress induces hepatomegaly and liver tumors via the nuclear receptor constitutive androstane receptor,” Molecular Endocrinology, vol. 19, no. 6, pp. 1646–1653, 2005.

15

Cofunctional Subpathways Were Regulated by Transcription Factor with Common Motif, Common Family, or Common Tissue.

Dissecting the characteristics of the transcription factor (TF) regulatory subpathway is helpful for understanding the TF underlying regulatory functi...
NAN Sizes 0 Downloads 10 Views