Gene. 121 (1992) 305-311 0 1992 Elsevier Science Publishers

GENE

B.V. All rights reserved.

305

0378-l 119/92/$05.00

06727

Cloning and characterization of the ADH.5 gene encoding dehydrogenase 5, formaldehyde dehydrogenase (Recombinant

DNA;

Man-Wook

Hur and Howard

ofBiochemistryand

Departments 5122,

pseudogene;

promoter;

CpG island;

housekeeping

gene; transcriptional

human alcohol

factors;

alcoholism)

J. Edenberg

Molecular Biology, and qf Medical and Molecular

Genetics, Indiana

University School

ofMedicine,Indianapolis,

IN 46202-

USA

Received

by J. Piatigorsky:

21 April 1992; Revised/Accepted:

12 June/l5

June 1992; Received

at publishers:

9 July 1992

SUMMARY

Human X-alcohol dehydrogenase (x-ADH) is a zinc-containing dimeric enzyme responsible for the oxidation of longchain alcohols and o-hydroxyfatty acids. Class-III ADHs, of which X-ADH is the prototype, are widely produced and well conserved during evolution. This suggests that they fulfill important housekeeping roles in cellular metabolism. Recent evidence suggests that class-III ADH and formaldehyde dehydrogenase (FDH) are the same enzyme. We have isolated and characterized two overlapping genomic clones that cover the entire ADH5 (FDH) gene. ADHS is composed of nine exons and eight introns. Two major transcription start points were identified by primer extension. The 5’ nontranslated region is unusual in that it contains two additional upstream ATG codons, which would encode peptides of 20 and 10 amino acids. Neither of the upstream ATGs is in a good context for translation initiation, whereas the ATG initiating &khgr;-ADH is in a favorable context. The 5’ region of ADHS is a CpG island; it is extremely G+C rich and has many CpG doublets. It does not contain either a TATA box or a CAAT box. This is consistent with ubiquitous expression, and contrasts with the promoters of all previously cloned ADH genes, which are expressed in a tissue-specific manner. The 5’ region of ADH.5 contains consensus binding sites for the transcriptional regulatory proteins, Spl, AP2, LF-Al, NF-1, NF-A2, and NF-El. A 1.5-kb upstream fragment from ADHS was able to drive the transcription of a cat reporter gene at high levels in monkey kidney cells (CV-1). Several processed pseudogenes were also isolated.

Mammalian alcohol dehydrogenases (ADH; EC 1.1.1.1) are zinc-containing dimeric enzymes that catalyze the reversible oxidation of a wide variety of alcohols, using

NAD + as the preferred coenzyme (Ehrig et al., 1990). They form a gene family divided into at least five distinct classes with about 60% aa identity in interclass comparisons (Strydom and Vallee, 1982; Pares et al., 1990; Edenberg, 1991; Yasunami et al., 1991). Human x alcohol dehydro-

Correspondence

monkey

INTRODUCTION

Molecular

to: Dr. H. J. Edenberg,

Biology, Indiana

Drive, Indianapolis,

University

IN 46202-5122,

Tel. (317) 274-2353;

Department

of Biochemistry

School of Medicine,

and

635 Barnhill

USA.

kidney

Abbreviations: acetyltransferase;

aa, amino acid(s);

ADH,

alcohol

dehydrogenase;

ADH; bp, base pair(s); cat, gene encoding Cm, chloramphenicol;

Denhardt

solution,

0.02’y0

Ficoll/

0.02%

lacZ, gene encoding

Fax (317) 274-4686.

ribonucleotide; gene encoding

cell line;

polyvinylpyrrolidone/ 0.02% bovine serum albumin; FDH, formaldehyde dehydrogenase; FDH , gene encoding FDH; kb, kilobase or 1000 bp;

cpm, count(s)

ADH,

CAT; CAT, Cm per minute; CV-1,

b-galactosidase; nt, nucleotide(s); oligo, oligodeoxyPipes, 1,4-piperazine-diethanesulfonic acid; SDS, sodium

dodecyl sulfate; SSC, 0.15 M NaCl/ 0.015 M Na,.citrate pH 7.6; SV40, simian virus 40; tsp,transcription start point(s); UTR, untranslated region.

306 genase (x-ADH) is the prototype of class-III ADHs (Strydom and Vallee, 1982); it is responsible for the oxidation of long-chain alcohols and o-hydroxyfatty acids (Pares

wise comparisons among human, rat, horse and mouse; Kaiser et al., 1988; 1989; Giri et al., 1989a: Sharma et al., 1989; Edenberg et al., 1990) suggests that all of the class-

and Vallee, 1981; Wagner et al., 1984; Giri et al., 1989b). X-ADH is expressed in all tissues tested, unlike other ADHs which are expressed in a tissue specific manner (Adinolfi et al., 1984; Duley et al., 1985; Edenberg, 1991). It is the only ADH present in brain, placenta, and testis (Pares

III ADHs and FDH conclusion: and FDH both been

et al., 1984; Beisswenger et al., 1985; Dafeldeker Vallee, 1986). The human gene encoding X-ADH

Molecular cloning of the ADH.5 (FDH) gene should provide information on the structure of this gene and on its evolutionary origin: did it diverge from the other ADHs or

named ADHS (Smith, 1986). Formaldehyde dehydrogenase

(FDH;

and was

EC 1.2.1.1) is im-

converge from another (FDH) gene? an important first step in studying expression, which differs from the ADH genes. In this communication

portant in the detoxification of formaldehyde (Pommotabbed et al., 1989). Formaldehyde can be formed in animal cells from several exogenous and endogenous precursors including methionine, choline, homoserine, methanol, and xenobiotics (Uotila and Koivusalo, 1987). FDH can also metabolize methylglyoxal and other a-ketoaldehydes. All mammalian cells studied so far contain FDH (Uotila and

It should also provide the regulation of its other, tissue-specific we report the cloning,

sequencing, and characterization of the human ADH.5 (FDH) gene, the first class-III ADH gene to be cloned, and demonstrate that its 5’ flanking sequence can drive the transcription of a reporter gene.

Koivusalo, 1974; Kahn et al., 1984). Recently, it was shown that the aa sequence of five tryptic peptides of rat FDH were identical to the sequences of the rat class-III ADH, ADH-2 (Koivusalo et al., 1989). The two activities copurify through numerous steps and have identical physiochemical properties and the same wide tissue distribution. Purified rat FDH has ADH activity

RESULTS

AND DISCUSSION

(a) Isolation and characterization of MM.5 genomic clones Initial screening was carried out with mouse Adh-B2 cDNA as the probe to avoid bias toward a single gene in case several genes were present. Clone 27-1, which contains the sequence from intron 4 to beyond the 3’ nontranslated region, was isolated along with several pseudogenes. No full-length clone was obtained. The second screening yielded clone AC-4, which overlaps with 17-l

with the characteristics of a class-III ADH (Koivusalo et al., 1989). Thus, rat class-III ADH and rat FDH appear to be the same enzyme. The high degree of aa conservation among the mammalian class-III ADHs (greater than 91% aa identity in pair0.95 n n

are also FDHs. The mapping of human ADHS to the same chromosomal region reinforces this ADHS (Carlock et al., 1985; Giri et al., 1989a) (Kahn et al., 1984; Hiroshige et al., 1985) have locahzed to chromosome 4q21-q25.

0.62 nn

3.0

B h7-1

C

K-4

D;

1 9

I

;

,

(f

5

E

I

I I

SP I

I"

8 I

I1

E

SH

K

S

H

H

E

I

II

I

I

I

I

I

10

I

12

I

I

I4

I

I

Fig. 1. Structure and restriction map ofADH5 (FDH). (A ) Gene structure. Exons arc represented signal is marked as (A)n. (B ) Sequencing strategy. (C ) I clones C-4 and 7-l. (D ) Restriction Methods: a human genomic

cDNA

fragment

tion (Denhardt, was isolated. Hybridization

library was prepared in the vector lGeml1 (Promega, Madison, (1.9-kb PstI-EcoRI fragment; Hur et al., 1992) as a probe. Hybridization

1966)/5%

SDS/O.02

M Na.phosphate/S%

dextran

16

I

I

18

I

I

E

20 kb

I

I

I

by open boxes, intron sizes are in kb. The polyadenylation map: E, EcoRI; H, HindIII; K, KpnI; Sp, SphI; S, SstI.

WI) and screened without amplification using a mouse Adh-B, was at 42°C in 40”/, formamide/S x SSCjl x Denhardt solu-

sulfate, and final washing

was at 5O”C, 90 min in 1 x SSC/O.l%

SDS. Then 17-l

A second library was constructed and screened with a human X-ADH cDNA fragment (220-bp EcoRI-DdeI fragment; Giri et al., 1989a). was at 50°C in 6 x SSCj5 x Denhardt solution/O.l% SDS/O.O2% Na.phosphate; the final wash was at 55°C 20 min in 1 x SSCjO. 1%

SDS. After stripping

off the probe, the filters were hybridized

with a 190-bp fragment

from intron 4 (excised from 17-I);

the final wash was at 55”C,

20 min in 0.1 x SSC / 0.1% SDS. Probes were labeled with [c(-32P]dATP and [a-32P]dCTP either by nick translation (Tabor and Struhl, 1987) or by using random hexamers (Feinberg and Vogelstein, 1984). DNAs from the positive phage clones were characterized by restriction endonuclease mapping and hybridization analysis (Southern, 1975).

307 and extends from the 5’ upstream region to intron 6. In total, ten positive bacteriophages were isolated from ap-

polypeptide

prox. lo6 plaques. The ;i DNAs were subjected to restriction mapping and sequencing. Most clones were processed pseudogenes related to ADH.5. These explain some of the multiple bands that were reported to hybridize with the X-ADH cDNA (Sharma et al., 1989). Another ADHS

as a strong translational initiation codon (Kozak, 1987). It is therefore likely that the two upstream ATGs are bypassed by the 40s ribosomal subunit most of the time; their presence might, however, modulate the translation efficiency of the X-ADH mRNA.

-related pseudogene has been reported (Matsuo and Yokoyama, 1990). ADHS is composed of nine exons and eight introns, spanning 15 kb (Fig. 1). The intron-exon boundaries were

Giri et al. (1989a) reported an apparent in-frame ATG 51 nt upstream from the main ATG in their cDNA se-

established by comparison to the nt sequence of human X-ADH cDNA (Giri et al., 1989a; Sharma et al., 1989). All intron-exon junctions follow the GT/AG rule (Breathnach and Chambon, 1981). The introns of ADH5 interrupt the coding sequence at the same positions as found for the class-I and class-II ADH genes (Edenberg, 1991). The identical intron locations, as well as the overall homology ofADH5 to the class-I and class-II genes, demonstrate that the ADHS (FDH) gene is clearly a member of the ADH multigene family. The sizes of the introns differ from those of the class I and class II ADHs , and there was no apparent sequence homology of the introns or of the 5’ and 3’ untranslated regions with genes of these other classes. The sequence of ADHS is shown in Fig. 2. The largest exon, exon 5, is 261 bp; the introns ranged from 13 1 bp (intron 8) to 4.3 kb (intron 4). There are several discrepancies among the protein, cDNA, and genomic sequences. ADHS encodes Asp167 and Phe246, as in the reported protein sequence (Kaiser et al., 1988) and in the cDNA described by Sharma et al. (1989); the cDNA of Giri et al. (1989a) differed in 2 nt that would result in Tyr’67 and Leu246. There are also some minor differences in the 3’ nontranslated region, as noted in Fig. 2. (b) Identification of the fsp and upstream ATGs The 5’ ends of the mRNA were determined by primer extension (Fig. 3). The major tsp were a doublet at 78 bp and 80 bp upstream from the ATG codon, and a weaker one was noted 162 bp upstream (Figs. 2 and 3). Both the longer and shorter 5’ untranslated regions have two ATG codons upstream from the ATG codon that initiates translation of the X-ADH polypeptide (Figs. 2 and 4). This is rare in eukaryotic genes (Kozak, 1987). The upstream ATGs are out of frame with the main polypeptide, but in frame with each other (Fig. 4). If they were to be translated, they would make a 20-aa and a lo-aa polypeptide, respectively; both would terminate just before the ATG that initiates the X-ADH polypeptide. The sequences around both upstream ATGs have Cs at the -3 position and pyrimidines at the + 4 position (Fig. 4); thus, both are expected to be weak for translational initiation (Kozak, 1987). The ATG codon that initiates the X-ADH

position

fits the Kozak consensus

sequence,

with A at

-3 and G at + 4 (Fig. 4) and should, therefore,

act

quence, and therefore suggested that there may be a leader polypeptide (Fig. 4). No leader polypeptide was found in the reported protein sequence (Kaiser et al., 1988). Our ADHS genomic sequence does not have the particular ATG believed by Giri et al. (1989a) to be responsible for the leader polypeptide; that ATG is part of an 8-bp region that diverges from the gene sequence (Fig. 4). The 5’ region of the cDNA is very G+C-rich; our sequencing, using 7-deaza-dGTP at elevated temperatures, showed an additional C in the cDNA inserted between the putative leader ATG and the ATG that initiates synthesis of the X-ADH polypeptide, altering the reading frame. This C is also found in the ADHS gene (Fig. 4). Thus, there is no evidence in the ADHS gene for a leader peptide. (c) Structure of the ADHS promoter The proximal 5’ region of ADHS is extremely rich in G+C: 73% in the first 200 bp upstream from the translation start codon and 61% in the 358-bp upstream region (Fig. 2). This region has the characteristics of a CpG island. The CpG doublet is rare in mammalian DNA, when compared either with the number expected based upon the C and G content or with the number of GpC doublets (Gardner-Garden and Frommer, 1987; Bird, 1986). CpG is not, however, underrepresented in CpG islands. For ADHS , CpGs are about equal in frequency to GpC: the ratio is approx. 0.95 in the 358 bp upstream from the translation start codon. CpGs are found approximately as often as expected from the base composition: there are 35 CpGs in the upstream 358 bp, where 31 are expected, and 24 within the upstream 200 bp where 24 are expected. These data indicate that the upstream region of ADHS is a CpG island. CpG islands with similar characteristics are found in the 5’ regions of virtually all housekeeping genes (Gardiner-Garden and Frommer, 1987). Indeed, the wide tissue distribution of X-ADH (FDH) enzyme activity and its evolutionary conservation suggests that it plays an important housekeeping role. The presence of processed pseudogenes (this report and Matsuo and Yokoyama, 1990) implies that the gene is expressed even in germ line cells. The proximal 5’ region of the human ADHS (FDH) gene contains no TATA box and no CCAAT box. The G+Crich promoter does contain consensus sequences for the

308 NF-PC6 NF-A2 TTTCALU$BBIIYIZITTCCCTCACCGTCCACATCTGT&_I NF-~6

NF-El L(SBBGT~CGATAACTGCTGTMAGTTACAC~

NF-1 TTCCCGACMUAAAKGAGTTCTGCAACMGTCCGJCGGATTTTAGCA4TGAUC

-162 .

LF-Al

GCACMCTTAGCGGCACGCACCACAGCTCGA&&&UX

CGGCGCCGTGCAGCCTCGCCGGCGAGTG

SPI

-316

-211

TTCCGGTTGGAGCCATTGCAAGCCCC

-106

-80 -45 l * G3T JAGGCGCTCGCCACGCCCA~GCCTCCGTCGCTGCGCGGCCCACCCCGGATGTCAGCCCCCCGCGCCGACCAGAATCCGTGMC

SPI CCCCACGCCCCGCCCCCCTCGC __Ap_2___

___Ap_2___

NF-I _-_Ap_2___

1 Met Ala Asn Glu ATG CCC MC GAG UAGGGCCCGTTG

-1

___AP_2___ 10 Vol Ile Lys Cys Lys Ala Ala Val Ala Trp Glu Ala Gly GTT ATC A4G TGC AAG GCT GCA GTT GCT TGG GAG GCT CGA

0.35 kb ivs-l TTTCCCCACTTC&

30 20 Lys Pro Leu Ser Ile Glu Glu Ile Glu Vol Aia Pro Pro Lys Ala HIS Glu Val Arg Ile Lys 3.0 kb AAC CCT CTC TCC ATA GAG GAG ATA GAG GTG GCA CCC CCA AAG GCT CAT GAA GTT CGA ATC AAG ZLy\ATGATACATT ivs-2

TAATGTC‘CTGAT&

50 40 Ile Ile Ala Thr Ala Val Cys His Thr Asp Ala Tyr Thr Leu Ser Gly Ala Asp Pro Glu ATC ATT CCC ACT GCG GTT TGC CAC ACC CAT CCC TAT ACC CTG AGT CGA GCT CAT CCT GAG

70 80 60 Gly Cys Phe Pro Val Ile Leu Gly His Glu Gly Ala Gly Ile Val Glu Ser Val Gly Glu Gly VIII Thr Lys Leu Lys GGT TGT TTT CCA GTG ATC TTG CGA CAT GM GGT GCT CGA ATT GTG GM ACT CTT GGT GAG CGA GTT ACT MC CTG AAG Alo G GCG G WGGAGMTA

620 bp 1vs-3

100 90 ly Asp Thr Vol Ile Pro Leu Tyr Ile Pro Gin Cys Gly Glu Cys Lys ATTTGTTTCCffi CT GAC ACT GTC ATC CCA CTT TAC ATC CCA CAG TGT CGA GM TGC AAA

110 Phe Cys Leu As" Pro Lys Thr Asn Leu Cys Gin Lys Ile Ar TTT TGT CTA MT CCT MA ACT MC CTT TGC CAG AAG ATA AG LITAGTATCTT

4.3 kb ivs-4 TTTTTCTTT&

g Vol Thr Gin Gly A GTC ACT CM CCC

130 140 120 Lys Gly Leu Met Pro Asp Gly Thr Ser Arg Phe Thr Cys Lys Gly Lys Thr Ile Leu His Tyr Met Gly Thr Ser Thr AAA CGA TTA ATG CCA CAT GGT ACC AGC AGA TTT ACT TGC AM GGA AAG ACA ATT TTG CAT TAC ATG CGA ACC AGC ACA 160 167 170 150 Phe Ser Glu Tyr Thr Val Vol Alo Asp Ile Ser Vol Ala Lys Ile Asp Pro Leu Ala Pro Leu Asp Lys 'Jo1 Cys Leu TTT TCT GAA TAC ACA GTT GTG GCT CAT ATC TCT GTT GCT AA4 ATA CAT CCT TTA GCA CCT TTG CAT AbA GTC TGC CTT 180 Leu Gly Cys Gly Ile Ser Thr Gly Tyr Gly Ala Ala Val Asn Thr Ala Lys CTA GGT TGT GGC ATT TCA ACC GGT TAT GGT GCT GCT GTG AK ACT CCC AAG UAAGAGACTGAC

151 bp IVS-5 TCTTTACTCCTAG

200 210 190 Leu Clu Pro Gly Ser Vol Cys Ala Val Phe Gly Leu Gly Gly Val Gly Leu Ala Val Ile Met Gly Cys Lys Vol Ala TTG GAG CCT CCC TCT GTT TGT KC GTC TTT GGT CTG CGA CGA GTC CGA TTG GCA GTT ATC ATG CCC TGT AM GTG GCT 240 220 230 Cly Ala Ser A,-g Ile Ile Gly Val Asp Ile Asn Lys Asp Lys Phe Ala Arg Ala Lys Glu Phe Gly Ala Thr Glu Cys GGT GCT TCC CCC ATC ATT GGT GTG GAC ATC MT AM CAT AM TTT GCA AGG CCC AAA GAG TTT CGA CCC ACT GAA TGT 250 260 246 Ile Asn Pro Gin Asp Phe Ser Lys Pro Ile Gin Glu Val Leu Ile Glu Met Thr Asp Gly Gly Vol Asp Tyr Ser Phe ATT AAC CCT CAG CAT TTT ACT AM CCC ATC CAG GAA GTG CTC ATT GAG ATG ACC CAT CGA CGA GTG GAC TAT TCC TTT 270 Glu Cys Ile Gly Asn Val Lys Vol Met GAA TGT ATT GGT AAT GTG MC GTC ATG LIGAGTATGGGCT

1.2 kb lvs-6 CTCTGTGCCTGW

280 Arg Ala Ala Leu Glu Ala Cys HLS AGA GCA GCA CTT GAG GCA TGT CAC

290 300 Lys Gly Trp Gly Val Ser Val Val Val Gly Vol Ala Ala Ser Gly Glu Glu Ile Alo Thr Arg Pro Phe Gin Leu Val AAG CCC TGG CCC GTC AGC GTC GTG GTT CGA GTA CCT GCT TCA GGT GM GAA ATT CCC ACT CGT CCA TTC CAG CTG GTA 320 310 Thr Gly Arg Thr Trp Lys Gly Thr Ala Phe Gly G ACA GGT CCC ACA TGG A4A CCC ACT CCC TTT CGA G WTTCCATGG

2.2 kb 1vs-7

GATTTCTTTT&

ly Trp Lys Ser Val Glu GA TGG AAG ACT GTA GAA

340 350 330 Ser Vol Pro Lys Leu Val Ser Glu Tyr Met Ser Lys Lys Ile Lys Val Asp Glu Phe Vol Thr HIS Asn Leu Ser Phe ACT GTC CCA AAG TTG GTG TCT GM TAT ATG TCC AA4 AAG ATA &!A GTT CAT GM TTT GTG ACT CAC MT CTG TCT TTT 360 Asp Glu Ile Asn Lys Ala Phe Glu Leu Met His SW Gly Lys Se CAT GAA ATC MC AAA CCC TTT GM CTG ATG CAT TCT CGA AAG AG UAGGCTTTCT

370 131 bp P Ile Arg Thr ~VS-8 TTCCTTTTACffi C ATT CGA ACT

374 Vol Vol Lys 1le l ** 1 GTT GTA MG ATT TM TTCAAnnGAGAAAAAT~TGTCCATCCTGTCGTGATGTGATAGGAGCAGCTTMCAGGCAGGGAG~GCGCCTCCMCCTCACA

83

GCCTCGTAGAGCTTCACAGCTACTCCAG~TAGGGTTATGTGTGTCATTCATG~TCTCTATMTCMGGACMGGATPTTCAGTCATGMCCTGTTTTCT

187

GGATGCTCCTCCACATAAATMTTGCTAGTTI/\T~GG~TATTTIMCATBBZBBBI\GTPTT~CTACATT~GTGTG~~TTGTCTTGTTT~TGCT~TC

290

ATCATTGTCACGGTTTGTCTGCCCATTATCTTCATTCTGCMGGG~GGG~GGMGCAGGGCAGTGGTGGGTGTCTGAAACCTCAGAAACAT~CGTTGM

334

CTTTTMGGGTCTCAGTCCCCGTTGATTAAAGMCAGATCCTCCATCAGTGACAnnGTTMTCAGGACCCMGTCTGCTTCTGTGATATTATCTT~MGGGA

438

GGTACTGTGCCTTGTTCATACCTGTACCCC~TTCCTAGGATG~CATCTGCCTTCAGGGGGCACT~TGTATTATTGAAACAGCATTCTGGGCTTAAATAG

602

GTGTATGTATGTGTTGGTTGTGACTGTACTATTICTAGTATAGTGMCTACATACTGAATATCCMGTTCTCAGCACCTACTTTTGTCAAATCTTMCATTTTG

706

CCACTTCGAGATCACATTGCCATTCCTCCCCTCC~GAGGTMC~TTATCCACMTTTGATGTTTATCATTCCTGTGTTGTTGTACTTTCACTGTGTATMCC

810

TAAACCATCTACTCTTTAGTACTGTTTTATATATTTTTAAGCCTCATACTTGCTCATTCTACAGCTTTTTTCACTCATTATTGTATAATTATATCTGAAGCTCT

914

CGTTCATTMTTTTAGTCCTGTGTAGCAGMTTCMTTACGGGMCTACCATMTTTATCTGTTCTCCAGTTGMGGCATGMGTTGTTGCCAGTTTCTGTATT

1018

AT~CACTGTAGTGG~CATTCTTCTGCATTGGGCTLACTGCGTGTTACCTMGACGTATCACAGBACATTTAGCCTTATAGACATTGCCAAATTGC

1121

TCTTCAMCTAAATGTGACTTTTTGTGMTTACATGAGTATGGMT *

Fig. 2. Nucleotide

sequence

of the ADH5 (FDH) gene: exons, the boundaries

of introns,

and the 5’ and 3’ flanking

regions

are shown.

The 5’ flanking

region is numbered at the right, counting from the ATG start codon. Potential binding sites (based upon consensus sequences) for the transcriptional activators AP-2, LF-Al, NF-1, NF-El, NF-AZ, NF-KB, and Spl are indicated, as is an inverted G, T-related site (Carr and Edenberg, 1990). The tsp arc shown by single asterisks and the stop codon by three asterisks. The deduced aa sequence is numbered above. Asp’“’ and Phe246, in bold, match the cDNA of Sharma et al. (1989) but differ from that of Giri et al. (1989a). Two potential polyadenylation signals, AATAAA, are bold and underlined. The nt where polyadenylate was added in the cDNA of Sharma et al. (1989) is indicated by A Those nt underlined in the 3’ UTR were not found in

309 transcriptional regulatory protein Spl (Kadonaga et al., 1986) as might be expected from its base composition. It

TGCA

also has potential binding sites for several transcriptional regulators: AP-2, liver-specific transcription factor LF-Al, and nuclear factors NF-1, NF-El, NF-A2, and NF-KB (Faisst

and Meyer,

1992).

(d) Function of the ADHS

promoter

To test the ability of the 5’ flanking region of ADH.5 to drive transcription, a 1.5-kb fragment containing the region upstream from the coding sequences was subcloned into pCAT-Basic (Promega) to create pCAT-ADHS. This construct was transfected into CV- 1 cells for transient expression assays. Plasmid pCAT-ADHS showed strong promoter activity in CV-1 cells (Fig. 5), nearly comparable to that of the SV40 promoter + enhancer contained in the pCAT-Control plasmid (Promega). (e) Structure of the 3’ untranslated region of ADH.5 ADHS has two potential polyadenylation signals (AATAAA), 243 bp and 1092 bp downstream from the stop codon TAA. Giri et al. (1989a) found that the major X-ADH mRNA in human liver, spleen, and in several human cell lines was about 1.7 kb, the size expected for use of the first polyadenylation signal; there was also a weak

Fig. 3. Primer extension Methods: A human

of human liver RNA. Arrows

X-ADH cDNA fragment

) was isolated, dephosphorylated, nucleotide

with HaeIII. The 73-bp antisense

from a 6% polyactylamide/‘l

for primer extension. lo6 cpm)

were

formaldehyde/40

Total human

dissolved

fragment

kinased with [ Y-~*P]ATP and T4 poly-

kinase, and digested

was isolated

three tsp.

indicate

(98-bp EcoRI-PstI

liver RNA (50 ng) and primer (0.1 x

in 30 pl of hybridization

mM Pipes/O.4

strand

M urea gel, purified, and used

M NaCl/l

solution

mM EDTA).

(75%

After denatur-

ation at 90°C for 5 min, the mixture was slowly cooled to 36°C (2) or 42°C

(1) and hybridized

precipitation,

at this temperature

the primer extension

reaction

overnight.

band of about 2.6 kb. Sharma et al. (1989) isolated a cDNA of about 2.3 kb from a placental cDNA library, consistent with use of the second polyadenylation signal. There is 67% sequence identity between the proximal portion of the 3’ untranslated region of ADHS and the entire 3’ UTR region of mouse Adh-B, cDNA (Hur et al., 1992). This region extends from 67 bp to 250 bp downstream from the stop codon, ending just beyond the first AATAAA of the human gene. The sequence from 197 bp to 247 bp downstream from the stop codon, surrounding the first polyadenylation signal, is 87% identical. Thus, there is evolutionary conservation of the region containing

After ethanol

mixture was added (1 unit/PI ng per ml of actinomycin

D/4

same primer was used in a sequencing

units per ~1 of Moloney

murine leukemia virus reverse transcriptase),

and

Perron

the extension

were done at 44°C

placental

RNase inhibitor/l reactions

with phenol/chloroform was analyzed

mM dNTPj50

and ethanol

by electrophoresis

either of the reported

cDNAs

for 70 min. After extraction

precipitation,

the extended

in 6% polyacrylamide/‘l

product

M urea gels. The

(Giri et al., 1989a; Sharma

et al., 1985) subclone

region of ADHS as template quencing

reaction

by comparison

et al., 1989). An A 11 bp downstream

reaction

that contains to provide

with the pUC18 (Yanisch-

exon

1 and the 5’ upstream

size markers.

Although

shown was not clear, we were able to determine

with cleaner

sequencing

the sethe tsp

ladders.

from the stop codon in both of the cDNAs

was not found

in ADH5. The G at 491 bp, A at 956 bp, and C at 1060 bp downstream from the stop codon in the cDNA of Sharma et al. (1989) were not found in ADHS. The TA 436 bp downstream from the stop codon in ADHS was CC in Giri et al. (1989a). Methods: the Sst I and Eco RI fragments of the i clones were subcloned

into pUC18

and sequenced

with universal

and reverse primers (BRL). The inserts were excised from the plasmid,

digested

with Sau3A1,

and subcloned into M13mp18 or mp19 (Yanisch-Perron et al., 1985) for sequencing by the dideoxynucleotide chain termination method (Sanger et al., 1977). Sequencing was in both directions, except for the 262 bp at the end of the 3’ untranslated region; these 262 bp match a published cDNA (Sharma et al., 1989). Sequences across the Sau 3AI restriction sites were confirmed either by sequencing the sites. The sizes of the introns were determined from their nt sequence, by restriction mapping, intron

1. These sequences

have been submitted

to GenBank

under accession

with oligo primers or by subcloning or by the polymerase chain reaction

Nos. M8 1112-M8 1118.

fragments that span using oligos flanking

310

Fig. 4. Sequence comparison of this cDNA,

of these are underlined. cDNAs

ofthe 5’ untranslated

using 7-deaza-dGTP.

The critical residues

is doubly underlined;

region ofthe ADHS (FDH) gene and cDNA: cDNA’

In the ADH5 sequence, (at -3,

it is embedded

the two upstream

+ 4) of the Kozak

in a short sequence

(1987) consensus

sequence

= our resequencing

polypeptides

initiated

for each ATG are bold. The additional

at either

ATG in the

that diverges from the gene (underlined).

the first human polyadenylation signal and the only polyadenylation site detected in the mouse Adh-B2 mRNA (Hur et al., 1992). (f) conclusions (I) This report describes

= Giri et al. (1989a); cDNA2

ATGs and the TGA that would terminate

ACKNOWLEDGEMENTS

We thank

Steve Fox for the preparation

of human

liver

RNA, Dr. Celeste J. Brown for the preparation and first screening of one human genomic library, and Dr. David the isolation

and characteriza-

tion of the human ADH5 (FL)H ) gene, the first ADH gene of its class to be isolated. The organization of this gene into nine exons interrupted with introns at the same positions as in the other members of the ADH multigene family firmly places it in the ADH family and suggests its FDH activity evolved from this basis. (2) The 5’ nontranslated region of the cDNA contains two ATG codons preceding the ATG that initiates X-ADH synthesis. Both upstream ATGs are in a poor translation initiation context, whereas the ATG that initiates &khgr;ADH is in a good context. (3) The promoter region is a CpG island with no TATA or CCAAT boxes. We have demonstrated that it is a strong promoter in CV-1 cells.

Goldman for the human X-ADII cDNA. This research was supported in part by grant RO 1 AA06460 from the National Institute on Alcohol Abuse and Alcoholism.

REFERENCES Adinolfi,

A., Adinolfi,

biochemical

X-ADH isozyme. Beisswenger,

M. and Hopkinson,

characterization

Immunological

alcohol

and

dehydrogenase

Ann. Hum. Genet. 48 (1984) I-10.

T.B., Holmquist,

alcohol dehydrogenase and inferences.

D.A.:

of the human

B. and Vallee, B.L.: X-ADH is the sole

isozymes

of rn~lnl~ian

Proc. Nati. Acad.

Bird, A.P.: CpG islands

brains:

implications

Sci. USA 82 (1985) 8369-8373.

and the function

of DNA mcthylation.

Nature

321 (1986) 209-213 Breathnach,

R. and Chambon,

caryotic

P.: Organization

split gene coding for proteins.

and expression

Annu. Rev. Biochem.

of eu-

50 (198 1)

349-383. Carr, L.G. and Edenberg,

Cm

binding

ac-Cm % ac-Cm

gene ADH2*3. Carlock,

H.J.: c&-Acting

and in vitro transcription

sequence

of human

involved

alcohol

J. Biol. Chem. 265 (1990) 1658-1664.

L., Hiroshige,

S., Wasmuth,

J. and Smith, M.: Assignment

0.1

the ADH5 gene coding for class 111 ADH to human

pCAT Basic

0.2

4q21-4q25.

pCAT Control

47.6

Mock

Dafeldecker, Biochem.

pCAT ADH5

40.9

Denhardt,

Fig. 5. ADH.5 promoter fragment

(F&I,

activity in transient blunted)

expression

containing

assays.

Methods:

the 5’ upstream

region of

ADH5 was subcloned into pCAT-Basic (Promega, Madison, WI) at the blunted XbuI site, creating pCAT-ADHS. PIasmid pCAT-Basic does not contain

either a promoter

promoter

and enhancer.

or enhancer; Then

pCAT-Control

contains

15pg of pCAT-ADH5

the SV40

(or the molar equiv-

alent of pCAT-Basic or pCAT-Control) plus 5 pg of pUCl8 and 5 ug of pCH I 10 (lucZ expression vector, Pharmacia) were mixed and transfected into CV-I ceils using the CaCl, transfection procedure. Gelis were harvested after 48 h, and extracts were made by three cycics of freezing and thawing.

An ahquot

tivity (Rosenthal,

of each extract was assayed

1987) and aliquots

representing

for &galactosidase

ac-

equal fi-galactosidase

and ~h~ac~~rization Res. Commun.

D.T.: A membrane-filter DNA.

technique

AMBlS

p-scanner

and given in percentage

was performed

acetylation

of the Cm.

on an

from testis.

for the detection

of com-

23 (1966) 641-

646. a~dehyde-metabolizing cusing. Edenberg,

Alcoholism

isozymes

by electrophores~s

alcohol- and

and isoelectric fo-

(NY) 9 (1985) 263-271.

H.J.: Molecular

biological

approaches

to studies

of alcohol-

metabolizing enzymes. In: Crabbe, J.C. and Harris, R.A. (Eds.), The Genetic Basis of Alcohol and Drug Actions. Plenum, New York, 1991, pp. 165-223. Edenberg,

H.J., Brown,

C.J., Carr, L.G.,

Ho, W.-H.

and Hur, M.-W.:

Alcohol dehydrogenase gene expression, and cloning of the mouse X-like ADH. Adv. Exp. Med. Biol. 284 (1990) 253-262. Ehrig, T., Bosron, W. and Li, T.K.: Alcohol and aldehyde dehydrogenase.

factors. Nucleic Acids Res. 20 (1992) 3-26. Feinberg, A.P. and Vogelstein, B.: A technique

Quantitation

4:

alcohol dehy-

Duley, J.A., Harris, 0. and Holmes, R.S.: Analysis ofhuman

extract were assayed as described was al 37°C for 1.5 h. Acetylated on a silica gel plate.

human of isozymes

Biochem.Biophys.Res.Commun.

Faisst, S. and Meyer, S.: Compilation

matography

of

134 (1986) 1056-1063.

activities were used for the CAT assays. The reaction mixtures containing 0.58 mM acetyl-CoA/75 nCi [ “C]Cm/0.25 M Tris.CI pH 7.6 and cell by Gorman et al. (1985); incubation Cm was analyzed by thin layer chro-

chromosome

Cell Genet. 40 (1985) 598.

isolation Biophys.

plementary a 1.5kb

Cytogenet.

W.P. and Vahee, B.L.: Orgy-speci~c

drogenase:

in protein

dehydrogenase

Alcohol

Alcohol.

25 (1990) 105- 116.

restriction endonuclease fragments them. 137 (1984) 266-267

of vertebrate-encoded

transcription

for radiolabeling

DNA

to high specific activity. Anal. Bio-

311 Gardiner-Garden,

M. and Frommer,

M.: CpG islands

in vertebrate

ge-

Giri, P.R., Krug, J.F., H.N.

Kozak,

and Goldman,

human

C., Moretti,

D.: Cloning

class III (x) alcohol

phys. Res. Commun. Giri, P.R., Linnoila, possible

S.J., Seuanez,

stomach

mapping

of a

characterized.

Biochem.

Bio-

T., O’Brien,

and comparative

dehydrogenase

cDNA.

M., O’Neil, J.B. and Goldman, role of class

III alcohol

in undifferentiated

embryonic

ment of human Cytogenet.

L., Wasmuth,

formaldehyde

Molecular

cloning

J. and Smith, M.: Regional

dehydrogenase

Brown, C.J., Goldman,

of mouse

cleotide sequences

N.: Poly(A)

(FDH)

assign-

DNA

J.T., Jones,

RNA polymerase

alcohol

D. and Edenberg,

H.J.:

cDNA:

nu-

formaldehyde

K.A. and Tjian R.: Promoter-specific II transcription

D.J.: Bovine liver formal-

signals. Cell 64 (1991) 671-674. of regulatory

Methods

elements of cloned genes with

Enzymol.

S. and Coulson,

inhibitors.

by Spl. Trends

Hagemeijer,

dehydrogenase

human

equidistantly

(FDH)

Cell Genet.

R., Holmquist,

Class-III

action

Biochem.

A. and Pearson,

of

Sci. 11

P.L.: Human

and its assignment

to chromo-

152 (1987) 704-720.

A.R.: DNA sequencing

Proc. Natl. Acad.

B., Hempel,

J., Vallee B.L. and Jomvall,

liver alcohol dehydrogenase:

H.:

a novel structural

type

Biochemis-

try 27 (1988) 1132-1140. R., Holmquist,

class III alcohol dehydrogenases,

M., Baumann,

with chain-

Sci. USA. 74 (1977) 5463-

I. Biochemistry

dehydrogenase

28

and class III al-

Lett. 257 (1989) 105-109.

of 5’-noncoding

sequences

S.: Cloning

from a human

gene. Am. J. Hum. Genet.

of a processed

class III alcohol

dehydrogenase

46 (1990) 85-91.

with unique kinetic characteristics.

Biochem. Biophys.

Strydom,

forms

Res. Commun.

Pares, X., Farres, J. and Vallee, B.L.: Organ specific alcohol metabolism: Biochem.

Biophys.

and aldehyde

of specific sequences

by gel electrophoresis. isozymes

Res. Commun.

119 (1984)

dehydrogenases.

among DNA fragments

Anal. Biochem.

alcohol

de-

liquid chromatographic

123 (1982) 422-429.

labeling of DNA by nick translation.

F.M., Brent, R., Kingston,

R.E., Moore,

J.G. and Struhl, K.(Eds),

lecular Biology. Wiley Interscience Uotila L. and Koivusalo,

of human

by high performance

S. and Struhl, K.: Uniform

In: Ausubel,

Biochem.

J. Mol. Biol. 98 (1975) 503-517.

D.J. and Vallee, B.L.: Characterization

hydrogenase Tabor,

alcohol

Current

D.D.,

Protocols

Smith, in Mo-

(1987) 3.5.3-3.5.6.

M.: Formaldehyde

dehydrogenase

from human

liver. J. Biol. Chem. 249 (1974) 7653-7663. M.: Formaldehyde

purification,

dehydrogenase

some properties

and evidence

from human for multiple

forms. Prog. Clin. Biol. Res. 232 (1987) 165-177. von Bahr Lindstrom, Wagner,

H. and Hobg, J.O.: Cloning and char-

F.W., Pares, X., Holmquist,

enzymatic Wickens,

H., Jdrnvall,

of the human ADH4 gene. Gene 103 (1991) 269-274. properties

B. and Vallee, B.L.: Physical

of a class III isozyme

X-ADH. Biochemistry

M.: In the beginning during

of human

and

liver alcohol

23 (1984) 2193-2199.

is the end: regulation

early development.

of poly(A) addition

Trends

Biochem.

Sci. 15

(1990) 320-324. Yanisch-Perron,

C., Vieira, J. and Messing,

J.: Improved

cloning vectors and host strains: nucleotide

98 (1981) 122-130. rc-ADH.

H. and Vallee, B.L.:

15 (1986) 249-290.

E.M.: Detection

and removal

and sequencing

Pares, X. and Vallee, B.L.: New human liver alcohol dehydrogenase

1047-1055.

Southern,

dehydrogenase:

from 699 vertebrate

Nucleic Acids Res. 15 (1987) 8125-8148.

Y. and Yokoyama, derived

an enzyme less var-

of class

B., Jdrnvall,

164 (1989) 631-637.

of human

Adv. Hum. Genet.

acterization

FEBS

M.: An analysis RNAs.

liver enzyme

formaldehyde

cohol dehydrogenase.

placental

Smith, M.: Genetics

erythrocytes:

H.: Characteristics

M. and Uotila, L.: Evidence for the identity of

glutathione-dependent

pseudogene

Res. Commun.

Uotila, L. and Koivusalo,

B., Vallee B.L. and Jornvall,

iable than the traditional (1989) 8432-8438.

messenger

C.P., Fox, E.A., Holmquist,

J.A., Seidman,

38 (1984) 112-115.

related to the class I and class II enzymes.

of mammalian

Sharma,

peptide mapping.

some 4. Cytogenet.

Matsuo,

terminating

separated

(1992) in press.

(1986) 20-23.

Kozak,

assays.

Sanger, F., Nicklen,

Biophys.

dehydrogenase-B,

Sequence

Kahn, P.M., Wijnen, L.M.M.,

Koivusalo,

from those already

cDNA sequence of human class III alcohol dehydrogenase.

to 4q21-4q25.

of the class III ADH genes evolve slowly even for

silent substitutions.

Kaiser,

J.: Class of the rat

J. Biol. Chem. 264 (1989) 17384-17388.

N.: Identification

Cell Genet. 40 (1985) 651-652.

Hur, M.-W., Ho, W.-H.,

Kaiser,

T., Shih, M.J. and Creighton,

functional

data

5467.

S., Carlock,

Kadonaga,

Pourmotabbed,

Rosenthal,

stem cells. Cell 42 (1985)

Structural

FEBS Lett. 277 (1990) 115-l 18.

Proudfoot,

of viral

E., Hoog, J.O. and Jornvall,

dehydrogenase.

enzyme reveal a new class well separated

and

5 19-526. Hiroshige,

alcohol

in the

D.: Distribution dehydrogenase

human brain. Brain Res. 481 (1989b) 131-141. Gorman, C.M., Rigby, P.W. and Lane, D.P.: Negative regulation enhancers

A., Cederlund,

dehyde dehydrogenase.

164 (1989a) 453-460.

metabolic

Pares, X., Moreno, IV mammalian

nomes. J. Mol. Biol. 196 (1987) 261-282.

and pUC19 Yasunami,

vectors.

Genes

Ml3

phage

sequence ofthe M13mp18

33 (1985) 103-119.

M., Chen, C.S. and Yoshida,

A.: A human

genase gene (ADH6) encoding an additional Natl. Acad. Sci. USA 88 (1991) 7610-7614.

alcohol dehydro-

class of isozymk.

Proc.

Cloning and characterization of the ADH5 gene encoding human alcohol dehydrogenase 5, formaldehyde dehydrogenase.

Human chi-alcohol dehydrogenase (chi-ADH) is a zinc-containing dimeric enzyme responsible for the oxidation of long-chain alcohols and omega-hydroxyfa...
957KB Sizes 0 Downloads 0 Views