Gerze. 98 (1991) 141-145

141

Elsevier

GENE

03887

Nucleotide sequence of the LYS2 gene of Saccharomyces tyrocidine synthetase 1 (Recombinant

DNA;

Mary E. Morris* Department

lysine;

yeast;

a-aminoadipate

reductase;

tyrocidine;

cerevisiae:

antibiotic

homology

to Bacillus

brevis

synthesis)

and Sue Jinks-Robertson

of‘ Biology, Emory University. Atlanta, GA 30322 (U.S.A.)

Received by J.A. Gorman: 10 July 1990 Revised: 5 October 1990 Accepted: 10 October 1990

SUMMARY

The Saccharomwes cerevisiue L YS2 gene, which encodes a-aminoadipate reductase, an essential enzyme in the yeast lysine biosynthetic pathway, has been sequenced. A large open reading frame (ORF) has been identified which can specify a 1392-amino acid protein with a deduced M, of 155 344. A DNA database search using the translated L YS2 ORF as a probe has revealed significant aa sequence homology to the Bacillus brevis enzyme tyrocidine synthetase 1.

INTRODUCTION

Two different pathways for lysine biosynthesis have been described : the diaminopimelic acid pathway in bacteria and some lower eukaryotes and the a-aminoadipate pathway exclusively in lower eukaryotes (Umbarger, 1978). In the yeast Swchurotn~ces cerevisiue, lysine is synthesized via the z-aminoadipate pathway from a-ketoglutarate and acetyl CoA in eight enzyme-catalyzed steps; the first four steps occur in the mitochondrion and the remainder in the cytoplasm (reviewed by Bhattacharjee, 1985). To date, twelve complementation groups defining loci involved in lysine

Correspondence to: Dr. S. Jinks-Robertson, University,

1510 Clifton Rd., Atlanta,

Dept.

of Biology,

Emory

GA 30322 (U.S.A.)

Tel. (404) 727-63 12; Fax (404) 727-2880. * Present

address:

200 Longwood

Program

in Neuroscience,

Ave., Boston,

Harvard

Medical

School,

MA 02115 (U.S.A.)

Abbreviations: aa. amino acid(s); zAR, I-aminoadipate reductase; bp, base pair(s); kb, kilobase or 1000 bp; LYS2, gene encoding zAR; nt, nucleotide(s);

oligo,

oligodeoxyribonucleotide;

frame;

S., Saccharomyces; TY 1, tyrocidine

coding

TYl;

037X-I

ORF, synthetase

open 1;

reading

tyd, gene en-

wt, wild type.

I IY Yl:SO3

50 0

1991

Elsevier

Sc~rnce

Publishers

B.V.

(Biomedical

Diwkn)

biosynthesis have been identified genetically in yeast: LYSl, LYS2, LYSI-LYSlO and LYSlI-LYS16. The yeast LYS2 gene encodes the enzyme a-aminoadipate reductase (EAR) which, together with the product of the LYSS gene, converts the intermediate a-aminoadipate to x-aminoadipic-6-semialdehyde in a three-step reaction. In the first step, r-aminoadipate is activated by ATPdependent adenylation, a unique step among aa biosynthetic pathways. In the second step, the activated substrate is reduced by NADPH. The final step is the cleavage of the adenyl group from the reduced substrate to yield the product cc-aminoadipate-&semialdehyde. Available evidence indicates that the first two steps are catalyzed by the L YS2 gene product, while the LYS5 gene product catalyzes the final step (Sinha and Bhattacharjee, 1971). It has been suggested that L YS2 and L YS5 encode different subunits of the active 180-kDa xAR, which has been partially purified (Storts and Bhattacharjee, 1989). Not only is the multifunctional L YS2 gene product interesting from a biochemical perspective, but the LYS2 gene itself has also been exploited for genetic studies. Since the LYS2 gene encodes an essential enzyme in the lysine biosynthetic pathway, yeast strains with mutations in this gene require lysine for growth on synthetic media. Rever-

142 tants or strains which have acquired a wt gene by transformation can be selected by simply plating cells on synthetic medium lacking lysine. It is possible to select not only for the acquisition of a LYS2 gene by transformation or by

portion of the LYSZ gene using the shotgun procedure of Bankier and Barrel1 (1983). The complete nt sequence of the LYS'2region is presented in Fig. 2. The GenBank ac-

reversion of mutant lys2 strains, but it is also possible to directly select for lys2 auxotrophs arising as a result of spontaneous loss of LYS2-encoding plasmids or as a result of forward mutation of the wt gene. The ability to directly select &s2 mutants results from the capacity of S. cerevisiue strains lacking zAR activity to utilize z-aminoadipate as a sole nitrogen source (Chattoo et al., 1979). The only other yeast gene for which there is a comparable way to directly select for and against wt gene function is the URA3 gene

M36287.

(Boeke et al., 1984). The L YS2 gene has been cloned by complementation in a !r:s2 mutant yeast strain and has been used as a selectable marker in the construction of both replicating and integrative yeast transformation vectors (Eibel and Philippsen, 1983; Barnes and Thorner, 1986). The locus produces a 4.2-kb transcript, and the 5’ and 3’ ends ofthe mRNA have been mapped by primer extension and S 1 nuclease analysis, respectively (Eibel and Philippsen, 1983 ; Fleig et al., 1986). In addition, Fleig et al. (1986) sequenced approx. 1 kb of the 5 ’ end and 1.5 kb of the 3’ end of the gene and identified the putative N and C ends of an encoded ORF. Their analysis, however, left an approx. 3-kb gap in the LYS2 sequence. In this paper we report the complete sequence of the yeast LYS2 gene and the results of a DNA database search for proteins homologous to TAR.

EXPERIMENTAL

AND

DISCUSSION

(a) Sequencing of the LYS2 gene A map of the L YS2 region is presented in Fig. 1, with the 5’ and 3’ regions previously sequenced by Fleig et al. (1986) indicated. We sequenced both strands of the internal

t

cession

number

for the sequence

reported

in this paper is

(b) Predicted protein product Computer analyses of the L YS2 region were done using the program Genepro (Riverside Scientific Enterprises, Seattle, WA). The TACTAAC sequence characteristic of yeast introns (Langford et al., 1984) is not present in this region and thus, like most yeast genes, the LY.SZ gene appears to contain no introns. Translation of the nt sequence in all possible reading frames yielded a large ORF in only one of the frames (data not shown); the 5’ and 3’ ends of this ORF are identical to those identified previously by Fleig et al. (1986). The ORF specifies a 1392-aa protein with a predicted M, of 155 344; the aa composition of the predicted protein is similar to that of other eukaryotic proteins (data not shown). The codon usage of the large ORF was analyzed by comparing it to that compiled for 59 highly or 99 lowly expressed yeast genes (Pesole et al., 1988). The presumptive L YSZ-encoded ORF follows the codon usage pattern for lowly expressed yeast genes, which is in agreement with experimentally determined mRNA levels (Barnes and Thorner, 1986). The translated protein was examined for evidence of internal duplications using dot matrix analysis and none were evident (data not shown). The LYS2encoded protein would be expected to have an nt-binding domain since it activates its substrate by adenylation. A search of the sequence revealed no striking homology to the consensus sequence for kinase-type nt-binding sites (G/AXXXXGKS/T; Doolittle, 1986) or for the ATPbinding site of various aminoacyl tRNA synthetases (e.g., HIGH ; Schimmel, 1987). A region containing four leucine residues, each separated by 6 aa (residues 143, 150,157 and 164) may be important in heterodimer formation with the

i 1 kb

I

I

Fig. 1. Restriction map of the L YS2 region. The hatched box corresponds to the L YS2 ORF and the single line to surrounding chromosomal sequences. The blackened and open boxes below the map correspond to the regions sequenced by Fleig et al. (1986) and in this work, respectively. Only restriction sites for selected

6-bp cutters

are shown;

the positions

of the ones outside

of the region sequenced

in this laboratory

are from Fleig et al. (1986).

sequence

fragments

sheared

approx.

Microchemical nt sequence.

the corresponding

Facility)

were filled in by sequencing

of Staden

using the

The aa sequence

of the large ORF is below

oligos

(1986). Gaps in pDP4 (Fleig et al., 1986) using synthetic as primers.

plasmid

pDP6 (Fleig

analysis

X&z1 site

gel and inserting

from plasmid

clones were sequenced program

Random and aligned using the computer (Emory University

fragment

clones for sequence

from the upstream

1 kb in length on an agarose

the 4-kb BglII-PvuIl fragments

shearing

into the SmaI site of M13mplO. of Sanger et al. (t977) either strand

method

end-repaired

of the yeast LYS2 region. The sequence Hind111 site is given (see Fig. 1). Random by ultrasonically

downstream et al., 1986), size-selecting

were generated

to the second

Fig. 2. Nucleotide

z icr

144

* :=, . :=>

match across conservative

GWCDFVGCIHDIFQDNAEAFPERTCWETPTLNSDKSRSFTY~INRTSNIVAHYLIKTG VPYAQGKSIHQLFEEQAEAFPDRVAIVFE-------NRRLSYQELNRKLLEKG **..*...*****.* _* * . . *..** *.*.*

231-337 21-120

LYS2 tycA

all seqs. substitutions

IKRGDVVMIYSSRGVDLMVCVMGVLKAGATFSVIDPAYPPARQTIYL VQTDSIVGVMMEKSIENVlAILAVLKAGGAYVPIDIEYPRDRIQYIL , . . _. .. ***** . ** ** . . ** .

TGDLGRYLPNGDCECCGRADDQVKIRGFRIELGEIDTHISQHPLVRENIT TGDLAKWLTDGTIEFLGRIDHQVKIRGHRIELGEIESVLLAHEHITEAW **** _ * ** * ** * ****** *******.. . *

LYS2 tycA

*

GEIYVRAGGLILEGYRGLPELNKEKFVNNWFVEKDHWNYLDKDNGEPWRQFWLGP~RLYR GELCIGGVGLARGYWNRPDLTAEKFVDNPFV-----------------------PGEKMYR **. _ *** **. *.* ****.A xx

620-729 361-448

LYS2 tycA

*

*

* . .**

. * .

RTFLKKRLASYAMPSLIVVMDKLPLNPNGKVDKPKLQFPTPKQLNLVAENTVSETDDSQF RDYAAQKLPAYMLPSYFVKLDKMPLTPNDKIDRKAL--PEP---DLT~QSQ~YHPPR.* * .** * .**.** ** *.*. * * * .** *.

786-910 476-591

TNVEREVRDLWLSILPTKPASVSPDDSFFDLGGHSILATKMIFTLKKKLQVDLPLG'~IFK TETESILVSIWQNVLGIEKIGIR--DNFYSLGGDSIQAIQWARLHS-YQLKLETKDLLN * *. *** ** * ._ * *. * * * . .* .*

..

YPTIK YPTIE **** Fig. 3. Amino acid sequence and Sharp, positive

integers

in the original identity;

alignment

1989). Conservative between

0 and 25. Ser/Thr

DNA database

dots, conservative

of portions

aa replacements search

of the LYS2 and Iq’cA-encoded proteins. are those with a score greater

and Glu;Asp

(LOD > 104) correspond

was done using the program

to be conservative

to aa 303-332,618-647.687-716

matrix of Dayhoff

substitutions.

Four stretches

and 792-821

ofthe

CLUSTAL

(Higgins

(1978) resealed of aa similarity

LYSZ-encoded

protein.

to give detected

Asterisks,

substitution

gene product if xAR has the subunit gested by Storts and Bhattacharjee (1989).

f.YS.5

are also considered

Alignment

than 10 using the aa similarity

structure

sug-

(c) Similarity of the LYS2 ORF to tyrocidine synthetase 1 The GenBank DNA database (release 62) was searched using the translated LYS2 sequence as a probe to identify proteins with aa similarities (Henikoff and Wallace, 1988). In this search the probe was compared to each possible reading frame of every sequence in the database by aligning stretches of 30 aa. Ofthe similarities detected, the only ones with log-odds (LOD) scores greater than 104 were obtained in comparisons with the tyrocidine synthetase l-encoding gene (t~a.4) from Bacillus hrevis (Weckerman et al., 1988). The similarity at the aa level was clearly evident as a diagonal extending over approx. one-half of each protein in a standard dot matrix analysis (data not shown). A partial sequence alignmfient of the two proteins beginning at aa 23 1 of rAR and aa 21 of TYl is presented in Fig. 3. The two proteins are 28 y, identical over approx. 100 aa and then the sequences diverge over the next 300 aa. The similarity is clearly evident again beginning at aa 620 and 361 of the L Y$2- and tJ]cA-encoded proteins, respectively. There is 40”,, identity over a stretch of approx. 100 aa followed by short region of no similarity (57 aa in the LYS2-encoded protein and 28 aa in the t_vcA-encoded protein) and then another 100-aa stretch of 3 17, identity.

The obvious implication of the striking aa similarities between the yeast enzyme rAR and the bacterial enzyme TY 1 is that there is functional homology between the two seemingly unrelated proteins. TYl is one of three multifunctional enzymes involved in the synthesis of the cyclic decapeptide antibiotic tyrocidine (reviewed by Katz and Demain, 1977). It corresponds to the so-called light enzyme which initiates the nonribosomal synthesis of tyrocidine by activating, racemizing and becoming covalently linked to phenylalanine, the first aa in the peptide. This initiation step is somewhat analogous to the charging of tRNAs with their respective aa. As mentioned. srAR activates its substrate x-aminoadipate by adenylation and then reduces it; there is no evidence that the substrate becomes covalently attached to the enzyme. Given what is known about the two proteins, a conservative guess concerning possible functional homology is that the regions with aa similarity are important for substrate adenylation, although we note that the substrates are chemically quite different. Sitedirected mutagenesis of the homologous regions would be the first step in assessing the possible functional significance of these regions. In the case of the LYS2-encoded protein, the ability to directly select against wt gene function in vivo can be used in conjunction with the yeast transformation system to substitute defined mutations constructed in vitro for the wt gene. In addition, the negative selection

145 system for L YS2 allows one to readily isolate spontaneous loss-of-function mutations, thus making this gene ideal for structure-function studies.

Eibel, H. and Philippsen,

P.: Identification

LYS2 gene by an integrative Genet.

of the cloned

transformation

S. cerevisine

approach.

Mol. Gen.

191 (1983) 66-73.

Fleig, U.N., Pridmore, cartridges

R.D. and Philippsen,

for use in genetic

P.: Construction

of LYS2

of Saccharom.vces

manipulations

cerevi-

siae. Gene 46 (1986) 237-245.

Henikoff,

ACKNOWLEDGEMENTS

S. and Wallace,

nucleotide

We thank U. Fleig for generously providing the cloned LYS2 gene, Guy Benian for invaluable help with the Staden program and D.F. Steele for helpful comments on the manuscript. This work was supported by National Institutes of Health grant GM38464 and Emory University Research Committee funds to S.J.-R.

sequence

D.G. and Sharp,

alignments

on

A.T. and Barrel],

B.C.: Shotgun

Shannon, Barnes,

DNA sequencing.

in the Life Sciences,

In: Flavell,

Vol. B508. Elsevier,

1983, pp. l-34.

D.A. and Thorner,

cerevisiae

J.: Genetic

of Saccharomyces

manipulation gene.

Mol.

Cell.

Biol.

Bhattacharjee,

J.K.: cc-Aminoadipate

lysine

eukaryotes.

pathway

CRC

Crit.

for the biosynthesis

Rev. Microbial.

of

12 (1985)

131-151.

biogenesis

F. and Fink, G.R.: A positive

tants lacking orotidine-5’-phosphate

decarboxylase

selection

for mu-

activity in yeast:

5-fluoro-erotic acid resistance. Mol. Gen. Genet. 197 (1984) 345-346. Chattoo, F.F., Sherman, F., Azubalis, D.A., Fjellstedt,T.A., Mehnert, D. of lys2 mutants

cerevisiae by the utilization

Protein

distant Sequence

cal Research Doolittle, Derived

change in proteins.

93 (1979) Matrices

relationships.

In: Dayhoff,

Vol. 5, Suppl. 3. National

R.F.: Of URFS Acid

Valley. CA, 1986.

Genetics

and Structure,

Foundation,

Amino

of the yeast Sacchammyces

of r-aminoadipate.

5 l-65. Dayhoff, M.O.: A model of evolutionary detecting

and sensitive CABIOS

Langford,

and possible functions.

C.J., Klinz, F.-J., Donath,

identify the conserved,

multiple

sequence

Commun.

5 (1989)

Sanger,

G., Attimonelli,

F., Nicklen,

terminating

Bacterial.

ofBacillus:

C. and Gallwitz,

intron-contained

chemistry,

Rev. 41 (1977) 449-474. D.: Point mutations

TACTAAC

box as an essen-

signal in yeast. Cell 36 (1984) 645-653. M. and Liuni, usage

strategy.

S. and Coulson,

inhibitors.

Proc.

S.: A backtranslation Nucleic

Acids

method

Res.

16 (1988)

A.R.: DNA sequencing Natl.

Acad.

Sci.

with chain-

USA

74 (1977)

P.: Aminoacyl

Washington, and ORFS:

Sequences.

M.O. (Ed.),

Atlas

for of

Biomedi-

DC, 1978, pp. 345-360.

A Primer University

on How to Analyze Science

Books,

Mid

tRNA

synthetases:

general

scheme

of struc-

ture-function relationships in the polypeptides and recognition tRNAs. Annu. Rev. Biochem. 56 (1987) 125-158. Sinha.

A.K. and Bhattacharjee,

m>‘ces: conversion

hyde. Biochem.

and Ogur, M.: Selection

using

16 (1988)

151-153.

Schimmel,

6 (1986)

2828-2838.

Boeke, J.D., LaCroute,

Res.

5463-5467.

by use of the LYSZ

in lower

P.M.: Fast

a microcomputer.

based on codon 1715-1728.

R.A. (Ed.), Techniques

similarities

Acids

Katz, E. and Demain, A.L.: The peptide antibiotics

Pesole,

Bankier,

of protein

Nucleic

6191-6204. Higgins,

tial splicing REFERENCES

J.C.: Detection databases.

in Saccharo-

J.K.: Lysine biosynthesis

of r-aminoadipate

into z-aminoadipic

&semialde-

J. 125 (1971) 743-749.

Staden, R.: The current status and portability of our sequence software. Nucleic Acids Res. 14 (1986) 217-231. Storts,

D.R. and Bhattacharjee,

/j,s5 mutants

J.K.: Properties

cerevisiue. Biochem.

Umbarger, H.E.: Amino acid biosynthesis Biochem. 47 (1978) 533-606. sequence

ofrevertants

as well as z-aminoadipate-semialdehyde

from Srrccharomyces (1989) 182-186.

Weckerman,

of

R., Furbab,

R. and Marahiel,

handling of!,:72 and

dehydrogenase

Biophys. Res. Commun. and its regulation. M.A.: Complete

16 I

Annu. Rev. nucleotide

of the fycA gene coding the tyrocidine synthetase Bocillus brevis. Nucleic Acids Res. 16 (1988) 1 1841.

1 from

Nucleotide sequence of the LYS2 gene of Saccharomyces cerevisiae: homology to Bacillus brevis tyrocidine synthetase 1.

The Saccharomyces cerevisiae LYS2 gene, which encodes alpha-aminoadipate reductase, an essential enzyme in the yeast lysine biosynthetic pathway, has ...
584KB Sizes 0 Downloads 0 Views