_ TA~ ~ CGAT OXCAGT

Nucleic Acids Research, Vol. 18, No. 10 3073

.) 1990 Oxford University Press

Nucleotide sequence of a 3.46 kb region of maize chloroplast DNA containing the gene cluster rpoC2-rps2-atpl-atpH

Dietmar Stahl, Steven R.Rodermell, Alap R.Subramanian* and Lawrence Bogorad1 Max-Planck-Institut fOr Molekulare Genetik, Abteilung Wittmann, IhnestraBe 73, D-1000 Berlin 33, FRG and 1Department of Cellular and Developmental Biology, Harvard University, Cambridge, MA 02138, USA EMBL accession no. X52270

Submitted April 18, 1990

We report the nucleotide sequence of 3463 base pairs in the large single copy region of Zea mays plastid DNA at map position 74.6-78.0 (1). It encodes the C-terminal part of the ,B" subunit of plastid RNA polymerase (see ref. 2), the ribosomal protein S2, the subunit IV and N-terminal part of subunit mII of chloroplast ATP synthase. The continuation of the last sequence is given in ref. 3. This whole region lies in the 22 kb inversion in monocots (4). The protein sequence of S2 is 97 % identical to that of three other cereal S2's (wheat, rice and rye), 79% to tobacco, 66% to that of a liverwort and 37-39% to that of three eubacteria including Spirulina platensis (a cyanobacterium). For ATP synthase subunit IV the corresponding values are 99 (rice), 93 (tobacco), 82 (liverwort) and 25 (E. coli) % respectively. Figure 1 includes the recently reported sequence of rps2 (5) with two z

I L G V P W G F L I G A E L AAATATMATAGAGA R H I E I I I R Q V T S K V GGCGGGCI=AGATGAAllCl R A L D E S I Y Y R A I L L c _ t TICC~~~rAAAMTGGE K A A L R G R I D W L K G L

nucleotide differences (positions 657 and 1743) and contains 10 A (instead of 9 A as in ref. 5) at position 908-917. The last difference alters the reading frame and introduces three termination codons, thereby eliminating the suggested presequence of S2 (5). The transcription pattern and transcription initiation of this gene cluster have been determined (Stahl, D. et al., ms in preparation).

REFERENCES 1. 2. 3. 4. 5.

Larrinua et al. (1983) Plant Mol. Biol. 2, 129-140. Hu,J. and Bogorad,L. (1990) Proc. Natl. Acad. Sci. USA 87, 1531-1535. Rodermel,S.R. and Bogorad,L. (1987) Genetics 116, 127-139. Hiratsuka,J. et al. (1989) Mol. Gen. Genet. 217, 185-194. Igloi et al. (1990) Nucl. Acids Res. 18, 663.

I A Q S R I S L V N K I Q K V Y R S Q G V Q I H N ATAGGACiA R V S EDG M S N V F L P G E L I G L L R A E R A G A G C M A~~~~~~~~~~~~~7MGTTTA G I T R A S L N T Q S F I S E A S F Q E T A R V L A TTCGlCCCACAAG K E N V V L G G I I P V G T G F Q K F V H R S P Q D ~~~~~~~~~~~~~~~~~TCCAACAGAGTTAGTTT K N L Y L E I Q K K N L F A S E M R D I L F L H T E L V S S D S D V T N N F Y E ATT _ATTTCG AAACATCAGAGACCCCGT7ACC T S E T P F T P I Y T I* T

iA -

CAAGCrATTTCGCTTTCTTAATCTTCAAAAATGAAA7ATTC

rs2 _AA_AGATr=AcA_AT T R R Y W N I N L

S A K R K G T H I T N L A R T GGTTGACTAICMA V G T K K R A A D L V A S A A I R

K E M I E A G V H F G H G I K K W N P K M A P Y I

CAGC7TCTTTTTATCGAl&TATA A R F L S E A C D L V F D A A S Q G K S F L I TCATTATGTTAATAAAAAGTGGTT CGTCTCGGTT S R C H Y V N K K W F S G M L T N W S

I

T K T R L S Q F R D L R A E E K M G K F

TCCACCATCTCCCAAAAAGAGAT:CGGCAATCTTAAAGAGAAAATTATCTACCTTI=CAAAGATATCTTAT L L S T L R Y L G G I K Y M T R L P D I V I V L D Q

K R K Q I1 H L P K R D A A I GAATGTGCCAITrlGGAAATATCGATCCTGCCAACGATGACA SalAAAAGTATATA=UlClFCAl Q K E Y I A L Q E C A I L G I P T I S L V D T N C D P D L A N I S I P A N D D T

M T S T

I

R L

I

L N K L V F A

AA&TMTCAIIm

I

S E G R S

ATAGAAA

ATAGATAATCTTAA7NCSAGTGAAAl::GAl:ClCMCAAAAG *

To whom correspondence should be addressed

L Y I

R N R

IGTATATTAAAT PCST MNIT MT-T A A A T CAAT Md N I T P C S I K

120 240

360 480 600 720 840 960 1080

1200 1320 1440

1560 1680

1800 1920

3074 Nucleic Acids Research, Vol. 18, No. 10

GAs

ACACTCTATT'rrATAIrATATAI '.',TMrrACrArr

T L K G L Y D I

S G V E V G Q H F Y W Q I G G F Q

I H A Q V L I T S W V V

I

T

I

2160

L L G S V I I A V R N P Q T I P T D G Q N F F E Y V L E F I R D L S K T Q I G E AMATTATAGASTTACCCCAIWCGMlTACCA=S G A 1 W E Y G P W v P F I G T M F L F I F V S N W S G A L L P W K I I E L P H G E L A A

P T N D I N T T V A L A L L T S A A Y F Y A G L S K K G L S Y F E K Y I K P T P I L L P I N I L E D F T K P L S L S F R L F G N I L A D E L V V V V L V S L V P L V V P I P V M F L G L F T S G I Q A L I F A T L A A A Y I G E S M E G H H

TTCATTATCrATCC TCAAT CCrATTGGAT7=ATTAllTC A A A lU l AaGGGCOGAAGAGTCA KI A _CTIWTTGATSTATCCTTMCCA TGATmTATl TTsrACAccATTGMrTCl-TACACGCGAcACAT rACCA M N P

2040

2280 2400 2520

2640

2760 2880 3000 3120 3240 3360

3463

Figure 1. Nucleotide sequence of a 3.46 kb region of maize chloroplast DNA encoding completely ribosomal protein S2 and a subunit of ATP synthase (see Text). Potential ribosome binding sites are underlined. The data for homology comparison were taken from EMBL data base; rye S2 data is unpublished by A.Prombona and A.R.Subramanian.

Nucleotide sequence of a 3.46 kb region of maize chloroplast DNA containing the gene cluster rpoC2-rps2-atpI-atpH.

_ TA~ ~ CGAT OXCAGT Nucleic Acids Research, Vol. 18, No. 10 3073 .) 1990 Oxford University Press Nucleotide sequence of a 3.46 kb region of maize c...
268KB Sizes 0 Downloads 0 Views