655

Nucleic Acids Research, Vol. 18, No. 3

Nucleotide sequence of an A-type legumin gene from pea William G.Rerie1l 2, Malcolm l.Whitecross2 and T.J.V.Higgins' 1CSIRO Division of Plant Industry, PO Box 1600, Canberra, ACT 2601 and 2The Australian National University, PO Box 4, Canberra, ACT 2601, Australia Submitted January 9, 1990

EMBL accession no. X17193

underlined and include the TATA (-26 to -32) and the 'legumin' (-100 to - 130) boxes in the 5' region and overlapping polyadenylation signals (+ 1969, + 1973) in the 3' region. The proposed signal peptide is delineated by bold-face type preceding the processing site (I I). The post-translational processing site between Asn and Gly residues is also indicated.

The A-subfamily of legumin genes encodes the predominant I ISclass of seed storage proteins in pea (Pisum sativwn L.). We have determined the complete nucleotide sequence of one such gene (LegA2) located on a 4.2 kb EcoR1 fragment and encoding a mature subunit with a deduced molecular weight of 56,929 Daltons. The coding region spans 1,825 bp (including three short introns) and exhibits a high degree of identity to LegA (1), but diverges from this gene in both the immediate 5'- and 3'-flanking regions. Nucleotide positions are numbered relative to the proposed cap-site (+ 1). Potential regulatory sequences are

-SOO -400 -300

-200 -100 1

REFERENCE 1. Lycett,G. et al. (1984) Nucl. Acids Res. 11, 4493-4506.

ATATGACTATATTGTAAATTGTAATTCAAGCACTTTAATTTGAAGTTGTTATCGTCCACCTATATTCAACTACACAAAATTGTGCTTTACCATTAAACTT TAAAAATTTGTACAGACCATGAACTAAATCCTATCCAACATACAATATACAGACATTAACTAGCTTGAAAGTGAATCAGGTTCATATATCTAGATATTAC AAGACAGTAATGATCAAACTCACGTACATATGTAAAAAGAGAAATCAATTATATACTATACATGGTCCCCAACACCACCGATTTCAGCTAGCTATCTAAT TACTCAACTCTCACTTGAAGCCACCTCTGCTGTAATGAGACATTTGATTTTATGAGGTGTAACACACAAGGCTTCCATAACCATGCAAGATGAAGAATGT CGAATGGTCAGCAACTCATGTATCTTCTTGGAGCCGATGTGTCCCTCCTATA~CTTTCCTATGTTTCACTATAAATCCCTATGCCAGATTAAGGTTCTTCG N A T X L L A L 8 L S F C F L L L G G c F A1 R E

CGTCACAAACATATATTCTATCCAACTATGGCTACTAAGCTTCTTGCACTTTCTCTTTCATTCTGTTTTCTACTTTTGGGGGGCTGTTTTG'CTTTGAGAG

101

QPE Q N E C Q L E R L N A L E P D N R I E S E G G L I E T W N P AACAGPCCAGAGCAAAATGAGTGCCAGCTAGAACGCCTCAATGCCCTCGAGCCTGATAACCGTATAGAATCGGAAGGTGGGCTCATTGAGACCTGGAATCC

201

N N K O F R C A G V A L S R A T L O H N A L R R P Y Y S N A P Q E CAACAACAAGCAATTCCGATGTGCTGGTGTGGCCCTCTCTCGTGCTACCCTTCAACATAACGCCCTTCGCAGACCTTACTACTCCAATGCTCCCCAAGAA

301

ATTTTCATCCAACAAGGTTACTTATTTTGATCTTATACCAACTTCTTTACTTACATTACATGCATATTAGCATACTAATTAGTGTTCTACTATACCAATT

401

ACAGGTAATGGATATTTTGGGATGGTATTCCCCGGTTGTCCTGAGACCTTTGAAGAGCCACAAGAATCTGAACAAGGAGAGGGACGCAGGTACAGAGACA

I

Q

Q

G

N

G

Y

F

F

I

G

F

V

M

P

G

C

E

P

T

F

E

E

P

E

Q

E

S

E

G

Q

G

R

R

R

Y

D

R

601

H Q K V N R F R E G D I I A V P T G I V F W M Y N D Q D T P V I A GACATCAAAAGGTTAACCGATTCAGAGAGGGTGATATCATTGCAGTTCCTACTGGTATTGTATTTTGGATGTACAACGACCAAGACACTCCAGTTATTGC V S L T D I R S S N N Q L D Q M P R CGTCTCTCTTACTGACATTAGAAGCTCCAATAACCAGCTTGATCAGATGCCTAGGGTGAGCTACTGAGCATAATTAAACTTCCCATATAAGATAATATGT

701

TGTCCAAAACAGTAACATAGATTCTATCTATCTATGTTTGACAGAGATTCTATCTTGCTGGGAACCACGAGCAAGAGTTTCTACGATACCAGCAT8AA8A

801

AGGAGGAAAGCAAGAACAAGAAAATGAAGGCAACAACATTTTCAGTGGCTTCAAGAGGGATTTCTTGGAAGATGCTTTCAACGTGAACAGGCATATAGTA

901

AG AA GACAGACTTCAGGCAGGAATGAAGACGAAGAGAAGGGAGCCATTGTCAAAGTGAAAGGTGGACTCAGCATCATAAGCCCACCCGAGAAGCAGCGCGCC

1001

ACCAGAGAGGCAGCAGACAAGAGGAAGATGAAGATGAAGATGAAGAGAGGCAGCCGCGCCACCAGAGAGGCAGCAGACAAGAGGAAGAGGAAGATGAAGA

501

1101

G

D

K

G

R

L

OR

E

Q

R

Q

R

S

G

E

Q

N

E

G

N

E

D

E

E

Q

E

N

D

N

K

E

E

D

F

I

A

G

E

S

D

I

E

F

G

K

V

E

K

R

R

V

R

P

Q

G

K

L

F

D

G

H

E

L

S

R

Q

A

D

I

G

S

I

V

P

N

P

E

Q

R

S

N

F

R

E

K

E

E

I

H

V

R

Q

D

E

E

H

D

TGAAGAGAGGCAGCCGCGTCATCAAAGGAGAAGAGGAGAGGAGGAAGAAGAAGACAAGAAAGAGCGCCGCGGCAGCCAAAAAGGCAAAAGCAGAAGGCAA IGL

D

G

T

E

E

V

A

T

C

L

K

N

L

R

I

G

P

S

S

P

S

D

I

Y

N

P

E

G

A

R

I

1201

GGAGACAATGGGCTTGAGGAAACAGTTTGCACTGCTAAACTTCGATTGAACATTGGCCCGTCTTCATCACCAGACATCTACAACCCTGAAGCTGGTAGAA

1301

K T V T S L D L P V L R W L K L S A E H G S L H K TCAAAACTGTTACCAGCCTGGACCTCCCAGTTCTCAGGTGGCTCAAACTAAGTGCTGAGCATGGATCTCTCCACAAAGTATGTTTTTTCATATTTTAATT

1401

TGTTTTTCCATGAATCAATTTCATGTCGAACTATGTGCTAACTCATTACAATCTTCATACAGAATGCTATGTTTGTGCCTCACTACAACCTGAATGCAAA

1501

S I I Y A L K G R A R L O V V N C N G N T V F D G E L E A G R A L CAGTATAATATACGCATTGAAGGGACGTGCAAGGCTA AAGTAGTGAACTGCAATGGCAACACCGTGTTTGATGGAGAGCTAGAAGCCGGACGTGCATTG

1601

ACAGTGCCACAAAACTATGCTGTGGCTGCAAAGTCACTAAGCGACAGGTTCTCATATGTAGCATTCAAGACCAATGATAGAGCTGGTATTGCAAGACTTG

1701

CAGGGACATCATCAGTTATAAATAATCTGCCGTTGGATGTTGTGGCAGCTACATTCAACCTGCAGAGGAATGAGGCAAGGCAGCTCAAGTCCAACAATCC

T

V

G

P

T

S

S

A

Y

N

Q

V

I

A

V

N

N

L

P

L

S

K

A

L

D

D

S

V

V

R

A

F

A

Y

S

T

F

V

N

A

L

F

Q

M

A

N

K

R

N

T

N

E

D

A

H

P

V

F

R

R

N

Y

A

Q

I

G

L

L

K

N

R

A

S

N

N

A

L

N

A

P

F KF LV GCGGACRQSENAGAGCTCGGCTTAGATTTCGCACCAAATCAATGAAAGTAATAATGAAAAGTCTGAATAAGAA CTCATTTGTCGTG

1901 2001

TACTTAGGCTTAGATGCCTTTGTTACTTGTGTAAAATAACTTGAGTCATGTACCTTTGGCGGAAACAGAATAAATAAAAGGTGAAATTCCAATGCTCTAT GTATAAGTTAGTAATACTTAATGTGTTCTACGGTTGTTTCAATATCATCAAACTCTAATTGAAACTTTAGAACCACAAATCTCAATCTTTTCTTAATGAA

Nucleotide sequence of an A-type legumin gene from pea.

655 Nucleic Acids Research, Vol. 18, No. 3 Nucleotide sequence of an A-type legumin gene from pea William G.Rerie1l 2, Malcolm l.Whitecross2 and T.J...
194KB Sizes 0 Downloads 0 Views