k.) 1990 Oxford University Press

Nucleic Acids Research, Vol. 18, No. 13 4013

Nucleotide sequence of human lactoferrin cDNA M.J.Powell and J.E.Ogden Delta Biotechnology Ltd, Castle Court, Castle Boulevard, Nottingham NG7 1FD, UK Submitted May 9, 1990 Human lactoferrin is an 80 kD iron-binding glycoprotein which is a major component of colostrum. It is also released by circulating polymorphonuclear neutrophilic leukocytes and is found in a number of exocrine secretions (1). The amino acid sequence of lactoferrin has been determined (2), however only a partial cDNA sequence has been published to date (3). Two overlapping Xgtl 1 clones were isolated from a human mammary gland cDNA library (Clontech). The library was screened with oligonucleotides derived from the published DNA sequence and with a PCR-derived probe, produced using a degenerate oligonucleotide based on the published amino acid sequence.

EMBL accession no. X52941 The DNA sequence encodes the entire mature lactoferrin protein and also 17 amino acids of a putative signal peptide (Figure 1). The deduced amino acid sequence contains ten single residue differences and also lacks 15 successive residues when compared with the published protein sequence (2) (Figure 1).

REFERENCES 1. Aisen,P. and Listowsky,l. (1980) Ann. Rev. Biochem. 49, 357-393. 2. Metz-Boutigue,M.H. et al. (1984) Eur. J. Biochem. 145, 659-676. 3. Rado,T.A. et al. (1987) Blood 70, 989-993.

1 v f 1 v I I f I 9 a 1 91 c I a G R R R R S V Q K C A V S Q P E A T K CTT GTC TTC CTC GTC CTG CTG TTC CTC GGG GCC CTC GGA CTG TGT CTG GCT GGC CGT AGG AGA AGG AGT GTT CAG T&G TGC GCC GTA TCC CAA CCC GAG GCC ACA AAA C F Q IQ A I A E N R A Q R N M R K V R G P P V S C I K R 0 S P I Q C TGC TTC CAA TGG CAA AGG AAT ATG AGA AAA GTG CGT GGC CCT CCT GTC AGC TGC ATA AAG AGA GAC TCC CCC STC CAG TGT ATC CAG GCC ATT GCG GAA AAC AGG GCC I A V T l 0 G G F I Y E A G L A P Y K L R P V A A E V Y G T E R Q P R T GAT GCT GTG ACC CTT GAT GGT GGT TTC ATA TAC GAG GCA GGC CTG GCC CCC TAC AAA CTG CGA CCT GTA GCG GCG GAS GTC TAC GGG ACC GAA AGA CAG CCA CGA ACT V Y Y A V A V V K K G G S F Q L N E L Q G L K S C A T G L R R T A G W N CAC TAT TAT GCC GTG GCT GTG GTG AAG AAG GGC GGC AGC TTT CAG CTG AAC GAA CTG CAA GGT CTG AAG TCC TGC CAC ACA GGC CTT CGC AGG ACC GCT GGA TGG AAT V P I G T L R P F L N W T G P P E P E E A A V A R F F S A S C V P G A 0 GTC CCT ATA GGG ACA CTT CGT CCA TTC TTG AAT TGG ACG GGT CCA CCT GAG CCC ATT GAG GCA GCT GTG GCC AGG TTC TTC TCA GCC AGC TGT GTT CCC GGT GCA GAT K G Q F P H L C R L C A G T G E N K C A F S S Q C P Y F S Y S G A F K C AAA GGA CAG TTC CCC AAC CTG TGT CGC CTG TGT GCG GGG ACA GGG GAA AAC AAA TGT GCC TTC TCC TCC CAG &AA CCI TAC TTC AGC TAC TCT GGT GCC TTC AAG TGT 0 E Y E L L C P 0 N L R 0 G A G 0 SA F E R E S T V F E 0 L S 0 E A E CTG AGA GAC GGG GCT GGA GAC GTG GCT TTT ATC AGA GAG AGC ACA GTG TTT GAG GAC CTG TCA GAC GAG GCT GAA ASG GAC G5G TAT GAG TTA CTC TGC CCA GAC AAC T R K P V 0 K F K 0 C H L A R V P S H A V V A R S t N G K E D A I W N l ACT CGG AAG CCA GTG GAC AAG TTC AAA GAC TGC CAT CTG GCC CGG GTC CCT TCT CAT GCC GTT GTG GCA CGA AGT GTG AAT GGC AAG GAG GAT GCC ATC TGG AAT CTT K 0 L L F K I S A I G L R Q A 3 E K F G K 0 K S P K F Q L F G S P S1 G CTC CGC CAG GCA CAG GAA AAG TTT GGA AAG GAC AAG TCA CCG AAA TTC CAG CTC TTT GGC TCC CCT AGT GGG CAG AAA GAT CTG CTG TTC AAG GAC TCT GCC ATT GGG F S R V P P R I 0 S G L Y L G S G Y F T A I Q N L R K SI E E V A A R R TTT TCG AGG GTG CCC CCG AGG ATA GAT TCT GGG CTG TAC CTT GGC TCC GGC TAC TTC ACT GCC ATC CAG AAC TTG A5G AAA AGT GAG GAG GAA GTG GCT GCC CGG CGT SS V T C S S A S T T S G L I E A R V V W C A V G E Q E L R K C N Q GCG CGG GTC GTG TGG TGT GCG GTG GGC GAG CAG GAG CTG CGC AAG TGT AAC CAG TGG AGT GGSC TTG AGC GAA GC ASCii GTG ACC TGC ICC TCG GCC TCC ACC ACA GAG A C G l V P V l A E N V L K G E A S L 0 G G Y V Y T A SG C I A 0 C GAC TGC ATC GCC CTG GTIG CTG AAS ISAGIA GCT GAT GCC ATG AGT TTG GAT GGA GGA TAT GIG TAC ACT GCA SGC AAA TGT GGT TTG GTG CCT GTC CTG GCA GAG AAC 1s s R P V E G Y lL A V T S L IT Y V R R IS Y K S1 Q P I P N C VS TAC AAA TCC CAA CAA AGC AGT GAC CCT GAT CCT AAC TGT GTG GAT AGA CCT GTG GAA GIGATAT CTT GCT GTG CIG G153 GTT AGG AGA TCA GAC ACT AGC CTT ACC 1GG N S V K G K K S C H T A V O R T A G W N I P GI L L F N Q T G S C K F 0 AAC TCT GTG AAA GSC AAG AAG TCC TGC CAC ACC GCC GTG GAC AGG ACT GCA GGC IGG AAT ATC CCC ATG GGC CTG CTC TTC AAC CAG ACG GGC TCC TGC AAA TIT GAT L C A L C IESG Q G E N K C V P N S N E Y F S Q S C A P G S I P R IS GAA TAT TTC AGT CAA AGC TGT GCC CCT GGG TCT GAC CCG ASS TCT AAT CTC TGT GCT CTG TGT SIT GGC GAC SAG CAlG GGT GAG AAT AAG TGC GTG CCC AAC AGC AAC IS V A F SIK IT V l Q N T I G N N E R Y Y G Y T G A F R C L A E N SA G GAG AGA TAC TAC GGC TAC ACT GGG GCT TTC CGG TGC CTG GCT GAG AAT GCT GGA GAC GTT GCA TTT GTG AAA SAT SIC ACT GTC TTG CAG AAC ACT GAT GGA AAT AAC N E SA W S K l K L A D F A L L C LI G K R K P V I E A R S C A L A N A AAT GAG GCA TGG GCT AAS GAT TTG AAG CTG GCA GAC TTT GCG CTG CTG TGC CTC GAT GGC AAA CGG AAG CCT 1iTG AlT G5 GCT AGA AGC TGC CAT CTT GCC ATG UCC P N N A V V S R M D K V E R L K Q V L L h Q Q A K F G R N G S D C P O K CCG AAT CAT GCC GITG GG TCT CGG ATG GAT AAG GTG GAA CGC CTG AAA CAG GTG TSG CTC CAC CAA CAG GCT AAA TITT GGU AGA AAT GGA TCT GAC TGC CCG GAC AAG R L H G K T T Y E K Y L G P Q F C L F Q S E T K N L L F N IN T E C L TTT TGC TTA TTC CAG TCT GAA ACC AAA AAC CTT CTG TTC AAT GAC AAC ACT GAG TGT CTG GCC AGA CTC CAT GGC A5A ACA ACA TAT GAA AAA TAT TTG GGA CCA CAG Y V A G I T N l K K C S T S P L l E A C E F L R K I TAT GTC GCA GGC ATT ACT AAT CTG AAA AAG TGC TCA ACC TCC CCC CTC CTG GAA GCC TGT GAA TTC CTC AGG A5G IAA aaccgaagaagatggcccagctccccaagaaagcctcag

ccattcactgcccccagctcttctccccaggtgtgttggggccttggctcccctgctgaaggtggggattgcccatccatctgcttacaattccctgctg'tZgtcttagcaagaagtaaaatgagaaattttgttgatattca

108 216 324 432

540 648

156 864

912 ld80 1188 1296 1404

1512 1620

1728

1936 1444

2052 2169 2312

aaaaaaaaaaaaaaaaaaa Figure 1. DNA sequence of lactoferrin cDNA with deduced amino acid sequence. The putative signal peptide is shown in lower case, mature lactoferrin in upper case. Single residue differences between the deduced and published amino acid sequences are underlined; the boxed residues DA replace 15 residues in the published sequence.

Nucleotide sequence of human lactoferrin cDNA.

k.) 1990 Oxford University Press Nucleic Acids Research, Vol. 18, No. 13 4013 Nucleotide sequence of human lactoferrin cDNA M.J.Powell and J.E.Ogden...
172KB Sizes 0 Downloads 0 Views