ATI_~ C ASTGOC S_
©) 1990 Oxford University Press
7184 Nucleic Acids Research, Vol. 18, No. 23
Nucleotide sequence of cathepsin D
cDNA encoding
a
mouse
J.F.Diedrich, K.A.Staskus, E.F.Retzel and A.T.Haase* Department of Microbiology, University of Minnesota, Minneapolis, MN 55455, USA Submitted June 13, 1990
We report the nucleotide sequence of mouse cathepsin D mRNA derived from clones in a XgtlO cDNA library constructed from C57BL/6J mouse brain. Cathepsin D is a lysosomal aspartyl protease, and this cDNA was isolated while screening for upregulated genes in scrapie (manuscript in preparation). The sequence is 1893 bp and contains an open reading frame of 1230 bp with an initiation codon at nucleotide position 7 and a termination codon at 1237. The predicted amino acid sequence (410 aa) shows 81 % similarity with the human kidney cathepsin D (1) and 82% with the porcine spleen cathepsin D (2, 3). ActiveME T
P
C
L
V
L
L
I
S
LL A
L
site aspartyl residues and the presumed polyadenylation signal within the 3' untranslated region are underlined.
REFERENCES
1. Faust,P.L., Kornfeld,S. and Chirgwin,J.M. (1985) Proc. Natl Acad. Sci. USA 82, 4910-4914. 2. Shewale,J.G. and Tang,J. (1984) Proc. Natl. Acad. Sci. USA 81, 3703 -3707. 3. Erickson,A.H., Conner,G.E. and Blobel,G. (1981) J. Biol. Chem. 256, 11224-11231.
F
A
I
I
R
P
I
L
R E
F
T
S
I
R
R
T
M
GG
V
S V E D
L
I
L
K
C
P
I
T
S M Q
K Y
Y
Y G D
V
H
H
K
K
C
F T V V r
I
G I G T
P
P
Q C
Y
R
K
8
8
T
Y
C
I
X V
9
D
V
K
N
F
K
S
S
P
K TT
Z
P
S E
V
L
L
K
N
Y
L
D
120 Q
G
L W
V
P
S I
R
C
K
I
L
A
C
W
H Y
G
8
C
S
L
C
Y
L
8
Q D T V
S
V
T
Q P
G
I
V
D
T G S S B
T
S
r
D
I
I
r
G
Z
A
D
I
240
360 8
8
D
Q
8
K A
R
Q
K
r
V
A a
K
r
Y
P
H
I
8
V
N V
N1
L
P
V
r
D
N
L
M
Q Q K L V D K N I r
8
r
Y
L
N
R
C
Z
N
F
L
T
I
Q
C
Z
K
T
G C
L
L
M L
I
Y
G ZE L S
Y
L
N V
T
R
K A
S
L
L
V
C
P
V
Z
G
T
D
S K
Y
A I
V 2
T
C
T
Q
L
Z
V
G
P
L
C
N
Q
V
B
Z
V K
Z
L
Q
K A
I
C
A V
C
Z
M I
P
C
Z
K V
8
8
L
P
T
V
Y
L
K
L
C
K N
Y
Z
L B
P
D
K
Y
I
L
K
V S
M
M
D
I
P
P
P
8
C
P
L W I
L G
D
V
r
I
C
S Y
Y
T
V
r
D
L S
CGr
O F
A
M A V V
L
OOCOCTCT ACCT
720
840
C
C
480
600
L
G
SCCCO COC -
R V
N
Q
Z
Y
M D
G
P
D
K
C
Y
CG
G
I
T T C CT & GTACGT _CC _CTA P G
L
G
D
K==TCD CCA SDQTAAETSCTCGVCV_ C
A
CTXCT^OGGACGOCCACTTTOTT
CTCQCAC
N
K
CATGCGTCGTACCACCCGGAGCLC CAGTTACTCAAAAACTACCTGGATGCCAG
GTGGGCGGCTCTGCACCTGCTCCCCCCTCACTC
M
T
CTCCaTSaCTCTTCCTCGGCTCCTCCTTCe;~CTTGCATTATCACAATCCCTCTCGCATTCATCTATCCCCGTCliCTATGALCCCA
GCCC
P
S
EMBL accession no. X53337
960 Q
1080 R
D
OOCAG'-CAITTTTMTAC&TTTOACGAGC 1200
*
_ GCACCASCccAcCTCCICCTCCCZCCA
1320
CTCACJCASACTCAQCSC'CGCTACTGTCSOSGCCTT__TCTOTCCTCACCCSTGCCTCkLTCTOCCCCTCl!COT 1440 TC 1560 COCCTCCOCTCCTC&;=X GT=CTAAGGGCCAROC^CC = c
A
_
AoloAA
g
c
=-s~~~CCCCCCArAACCTGoC
S
CTCACTOTTGCT=ICTOCT
C
_Ak
TACCCCTTGOCGCOCOSCT CSAC_
GSCCCCTGTA=ArAA
STTGCOCTTfCr __==
1680 1800 1893
Note added in proof. Since the acceptance of this manuscript we have become aware of results by Grusby et al. (Nucl. Acids Res., vol. 18, p. 4008, 1990) concerning the cloning of a mouse cathepsin D cDNA. Their sequence shows one nucleotide difference (position 1681, A to T) in the 3' untranslated region when compared to the sequence given here. This sequence was accepted July 16, 1990. * To whom correspondence should be addressed