Nucleic Acids Research, Vol. 18, No. 1
The nucleotide sequence of the cDNA encoding a chicken Deformed family homeobox gene, Chox-Z Hiroshi Sasaki and Atsushi Kuroiwa Department of Cell Biology, Research Institute for Tuberculosis and Cancer, Tohoku University, 4-1 Seiryomachi Aobaku Sendai, 980 Japan Submitted December 6, 1989
EMBL accession no. X17612
cDNA clones encoding a chicken Deformed (Dfd) family
REFERENCES
homeobox gene, Chox-Z, were isolated from chicken 4 day embryo cDNA library using a chicken genomic DNA fragment containing Chox-Z homeobox as a probe (Kuroiwa in preparation). The cDNA encodes a protein of 245 amino acids. The homeodomain of this protein (underlined) reveals strong Thehomologywiththo fanlyhome oboxgenestrong homology with those of of vertebratesDfd vertebrates Dfd famnily homeobox genes, Hox-2.6 (98%) (1), Hox-5.J (93%) (2), Hox-1.4 (93%) (3), Xox-JA (95%) (4) and Dfd (88%) (5). The entire Chox-Z protein shows homology with Hox-2.6 (66%) and Xhox-JA (63%) proteins. First 22 nucleotides just upstream from the transcriptional initiation codon have strong homology with corresponding region of Hox-2. 6 (86%).
Graham,A.,
1.
Papalopulu,N., McVey,J.H., Tuddenham,E.G.D. and
Krumlauf,R. (1988) Genes and Dev. 2, 1424-1438.
2. Featherstone,M.S., Baron,A., Gaunt,S.J., Mattei,M.-G. and Duboule,D.
(1988) Proc. Natl. Acad. Sci. USA 85, 4760-4764.
Vigneron,M., Featherstone,M.S., Baron,A. and GaGliot,B._ Dolle,P., Duboule,D. (1989) Development 107, 343-359. 4. Harvey,R.P., Tabin,C.J. and Melton,D.A. (1986) EMBO J. 5, 1237-1244. 3.
5. Regulski,M., McGinnis,N., Chadwick,R. and McGinnis,W. (1987) EMBO
J. 6, 767 -777.
GCGGTGCCCGGGGAACGGGGCCGGAGGGGAGCGGGCGCCCCCGGCCGCCTCCCCGGGGATGCTCCGGG CCCCGGGAGAGCCCGTGCCGGGCAGATTTCCTTATCCGGGGATCGCAGGCCACCTCGCCATTGGCCCGCGCT GTCACATGGACTCCAACTTTGTTCACTTGACAGTAAGTAGGAGGGTTTCACGAAACAGGAAAACGAGTAAAG GGGGGGACAGGAATAAATTTTAGGAAATATATATATATATATATTTTTCGTGTGTGCAATTCTAAGAAATTA ATGGCCATGAGCTCGTTTTTGATCAACTCCAACTATGTGGACCCCAAGTTCCCACCCTGTGAAGAGTATTCC M A M S S F L I N S N Y V D P K F P P C E E Y S CACAGCGATTACCTTCCCAATCACTCGCCGGAATATTACAGCAGCCAGAGGCGAGAGAGCACTTTCCAACAT H S D Y L P N H S P E Y Y S S Q R R E S T F Q H GAAGCGATGTACCAGCCGCGGTCAGCATGCAGCGAGCAGCTCTACCCGTCCTGTCAGAGCTCCGGGCACCAA E A M Y Q P R S A C S E Q L Y P S C Q S S G H Q GCAGCGGTGTTATCCCCCCGGGGTCATGTCCATCCTCCGGCCGGACTGCAGAGCCATCTCTCTGAGCCAAAC A A V L S P R G H V H P P A G L Q S H L S E P N CATCCCTGCGAGCCGGGCACCCCCAGCCCTCCACCCTCCTGCAGCCAAAACTCTCTGAACCAAAGCCCTTCC H
P
C
E
P
G
T
P
S
P
P
P
S
C
S
Q
N
S
L
N
Q
S
P
S
68 140 212 284 356 24 428
48
500 72
572
96 644
120
AATTCCTCTTGCAAAGAGCCGGTAGTTTACCCCTGGATGAAAAAAGTCCATGTAAGCACGGTAAACCCCAAT N
S
S
C
K
E
P
V
V
Y
P
W
M
K
K
V
H
V
S
T
V
N
P
N
716 144
TATTCAGGAGGGGAACCGAAACGCTCGCGCACAGCCTACACCAGGCAGCAGGTCCTGGAGCTGGAGAAGGAA Y
S
G
G
E
P
K
R
S
R
T
A
Y
T
R
Q
Q
V
L
E
L
E
K
E
788 168
TTCCACTATAACCGCTATCTCACCCGGAGGCGGAGGGTCGAAATCGCCCATTCTCTGTGCCTCTCCGAGCGC F
H
Y
N
R
Y
L
T
R
R
R
R
V
E
I
A
H
S
L
C
L
S
E
R
860 192
CAGATCAAAATCTGGTTCCAGAACAGGAGGATGAAATGGAAGAAAGACCACAAGTTACCCAACACCAAGATC Q
I
K
I
W
F
Q
N
R
R
M
K
W
K
K
D
H
K
L
P
N
T
K
I
932 216
AGGTCCAACCCTTCCAGCTCCTCCGCCAGCCTGCAGATCCCACCGGCAGCTTCTCAAAGCCGATCCAGCGGA R
S
N
P
S
S
S
S
A
S
L
Q
I
P
P
A
A
S
Q
S
R
S
S
G
1004 240
CCAGCCAGCAGCCTATAACTATTCCCTGGAGGATTTCAGGGCCCGTTGTCGTATGGCAGTGCCGGAGGTGGG P
A
S
S
L
*
GGTGAATTTTTGCTGCTGCAAATCTCCGCAGACCGTCTGTTGGGGAAAAAGCACCCGGAGTTCGACCAGAAT CTTCAGGGGAAATAATAATACTAATAAT
184
1076 245
1148 1176