GENOMICS
10,940-948
Isolation
(1991)
and Characterization of Two Human HI Histone within Clusters of Core Histone Genes
WERNER ALBIG,*
EFTERPI KARDALINOU,
Genes
* BIRGIT DRABENT, * ANDREAS ZwdMm, 7 AND DETLEF DOENECKE*
‘Institut fiir Biochemie, Universitat Gottingen, Humboidtallee 23, O-3400 GcSttingen; and t Max-Pianck institut filr Experimenteiie Medizin, Abt. Moiekuiare Neuroendokrinoiogie, Hermann-Rein-Strasse, O-3400 Gbttingen, Federal Republic of Germany Received
December
27, 1990;
Press, Inc.
INTRODUCTION
The Hl class of histones contributes to the structure of chromatin at several levels of organization. It is a prerequisite for sealing two rounds of DNA around the nucleosomal histone octamer (Simpson, 1978; Allan et al., 1986) and it is needed for the formation of higher order chromatin structures above the level of nucleosomes (Thoma et aZ., 1979). Each histone class consists of several subtypes, with the probable exception of H4. The Hl group represents the most complex pattern of histone isoforms. Based on two-dimensional gel electrophoresis data, five main types of Hl histones were defined and termed Hla-Hle (Lennox and Cohen, 1983). Another subtype, Hl”, is restricted mainly to terminally differentiated tissues (Panyim and Chalkley, 1969), whereas a testicular Hl histone (Hit) is found only at specific stagesof sperm development (Seyedin et al., 1981; Cole et al., 1984, 1986). Varied patterns of Hl histones in different tissues suggest a functional role for individual Hl subtypes, which may influence higher order chromatin strucSequence data EMBL/GenBank M60752.
0888-7543/91
Copyright All rights
0 of
$3.00 1991
from this article have been deposited Data Libraries under Accession Nos.
by Academic Press, Inc. reproduction in any form reserved.
April
1, 1991
tures by modulating the cooperative interactions between Hl histones (Weintraub, 1984). Histone genes are organized as clusters (Hentschel and Birnstiel, 1981; Maxson et al, 1983). Such groups of histone genes can be highly regular tandem repeats of the five histone genes, or the individual histone classes can be randomly organized within clusters. Varied numbers of histone genes in the mammalian genome have been described (Stein et al., 1984). Mapping of histone gene clusters, hybridization analysis, and sequencing demonstrated the heterogeneity of histone gene clusters and the differences between nucleotide sequences of individual histone genes of the human and murine genomes. A lack of any defined histone gene repeat pattern has been similarly found in avian histone gene clusters. D’Andrea et al. (1985) demonstrated that the chicken histone gene complement is essentially distributed in three different clusters containing 40 histone genes including six Hl genes (coding for the subtypes Hla-d and two additional variants Hla’ and Hlc’; Shannon and Wells, 1987). A gene arrangement similar to those of the chicken histone gene clusters was found in the duck genome, where again all five histone genes were represented in this cluster of genes (Tiinjes and Doenecke, 1987). The number of human histone gene clusters is unknown. Chandler et al. (1979) hybridized cloned sea urchin histone gene probes to human metaphase chromosomes and found hybridizations only on the long arm of chromosome 7. Later, Triputti et al. (1986) screened a panel of mouse-human hybrid cells and found positive signals on chromosomes 1, 6, and 12. This chromosome assignment was confirmed by in situ hybridization of labeled histone gene probes to metaphase chromosomes. This localization on individual chromosomes describes the histone gene distribution in general, but individual dissimilar clusters may be found within these large regions. The information on human histone gene clusters is still in-
Two human II1 histone genes, termed H1.3 and H1.4, were isolated from two cosmid clones. The H1.4 gene is associated with an HZB gene, whereas genes coding for all four core histones are located in the vicinity of the H1.3 gene. This cluster arrangement was found both in the two cosmid clones and on overlapping bacteriophage clones isolated from an EMBLB library. In continuation of our previous analysis of two human Hl genes, this analysis raises the number of completely sequenced Hl histone genes within clusters of core histone genes to four. o 1~1 Academic
revised
with the M60746-
940
HISTONE
GENE
CLUSTERS
complete. Most portions of histone gene clusters previously described showed several core histone genes, but Hl genes were missing in most cases. Carozzi et al. (1984) and Zwollo et al. (1984) were the only authors to detect all five histone genes on continuous or overlapping genomic DNA clones. The Hl genes participating in these clusters were identified in one case by sequencing an N-terminal segment of the Hl gene (Carozzi et al, 1984), whereas Zwollo et al. (1984) showed only Southern blot hybridization data. In our attempt to characterize the human Hl gene complement, we first described the Hl” gene (Doenecke and Tiinjes, 1986) and, recently, we published the gene structures of two main-type Hl histones (Eick et al., 1989). Previously, the only known human Hl sequence was based on protein sequencing (Hlb; Ohe et al,, 1986), except for the partial sequence derived from a human Hl gene by Carozzi et al. (1984). Both newly described gene sequences differed from these data and from Hla, Hlc, and Hld, which have been published in the meantime (Ohe et al,, 1989). The nomenclature used by Ohe et al. was based on criteria other than the designation of subtypes Hla-e by Lennox and Cohen (1983). To avoid any confusion, we decided to term the newly discovered H1 genes with consecutive numbers as H1.l and H1.2 genes (Eick et al., 1989). Here, we describe the gene and the surrounding structures of two additional Hl gene sequences, termed H1.3 and H1.4, together with neighboring core histone gene sequences. MATERIALS
Screening of a Human Cosmid Library
AND
METHODS
Genomic Phage Library and a
A human genomic library in EMBL3 bacteriophage was obtained from Clontech (Palo Alto, CA). The library had been constructed from human granulocyte DNA and was screened using the in situ plaque hybridization technique described by Benton and Davis (1977). After three rounds of purification, five recombinant bacteriophage were isolated and the phage DNA was prepared (Sambrook et al., 1989). A cosmid library, which had been prepared in the laboratory of A. M. Frischauf (London) with DNA from human blood cells using pcos2EMBL as a vector (Ehrich et al., 1987), was screened as described by Herrmann et al. (1987). Two cosmid clones were isolated and analyzed as described (Herrmann et al.,
1987). In both cases, a DNA fragment prepared from the coding portion of the human H1.2 gene (Eick et al., 1989) was taken as a hybridization probe and labeled with [a-32P]dATP according to Feinberg and Vogel-
INCLUDING
Hl
941
GENES
stein (1983) using the Amersham random prime labeling kit.
Mapping and Subcloning Insert DNA
of the Phuge and Cosmid
DNA was prepared from the individual cosmid and phage clones, cut with several restriction enzymes and combinations thereof, separated electrophoretitally and analyzed by Southern blot hybridization (Southern, 1975). Hybridization probes were prepared from the coding portions of the human H1.2 gene (Eick et al, 1989) and individual core histone genes (Eick and Doenecke, unpublished). These probes were labeled with [u-~~P]~ATP by random priming (Feinberg and Vogelstein, 1983). Hybridizations were done at 64°C in 4X SSC (1X SSC: standard saline citrate, 0.15 M NaCl, 0.015 M Na citrate) in 5~ Denhardt’s solution (Denhardt, 1966). DNA fragments were isolated and subcloned in pUC19 plasmid, and the Escherichia coli K12 derivative DH5a was used as a host strain. Plasmid DNA was prepared as described (Birnboim and Doly, 1979; Sambrook et aZ., 1989).
DNA Sequence Analysis and Sl Nuclease Assay The subcloned fragments derived from the phage and cosmid clones were sequenced according to Sanger et al. (1977) using the T7-sequencing kit provided by Pharmacia (Uppsala, Sweden) under the conditions described by the manufacturer for sequencing short and long distances relative to the primer sites. The 35S-labeled sequencing reaction products were separated on 6% polyacrylamide gels (Maxam and Gilbert, 1977) and exposed to Kodak X-ray film. Each of the sequences presented in this paper was obtained by analyzing both DNA strands. The mRNA start sites of both Hl genes were determined by Sl nuclease mapping using the protocol of Weaver and Weissmann (1979). RNA was prepared from the human monocytic U937 cell line (Sundstriim and Nilsson, 1976) as described by Chirgwin et al. (1979) and hybridized to appropriate end-labeled, single-stranded DNA fragments, and the resulting hybrids were exposed to Sl nuclease. Protected DNA was analyzed by electrophoretic comparison with a G/A sequence ladder (Maxam and Gilbert, 1977).
Materials
and Enzymes
Restriction enzymes were from Boehringer (Mannheim, FRG), Bethesda Research Laboratories (BRL, Gaithersburg, MD), and New England Bio-
942
ALBIG EH
EEH
E
H
H EEB
ET
AL.
EE
HZB H2A
E
H3 cormi
‘
Ii
HH
E
Ill.3
E
H4
CS
J phage
L phage
,
BHHE
Hl2
012 lkb -
e
HSHII
BE
BE
H2B cormid
I
C3 phage
phage
I
F12
1
El4 phage
0
B
HI.4
Jll
FIG. 1. Chromosomal arrangement of histone genes within two gene clusters. Overlapping inserts of cosmid indicated below the restriction site maps. Coding sections and transcription directions are indicated by arrows. sites are abbreviated: E, EcoRI; H, HindHI; B, BarnHI; S, SaK
labs (Beverly, MA). DNA-modifying enzymes were from Boehringer and BRL and the T7-sequencing kit was from Pharmacia (Uppsala, Sweden). Radioactively labeled nucleotides were from AmershamBuchler (Braunschweig, FRG). Filter materials were from Schleicher & Schuell (Dassel, FRG) and Amersham-Buchler. Chemicals for electrophoresis were from Serva (Heidelberg, FRG) and Sigma (Munich, FRG). All other chemicals were from Merck (Darmstadt, FRG). RESULTS
AND
DISCUSSION
Isolation of Two Human Histone Containing HI Genes
Gene Clusters
We have isolated two groups of histone genes upon screening a human cosmid library and an EMBLS phage library with a human Hl histone gene probe (termed H1.2 in Eick et al., 1989). Five recombinant phage and two cosmids were isolated and the insert DNA segments were mapped using Hl and core histone probes (composite drawing in Fig. 1). The phage insert sizes ranged between 12 and 19 kb, whereas one of-the cosmid inserts was 30 kb in length. Surprisingly, the second cosmid insert measured just 6 kb. Since this insert size is too short to allow packaging of insert and cosmid vector in the bacteriophage particle, this short insert may have bypassed the packaging selection due to vector concatemerization. The large cosmid insert (designated C5) harbored one gene for each of the five histone classes, except an H4 gene. An H4 gene, however, was covered by an overlapping bacteriophage insert (phage H12), which had been isolated using the same Hl probe as that
and bacteriophage clones are Restriction enzyme cleavage
used for cosmid library screening. Comparison of the composite restriction map of these cosmid and phage inserts with the data of Zwollo et al. (1984) suggests that these clones may share common sites of the genome. Lack of sequence data and lack of fine mapping in the previous data, however, prohibit a formal proof. The small cosmid insert (C3) organization was identically found on three overlapping phage clones (B14, F12, and Jll; seeFig. 1). Thus, this surprisingly short cosmid insert covered an authentic genomic DNA fragment showing an Hl gene and, with the same transcriptional orientation, an H2B gene.
Sequences of Core and HI Histone
Genes
Each of the Hl and core histone genes indicated in Fig. 1 was sequenced and compared with published sequence data. Figures 2 to 8 show the sequences of the coding and flanking sequences of these seven genes. According to Franklin and Zweidler (1973), individual core histone subtypes may be discriminated on the basis of a few amino acid exchanges. According to this nomenclature, the H2A, H2B, and H3 genes described here may be defined as H2A.1, H2B.1, and H3.1 genes. The two H2B.l genes, however, which are located near the H1.3 and H1.4 genes, respectively, differ at one amino acid residue (Ala/Thr exchange at position 4). The amino acid sequence encoded by the H4 gene is identical to known mammalian H4 sequences (Wells and McBride, 1989). Each of the seven genes features the 3’ consensus sequence which is the site of histone mRNA precursor processing and formation of the mature histone mRNA 3’ end (Hentschel and Birnstiel, 1981; Hoffmann and Birnstiel, 1990). Several specific sequence motifs upstream of spe-
HISTONE
GENE
CLUSTERS
GAAAGAGAGT
TGGGGGAATA AGMAAGTTT
W~TGCCG
GTTTTAAAAC
TATGCATAGA
MCAMGGCA
GGAATGMGC
CCGMCTCTC
TCG~T~GT
TTCCCCCMG
TCTCTAATTA
TTTCATGCCC
AGATTTCTTA
T A T T T T T A C T CTTTTTTMG
GGGCMCAAA
CACAGCAGCG
CGGTAGATAC
GAGGAGTCCT
TTTCCAGCAG
1 GGAGCAAGGA
ACCMTCATC
ACTCAGCGTC
TCTCJTATSCCCTCAGC
GAGGA&TGC
TGTTCTGACA
GTTTGAGATT ACTTATTGlt
CGCCGCGCAT
GGG AAA CCC MA Gly Lys Arg Lys MG
AAA
Lys
GCA TCC GGA CCC CCA GTA TCT GAG CTT ATC ACC Ala Ser Gly Pro Pro Val Ser Glu Leu Ile Thr 45
GCA GTG GCA GCT TCT MC Val Ala Ala Ser Lys
GCG CTT MG MA Leu Lys Lys
Ala
MC
Asn
Comparison 15
AAG CCC AAG AAG GCA GGC GCA ACT GCT Lys Ala Lys Lys Ah Gly Ala Thr Ala 30
LyS Ala
GAG CGC AGC GGC G T T TCT CTG GCC GlU Arg Ser Gly VSl Ser Leu Ala 6o
GCG CTT GCG GCT GCT GGC TAC GAT GTA GM AAA Ala Leu Ala Ala Ala Fly Tyr Asp V&l Glu Lys '5
MC AGC CGT ATC AAG CTT GGC CTC MC Asn Ser Arg Ile Lys Leu Gly LB" Lys
AGC TTG GTG AGC AM Ser Leu Vdl Ser Lys
9o
GGT ACT CTG GTG CAG ACC AAA GGT ACC GGT GCT TCT GGC TCC TTC Gly Thr Leu Val Gln Thr Lys Gly Thr Gly Ala Ser Gly Ser PhP"' AM Lys
CTC MC Leu
Asn
GCT GGC GCA GCC MG CCT AGG MG Ala Gly Ala Ala Lys Pro Arg Lys
AAG MG Lys Lys
CCC AAG MG Pro Lys Lys
ATC MA Ile Lys
MG
ACT CCT MG Lys Thr Pro Lys
CAG CCA AAA AM MG
Lys
Lys
AAG GTA AAG AAG CCA GCA ACC GCT GCT
Lys
Lys
Pro
Ala
Ala
MG
GCA AAG MG Ala Lys Lys
Ala
Lys
MCTGGCGGG
ACGTTCCXT
TCTCAGTCAA AAGAGCTGAA
Val
Lys
Lys
Pro
Lys
Ala
Thr Ala
Ala"'
AGT GCG AAA AAG GTG AAA ACA CCT Ser Ala Lys Lys Val LyS Thr P=dao
GCT GCC AAG AGT CCA GCT MG Ala Ala Lys Ser Pro Ala Lys
CCC AAG GCG GCC MG
Lys
CCT GCT GGG GCA GCC Pro Ala Gly Ala Ala1'5
GTG GCT GGC GCC GCT ACC CCG MG AAA AGC Val Ala Gly Ala Ala Thr PP.3 LyS Lys SerI"
GGG ACC MG AAA GTG GCC MG Gly Thr Lys lys Val Ala Lys Pro
and the require subtypes subtypes 1984).
of Human
HI Histone
Genes
Comparison of the Hl amino acid sequences derived from the H1.3 and H1.4 genes shown in Figs. 2 and 3 with protein sequencing data published by Ohe et al. (1986,1989) indicates that these genes code for the subtypes Hlc (H1.3) and Hlb (H1.4) as defined by Ohe et al. (1989) on the basis of sequencing spleen Hl histones. By Sl nuclease analysis with RNA from the human mono&c cell line U937, we observed a full protection of the coding and 5’ flanking DNA up
AAG AAA GCG GCT TCC GGG GM GGC AAA CCC MG GCC Lys Lys Ala Ala Ser Gly Glu Gly LyS Pro LyS Ala"o
AM MG Lys Lys
Gln
943
GENES
CTCCCTCGTA
TTTTCTGGGA AGA-C
ATG TCG GAG ACT GCT CCA CTT GCT CCT ACC A T T CCT GCA CCC GCA GM Ser Glu Thr Ala Pm Leu Ala Pro Thr Ile Pro Ala Pro Ala Glu AAA ACA CCT GTG MG Lys Thr Pro Val Lys
Hl
ing and flanking sequences. The Hl genes derived protein primary structures, however, a detailed discussion, since individual Hl differ substantially and the pattern of Hl may be functionally important (Weintraub,
H1.3 CTTCGATTTG ACCGAAGTTT
INCLUDING
CCT MC Pro Lys
TCG GGG MG Ser Gly Lys
GCC AAA GCC CCT Ala Lys Ala Prd9' CCG MC
Pro
LyS
GTT ACA Val Thszo
TT;TCAGAGC
TGCTTCCCCT
TATMTTTGA
GAAAAACCGG
GCAGTTCGGT GTAGACAATT TTTATATTTT TGGCTTTTTT TGAGGTGTM
CAAACACMC 7 CGCACCAATC
ACGGCGCACG
TCCGCCCTAT ATAAACGGGC
2 GGGCGCAGCG CCGCGGCTCG
k&C
AGTGCCTCTG
CTTCCGGCTC
CGCTCACGCT
CACCCACAGG
GCTTTTTGGA GGGGGGAGTG GGGTGGAGAG GGGTGCTGCG
GTGTTGTGCG GCCACGGTCT ATCCTAGTCG TGCTGGTTGG GGGTCAGTTA TTAACAGCTC
FIG. 2. Nucleotide and.amino acid sequences of human histone H1.3 gene and protein, respectively. The mRNA-like strand is presented in 5’to I’direction (the same in Figs. 3-7). Consensus regulatory elements are marked: 1, Hl box; 4, CCAAT element; 5, TATA box; 6, 3’ stem-loop structure, followed by’ACCCA (mRNA terminus).. The cap site as determined by Sl nuclease mapping with RNA from the human cell line U937 is indicated by an open arrowhead. The conserved octapeptide TPKKVKKP starts at amino acid 154.
GTTCGGGGAT CATTATCT~
&ATTGCTCT
TGCCTTmc
GAG ACT GCG CCT CCC GCG CCC GCT GCT CCG GCC CCT GCC GAG Se= GlU Thr Ala Pro Ala Ala Pro Ala Ala Pro Ala Pro Ala GJ" 11
ATG WC MG
ACT CCC GTG MC
Lys
AAG MG
Lys
Lys
GCC CCC MG Ala Arg Lys
TCT GCA GGT GCG Gee Ser Ala Gly Ala Ala JO
MG CGC AAA GCG TCT GGG CCC CCG GTG TCC GAG CTC A T T ACT AM L!jS A=9 LY= Ala Ser Gly Pro Pro Val Ser Glu Leu Ile Thr LYS (5 GCT GTT GCC GCC TCC MG GAG CGC AGC GGC GTA TCT T T G CCC GCT Ala V*l Ala Ala Ser LyS Glu Arg Ser Gly Val Ser Leu Ala Ala 60 CTC MG AAA GCG CTG GCA GCC GCT GGC TAT GAC GTG GAG AAA MC L*U LY= LY= Ala Leu Ala Ala Ala Gly Tyr ASP Val Glu Lys Asn '5 MC AGC CGC ATC MG CTG GGT CTC AAG AGC CTG GTG AGC MC GW Asn Se= Arg II* Lys Leu Gly Leu Lys Ser Leu Val Ser Lys Gly 90 ACC CTG GTG CAG ACC MG
Thr LeU
Val
Gin
Thr Lys
GGC ACC GGC GCG TCG GGT TCC TTC AAA Gly Thr Gly Ala Ser Gly Ser Phe Lyso'
CTC MC MG AAG GCG GCC TCT GGG GM GCC MG CCT MG GCT AAA Asn LYS Lys Ala Ala Ser Gly Glu Ala Lys Pro Lys Ala Lyslo
Leu MG
Lys
GCA GGC GCG GCC AAG GCC AAG AAG CCA GCA GGA GCG GCG MG Ala Gly Ala Ala Lys Ala tys Lys Pro Ala Gly Ala Ala Lysll'
AAG CCC MG AAG GCG ACG GGG GCG GCC ACC CCC MG MO AGC GCC LyS P=O LyS Lys Ala Thr Gly Ala Ala Thr Pro Ltjs Lys Se= Ald"O MG
Lys
cific histone genes. have been described as binding sites for tram-acting factors involved in histone gene expression. Specifically, regulatory elements have been described upstream of Hl (Coles and Wells, 1985), HZB (Sive and Roeder, 1986), and H4 (Capasso and Heintz, 1985; Clerc et al., 1983) genes. Each of these elements was found in the respective genes and is underlined in the sequences. Coles and Wells (1985) have shown that a GC-rich element (Spl site) is frequently located upstream of vertebrate Hl genes. It is found in the promoter region of H1.4 (Fig. 3) and H1.l (Eick et al., 1989), but not in the H1.3 and H1.2 gene promoters. Thus, the seven genes described here essentially correspond to consensus histone gene cod-
GATCTGCGTG AAGCCTGAGG
TCGGGATCCG AGAGGACACT CTGCGGCTGC CAGCGAGGCG GGCTGaaG
LYS Th= P=O V*l
GCA GCT CCG MG AAA MG TGA Al& Pro LyP LyS Lys *I* TTGAAAATTT TAAACGGCTC
En.4
AAG ACC CCA MG AAG GCG MG LYS Thr Pro Lys LyS Ala Lys
MG
Lys
CCG GCT GCA GCT GCT GGA Pro Al& Ala Ala Ala Gly'65
GCC AAA AAA GCG AAA AGC CCG AAA AAG GCG AAA GCA GCC MG CCA LYS LYS Ala Lys Ser Pro Lys Lys Ala Lys Ala Ala LyS PrdsG
Ala
AAA AAG GCG CCC AAG AGC CCA GCG MG Lys Lys Ala PTO Lys Ser Pro Ala Lys MG
GCC AAA GCA GTT CCC Lys Ala Vdl Lys Prde5
Ala
GCG GCT AAA CCA AAG ACC GCC AAG CCC AAG GCA GCC MG Ala Lys Pro Lys Thr Ala Lys Pro Lys Ala Ala LyS
Lys Ala
AAG AAG GCG GCA GCC MG AAA AAG Lys Lys Ala Ala Ala Lys Lys Lys AAAGTTCCTT TGGCCMCTG ACCCACCGCT
CTCAGTAAAA
TGCTMGCAG
GG
CCA
Prd'O
TAG l
**
CTTAGAAGCC CAKACMCC
CAAAGGCTCT TTTCAGAGCC sGAGCTGTTGC ACTATTAGGG GGCGTGGCTC GGGAMACGC
FIG. 3. Nucleotide and amino acid sequences of human histone H1.4 gene and protein, respectively. Consensus regulatory elements are: 1, Hl box; 2, Spl site; 4, CCAAT element; 5. TATA box; 6, 3’ stem-loop structure, followed by ACCCA (mRNA terminus). Cap site: open arrowhead (determined as in Fig. 2 by Sl nuclease assay). The conserved octapeptide TPKKAKKP hrts at position 152.
944
ALBIG
H2A.I
TCG ACC MG TCC GTAS' TGA CTT AGC TGG TTC AGG CAT
)‘ACT
GM
GCTGTCAGAAA
ACAATAACAG
CAGTGAGMT
GAACGCACTT
MATAMAGC
TCGTGTCTAG AGTCTCTCCT
TTTATAGGCC
TTTCATGCM
ATAAAGMTT
CAMATATCC
AGCTCTGATT
GGGCMTGTG
TTAGTGACGC
ATACATGTM
AATAGCCTTC
ACCTTATTTC
CTTTCTMTT
GGTTGGCTCG TCMAGAACA
ATCAAATTGC GAGCCCATTC
GCCTTTCACA
ATTCTACCGA
TGACTATAAC
T T T T T C T T T A TTCAGTGGAT T G T T Z T
ATTTTMCCA a TAGCTTCTTA TTCCTCCATC TCTGCTGTTA GGAAGCCACT
ATG TCT GGA CGT GGA AAG CAA GGC GGC AAA GCT CGG GCA AAA GCT AAA Ser Gly Arg Gly Lys Gin Gly Gly Lys Ala Arg Ala Lys Ala Lys 15 ACG
Thr
CGT TCT TCC Arg Ser Ser
AGG Arg
GCC Ala
His
CCC Arg
CTC CTC CGC MA Leu Leu Arg Lys
GGC Gly
GCT Ala
CCA GTG TAC Pro Val Tyr
CAC
GGT CTT CAG T T T CCA G T T GGC CGT GTG Gly Leu Gin Phe Pro Val Gly Arg Val 30 GGC MC Gly Asn
TAC
Tyr
TCC GM Ser Glu
CTG GCA GCG GTG CTG GM Leu Ala Ala Val Leu Glu
GAG ATC T T A GAG CTA GCT GGC AAC GCG GCT Glu Ile Leu Glu Leo Ala Gly Asn Ala Ala
CGA GTC GGG GCC Arg val Gly Ala '5 T A T CTG Tyr Leo
ACG GCC Thr Ala (Q
CGC GAC MT AAG AAG Arg Asp Asn Lys LyS 6o
ACC CGC ATC ATC CCG CGC CAC CTG CAG CTA GCC ATC CGC MC Thr Arg Ile Ile Pro Arg His Leo Gln Leu Ala Ile Arg Am GAG GAG CTA GlU Glu Le"
MT AAG CTT CTA GGT Asn Lys Le" Le" Gly
GGT GTC CTG CCC MC Gly Val Leu Pro Asn
CGC GTG ACC Arg Val Thr
GAC Asp'=
ATC GCG CAG GGC Ile Ala Gin Gly 9o
ATC CAG GCC GTA T T G CTG CCT AAG AAG ACG Ile Gln Ala Val Leu Leu Pro Lys LyS Thrla5
GAG AGC CAC CAT AAG GCC AAG GGC AAG TGA Glu Ser His His Lys Ala Lys Gly Lys *** AATGATTACT
AGTCAAATCC
GAGCCACCCA
CCTTTTCTGT AAAGTGCTGG AATACACATA
GTCAGTGATC
CCGAGTCCCA
GAAACCAAAG
CGATGCCTGA
GCTCTTTTCA 6-AATCTCAATG
FIG. 4. Sequence of H2A.l gene. The mRNA-like strand of the H2A.l gene is shown in 5’ to 3’ direction, whereas the top line describes the complementary strand, beginning (5’) with the ATG initiation codon of the H2B.l gene described in Fig. 5. Numbering of consensus regulatory elements as in Figs. 2 and 3.
ET
AL.
sis does not depend on cell division. Its gene is not part of a histone gene cluster and the flanking parts of that gene differ from the respective portions of cellcycle-regulated histone genes. Comparison of the sequences encoded by H1.l to H1.4 reveals that the N-terminal and C-terminal domains of these four Hl subtypes differ in several segments. First, the second (basic) half of the N-terminal domain, which is involved in aligning the central domain at the nucleosomal DNA (Allan et al., 1986), varies in all four subtypes. Specifically, the H1.l sequence varies widely from the other three subtypes H1.2 to H1.4. This difference between H1.l and the other Hl histones is similarly evident in the C-terminal domain. Here, the alignment of H1.l to H1.4 needs several insertions (and some deletions) in the H1.l sequence compared with the other Hl sequences, even with the Hl” sequence. On the basis of a previous comparison of several Hl sequences with chicken Hl and H5 data (RuizCarrillo et al., 1983), we have shown that most Hl histones show the octapeptide T( S)PKKAKKP within their C-terminal domains. This sequence
HZB.l(a) X'AAC GM AGG TGC AGG TCT GTA5' T T G CTT TCC ACG TCC AGA CAT
to the mRNA start sites of both the H1.3 and H1.4 genes, as indicated in Figs. 2 and 3. This proves that both genes are expressed genes coding for the subtypes described by Ohe et al. (1989) as Hlc and Hlb. This nomenclature used by Ohe et al. differs from the assignment given by Lennox and Cohen (1983) to individual spots in two-dimensional electrophoresis. In that case, however, sequence data on human Hl subtypes are missing. Previously, we have published the sequences of two human Hl genes (Eick et al., 1989), which differ from the data presented here as well as from the protein sequence data on Hla-d published by Ohe et al. (1986, 1989) and from the partial Hl sequence presented by Carozzi et al. (1984). Because of the discrepancies described above, we decided to introduce a consecutive gene nomenclature (Hl.l-H1.4). In addition to the four main type Hl sequences, which are all associated with core histone genes, we have included the Hl” protein structure as derived from the respective gene (Doenecke and Tonjes, 1986) in the comparison of individual Hl subtypes (Fig. 9). The alignment of amino acid sequences shows that the subtype Hl” varies drastically from the other four subtypes. This may reflect the different role of Hl”, which is confined to terminally differentiated cells, and its synthe-
AGTGGCTTCCT
MCAGCAGAA
GAACTAACM
ATGGAGGAAT
AAGMGCTAG
TTATAGTCAT
CAATTTGATT G G T T - T T
GTTCTTTGAC
GAGCCMCCA
AAGGCTATTT TACATGTATG
CGTCACTGAC
TCCACTGAAT
AAAGARAAAG
MTGGGCTCG
CGGTAGMTT
GTGAAAGGCG
A~TAGAMGG
AAATMGGTG
@ZAGAGCTG
GATATTTTGA A T T C T T T A T T FGAMG
TAGACACGAG
C T T T T A T T T A AGTGCGTTCA TTCTCACTGC
ATG
CCT GM Pro Glo
CCA Pro
GCT
Ala
AAG Lys
TCA
Ser
a ACATTGCCCA
GCCTATAAAA GGAGAGAZ S T G T T A T T G T T TTCTGACAGC
GCT CCT GCT CCG MG Ala Pro Ala Pro Lys
AAG GGT TCC AAG Lys Gly Ser LyS I5
AAG GCT GTG ACC MG Lys Ala Val Thr Lys
GCG CAG Ala Gln
CGC AGT CGT MG
AGC TAC TCC GTG T A T GTG TAC AAG GTG CTA Ser Tyr Ser Val Tyr Val Tyr Lys Val Le"'s
Arg
Ser
At-g
Lys
AAA CAG G T T CAC Lys Gln Val His ATC ATG Ile "et
MT AS"
GAG GCT TCC Glu Ala Ser
GAG
Glu
CAC
His
GAC Asp
ATC TTC GM Ile Phe Glu
CGC ATC GCA Arg Ile Ala
GGC Gly '5
CTG GCC CAC TAC AAC AAG CGC TCG ACC A T T ACC Leu Ala His i"yr Asn Lys Arg Ser Thr Ile ThrgO
TCC AGG GAG ATC CAG Ser Arg Glu Ile Gln CTG GCC MG Leu Ala Lys
AAG CGC AAG Lys Arg Lys I0
CCC GAT ACT GGC ATC TCA TCC AAG GCC ATG GGC Pro Asp Thr Gly Ile Ser Ser Lys Ala Met Gly 6o
TCC TTC G T T MC Ser Phe Val Asn CGT Arg
AAG AAG GAT GGC MG Lys Lys Asp Gly Lys
ACC
Thr
GCA GTG Ala val
T A T ACA AGC TCC AAG Tyr Thr Ser Ser Ly.s
GCC GTG Ala
Val
TCC GM Ser Glu
CGT Arg
CTG CTG CTT CCC GGA GAG Leu Let, Leu Pro Gly Glu'Os
GGT
ACC
Gly
Thr
AAG GCT GTC ACC AAG Lys Ala Val Thr LYS'~~
TM
f**
ATGTGTGCTT AGGTGCTTTA MACTCAAAG
AGTCTCACAT
MAGAGCTTT
AATATTGMT
GCTCTTTTCA GAGCCACTCA -6TTCACCGTTT TCTAGGGMT AAGGGMTTT
TTCGATTTTG
TAATCCCAGC
ACTTT
FIG. 5. Sequence of H2B.l gene (a), adjacent to H2A.l gene described in Fig. 4. The mRNA-like strand is shown in 5’ to 3’ direction. The top line shows the complementary sequence, beginning with the ATG initiation codon of the H2A.l gene described in Fig. 4. Underlined elements 4, 5, and 6 as in Figs. 2-4; 3, Ott-1 binding site (Ref. (34)).
HISTONE
GENE
CLUSTERS
H2B.l(b) AGCGAAGTGC
ACAGCTGTCT
TGCCTAAAGA
GGGTAGTCTG GTCTGGGCTT TCTCATTGGG TACAAGTAM
AGCCMTGM
GGAACGAAAT
GTTTAcATTG GCATTTGTGA CGACACTCTA
MGCATAAAC
TAACCTCATT
GAAATCTGAT
FMTACCGC
ATCET
-TTMTE
Pro
Glu
CCT ACC MG TCT GCT CCT GCC CCA AAG AAG GGC TCC MG PrO Thr LyS Ser Ala Pro Ala Pro Lys Lys Gly Ser Lys
MG LyS
GCG GTG ACT MG GCT CAG MG Ala Val Thr LyS Ala GlR Lys
AAG Lys
GAC Asp
GTG
T A T GTG TAC Tyr Val Tyr
CGC AGC CGC AAG GAG AGC TAT Se= A=g LYS Glu Ser Tys
MG Lys
GTC CAT CCC GAC Val HIS Pro Asp
CAG
Gin
ATC ATG MT Zle Met Asn
TCA
Ser
ACC GGC ATC Thr Gly Ile
TCC TTC GTC MC Ser Phe Val Asn
GAC
Asp
GCT TCC CGC CTG GCG CAT TAC Ma Ser Arg Leu Ala His Tyr
GAG
Glu TCC
Val
GGG MG Gly tys
AAG CGC MG Lys Arg Lys
MC MG Asn Lys
Ser Arg
CTG GCC MG Leu Ala Lys
GCC GTG TCG GAG Ala Val Ser Glu
CAC
His
GGC
Gly
MG Thr Lys
CTTTGCCMG
TAAGCATCTT TACACCTAAT
TTTCMTXM
TGAGTTGTM
AGTACCMTC
TTTMTGTTA
FIG. 6. Regulatory
CTTTTMGAG
TCATTTCATT
CGTTAGTACT
GCAGAGAAAT
AACTTGGMG
Sequence of H2B.l sites are underlined
GGC Gly'5
TTACAGGGM
AAAATAAAYGTGMT
GATCACTMC
AGGGATCGTC
CACAAT~CAGAGTGAT
TTCCTAACTC
ATTTACTTTG CAGATGMCT
AAG CAG ACG GCT CGT Lys
Gin
Thr
Ala
Arg
AAATCCACA Lys
Ser
Thr
CTCTACGGCC
-
GGC GGT Gly Gly Lys
GTG GCT CTG CGC GAG ATC CGT CGC TAC CAG AAG TCT ACC GAG CTT Val Ala Leu Arg Glu Ile Arg Arg Tyr Gin Lys Se2 Thr Glu Leu 60 CGG AAG CTG CCG T T T CAG CGC CTG GTG CGA GAA ATA GCT Arg Lys Leu Pro Phe Gin Arg Leu Val Arg GIu Ile Ala 75
CAG GAC TTC AAG ACC GAC CTG CGC TTC CAG
AGT
Gln
Sef
Phe
Lys
GCG CTG CAG Ala Leu Gin
GAG
Asp
ACC
Thr
CCT MA Pro Lys
MC Asn
Glu
Thr
Asp
Arg
Phe
Gln
TCC GCG GTG ATG Se? Ala
Val
Met
9o
GCC TGC GAG GCC TAC T T G GTG GGG CTT TTC GAG Ala Cys Glu Ala Tyr Leu Val Gly Leu Phe G/do5
CTG TGC GCT Leu Cys Ala
GAC ATC Asp Ile
Leu
CAG
Gin
ATT
Ile
CAT GCC AAA CGC GTG ACC His Ala Lys Arg Val Thr
CTT GCC CGC CGC A T T CGT GGG GAG Leu Ala Arg Arg Ile Arg Gly Glu
ATC ATG Ile Hebzo AGG Arg
GCG Ala”5
TGA *** A T T G T T T T G A GTACAAACCT
TAAATCCAM
TGAAAGGGCT GTAATCCTTT
GAGACGCATT
CATTTGAAAT
FIG. ulatory
AGTGGCATTC AGTTCCCTCG
Gly
Gly
Lys
Gly
Leu
CGT GAC MC Arg Asp Asn
Gly
tys
GGA GGC GCC Fly Gly Ala I5
ATA CAG GGC ATC ACG Ile Gin Gly Ile Thrao
A T T TCG GGC CTC A T T T A T GAG GAG ACC CGC GGT O T T CTT MG GTG Ile Ser Gly Leu Ile Tyr Glu Glu Thr Arg Gly Val Leu LyS Val 6o TTC CTG GAG MT Phe Leu Glu Am
MG Lys
GTG ATA CGG GAC GCC GTA ACC TAC ACG GAG CAC Val 11e Arg Asp Ala Val Thr Tyr Thr Glu His"
CGT AAG ACA GTC ACT Arg Lys Thr Val Thr
CGC CAG GGA CGC ACT Arg Gln Gly Arg Thr
GCA Ala
ATG Met
GAT G T T GTC TAC GCG CTC Asp Val Val Tyr Ala LeU go
CTG TAC GGC T T T GGT GGC TGA Le" Tyr Gly Phe Gly Gly ***
GCCTCACCCC
G G C T T T T T A T TTMCAGCTC
CCTTCGTCAC
ACGAAGGGCT
GTAACTGATG
RCCCAT-
GGCCCTTTTC AGGGCCACCT -sACGACTTGGG T T T C G T T T T G TAAATTTGGG
ATTCTAACTG
AGTTAAACCG
AGCCGTTTTT AGCGATCTTC
CTAAGATGGC
GGATGTGCTA
of H4 gene (upstream of H1.3). The CCAAT, elements are labeled 4-6 as in Figs. 2-7. In H4-regulatory element (Ref. (8)) is indicated
--GGCTCTTCTC AGAGCCMCC ACTGCACTGA
ACTTTGTCCG
AGACCACTL
TCC-TAGA
ATGCTTCATT TAAATTTCCA
Mm"rMGCG
7. Sequence of H3.1 gene (between H2A.l and H1.3). sites are underlined and labeled as in Figs. 2-6.
this scheme (except for an Ala/Val amino acid exchange in H1.3). SPKK, i.e., the beginning of the conserved octapeptide, has been defined as a DNA-binding motif by Suzuki (1989). Langan et al. (1981) have described a growth-associated phosphorylation site at threonine 153 (followed by Pro, Lys) of an Ehrlich ascites tumor cell Hl histone. Since the threonine at the beginning of the conserved octapeptide is at position 154 of, e.g., H1.3, we may conclude that the conserved octapeptide may be a site of phosphorylation of the Hl protein and one of the Hl DNA binding sites.
1s
GCC ACG GGC GGC GTG AAG AAG CCC CAT CGC TAC CGC CCT GGC ACC Ala Thr Gly Gly Val Lys Lys Pro His Arg Tyr Arg Pro Gly Thr's
Asp
Lys
GCA
Ala
CCG CGC AAA CAG CTG GCC ACT AAG GCA GCT CGC AA0 AGC GCT CCG Pro Arg Lys Gln Leu Ala Thr Lys Ala Ala Arg LyS Sef Ala Pro 30
CTA ATC Leu Ile
Gly
gene (b) in the vicinity of Hl.4. and labeled as in Fig. 5.
TTGCACTGM
Thr
Arg
FIG. 8. Sequence TATA, and stem-loop addition, a tripartite by dashed lines.
GTTATTGGAC TTAGCACCAC
AATCAGAATC
Arg
GTTGTTGTTT GTCTTCGATC
GCC MG Ala Lys
CCACGCATGT
ACTTCCGGAA TTTAGCAACC
iii%
MAGTTAAGA
AAAGGCGGAG
CCC GCC ATC CGT CGC T T G GCC CGA CGC GGC GGC GTG AAA CGC Pro Ala Ile Arg Arg Leu Ala Arg Arg Gly Gly Val LyS Arg"
I-n.1
TCTGTTCCTA TATAGAGGGG CAMCCAATC
AGGTCCGCAT MTTGATATA
MG Lys
starts in all mammalian Hl species with a threonine and in all other sequences examined with a serine (Tsnjes and Doenecke, 1987; Eick et al., 1989). The comparison shows that all human Hl histones follow
CAAATTTACC
AGTCTCCAM
30
GCC GTC ACC AAG Ala val Thr Lysl2a
CCCAAAGGCT -sCAAAAGGGGG
CCTCCAATTC
CGC CAT CGC AAA GTG CTG Arg His Arg Lys Val Leu
ACC AGT TCC AAG TM Thr Ser Ser Lys ***
TAC Tyr
ACAGAGGTTT CGTTCCCGCC
MG Lys
CGC TCG ACC ATC ACC Arg Ser Thr Ile Thr 90
ACC
CCTCACCGCC
MGTTACGGA
15
GAG ATC CAG ACG GCC GTG CGC CTG CTG CTT CCG GGG GAG Glu Ile Gin Thr Ala "al Arg Le" Leu te" Pro Gly GlU105
AGG
TCAACTTGCA TTTCCCCTAC
TAGTGTAGGG GACGCCCAGT
Gly
AAG GTG CTG Lys Val Leu 45
CGC ATC GCA Arg Ile Ala
AGGAGTCAAA
Ser
TCT TCC AAG GCA ATG GGG Ser Ser Lys Ala Met Gly 60
ATC TTC GAG Ile Phe Glu
CAGGAAACGG
CCCTTTCCTT CCCAAGGCM
ATG TCT GGT AGA GGC AAA GGT GGT AAA GGT T T A GGA MG
TMCGCTACG
ATG CCT GM
GAC
945
GENES
TAAGGGGCTT CAGTGTGTAG CAA,&CA -
GMCAGGGCC
TCGGCGGGAG T G A T T A T T T T CTCAGGTGTT TGCAACAGTG TTCTAkTAT
Arg
CAGACCCMA
GGCCTATTTA TAGTAGCCTC
AAGGTAGACT TTTMGTGTC
Ala
Hl
H4
MCTGCTCCA
ATG GCG
INCLUDING
Reg-
CONCLUSIONS The data presented here show that the genes coding for the human Hl histone subtypes Hlc and Hld are associated with core histone genes coding for the subtypes H2A.1, H2B.1, H3.1, and H4. One amino acid exchange between the two H2B.l gene products suggests that even the core histone subtype classification will need a revision upon further sequencing of human histone genes. Hl histones are known as the most complex group of histones. In mammals, five subtypes have been defined first by two-dimensional electrophoresis (Lennox and Cohen, 1983) and subsequently by HPLC. In addition, Hlt and Hl” represent Hl histones that are confined to just one cell type (spermatocytes, early spermatids, Hlt) or to several terminally differentiated tissues (Hl”).
946 H1.l H1.2 H1.3 H1.4 H10
ALBIG
ET
AL.
-A;KKKPA.GPSV;
sET"PPAPA:sAAPEKPLA~KKA-KK-P*i***---SETAPAAPAAAPPAEKAPVKKKAAKK--AGGTP---RKAS-----GPPVS SETAPLAPTIPAPAEKTPVKKKA-KK--A-GATAGKRKAS-----GPPVS SETAPAAPAAPAPAEKTPVKKKA-RKS TENSTSAPAAKPK------RAKASKKS-T-DHPK--------------YS
_ _ - -
.~
A-GA--AKRKAS-----GPPVS
~~~~
ELITKAVAASKERSGVSLAALKKALAAAGYDVEKNNSRIKLGLKSLVSKG ELITKAVAASKERSGVSLAALKKALAAAGYDVEKNNSRIKLGLKSLVSKG DMIVAAIQAEKNRAGSSRQSIQKYIKSHYKVGENADSQIKLSIKRLVTTG v TLVQTKGTGiSGSFKLNK.KASSVETKPGAV-----TLVQTKGTGASGSFKLNKKAASGEAKPKV----------KKAGGTKPKKP TLVQTKGTGASGSFKLNKKAASGEGKPKA----------KKAGAAKPRKP TLVQTKGTGASGSFKLNKKAASGEAKPKA----------KKAGAAKAKKP VLKQTKGVGASGSFRLAKSDEPKKSVAFKKTKKEIKKVATPKKASKPKKA
--S&'ATKTKA--V
T-GASKKLK~AT-GAS-KK~V-KTPKKAK~P--AAT--RP--KSSKNPK~ V-GAAKKPKKAAGGATPKKSAKKTPKKAKKPA-AATVTKKVAKS---PKK A-GAAKKPKKVAGAATPKKSIKKTPKKVKKPATAA-GTKKVAKSA---KK A-GAAKKPKKATGAATPKKSAKKTPKKAKKPAAAA-GAKK-AKS---PKK ASKAPTKKPKATPVKKAKKKLAATPKKAKKP------------------PKTVKPKKViKSPAKAKAVitPKAAKARVTiiP---KTAKPiKAAPKKK AKVAKPKKAAKS-A-AKAVKPKAAKP--------KVVKPKKAAPKKK VKTPQPKKAAKSPAKAKAPKPKAAKPKSGKPKV--T-KAKKAAPKKK AKAAKPKKAPKSPAKAKAVKPKAAKPKTAKPKAA---KPKKAAAKKK -KTVKAK-PVK--A-SKPKKAKPVKP-----KA-KSS-AKRAG-KKK
FIG. 9. Comparison alignment of sequences. highly conserved central lishing the three domain
(214 (212 (220 (218 (193
of human Hl histone sequences, presented in one-letter amino acid code. Dashes indicate gaps introduced for Every 10th position is indicated by an open arrowhead, and the highly conserved octapeptide is underlined. The domain starts around position 45 and ends around position 120 (borders indicated by solid triangles), thus estabstructure (Ref. (1)).
In 1986, Ohe et al. sequenced the Hlb protein. We have recently described two human Hl genes (H1.l and H1.2; Eick et al., 1989). Shortly thereafter, Ohe et al. (1989) published the sequences of human spleen Hlproteins Hla, Hlc, and Hld. This allows a comparison of the gene sequencesdescribed here with known protein and gene sequence data. Two of the four completely sequenced Hl genes (H1.3 and H1.4) code for proteins, which had been described at the protein sequence level as Hlc and Hld (Ohe et aZ., 1989). In addition to these four sequences, Carozzi et al. (1984) deduced a partial sequence from a human Hl gene, which was part of a histone gene cluster. This sequence fragment again differs from the four known protein (Hla-d; Ohe et al.) and gene sequences (this paper; Eick et al., 1989). This allows the conclusion that the number of Hl histone genes in the human genome exceeds the number of the five main types Hla-e by at least one or two additional Hl species. In addition to the analysis of two novel human Hl gene sequences, this paper describes the structure and arrangement of core histone sequences in the vicinity of these Hl genes. Thus, all human Hl histone genes (except Hl”; Doenecke and Tonjes, 1936) are associated with core histone genes. The arrangement of core histone genes varies in all these clusters, which is
in contrast to the genomes of lower eukaryotes showing tandem repeats of identical clusters. The number of human histone gene clusters is still unknown. According to Triputti et al. (1986), histone genes map to chromosomes 1,6, and 12. None of the human histone gene clusters published until now overlapped with each other. Only the two overlapping clones HHGl and HHG4 (Zwollo et al., 1984) appear to represent the same site as the segment covered by cosmid C5. Given the number of at least 6 Hl histone genes within histone gene clusters, we should conclude that more than one Hl gene may on the average be contained within each of the three major histone gene loci. ACKNOWLEDGMENTS This work was supported by the Deutsche schaft. We thank A. M. Frischauf for providing The technical assistance of Silvia Biittinger, Susanne Hellmold is gratefully appreciated.
Forschungsgemeinthe cosmid library. Christina Kunz, and
REFERENCES 1.
ALLAN, J., MITCHELL, T., HARBORNE, N., B~HM, L., AND CRANE-ROBINSON, C. (1986). Roles of Hl domains in determining higher order chromatin structure and Hl location. J. Mol. Bid. 187: 591-601.
HISTONE
GENE
CLUSTERS
2.
BENTON, D. W., AND DAVIS, R. W. (1977). Screening lambda recombinant clones by hybridization to single plaques in situ. Science 196: 180-182.
3.
BIRNBOIM, H. C., AND DOLY, J. (1979). A rapid alkaline extraction procedure for screening recombinant plasmid DNA. Nucleic Acids Res. 6: 1513-1526.
4.
CAPASSO, O., AND HEINTZ, N. (1985). Regulated expression of mammalian histone H4 genes in vivo requires a trans-acting transcription factor. Proc. Natl. Acad. Sci. USA 82: 56225626.
5.
CAROZZI, N., MARASHI, F., PLUMB, M., ZIMMERMAN, S., ZIMMERMAN, A., COLES, L. S., WELLS, J. R. E., STEIN, G., AND STEIN, J. (1984). Clustering of human Hl and core histone genes. Science 224: 1115-1117.
6.
CHANDLER, M. E., KEDES, L. H., COHN, R. H., AND YUNIS, J. J. (1979). Genes coding for histone proteins in man are located on the distal end of the long arm of chromosome 7. Science 206: 908-910.
7.
CHIRGWIN, J., PRZYBYLA, A., MACDONALD, R., AND Ru~R, W. J. (1979). Isolation of biologically active ribonucleic acid from sources enriched in ribonuclease. Biochemistry 18: 5294-5299.
INCLUDING
19.
Hl
947
GENES
zation and expression of histone 313. HERRMANN, B. G., BARLOWE, (1987). A large inverted duplication bination between chromosomes mal t complex inversion. CeU 48:
gene families.
Cell 25: 301-
D. P., AND LEHRACH, H. allows homologous recomheterozygous for the proxi813-825.
20.
HOFFMANN, I., AND BIF~NSTJEL, M. L. (1990). pendent regulation of histone precursor mRNA modulation of U7 snRNA accessibility. Nature
21.
LANGAN, T. A., ?iEILIG, C., AND LEICHTLING, B. (1981). Characterization of multiple-site phosphorylation of Hl histone in proliferating cells. In “Protein Phosphorylation” (0. M. Rosen and E. G. Krebs, Eds.), Vol. 8, pp. 1039-1052, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY.
22.
LENNOX, R. W., AND COHEN, L. H. (1983). The histone Hl complements of dividing and non-dividing cells of the mouse. J. Bial. Chem. 258: 262-268.
23.
Mm, A. M., AND GILBERT, W. (1977). A new method sequencing DNA. Proc. Natl. Acad. Sci. USA 74: 560-564.
24.
MAXSON, R., COHN, R., KEDES, L., AND MOHUN, T. (1983). Expression and organization of histone genes. Annu. Rev. Genet. 17: 239-277.
25.
OHE, Y., HAYASHI, H., AND IWAI, K. (1986). histone Hl: Isolation and amino acid sequence ant, Hlb. J. Biochem. 100: 359-368.
26.
COLE, K. D., YORK, R. G., AND KISTLER, W. S. (1984). The amino acid sequence of boar Hlt, a testis-specific Hl histone variant. J. Biol. Chem. 269: 13695-13702.
OHE, Y., HAYASHI, H., AND IWAI, K. (1989). Human spleen histone Hl. Isolation and amino acid sequences of three minor variants, Hla, Hlc and Hld. J. Biochem. 106: 844857.
27.
10.
COLE, K. D., KANDALA, J. C., AND KISTLER, W. S. (1986). Isolation of the gene for the testis-specific Hl histone variant Hlt. J. Biol. Chem. 261: 7178-7183.
PANYIM, S., AND C HALKLEY, R. (1969). A new histone found only in mammalian tissue with little cell division. Biochem. Biophys. Res. Commun. 37: 1042-1049.
28.
11.
D’ANDREA, R., COLES, L. S., LESNIKOWSKI, C., TABE, L., AND WELLS, J. R. E. (1985). Chromosomal organization of chicken histone genes: Preferred associations and inverted duplications. Mol. Cell. Biol. 5: 3108-3115.
RUIZ-CARIULLO, A., AFFOLTER, M., ANII RENAUD, J. (1983). Genomic organization of the genes coding for the six main histones of the chicken: Complete sequence of the H5 gene. J. Mol. Biol. 170: 843-859.
29.
COLES, L. S., AND WELLS, J. R. E. (1985). An Hl histone gene-specific 5’ element and evolution of Hl and H5 genes. Nucleic Acids Res. 13: 585-594.
SAIVIBROOK, J., FRITSCH, E. F., AND MANIATIS, T. (1989). “Molecular Cloning: A Laboratory Manual,” 2nd ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY.
30.
DENHARDT, D. T. (1966). A membrane the detection of complementary DNA. Res. Commun. 23: 641-646.
SANGER, F., NICKLEN, S., AND COULSON, A. R. (1977). sequencing with chain-terminating inhibitors. Proc. Acad. Sci. USA 74: 5463-5467.
31.
SEYEDIN, S. M., COLE, R. D., AND KISTLER, W. S. (1981). Histones from mammalian testes: The widespread occurrence of Hlt. Exp. Cell Res. 136: 399-405.
32.
SHANNON, M. F., AND WELLS, J. R. E. (1987). Characterization of the six chicken Hl proteins and alignment with their respective genes. J. Bial. Chem. 262: 9664-9668.
33.
SIMPSON, R. T. (1978). Structure of the chromatosome, a chromatin particle containing 160 base pairs of DNA and all the histones. Biochemistry 17: 5524-5531.
34.
Srv~, H. L., AND ROEDER, R. G. (1986). Multiple sequence elements are required for maximal in vitro transcription of a human histone HZB gene. Mol. Cell. Biol. 6: 3329-3340.
35.
SOUTHERN, E. (1975). Detection of specific sequences among DNA fragments separated by gel electrophoresis. J. Mol. Bial. 98: 503-517.
36.
STEIN, G. S., SIERRA, F., STEIN, J. L., PLUMB, M., -HI, F., CAROZZI, N., PROKOPP, K., AND BAWACH, L. (1984). Organization and expression of human histone genes. In “Histone Genes” (G. S. Stein, J. L. Stain, and W. F. Marzluff, Eds.), pp. 397-455, Wiley-Interscience, New York.
8.
9.
lla.
12.
13.
CLERC, R. G., BUCHER, P., STRUB, K., AND (1983). Transcription of a cloned Xerwpus gene in the homologous frog oocyte system lutionary conserved sequence motif in the cleic Acids Res. 11: 86418657.
BIRNSTIEL, M. L. Levis H4 histone depends on a evo-50 region. Nu-
filter technique for Biochem. Biophys.
DOENECKE, D., AND T~NJES, R. (1986). Differential distribution of lysine and arginine residues in the closely related histones Hl” and H5. Analysis of a human Hl” gene. J. Mol. Bid. 18’7: 461-464.
14.
EHRICH, L. J., CRAIG, A., FRISCHAUF, A. M., POUSTKA, AND LEHRACH, H. (1987). A family of cosmid vectors with multicopy R6K replication origin. Gene 5’7: 229-237.
15.
EICK, S., NICOLAI, M., MUMBERG, D., AND DOENECKE, D. (1989). Human Hl histones: Conserved and varied sequence elements in two Hl subtype genes. Eur. J. Cell Biol. 49: llO115.
16.
FEINnERG, A. P., &ND VOGELSTEIN, B. (1983). Atechnique radiolabeling DNA restriction endonuclease fragments high specific activity. Anal. Biachem. 132: 6-13.
17.
FRANKLIN, S. G., AND ZWEIDLER, A. (1977). Non-allelic ants of histones Za, 2b and 3 in mammals. Nature 266: 275.
18.
HENTSCHEL,
C. C., AND BIRNSTIEL,
M. L. (1981).
A., the
for to vari273-
The organi-
Cell-cycle deprocessing by 346: 665-668.
for
Human spleen of a main vari-
DNA Natl.
948 37.
38. 39.
40.
41.
42.
ALBIG SUNDSTR~~M, C., AND NILSSON, K. (1976). Establishment and characterization of a human histiocyte lymphoma cell line. Int. J. Cancer 17: 565-577. SUZUKI, M. (1989). SPKK, a new nucleic acid-binding unit of protein found in histone. EMBO J. 8: 797-804. THOMA, F., KOLLER, T., AND KLUG, A. (1979). Involvement of histone Hl in the organization of the nucleosome and of the salt-dependent superstructures of chromatin. J. Cell Biol. 83: 403-427. T~NJES, R., AND DOENECKE, D. (1987). A highly conserved sequence in Hl histone as oligonucleotide hybridization probe: Isolation and sequence of a duck Hl gene. J. Mol. Euol. 25: 361-370. TRIPPUTI, P., EMANUEL, B. S., CROCE, M., GREEN, L. G., STEIN, G. S., AND STEIN, J. L. (1986). Human histone genes map to multiple chromosomes. Proc. Natl. Acad. Sci. USA 83: 3185-3188. WEAVER, R. F., AND WEISSMANN, C. (1979). Mapping of
ET
AL. RNA by a modification of the Berk-Sharp procedure: The 5 termini of 15S-globin mRNA precursor and mature lOS-globin mRNA have identical map coordinates. Nucleic Acids Res. 7: 1175-1193. 43.
WEINTRAUB, superstructures 17-27.
43a.
WELLS, D. E., AND MCBRIDE, C. (1989). A comprehensive compilation and alignment of histones and histone genes. Nucleic Acids Res. 17: r311-r346. ZWEIDLER, A. (1984). Core histone variants of the mouse: Primary structure and differential expression. In “Histone Genes” (G. S. Stein, J. L. Stein, and W. F. Marzluff, Eds.), pp. 339-371, Wiley-Interscience, New York.
44.
45.
H.
(1984). Histone-Hl-dependent chromatin and the suppression of gene activity. Cell 38:
ZWOLLO, P., STEIN, in the organization taining Hl histone 124: 988-993.
G. S., AND STEIN, J. L. (1984). Variation of human genomic DNA segments congenes. Biochem. Biophys. Res. Commun.