4940 Nucleic Acids Research, Vol. 18, No. 16
Sequence of a genome with a
©-) 1990 Oxford University Press
replication competent Hepatitis B virus preX open reading frame
Ivan Francisco Loncarevic, Hanswalter Zentgraf* and Claus Hobe Schroder Institut fOr Virusforschung, Deutsches Krebsforschungszentrum, Im Neuenheimer Feld 280, D-6900 Heidelberg, FRG Submitted May 9, 1990
EMBL accession no. X52939
The complete sequence of a replication competent Hepatitis B Virus (HPV) full length genome (p4al ; 1) cloned via the unique XhoI restriction site from a hepatocellular carcinoma was 3215 base pairs long. As other sequenced HBV genomes it contains open reading frames (ORF) for the genes X (1247-1708), C (1774-2322), PreS/S (2586-705) and Pol (2180-1493). The genome can be attributed to the adr serotype and possesses an interrupted preC and an intact preX ORF, properties shared by 4 other HBV DNA clones from the same carcinoma (1). The presence of the preX ORF appears to be adr specific but not generally present on DNA of this serotype based on a comparison with published sequences (2, 3, 4, 5, 6).
GAACCGACAACTCTCTTGTC CCGCGGACGACCCGTCTCGG TGCCGGACCGTGTGCACTTC GACCGACCTTGAGGCATACT AGCACCATGCAACTTTTTCA GGAGCTTCTGTGGAGTTACT CCTCACCATACAGCACTCAG AATGTTAATATGGGCCTAAA CTCGCTTACAGACCACCAAA TCGCCGCGTCGCAGAAGATC GAAAACTCCCTCTTTTCCTA CTTCTATCCTAACCTTACCA TGGCATTCTATATAAGAGAG CGAATCTTTCTGTTCCCAAT AGGCAAAGCAGGTAGGAGCG
CAGCGCATGCGTGGAACCTT CTCTCTCGGAAATACACCTC GGCCGTTTGGGCCTCTATCG GCTTCACCTCTGCACGTCGC TCAAAGACTGTTTGTTTAAA CCTCTGCCTAATCATCTCAT CTCTTTTTTGCCTTCTGACT GCAAGCTATTCTGTGTTGGG
1. Loncarevic,I.F., Schranz,P., Zentgraf,H., Liang,X.-H., Herrmann,G., Tang,Z.-Y. and Schr6der,C.H. (1990) Virology 174, 158-168. 2. Fujiyama,A., Miyanohara,A., Nozaki,Z., Yoneyama,T., Ohtoma,N. and
Matsubara,K. (1983) Nucd. Acids Res. 11, 4601-4610. 3. Gan,R., Meijin,C., Luping,S., Suwen,Q. and Zaiping,L. (1987) Scientia Sinica B 30, 507-521. 4. Kobayashi,M. and Koike,K. (1984) Gene 30, 227-232. 5. Ono,Y., Onda,H., Sasada,R., Igarashi,K., Sugino,Y. and Nishioka,K. (1983)
Nucd. Acids Res. 11, 1747-1757. 6. Rho,H.M., Kim,K., Hyun,S.W. and Yong,S.K. (1989) Nucd.
CTGCTCGTGTTACAGGCGGG GTTTTTCTTGTTGACAAGAA TCCTCACAATACCACAGAGT AATTCGCAGTCCCCAACCTC CAATCACTCACCAACCTCTT GTCCTCCAACTTGTCCTGGC ATCTTCTTGTTGGTTCTTCT GGACTACAAAGGTATGTTGC CCGTTTGTCCTCTACTTCCA
120 240
TCTATGTTTCCCTCTTGTTG CTGTACAAMACCTTCGCACG CAAACTGCACTTGTATTCCC
480
GGCTTTCCCCCACTGTTTGG ACATTTGAACCCTAATAAAA
600
TCGAAAACTGCCTGTCAATA
CTGTATACAATCTAAGCAGG
140 960
CCGTTGCCCGGCAACGGTCA GGCCTCTGCCAACGTGTTTCC TGATGCAACCCCCACT8G1§
1060
TGTGGCTCCTCTGCCGATCC ATACTGCGGAACTCCTAGCA GCTTGTTTTGCTCGCAGCCG GTCTGGAGCGACACTTATCG start ~~~~~~~~~~~X
1200 1320 1440 1560
TCGAGGACTGGGGACCCTGC ACCGAACATGGAGAGCACAA CATCAGGATTCCTAGGACCC CTAGACTCGTGGTTGACTTC TCTCAATTTTCTAGGGGGAA CACCCAAGTGTCCTGGCCAA TATCGCTGOATGTGTCTGCG GCGTTTTATCATATTCCTCT TCATCCTGCTGCTATGCCTC GGAACATCA&CTACCAGCAC GGGACCATGCAAGACCTGCA CGATTCCTGCTCAAAACACC ATCCCATCATCCTGGGCTTT CGCAAGATTCCTATGGGAGT GGCCTCAGTCCGTTTCTCC CTTTCAGTTATATGGACGAT GTGGTATTGGGGGCCAAGTC TGTACAACATCTTGAGTCCC CCAAGCGTTGGGGCTACTCC CTTAACTTCATGGGATATGT AATTCGAAGTTGGCGTACTT GACCTATTGATTGGAAAGTA TGTCAGAGAATTGTGGGTCT TTTAGGCTTTGCITGCCCCTT CTTTCACTTTCTCGCCAACT TACAAGGCCTTTCTGTGTAA ACAATATCTGAACCTTTACC
&GGrGTTGGCCATTGGCCAT -
REFERENCES
TGGCTCAGTTTACTAGTGCC ATTTGTTCAGTGGTCTGCAG TTTTTACCTCTATTACCAAT TTTATGTTGTCTTTGGGCAT TACCACAAGAACATATTGTA CAAAAACTCAAGCAATGTTT TTACACAATGTGGCTATCCT CCCTTGATGCCTTTATATCC
CTTTCCATGGCTGCTACGGT GTGCTGCCAACTGGATCCTG CGCGGGACGTCCTTTGTCTA CGTCCCGTCGGCGCTGAATC TCCCCTTCTTCGTCTGTCGT TCCGGCCGACCACGGGGCGC ACCTCTCTTTACGCGGTCTC CCCGTCTGTGCCTTCTCATC ATCGAGACCACCGTGAACGC CCACCAGCTCTTGCCCAAGG TCTTATATAAGAGGACTCTT GGACTCTCAGCAATGTCAAC CACTGGGAGGAGTTGGGGGA GGAGATTACGTTAATGATCT TTGTACTAGGAGGCTGTAGG CATAAATTGGTCTGTTCACC GTTCATGTCCTACTGTTCAA GCCTCCAAGCTGTGCCTTGG GTGGCTTTAGGGCATGGACA TTGACCCGTATAAAGAATTT TCTTTCCGTCTGTTCGAGAT CTCCTCGACACCGCCTCTGC TCTCTATCGGGAGGCCTTAG AGTCTCCGGAACATTGTTCA TTGAGTTGATCAATCTGGCC ACCTGGGTGGGAAGTAATTT GGAAGACCCAGCATCCAGGG AATTAGTAGTCAGCTATGTC AATCAGACAACTATTGTGGT TTCACATTTCCTGTCTTACT TTTGGAAGAGAACTGTTTT AGAGTATTTGGTGTCTTTTG GAGTGTGGATTCGCACTCCT TGCCCCTATCTTATCAACAC TTCCGGAAACTACTGTTGTT AGACAACGAGGCAGGTCCCC TAGAAGAAGAACTCCCTCGC CTCGCAGACGAAGATCTCAA TAATCTCGGGMTCTCMT GTTAGTATCCCTTGGACTCA TAAGGTGGGAAACTTTACTG GACTTTATTCTTCTACTGTA CCTGTCTTTAATCCTGAGTG ACATTCATTTACAGGAGGAC ATTATTGATAGATGTCAACA ATATCTGGGCCCTCTTACAG TGAATGAAAAAAGGAGATTA AAATTAATTATGCCTGCTAG AGTATTTACCCTTCGATAAA GCCATTAAACCTTATTATCC TGAACATGCAGTTAATCATT ACTTCCAAACTAGGCATTAT TTACATACTCTGTGGAAGGC AAACTACACGCAGTGCCTCA TTTTGTGGGTCACCATATTC TTGGGAACAAGAGCTACAGC ATGGGAGGTTGGTCTTCCAA ACCTCGACAAGGCATGGGGA
CCTCTGGGATTCTTTCCCGA TCACCAGTTGGACCCTGCGT TCGGAGCCAACTCAAACAAT CCAGATTGGGACTTCAACCC CAACAAGGATCACTGGCCAG GGAGCATTCGGGCCAGGGTT CACCCCACCACACGGCGGCC TTTTGGGGTGGAGCCCTCAG GCTCAGGGCACATTGACAAC AGTGCCAGCAGCGCCTCCTC CTGCTTCCACCAATCGGCAG TCAGGAAGACAGCCTACTCC CATCTCTCCACCTCTAAGAG ACAGTCATCCTCAGGCCATG CAGTGGAACTCCACAACATT CCACCAAGCTCTGCTAGATC CAAGAGTGAGGGGCCTCTAT TTCCCTGCTGGTGGCTCCAG TTCCGGAACAGTAAACCCTG TTCCGACTACTGCCTCACCC ATATCGTCAATCTTC
*
To whom correspondence should be addressed
360
720
1680 1300 1920 2040 2160 2280 2400 2520 2640 2760
2960 3000
3120
3215