Gene, 97 (1991) 113-117

113

Elsevier

GENE

03839

A vector for directional cloning and expression of polymerase chain reaction products in Esciherichia coli (Expression

vector;

ferredoxin;

recombinant

DNA)

Mark E. Brandt, Ann H. Gabrik and Larry E. Vickery Department of Physiology and Biophysics, Universityqf Cal$ornia, Irvine, CA 92717 (U.S.A.) Received by G. Wilcox: 17 July 1990 Rev%sed: 27 August 1990 Accepted: 29 August 1990

SUMMARY

This paper describes the construction of a modified vector for the cloning and expression of protein-encoding genes in Escherichia coli. The vector, pfXblue, is derived from the system originally developed by Nagai and Thogerson [Nature 309 (1984) SIO-812], but contains a modified multiple cloning site (MCS) from M13mp18 to allow directional insertion of foreign coding sequences. The MCS is located within the Ml3mpI8 ZacZ’ gene and thus allows blue/w~te screening of colonies for inserts. The inserted gene is expressed as a fusion protein, which, when cleaved by the coagulation factor Xa protease, yields the mature product. This vector was successfully used for the production of a mitochondrial[2Fe-2Slferredoxin using polymerase chain reaction products generated from a chick kidney cDNA library.

INTRODUCTION

The synthesis of eukaryotic proteins in a prokaryotic system such as E. co& is commonly used to produce large quantities of protein for enzymatic and/or st~ctur~ studies. The accumulation of a foreign gene product, however, may have deleterious effects on the growth of the bacterial host. In addition, the eukaryotic protein may be prone to degradation by endogenous proteases. The approach of Nagai and Thargerson (1984) avoids some of these problems and has been used successfully by a number of laboratories for the production of foreign proteins (cf.,

Correspondenceto: Dr. L.E. Vickery, Department of Physiology Biophysics, University of California, Irvine, CA 92717 (U.S.A.) Tel. (714)856-6580; Fax (714)856-8540.

and

aa, amino acid(s); Ap, ampicillin; bp, base pair(s); cFd, chick mt ferredoxin; CIAP, calf intestinal alkaline phosIPTG, isopropyl-/?-D-thiogalactopyranoside; Km, kanamycin;

Abbreviations:

gene encoding phatase;

MBN, mung-bean nuclease; MCS, multiple cloning chondrial; nt, nucleotide(s); oligo, o~g~eox~bonucieotide; DNA replication; PCR, poiymerase chloro-3-indolyl-@-D-galactoside.

0378-l 119/91/$03.50

0

1991 Elsevier

chain reaction;

Science Publishers

site;

XGal,

mt, mitoori, origin of 5-bromo4

B.V. (Biomedical

Division)

Nagai and Thogerson, 1987). In this system, the expression vector is transformed into a defective bacteriophage A lysogenie E. coli strain. The synthesis of the exogenous protein, under the control of the /1pL promoter and the temperaturesensitive cl857-encoded repressor, occurs only after inactivation of the repressor by temperature shift; use of the repressor prevents synthesis of the potentially toxic exogenous product during the growth phase of the cells. The exogenous protein is then produced as a fusion with the first 31 aa of the I CII protein followed by a heptapeptide that includes a recognition sequence for the coagulation factor Xa protease. The fusion protein produced by this method tends to be more stable in the E. coli than the foreign protein alone (Hellebust et al, 1989); the protein of interest can be cleaved from the leader by the factor Xa protease. In addition, the vector supplies the AUG translation start site, allowing the expression in mature form of proteins in which N-terminal sequences would normally be removed during post-translational processing. The method of Nagai and Thogerson (1984) requires a number of steps for insertion of coding sequences into the expression vector (cf., Coghlan and Vickery, 1989). We have constructed a modified vector that allows insertion of

114 EXPERIMENTAL

a variety of sequences in a directional manner in frame with the a cII and factor Xa-coding region in a single step and which allows blue/white screening of the colonies for the presence of the insertion. Our aim was to construct a vector

(a) Construction of the expression vector Construction ofthe expression vector pfXblue is outlined in Fig. 1. It was first necessary to remove the BamHI site from the M13mp18 multiple cloning site to allow correct processing of a later intermediate with BamHI. M13mp18

that has general usefulness; however, it is designed as a system for expressing genes amplified from cDNA libraries by PCR. Hi&III

HindI

AND DISCUSSION

Be111

HindI ;

StiI

Ml3mp18

multiple

Synthetic

linker

Factor

El

Ezl

cloning site

Xa sequence

xcn sequence Modified 2ac.Z’

Purify 700 bp fragment

B&II ECORI .&OR1

ECORI Synthetic

’ Hi&III

B&II

linker 1. Ligate

5’-CCTCCGCGGCCATGG 3’-GGAGGCGCCGGTACCTTAA

2.HindIII Sk41

EcoRI

I

Hin,dII I Hi&III BamHI

Ligate HmdIII HtidIII

HindI

HindI

BUIAI ShII -L$ small fragment

Bar&II

9

Ligate h HindI

PCR

ZucZ’ 5’

ModifiedlaG

PCR Primers:

Primer

5’-CCTACCATGGTTACGAATTCGAGC

S&l Ligate

__)

3’ Primer

BamHI

5’-TCATATGTACCGCGGTTGAT

Fig. 1. Construction

of the expression

vector ptXblue.

Restriction

endonucleases

and other enzymes

used for DNA manipulation

were obtained

from

Boehringer-Mannheim, New England Biolabs, Stratagene, or United States Biochemicals. The plasmids pfX8 and pMb3 were gifts from Charles Glabe (University of California, Irvine, CA) (Coghlan and Vickery, 1989); pMb3-EcoRI was produced by digesting with EcoRI followed by MBN, and then ligating the resulting blunt ends. Oligos for construction or for use as PCR primers were purchased from either the Biotechnology Instrumentation Facility (University of California, Riverside, CA), or Operon Technologies, Inc. (Alameda, CA). The E. coli strain DHSaF’IQ (Bethesda Research Laboratories, Gaithersburg, MD) was used for propagation of the intermediate products techniques (Sambrook et al., 1989). PCR (Saiki et al., 1988) was carried

during construction. out using methods

DNA manipulations adapted from those

were performed using standard included with the United States

Biochemicals GeneAmp Kit (United States Biochemicals, Cleveland, OH). Intermediate products were partially characterized by restriction mapping. The final construct was verified by double-stranded dideoxy sequencing (Sambrook et al., 1989) using an oligo primer corresponding to a segment of the I CII protein.

115 was

cleaved

with

BamHI

and

the

overhanging

ends

digested with MBN. The resulting blunt ends were ligated to yield an M 13mp 18 lacking the Barn HI site. The ligation mixture from this and all subsequent steps generating circular plasmids was used to transform competent DHSaF’IQ cells, followed by purification of the plasmids from the cells. The MCS of the modified M13mp18 was excised and purified as a 700-bp EcoRI-BgZII fragment. Oligos comprising a linker were synthesized, added in excess to the EcoRIBgZII fragment, and ligated. A 62-bp StuI-Hind111 fragment was released with HindIII, purified, and ligated to the large fragment of S&I + HindIII-digested pfX8. The resulting plasmid (pfX8mcs) was digested with BamHI + Hind111 to yield the 79-bp fragment required for the next step. The vector pMb3 contains an EcoRI site (Remaut et al., 1983; Varadarajan et al., 1985). This site was removed using the method described above for the removal of the BamHI site from M13mpl8 to allow future use of the unique EcoRI site from the MCS. EcoRI-deleted pMb3 was digested with BamHI + HindIII, and then used to accept the 79-bp BamHI-Hind111 fragment to yield pc2fX.

BamHI stu1 --NcoI GGATCCATCGAGGGTA-ACCATGGTTACGAATCC

Fig. 2. The structure the factor

EcoRI

of pfXblue. The underscored

Xa recognition

site. The pMb3-derived

. . . MCS

aa residues sequences

comprise (i.e., the

entire vector excluding the lacZ’ and the factor Xa region) consist of (clockwise from the IacZ’): the pUC19 lad gene, the ori and ApR gene from pBR322, an incomplete segment of a KmR gene, and the stated sequences from 1. This structure (determined by sequencing some of the intermediate

product

plasmids)

does not entirely

match

the structure

expected from the description ofthe construction ofpMb3 (Remaut et al., 1981; 1983; Varadarajan et al., 1985). The nt sequence data for pfXblue have been submitted to GenBank and assigned the accession number M36415.

The expression

vector pc2fX was improved

by the incor-

poration of a modified ZacZ’ gene from M 13mp18 to allow blue/white screening of colonies for insertions while maintaining the unique restriction sites from pc2fX. To produce the modified IacZ’, two PCR primers were synthesized. The 5’ primer contained two modifications (shown in italics) to the M13mp18 sequence. The first 3 nt (CCT), introduce one half of a St~1 site and do not anneal to the M 13mp 18. In addition, the M 13mp 18 sequence CCATGA was changed to CCATGG to introduce an NcoI site. These changes encode a LacZ’ protein that contains two modifications other than the A CII/factor Xa fusion: an extra Pro following the factor Xa sequence and the substitution of Val for the Ile at position 3 of the LacZ’ peptide (Fig. 2). The 3’ PCR primer contains a single nt mismatch to create a Sac11 site; this primer is complementary to a region downstream from the ZacZ’ stop codon. The modified ZucZ’ was amplified from an M 13mp18 template by PCR, phosphorylated with T4 polynucleotide kinase, and ligated to pc2fX that had been sequentially digested with HindIII, treated with MBN to remove the Hind111 overhang, digested with ,%I, and dephosphorylated with CIAP. Competent DHSaF’IQ cells transformed with the ligation mixture were plated on XGal and IPTG. One of the resulting blue colonies was sequenced to confirm that it contained the correct insert and was designated pfXblue. Fig. 2 depicts the structure of the completed vector and lists the 13 unique restriction sites available for cloning. Additional sites exist in the M13mp18 MCS (e.g., BamHI, PstI, and SacI) but these also exist in the region of the vector originating from pMb3 (the pL promoter and 2 cl1 sequences, and the pBR322-derived ori and the ApR gene). (b) Expression of chick ferredoxin We tested the expression vector by utilizing it to express the gene (cFd) coding for chick mt [2Fe_2S]ferredoxin. The general method is presented diagrammatically in Fig. 3. To amplify the cFd gene from the cDNA library, two PCR primers were designed based on the published sequence of the cFd gene (Kagimoto et al., 1988). The mt leader sequence was omitted from the construct by choosing the 5’ primer such that it began with the first nt of the mature protein coding sequence. The 3’ primer was chosen to be complementary to a sequence in the 3’-noncoding region containing a unique Hind111 site. A 400-bp band was amplifled from phage particles of a chick kidney cDNA library by PCR. This product, the putative cFd cDNA, was digested with Hind111 and phosphorylated with T4 polynucleotide kinase to produce double-stranded DNA suitable for ligation. This was then ligated into StuI + HindIII-cleaved pfXblue, and transformed into DHSaF’IQ cells.

116 Non-coding region 1. PCR 2. Kinase

Hind111

Hind111

\

f3’ PCR Primer Mature protein coding sequence

Leader sequence

Hind111

Hind111 .

Hin stu1

Ligate

Fig. 3. Insertion

of foreign DNA into pfXblue.

This figure illustrates

the general

method

for inserting

PCR products

into pfXblue.

The restriction

site

used in this example is the Hind111 site used for cFd. The cFd cDNA was amplified from a chick kidney library using two 23-mer PCR primers (5’ primer: 5’-TGCAGCTCAGAAGATAAAATAAC and 3’ primer: 5’-TAATTAAGCTTCTTTAAAACTCC). The chick kidney cDNA library in 1Zap (prepared by Stratagene)

was a gift of Dr. H. Henry

(University

of California,

Riverside,

Approximately 30 colonies were observed, of which six were blue. Plasmid DNA from one of the white colonies was partially sequenced to confirm that the cFd gene had inserted in-frame with the a ~11 and factor Xa region. This DNA was then used to transform competent MZ-1 cells. Production of the chick ferredoxin fusion protein was induced by incubation at 42” C for 3 h followed by incubation overnight at 37’ C. Chick ferredoxin was purified by methods developed for the human protein (Coghlan et al., 1988). As was found for the recombinant human ferredoxin produced in E. coli (Coghlan and Vickery, 1989) the

1.2 -

0.3 -

O250

300

400

350

Wavelength

450

500

(nm)

Fig. 4. Optical absorption spectrum for recombinant chick ferredoxin. The vector pfXblue/cFd was transformed into the defective 3, lysogen MZ-1 (Nagai and Thogerson, 1984); the cells were grown and induced to synthesize the chick ferredoxin protein. The protein was purified according homologue

to the methods (Coghlan

developed

et al., 1988).

for the purification

of the human

CA).

[2Fe-2S] center of the chick protein is assembled in vivo under these conditions of expression. The yield was approx. 6 mg of crude chick ferredoxin/liter of culture. Then absorption spectrum of the purified protein (Fig. 4) is similar to that of the human and bovine ferredoxins. (c) General applications of the vector for PCR products While this vector can be used for the expression of any DNA, it is of special utility for the expression of PCR products. One method, as is illustrated by the expression of the cFd, requires knowledge of nt sequences at each end of the coding region for the protein of interest. The 5’ PCR primer is designed to begin with the first nt of the coding region, and the 3’ primer is chosen complementary to a region following the stop codon. If a unique restriction site does not exist in the downstream noncoding region, the 3’ primer can be used to create one during the PCR process, either by single nt mismatches, or by additional noncomplementary sequences added to the 5’-end of this primer. In some cases this vector can also be used to express proteins for which only N-terminal sequence information is available. For this procedure, it is necessary to use a form of ‘single-sided’ PCR (Berchtold, 1989), in which a PCR primer is designed to match the known 5’ sequence (either an exact match or a degenerate primer based on the aa sequence). The 3’ primer must then be designed to complement known sequences from the library vector (e.g., sequences surrounding the insertion site in 2). In the case of J libraries, the cDNA inserts are cloned into an EcoRI site; since there is a unique EcoRI site in the cloning region of pfXblue, this enzyme can be used to guide the directional

117 insertion of the amplified cDNA into the vector, assuming that no EcoRI sites exist in the gene. Because the 3’ primer anneals to all of the DNA in the template mixture, only the 5’ primer confers specificity to the product and must be carefully designed to avoid the amplification of incorrect sequences from the template mixture.

REFERENCES Berchtold,

M.W.: A simple method

for direct

cloning

and sequencing

cDNA by the use of a single specific oligonucleotide and oligo(dT) in a polymerase chain reaction (PCR). Nucleic Acids Res. 17 (1989) 453. Coghlan, V.M. and Vickery, L.E.: Expression of human ferredoxin and assembly of the [2Fe-2S] center in Eschenkhia coli. Proc. Natl. Acad. Sci. USA 86 (1989) 835-839. Coghlan, V.M., Cupp, J.R. and Vickery, L.E.: Purification

(d) Conclusions We have modified the E. coli expression vector developed by Nagai and Thsgerson (1984) to incorporate several new features. As with the original vector, the expressed material is synthesized as a cleavable fusion protein using the Met codon supplied by the 1 CII protein as a translation start signal. The insertion site for the foreign gene, however, now begins with the StuI recognition sequence which allows for direct ligation of blunt cDNA fragments in frame with the factor Xa peptide recognition sequence - this is especially useful for cloning of DNA products derived by PCR. In addition, twelve other unique restriction sites were introduced to guide the directional insertion of the DNA of interest. Finally, the vector contains the ZucZ’ gene and therefore allows visual selection of colonies with insertcontaining recombinant plasmids.

zation

of human

placental

ferredoxin.

and characteri-

Arch. Biochem.

Biophys.

264

(1988) 376-382. Hellebust,

H., Murby,

Different

M., Abrahmsen,

approaches

Bio/Technology Kagimoto,

J.L., Waterman,

amino acid sequence

Biochem.

Biophys.

of hybrid

Enzymol. Remaut,

lambda. Remaut,

M.:

testis ferredoxin.

protein

of p-globin

produced

by sequence

in Escherichiu coli.

and sequence-specific

pro-

in Escherichia coli. Methods

produced

153 (1987) 461-481. P. and Fiers,

expression Gene

controlled

W.: Plasmid

vectors

by the pL promoter

for high-

of coliphage

15 (1981) 81-93.

E., Stanssens,

mature

S.O.:

protein.

155 (1988) 379-383.

H.C.: Synthesis

proteins

E., Stanssens,

efficiency

fusion

and Kagimoto,

chicken

H.C.: Generation

K. and Thogerson,

teolysis

M.R.

of mature

Res. Commun.

K. and Thegerson,

specific proteolysis of a hybrid Nature 309 (1984) 810-812. Nagai,

a recombinant

7 (1989) 165-168.

K., McCarthy,

Deduced Nagai,

L., Uhlen, M. and Enfors,

to stabilize

P. and Fiers, W. Inducible

human libroblast

interferon

high level synthesis

of

in Escherichia coli. Nucleic Acids

Res. 14 (1983) 4677-4689. Saiki, R.K., Gelfand, ACKNOWLEDGEMENTS

D.H., Stoffel, S., Scharf,

G.T., Mullis, K.B. and Erlich, H.A.: amplification of DNA with a thermostable

S.J., Higuichi,

R., Horn,

Primer-directed enzymatic DNA polymerase. Science

239 (1988) 487-491.

We thank Dr. Helen Henry for providing the chick kidney cDNA library. This work was supported by National Institutes of Health research grants DK30109 and GM43548 (to LEV) and training grant T32 CA09054 (MEB).

Sambrook,

J., Fritsch,

E.F. and Maniatis,

T.: Molecular

Laboratory Manual, 2nd ed. Cold Spring Harbor Spring Harbor, NY, 1989. Varadarajan,

R., Szabo,

A. and Boxer,

Escherichia coli, and reconstitution Acad.

S.G.:

of human

Sci. USA 82 (1985) 5681-5684.

Cloning.

Laboratory,

Cloning,

expression

myoglobin.

A

Cold in

Proc. Natl.

A vector for directional cloning and expression of polymerase chain reaction products in Escherichia coli.

This paper describes the construction of a modified vector for the cloning and expression of protein-encoding genes in Escherichia coli. The vector, p...
555KB Sizes 0 Downloads 0 Views