Structure of the gene for human beta 2-adrenergic receptor ...

5
Proc. Natl. Acad. Sci. USA Vol. 84, pp. 6995-6999, October 1987 Biochemistry Structure of the gene for human f32-adrenergic receptor: Expression and promoter characterization (glucocorticoid induction/transcription factor SpI binding sites) L. J. EMORINE*, S. MARULLO, C. DELAVIER-KLUTCHKO, S. V. KAVERI, 0. DURIEU-TRAUTMANN, AND A. D. STROSBERG Department of Biotechnology, Pasteur Institute and University Paris VII, 25, rue du Docteur Roux, 75724 Paris Cedex 15, France Communicated by Harden M. McConnell, June 5, 1987 ABSTRACT The genomic gene coding for the human (32-adrenergic receptor (fi2AR) from A431 epidermoid cells has been isolated. Transfection of the gene into eukaryotic cells restores a fully active receptor/GTP-binding protein/adenylate cyclase complex with ,32AR properties. Southern blot analyses with (32AR-specific probes show that a single PAR gene is common to various human tissues and that its flanking se- quences are highly conserved among humans and between man and rabbit, mouse, and hamster. Functional significance of these regions is supported by the presence of a promoter region (including mRNA cap sites, two "TATA boxes," a "CAAT box," and three G+C-rich regions that resemble binding sites for transcription factor Spl) 200-300 base pairs 5' to the translation initiation codon. In the 3' flanking region, se- quences homologous to glucocorticoid-response elements might be responsible for the increased expression of the (32AR gene observed after treatment of the transfected cells with hydrocortisone. In addition, 5' to the promoter region, an open reading frame encodes a 251-residue polypeptide that displays striking homologies with protein kinases and other nucleotide- binding proteins. Peripheral catecholamine-responsive systems are mainly characterized by four types of cell surface receptors: the a,-, a2-,831-, and p2-adrenergic receptors (a,-, a2-, /3l-, and /32AR), which are coupled to different effector systems via specific GTP-binding proteins (G proteins). The a1AR modulates levels of inositol trisphosphate through phospholipase C activation, whereas a2AR and 8AR act on cAMP levels by modulating adenylate cyclase activity (1). Cellular responses to catecholamines depend on the type and ratio of each receptor. In particular tissues, this ratio may be controlled by external factors, such as temperature, and by the action of thyroid, glucocorticoid, or estrogen hormones, which exert their action by modulating the number of receptors (2, 3). In view of the complexity of the adrenergic systems, it is necessary to define the various components involved in the tuning of cellular responses to catecholamines. This requires characterization of the receptor, transducer, and effector components; identification of the interacting regions of each of these components; and localization of DNA elements responsible for the regulated expression of receptor genes. Molecular cloning and eukaryotic cell expression of specific genes constitute a powerful approach to these problems. Several cDNAs coding for the three subunits of various G proteins have been characterized (6). Turkey erythrocyte ,I1AR cDNA and hamster lung and human placenta and brain /32AR cDNAs have also been studied recently (7-10). Here we describe the human gene coding for the FEAR of the A431 epidermoid cell line.t Expression of this gene in rabbit TP3 cells, which are devoid of a 32-agonist-responsive system, restores a functional, agonist-dependent 82AR/G- protein/adenylate cyclase complex. The p2AR proximal promoter has been characterized, and potential DNA regu- latory elements such as binding sites for glucocorticoid receptor have been defined. MATERIALS AND METHODS Cell Lines. A431 human epidermoid carcinoma cells ex- press high levels of the P32AR (11), whereas CCL137 cells (HEL299 from the American Type Culture Collection) from human embryonic lung are devoid of FAR (C.D.-K., unpub- lished data). C6 murine glioma cells contain mainly P1AR (12). TP3 is a rabbit splenocyte line (13) selected for hypo- xanthine (guanine) phosphoribosyltransferase deficiency. Recombinant DNA Techniques. A library of size-selected EcoRI fragments of A431 genomic DNA was constructed in EMBL4 phages (14) and probed (15) with oligonucleotides derived from a hamster lung P32AR cDNA sequence: base pairs (bp) 869-906 and 926-963 on the coding strand and bp 898-935 on the opposite strand (8). For Southern (16) blots, hybridization and washing were as described (17). The DNA fragment to be sequenced was subcloned in both orientations in bacteriophage vector M13mpl8. Continuous overlapping subclones were obtained as described (18) and sequenced (19). To avoid electrophoretic band compression due to secondary structure in G+C-rich regions, dITP or 7-deaza-dGTP was used in place of dGTP (20). For nuclease S1 mapping of the mRNA cap site, M13 deletion subclones were used as templates to generate high- specific-activity single-stranded probes (21) H/3R-9, Hf3R-10, and H/3R-11 (Fig. lb), which begin, respectively, at positions 1303, 1243, and 1026 and extend to restriction sites at positions 962 (Nae I), 887 (Sma I), 245 (Pvu II). Probe (5 x 105 cpm) was mixed with 50 jig of RNA and processed as described (14), except that formamide was omitted and hybridization was at 650C for 14 hr. Transfection of Eukaryotic Cells and Assays of Receptor Binding and cAMP. DNA containing the gene to be expressed was cotransfected into TP3 cells with the selection plasmid pSV2gpt by the calcium phosphate method (23). After 2 weeks in selective medium, resistant cells were screened for gene expression by mRNA dot blot assays. Cells from Abbreviations: aAR (aIAR or a2AR), a-adrenergic receptor(s) (of the a, or a2 subtype); 3AR (PIAR orI32AR), ,B-adrenergic receptor(s) (of the PI or /2 subtype); CGP-12,177 (CIBA-Geigy product 12,177), 4-(3-t-butylamino-2-hydroxypropoxy)benzimidazol-2-one; ICYP, 3- iodocyanopindolol; ORF, open reading frame; G protein, GTP- binding protein. *To whom reprint requests should be addressed. tThe sequence reported in this paper is being deposited in the EMBL/GenBank data base (Bolt, Beranek, and Newman Labora- tories, Cambridge, MA, and Eur. Mol. Biol. Lab., Heidelberg) (accession no. J02960). 6995 The publication costs of this article were defrayed in part by page charge payment. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. §1734 solely to indicate this fact.

Transcript of Structure of the gene for human beta 2-adrenergic receptor ...

Page 1: Structure of the gene for human beta 2-adrenergic receptor ...

Proc. Natl. Acad. Sci. USAVol. 84, pp. 6995-6999, October 1987Biochemistry

Structure of the gene for human f32-adrenergic receptor:Expression and promoter characterization

(glucocorticoid induction/transcription factor SpI binding sites)

L. J. EMORINE*, S. MARULLO, C. DELAVIER-KLUTCHKO, S. V. KAVERI, 0. DURIEU-TRAUTMANN,AND A. D. STROSBERGDepartment of Biotechnology, Pasteur Institute and University Paris VII, 25, rue du Docteur Roux, 75724 Paris Cedex 15, France

Communicated by Harden M. McConnell, June 5, 1987

ABSTRACT The genomic gene coding for the human(32-adrenergic receptor (fi2AR) from A431 epidermoid cells hasbeen isolated. Transfection of the gene into eukaryotic cellsrestores a fully active receptor/GTP-binding protein/adenylatecyclase complex with ,32AR properties. Southern blot analyseswith (32AR-specific probes show that a single PAR gene iscommon to various human tissues and that its flanking se-quences are highly conserved among humans and between manand rabbit, mouse, and hamster. Functional significance ofthese regions is supported by the presence of a promoter region(including mRNA cap sites, two "TATA boxes," a "CAATbox," and three G+C-rich regions that resemble binding sitesfor transcription factor Spl) 200-300 base pairs 5' to thetranslation initiation codon. In the 3' flanking region, se-quences homologous to glucocorticoid-response elements mightbe responsible for the increased expression of the (32AR geneobserved after treatment of the transfected cells withhydrocortisone. In addition, 5' to the promoter region, an openreading frame encodes a 251-residue polypeptide that displaysstriking homologies with protein kinases and other nucleotide-binding proteins.

Peripheral catecholamine-responsive systems are mainlycharacterized by four types of cell surface receptors: the a,-,a2-,831-, and p2-adrenergic receptors (a,-, a2-, /3l-, and /32AR),which are coupled to different effector systems via specificGTP-binding proteins (G proteins). The a1AR modulateslevels of inositol trisphosphate through phospholipase Cactivation, whereas a2AR and 8AR act on cAMP levels bymodulating adenylate cyclase activity (1). Cellular responsesto catecholamines depend on the type and ratio of eachreceptor. In particular tissues, this ratio may be controlled byexternal factors, such as temperature, and by the action ofthyroid, glucocorticoid, or estrogen hormones, which exerttheir action by modulating the number of receptors (2, 3).

In view of the complexity of the adrenergic systems, it isnecessary to define the various components involved in thetuning of cellular responses to catecholamines. This requirescharacterization of the receptor, transducer, and effectorcomponents; identification of the interacting regions of eachof these components; and localization of DNA elementsresponsible for the regulated expression of receptor genes.Molecular cloning and eukaryotic cell expression of specificgenes constitute a powerful approach to these problems.Several cDNAs coding for the three subunits of various Gproteins have been characterized (6). Turkey erythrocyte,I1AR cDNA and hamster lung and human placenta and brain/32AR cDNAs have also been studied recently (7-10).Here we describe the human gene coding for the FEAR of

the A431 epidermoid cell line.t Expression of this gene in

rabbit TP3 cells, which are devoid of a 32-agonist-responsivesystem, restores a functional, agonist-dependent 82AR/G-protein/adenylate cyclase complex. The p2AR proximalpromoter has been characterized, and potential DNA regu-latory elements such as binding sites for glucocorticoidreceptor have been defined.

MATERIALS AND METHODS

Cell Lines. A431 human epidermoid carcinoma cells ex-press high levels of the P32AR (11), whereas CCL137 cells(HEL299 from the American Type Culture Collection) fromhuman embryonic lung are devoid of FAR (C.D.-K., unpub-lished data). C6 murine glioma cells contain mainly P1AR(12). TP3 is a rabbit splenocyte line (13) selected for hypo-xanthine (guanine) phosphoribosyltransferase deficiency.Recombinant DNA Techniques. A library of size-selected

EcoRI fragments of A431 genomic DNA was constructed inEMBL4 phages (14) and probed (15) with oligonucleotidesderived from a hamster lung P32AR cDNA sequence: basepairs (bp) 869-906 and 926-963 on the coding strand and bp898-935 on the opposite strand (8). For Southern (16) blots,hybridization and washing were as described (17).The DNA fragment to be sequenced was subcloned in both

orientations in bacteriophage vector M13mpl8. Continuousoverlapping subclones were obtained as described (18) andsequenced (19). To avoid electrophoretic band compressiondue to secondary structure in G+C-rich regions, dITP or7-deaza-dGTP was used in place of dGTP (20).For nuclease S1 mapping of the mRNA cap site, M13

deletion subclones were used as templates to generate high-specific-activity single-stranded probes (21) H/3R-9, Hf3R-10,and H/3R-11 (Fig. lb), which begin, respectively, at positions1303, 1243, and 1026 and extend to restriction sites atpositions 962 (Nae I), 887 (Sma I), 245 (Pvu II). Probe (5 x105 cpm) was mixed with 50 jig of RNA and processed asdescribed (14), except that formamide was omitted andhybridization was at 650C for 14 hr.

Transfection of Eukaryotic Cells and Assays of ReceptorBinding and cAMP. DNA containing the gene to be expressedwas cotransfected into TP3 cells with the selection plasmidpSV2gpt by the calcium phosphate method (23). After 2weeks in selective medium, resistant cells were screened forgene expression by mRNA dot blot assays. Cells from

Abbreviations: aAR (aIAR or a2AR), a-adrenergic receptor(s) (ofthe a, or a2 subtype); 3AR (PIAR orI32AR), ,B-adrenergic receptor(s)(of the PI or /2 subtype); CGP-12,177 (CIBA-Geigy product 12,177),4-(3-t-butylamino-2-hydroxypropoxy)benzimidazol-2-one; ICYP, 3-iodocyanopindolol; ORF, open reading frame; G protein, GTP-binding protein.*To whom reprint requests should be addressed.tThe sequence reported in this paper is being deposited in theEMBL/GenBank data base (Bolt, Beranek, and Newman Labora-tories, Cambridge, MA, and Eur. Mol. Biol. Lab., Heidelberg)(accession no. J02960).

6995

The publication costs of this article were defrayed in part by page chargepayment. This article must therefore be hereby marked "advertisement"in accordance with 18 U.S.C. §1734 solely to indicate this fact.

Page 2: Structure of the gene for human beta 2-adrenergic receptor ...

6996 Biochemistry: Emorine et al. Proc. Natl. Acad. Sci. USA 84 (1987)

00 ~~-0 E 0 -E=1> c -o a. =oM ,a.n E o a

Ia] I1lull TA ____,X I I

U

II 3...J.LL6I1 1 I L ~ ll IKb1

sr =w, --_-o c Z - C c

HQ ~> KJ b

-- - - -

- -

C

GAArTCTCATTGCATCTCCAGTTCAACAGATAATGAGTGAGTGATGCCACAC ICtCAAGAGT TAAAAACAAAACAACAAAAAAATTAAAACAAAAGCACACAACTTTIC TCTCTCT6TCCC 120

AAAATACATACTrrCCTACCCCGCCCAGATAAAATCCAAAGGGTAAAACTGTCTTCATGCCTGCAAATTCCTAAGAGGGCCCTAAAGTACTrGACAGCGAr GTGTGGAAATC 240

GCAG^C Ct GTTGAAGCACC*C C TGTCTCCT*T*CCAc^<TTTGAAAIGGccAATACAc CCG6T TCCGGGTGTATGT TGGGAGGGGAGCATTA^TCAGTGCT CGGGTGAGGCAAGTT cCGGAG 360

ACCCAAG~rGAGACATCCG rTGCTGTGTCGc CTTGGAT6CC TCCAAGCCAGCCGTGTGTCT**CT TrCTrGTGTGTrGTcCACCTGTC TTtCTGCCTTC TCGCTGC **C TGTGT TrGTrTT*ctrG~C 460

CGGTTCTr GrGrTGGACAcsGrGGGTGrC!TGTGCCGGATGGcrC TcTCTGTGGACGCGCGssCGGTGTGCATGTCGGTGAGCTrGGGAGGGTGTGTCTrCATGcTCTrAGGCTGTGGTTCGG 600

TATAAGTCT*GAGCATGTC * GCCAGGGTrGTAtTGTGCC TGTATGT6C6GCCTCGGT6GsCATC TCCTTr*CC*TtCC^TGT~iGGCcGTGCsCGTCTGCT*CCCTCTGCCTTGAACC 720

TCAAGCC~csc^GCGC GCCoCGGcGCAGGCAGTAGCGGCCACAG^AAGCoCAGC TCCCCCCTTGGCTrGGTAAGCACACCACCTsCCGCT rTAG£C-C-rCTTGGGGCCAGCCAG6GTAGCC 840

GGGAAGCAGTGGTGGC CCGCCCTCCA6GGAGCAGTT666CCCCGCCCGG6CCACTGC A66GAAGGA66GCGAGG666AGGAG6AAAGGGAGGAGTGCCTCGCCCCTTCGCGGCTG 960

CCGGCGTGC CATT66C CGAGTCCCGT TCACGGCGAGGGC TCCCCt A4%TCJGTGCACATAAC6GACGCACTGCGAA6CcXTTCTTCC^A6CCCTGGAAC 1060

TGGCAGGCACCGCGAGCCCC TACACCCGACAAGCTGAGtGTGCAGGCGAGtCCCCACCACACCCACACCACAGCCGCTGAAtGAGGCr CCAGGCGTCCGCTCGGCCC6CAGAGCC 1200

CCGCCGTGGGTCCGCCTGCTGAG6CGCCCCCAGCCA6TGCGCTTACCTGCCAGACTGCGCGCC AT6 666 CAA CCC GGG AAC GCC AGC CCC TTC TTS CTG GCA CCC 1305NET GLY GLN PRO GLY ASN GLY SER ALA PHE LEU LEU ALA PRO 14

AAAT GGA AGC CAT GCG CCG GAC CAC 6AC GIC ACG CAG CAA 6GG SAC GAG GrG tGGGTGG GGC ATG GGC ATC GtC ATG TCT CTC ATC STC 1395ASN GLY SER HIS ALA PRO ASP HIS ASP VAL rHR GLN GLN AUG ASP GLU VAL TRP VAL VAL SLY MET SLY IE VAL NET SER LEU LIE VAL 44

ARGACTG 6CC ATC G1G TTT GGC AAT GTG C I GIC ATC ACA GCC ArT 6CC AAG ITCGAG CGI CTG CAG ACc GTC ACC AAC TAC TIC AIC ACT ICA 1465LEU ALA ILE VAL PHE SLY ASN VAL LEU VAL ILE IHR ALA ILE ALA LYS PHE GLU ARC LEO GLN IHR VAL IHR ASN TYR PHE ILE THR SER 74

0 6CTG GCC 1G1 SCT SAT CTG GTC ATG GGC CTA GCA GTC GIG CCC ITT G66 6CC 6CC CAT ATT CTT ATG AAA ATC T16 ACT ITT GCC AAC ITC 1575LEU ALA CYS ALA ASP LEU VAL nET GLY LEU ALA VAL VAL PRO PHE SLY ALA ALA HIS ILE LEO nET LYS nET TRP IHR PHE SLY ASN PHE 104

1GG TGC GAG III TGG ACT TIC ATT SAT G6G CIG TGC STC AC6 GCC ASC AlT SAG ACC CTCG6C 6G6 ATC 6CA GTG GAT CG TAC ITT GCC 1665tRP CYS GLu PHE TRP THR SER ILE ASP VAL LEO CYS VAL THR ALA SER ILE CLU THR LEU CYS VAL ILE ALA VAL ASP ARN TYR PHE ALA 134

*CAATT ACT TCA CCT TTC AAG TAC CAG AGC CTG CtG ACC AAG AAT AA6 6CC CGG G6G ATC ATT CtG 6TG G6G tG6 AlT GIG ICA G6C CIT ACC 1755lLE IHR SER PRO PHE LYS TYR CLN SER LEU LEU THR LYS ASN LYS ALA ARG VAL ILE ILE LEU NET VAL TRP ILE VAL SER 6LY LEO IHR 164

SER COTCC TtC TTS CCC ATt CAG ATG CAC T1G TAC AGG GCC ACC CAC CAG GAA GCC ATC AAC TGC TAT GCC AAT GAG ACC TGC 15T G4C TIC *tC 1645SER PHE LEO PRO ILE 6LN nET HIS TRP TYR ARG ALA IHR HIS GLN GL4 ALA ILE ASN CYS TYR ALA ASN 6LU IHR CYS CYS ASP PHE PHE 194

ACG AAC CAA 6CC TAT GCC ATT GCC *CT tCC AIC GTG tCC TIC TAC SIT CCC CtG GTATC ATG GTC TIC StC TAC tCC AGS GIC IIT CA 1935THR ASN GLN ALA TIN ALA ILE ALA SER SER ILE VAL SER PHE TYR VAL PRO LEO VAL ILE NET VAL PHE VAL TYR SER AR6 VAL PHE GLN 224

GAG GCC 666 ASS CAG CTC CAG AAG ATT GAC AAA TCT GAG GGC CGC TTC CAT GTC CAG AAC CTT ACC CAG GTG GAG CAS SAT 666 CGG ACG 2025GLU ALA LYS ARG GLN LEO GLN LYS ILE ASP LYS SER GLU GLY ARG PHE HIS VAL GLN ASN LEO SER GLN VAL 6LU GLN ASP GLY ARG THR 254

666 CAT GSA CTC CGC AGA TCT TCC AAG TIC TGC TTS A6G GAG CAC AAA GCC CTC AAG ACG TtA GGC ATC ATC ATG GGC ACT TIC ACC CIC 2115SLY HIS SLY LEO ARG 6RG SER SER LYS PHE CYS LEO LYS CLU HIS LYS ALA LEO LYS THR LEO SLY ILE ILE NET GLY THR PHE IHR LEU 284

TGC TG16 CT CCC TTC TTC ATC GTT AAC ATT GTG CAT G5G ATC CAG GAT AAC CTC ATC CGT 66G 666 GTT TAC ATC CTC CIA AAT 166 ATA 2205CYS TRP LEO PRO PHE PHE ILE VAL ASN ILE VAL HIS VAL ILE GLN ASP ASN LEO ILE ARG LYS 6LU VAL TYR ILE LEO LEO ASN TRP ILE 314

GGC TAT GTC 66T TCt 6GT TIC AAT CCC CTT ATC TAC TGC CGG AGC CCA GAT TTC AGG 6TT GCC TTC CAG GAG CTT CTC TGC CTG CGC A6G 2295GLY TYR VAL ASN SER GLY PHE ASN PRO LEO ILE tYR CYS ARG SER PRO ASP PHE ARG ILE ALA PHE GLN GLU LEO LEO CIS LEO ARG ARG 344

0 GTCT TCT TtG A6G GCC TAT GGC AAT GGC TAC TCC A6C AAF GGC AAC ACA 666 GAG CAG A66 GGA TAT CAC G1G GAA CAC G66 666 G66 661 2385SER SER LEO LYS ALA TYR GLY ASN GLY TYR SER SER ASN GLY ASN THR 6LY GLU GLN SER GLY TYR HIS VAL GLU GLN 6LU LYS GLU ASN 374

AAA CTG CTG TGt GAA GAC CTC CCA GGC ACG GAA GAC ITT GIG GGC CAT CAA GGT ACT GIG CCT GC SAT AAC 61 SAT TCA CAA 666 AGG 2475LYS LEO LEO CYS GLU ASP LEU PRO SLY THR GLU ASP PHE VAL GLY HIS GLN GLY THR VAL PRO SER ASP ASN ILE ASP SER GLN GLY ARG 404

AAT TGT AGT ACA AlT GAC ICA CtG CtA TAAAGCAGSTTTTCTACTTtTAAAGACCCCCCCCCGCCCAACAGAACACTAAACAGACTATTTAACTTGAGSTIAATAAAC 2563ASN CYS SER THR ASN ASP SER LEU LEU

TTAGAATAAAATTGTAAAATTGTATAGAGATATGCAGAAGGAG6GGCATCC TTCTGCCTTT TT TATT TTTTTAAGC TGTAAAA^GAGAGAAAACTTATTTGA6GAT6TA*TTTGT TATTfTG 2703

TACAGT TCAGT TCC TC T TTGCATGGAAtTTTGTAAGTT TATGTC TAAA6AGgT T TAGCC IAGAGGACC TGAGTC TGC TATAT TT TCATGAC TT TTCCATGTATC TACCT*CAC TATTCAAG 2823

TATTA6G6GTAATATATTGC TGC TGGTAATT*TGTATC TGAAGGAGAT T TTCCT TCC TACACCCTrTGGACT*GAG^GAiTT*T*GAGTATC TCGGACCT*T*TCAGCTGTGAAC ATGGACTICT*TCC 2943kGGTCTARD

CCCACT*CCTCT TATT TGCT*CACAC6GGGTAT TTTAGGCAGGGATT T6^AGA6iCAGCTTCAGTTG6TTTTCCCGAGCAAATC TAAAGTTTACAGTAAATAAA^TTGTTTGACCATG'CC TTCA 3063

TTGCACCTGTTTCTCC^AAACCCC TTGAC TGGAGTGCTGTTGCCTCCCCCACTGGAAACCGCA66TAAC TAC TCTGTATTACTGCCCATGACTTAATGTAGAAT6ATACAAGAAT GACAT 3163

GCACAGATTGiCTTAACCCTTTCAT TTCCCTT TGAGTCT*GCT*GCTGCAAAGC TGCATCTC TCCTGACACT*TGT6CCCCAAATCAGTT*CTGCCTGC TCTT*AGTATAGCT*CAACTCT*CCC TAT 3303

GGTTATTGTTC TTTGTT^iCCTCAGAAACAC TGAC TCACAAAGCGGAGTTAAGGGGATTT6T T T T TtC TCTCCACGTGCACCCACCACCCACC T TCCAGTTC TAC TTGrTTTCA 3423

FIG. 1. Structural characterization of the f32AR locus. (a) Map of the p2AR locus showing conserved restriction sites (cf. Fig. 2). (b)Sequencing strategy: each arrow indicates the origin, direction, and extent of sequence determination for individual M13 subclones. HPR-9,HJR-10, and H/3R-11, used for mRNA cap site localization, are indicated. Double-headed arrows F and C represent the flanking- andcoding-region probes used. (c) Nucleotide sequence of the 3.5-kb EcoRI-Sst I fragment. In the 5' region, the first methionine codon of ORFXis boxed. Its TAA stop codon is contained in the TATA box at position 1030. Black triangles represent ,32AR mRNA initiation sites. In the 3'

Page 3: Structure of the gene for human beta 2-adrenergic receptor ...

Proc. Natl. Acad. Sci. USA 84 (1987) 6997

positive wells were subcloned in selective medium on feedercells at a density of 0.5-1 cell per well in 96-well plates.Receptor binding assays were performed as described (11),

with 3-[1251]iodocyanopindolol ([1251]ICYP, 15-500 pM) orwith 4-(3-t-butylamino-2-hydroxypropoxy)-[5,7-3H]benzim-idazol-2-one ([3H]CGP-12,177, 0.03-2 nM). In competitionexperiments, cells were incubated with 60 pM [125IJICYP andvarious concentrations of antagonists or agonists. For cAMPassays (11), aliquots of 2 x 104 preconfluent cells wereincubated for 2 min at 370C with various concentrations of(0-isoproterenol. Inhibition of adenylate cyclase stimulationwas assayed in the presence of different antagonists at a finalconcentration of 100 AM. The level ofcAMP was determinedusing the Amersham cAMP kit.

RESULTSStructural Characteristics of the Cloned DNA Fragment.

Extensive similarities exist between mammalian ,AR (1, 24).To search for the human /32AR gene we derived threeoligonucleotide probes from a hamster lung 182AR cDNAsequence (8). The probes, hybridized to blots of A431 DNA,revealed a single 9.8-kilobase (kb) EcoRI fragment. Restric-tion mapping analysis of this fragment, cloned in EMBL4phages, showed that the sequences homologous to the probeslie in a 3.5-kb EcoRI-Sst I fragment whose sequence ispresented in Fig. ic.

Translation in the six phases revealed two long openreading frames (ORFs). The longest ORF (Fig. ic) and its 5'and 3' untranslated sequences are highly homologous tocorresponding cDNA sequences for turkey P1AR (56% iden-tity; ref. 7) and for hamster (8) and human (9, 10) /32AR (76%and 99%, respectively). In the coding region, few differencesexist between the three human sequences (Fig. ic). Mostprominent is the presence, in the 3' untranslated region of theA431 and placental (9) genes, of a 10-bp sequence that isabsent at homologous positions in the brain cDNA sequence(10). Almost the same sequence is inserted in the comple-mentary strand of the brain cDNA at a position correspond-ing to our bp 3021. Surprisingly, the insertion/deletion eventpreserves in both sequences a tandem repeat of this 10-bpsequence (Fig. ic). This decamer contains a consensussequence for the glucocorticoid receptor binding siteWGTYCT (W = A or T; Y = C or T) (25). Such glucocor-ticoid-response elements and close approximations of thedecamer are found at several other positions in the coding andflanking regions of the gene (Fig. ic). Homology between theA431 and the brain 32AR genes falls from 99% to randomhomology after the mRNA polyadenylylation site. The se-quences responsible for this lack of homology seem to beunique to the published sequence of the brain gene, since its3' end contains EcoRI and BamHI restriction sites that do notexist at corresponding places in the .2AR gene from A431 orother human cells, including neuroblastoma cells (Fig. 2).The second ORF (ORFX) translates into a 251-residue

polypeptide of 26,681 Da (Fig. 3) and possesses four methio-nine codons, none of which fit Kozak's criteria for effectivetranslation initiation (27). A close approximation of a con-sensus polyadenylylation signal is contained in the ACATAAsequence constituting the final two codons (positions1027-1032 in Fig. ic). Analyses of total RNA from A431(/32AR-positive) and CCL137 (p8AR-negative) cells with the F

Bgll BamH I EcoRII I I I I

2 E _ E, o E.C)<XZ c C z cl a.CLZ cnJ

4 -W 4p @ _j

_6.6

-4.3

_2.3_1.9

FIG. 2. Conservation of the 132AR gene locus. Southern blots ofhuman DNA from various tissues [CCL137 and A431 cells, periph-eral blood lymphocytes (PBL), IMR32 neuroblastoma cells (26),semen, and liver] from unrelated individuals were hybridized to theC probe (Fig. lb). Results are shown only for Bgl II, BamHl, andEcoRl restriction enzymes, but the other conserved sites are mappedin Fig. la. Molecular size markers (in kb) are indicated at right.

probe (Fig. lb) did not detect any hybridization signal (datanot shown).mRNA Cap Site and RNA Polymerase II Promoter Site. The

three probes used in these experiments were annealed eitherto themselves or to RNA from A431 or from 32AR-negativecells (Fig. 4). None of the probes was self-protected. HpR-11was not protected with any RNA, whereas HjBR-9 andH3R-10 were protected with A431 RNA, yielding threefragments, whose lengths correspond to three mRNA initi-ation regions (Fig. ic). As in several eukaryotic RNApolymerase II promoters, a reversed "CAAT box" and a"TATA box"-like sequence are found about 80 and 30 bpupstream, respectively, from the mRNA start regions. Theabsence ofa true TATA box, and the presence instead oftwoweakly homologous sequences separated by roughly 10 bp(as are the two main mRNA cap sites), might be responsiblefor the observed low specificity ofRNA initiation. Variationsfrom the consensus sequence (KGGGCGGRRY, where K =G or T, R = A or G, and Y = C or T) for the binding site oftranscription factor Spl (28) occur in three such sequencesfound in reverse orientation in the A431 8AR gene (Fig. ic).

Transfection and Expression in Eukaryotic Cells. To verifythat the 3.5-kb EcoRI-Sst I fragment indeed contains thefunctional elements for 8AR expression, the fragment wastransfected into eukaryotic cells in plasmid constructs lack-ing any foreign promoters. One of the transfectant clones,TP3-15, binds both ICYP and CGP-12,177 with apparent Kdvalues of 35 pM and 170 pM, respectively (Fig. 5a). Atsaturation, the maximum number of sites (Bmax) per 106 cellsis 39 fmol for ICYP and 31 fmol for CGP-12,177. In contrast,the original TP3 cells do not bind CGP-12,177, and low levelsof ICYP binding are observed. In competition experiments(Fig. Sc), various agonists inhibit ICYP binding with thefollowing order of potency: procaterol > (l)-epinephrine >(l)-norepinephrine (respective IC50 values are 6.9 ± 1.5, 31 ±7, and 540 ± 110 ,uM). For antagonist competitors (Fig. Sd),the order is (l)-propranolol > (d)-propranolol > butoxamine

region, three polyadenylylation signals are identified by overbars, and the mRNA polyadenylylation site is indicated by an arrowhead. The boxeddecanucleotide beginning at bp 2754 is deleted at the corresponding position but inserted on the complementary strand of the human brain cDNAsequence at a position corresponding to bp 3021. Close approximations of this decamer (in either orientation) are highlighted by brokenunderlining. Sequences that resemble glucocorticoid-response elements (in either orientation) are indicated by " x " symbols. In the P2AR codingregion, discrepancies between the A431 and published cDNA sequences (9, 10) are shown with open circles for silent substitutions and withblack circles for substitutions leading to replacement by the residue indicated below the translated sequence.

Biochemistry: Emorine et al.

-9.6

Page 4: Structure of the gene for human beta 2-adrenergic receptor ...

6998 Biochemistry: Emorine et al.

T vMFEREYTGLPGV.QWEGS IISARVRQVRSTQMETSVSVSLWMPPSQRVFTF 50

aVaHHVFVLLGASVFVSGRVSVLDRGDFVPDGFaVRARASVHVGELGGCV 100VSVSA WVRYKSEHVaQGVFVPVaAaLGGHSRFLPNVGQaRaAALaLETSS 150

RAGAQGRQVAATEEPKAPGLAGKHTTSSFSPLGPARVAGKQWWPALQGAV 200

GPRPGQPQEK EGRGGEECLAPSRLPACHWPKVPVRHGEGSSPKVLCT 251

FIG. 3. ORFX amino acid sequence (standard one-letter aminoacid symbols). Three methionines (potential initiation codons) aremarked with triangles. Cysteine residues are underlined. A phos-phoryl-binding-site consensus sequence is boxed; a star marks alysine residue, essential for phosphorylation activity of severalprotein kinases and found at the equivalent position in ORFX.

> practolol (respective IC50 values are 23.4 ± 6.2 nM and 1.32± 0.18, 4.5 ± 0.9, and 46 ± 2 ,uM). The forskolin-sensitiveadenylate cyclase of TP3 cells is not inducible by isopro-terenol. However, in the transfected TP3-15 cells, isopro-terenol is able to stimulate the adenylate cyclase activity bya factor of 5 (Fig. Se). Again, inhibition ofthe process followsthe same antagonist potency order.To investigate glucocorticoid effects on P32AR synthesis in

TP3-15 cells, dialyzed fetal bovine serum was used in theculture medium, resulting in slightly modified Bmax (18 fmolper 106 cells) and Kd (9 pM). Addition of5 ,M hydrocortisonenearly doubled receptor number (Fig. Sb).

DISCUSSIONDespite extensive studies of 31AR and P2AR (1, 24), it is notclear whether these two classes of receptors are encoded bya single gene specifically processed toward the (31 or (2subtype or by distinct genes. Using probes derived fromhamster p2AR cDNA, we cloned a human gene encoding a46,404-Da peptide that is highly homologous to mammalian132AR and to a lesser extent to avian ,81AR. Upon transfection

HBR-9

rfn Z

~t CO~O *

258

248

235-b

-

it

H BR-1 0I

c')Z<:4.Io*I

+4 +

0

S198-U w

1 88-o_,m

175-

FIG. 4. Nuclease S1 mapping ofmRNA cap site. Single-strandedprobes (H/3R-9 and HPR-10) were annealed to themselves (lanes 0);to tRNA from yeast; to total RNA from A431 cells, C6 glioma cells,peripheral blood lymphocytes (PBL), liver, TP3 cells, or TP3-15transfectants; or to poly(A)+ RNA from CCL137 or A431 cells(CCLA+ or A431A+). Products of nuclease S1 digestion wereanalyzed by electrophoresis and autoradiography. Stars indicateundigested probes. Molecular size markers (in bp) are shown at left.

of this gene in eukaryotic cells, a protein was produced thatdisplayed P2AR rather than 81AR properties. Using probesderived from the A431 gene, we studied RNA from severalsources by cap site analysis (Fig. 4) and blot hybridization(data not shown). Positive signals were detected in A431cells, which express f32AR. No signals were obtained with C6cells, which express mostly j31AR. These results indicate thatwe have cloned a gene coding for f32AR and that distinctgenes code for p1AR and P32AR.To investigate the existence of tissue-related or genetic

variations in the j32AR gene locus, DNA from various tissuesderived from unrelated individuals was analyzed by Southernblot hybridization (Fig. 2). No tissue or individual restrictionpolymorphism was detected over a 20-kb chromosomalregion. However, our DNA sequencing results showed thatminor individual variations exist among human P2AR se-quences. The drastic decrease in the homology between oursequence and that of the brain cDNA (10) after the mRNApolyadenylylation site might correspond to a rare individualexample of genetic exchange. The Southern blot analysesshow that both the 5' and 3' flanking regions of the ,2AR geneare highly conserved. In line with this observation, the5'-flanking-region probe F (Fig. lb) detects the correspond-ing rabbit, mouse, and hamster sequences under stringentconditions (data not shown). Indeed, these flanking se-quences contain the promoter region used in various humantissues as well as in the rabbit TP3 spleen cells (Fig. 4).Because of its high G+C content this promoter region is ableto form strong secondary structures (as evidenced by thedifficulties encountered during DNA sequencing of thisparticular region). The methylation pattern of these regionsand/or their secondary or chromatin structure might affectthe level of gene expression (29). Also, we have shown thatfunctional glucocorticoid-response elements are certainlypresent in the cloned fragment, since hydrocortisone treat-ment of the TP3-15 transfected cells nearly doubled /32ARnumber. In particular, a glucocorticoid-response-elementsequence is contained in the decamer whose position variesfrom the brain f32AR cDNA to our clone but still occurs as atandem repeat in both sequences (Fig. lc). Since temperatureor stress, and any of several steroid hormones, is able tomodulate 182AR expression, other distal regulatory elementsmight be discovered in the vicinity of the p2AR gene. Thus,conservation of the sequences flanking the f32AR gene couldreflect the presence of regulatory DNA elements modulatingthe expression of the /32AR.The potential product of ORFX (Fig. 3) contains 14

cysteine residues, 8 ofwhich are concentrated within a regionof about 60 amino acids, and it harbors the Gly-Xaa-Gly-Xaa-Xaa-Gly-Lys phosphoryl-binding-site consensus se-quence found in nucleotide-binding proteins (22). Moreover,residues 98-212 of ORFX constitute a region of significanthomology (22% or 40% if one considers conservative substi-tutions) with one of these proteins: yeast elongation factorEF-la (5). In various growth factor receptors, a lysineresidue, essential for protein kinase activity, is found towardthe COOH terminus, 11-20 residues from this consensussequence (4). ORFX encodes a lysine at a correspondingposition (Fig. 3). Several genes for such receptors map in thesame region of chromosome 5 as the 132AR gene (9).

We wish to thank Dr. T. J. Kindt for oligonucleotide synthesis andMs. C. Peyrat for editorial assistance. This work was supported bygrants from the Centre National de la Recherche Scientifique and theInstitut National de la Sante et de la Recherche Medicale.

1. Lefkowitz, R. J., Caron, M. G. & Stiles, G. L. (1984) N. Engl. J. Med.310, 1570-1578.

2. Williams, L. T., Lefkowitz, R. J., Watanabe, A. M., Hathaway, D. R.& Besch, H. R., Jr. (1977) J. Biol. Chem. 252, 2787-2789.

Proc. Natl. Acad. Sci. USA 84 (1987)

Page 5: Structure of the gene for human beta 2-adrenergic receptor ...

Biochemistry: Emorine et al.

I .I I, - I I I

10 20 30 40 50 60Bound ligand, fmol per 106 cells

200Ub

100 X

0 5 2~~~~~

0 10 20 30 4(Bound [1251]ICYP, fmol per 106 cells

-2log[agonist] log[antagonist]

E00.

D

0

0)

0a)

E

0:~

C.)

-9 -8 -7 -6log[isoproterenol]

-I FIG. 5. Expression of the human f32AR gene in TP3 cells. (Top)Scatchard analyses of ICYP binding to TP3-15 (X) or TP3 (o) cells and ofCGP-12,177 binding to TP3-15 cells (.) (a) and ICYP binding to TP3-15 cellsgrown in the presence (X) or absence (*) of 5 u.M hydrocortisone (b). Forthese analyses, the correlation coefficient ranged from 0.82 to 0.99. (Middle)Inhibition of ICYP binding to TP3-15 cells by the 3-agonists (I)-norepineph-rine (W), (l)-epinephrine (*), and procaterol (a) (c) and the (3-antagonistspractolol (X), butoxamine (.), (d)-propranolol (a), and (l)-propranolol (c)

==tF (d). (Bottom, e) cAMP accumulation in TP3-15 cells treated with isopro-terenol alone (v) or isoproterenol plus antagonists [butoxamine (v) or

practolol (o)]. No isoproterenol stimulation was observed for TP3 cells (v).* 4 Basal cAMP levels are shown for TP3-15 (v) and TP3 (e) on the left axis.

Vertical bars indicate standard deviations calculated from two or threeexperiments done in duplicate.

3. Fraser, C. M. & Venter, J. C. (1980) Biochem. Biophys. Res. Commun.94, 390-397.

4. Hannink, M. & Donoghue, D. J. (1985) Proc. Natl. Acad. Sci. USA 82,7894-7898.

5. Nagata, S., Nagashima, K., Tsunetsugu-Yokota, Y., Fujimura, K.,Miyazaki, M. & Kaziro, Y. (1984) EMBO J. 3, 1825-1830.

6. Spiegel, A. M. (1987) Mol. Cell. Endocrinol. 49, 1-16.7. Yarden, Y., Rodriguez, H., Wong, S. K. F., Brandt, D. R., May, D. C.,

Burnier, J., Harkins, R. N., Chen, E. Y., Ramachandran, J., Ullrich, A.& Ross, E. M. (1986) Proc. Natl. Acad. Sci. USA 83, 6795-6799.

8. Dixon, R. A. F., Kobilka, B. K., Strader, D. J., Benovic, J. L., Dohl-man, H. G., Frielle, T., Bolanowski, M. A., Bennett, C. D., Rands, E.,Diehl, R. E., Mumford, R. A., Slater, E. E., Sigal, I. S., Caron, M. G.,Lefkowitz, R. J. & Strader, C. D. (1986) Nature (London) 321, 75-79.

9. Kobilka, B. K., Dixon, R. A. F., Frielle, T., Dohlman, H. G.,Bolanowski, M. A., Sigal, I. S., Yang-Feng, T. L., Francke, U., Caron,M. G. & Lefkowitz, R. J. (1987) Proc. Natl. Acad. Sci. USA 84, 46-50.

10. Chung, F.-Z., Lentes, K.-U., Gocayne, J., Fitzgerald, M., Robinson,D., Kervalage, A. R., Fraser, C. M. & Venter, J. C. (1987) FEBS Lett.211, 200-206.

11. Delavier-Klutchko, C., Hoebeke, J. & Strosberg, A. D. (1984) FEBSLett. 169, 151-155.

12. Lucas, M. & Bockaert, J. (1977) Mol. Pharmacol. 13, 314-329.13. Strosberg, A. D., Collins, J. J., Black, P. H., Malamud, D., Wilbert, S.,

Bloch, K. H. & Haber, E. (1974) Proc. Natl. Acad. Sci. USA 71,263-264.

14. Maniatis, T., Fritsch, E. F. & Sambrook, J. (1982) Molecular Cloning: A

Laboratory Manual (Cold Spring Harbor Laboratory, Cold SpringHarbor, NY).

15. Ullrich, A., Berman, C. H., Dull, T. J., Gray, A. & Lee, J. M. (1984)EMBO J. 3, 361-364.

16. Southern, E. M. (1975) J. Mol. Biol. 98, 503-517.17. Wahl, G. M., Stern, M. & Stark, G. R. (1979) Proc. Natl. Acad. Sci.

USA 76, 3683-3687.18. Henikoff, S. (1984) Gene 28, 351-359.19. Sanger, F., Nicklen, S. & Coulson, A. R. (1977) Proc. Natl. Acad. Sci.

USA 74, 5463-5467.20. Mizusawa, S., Nishimura, S. & Seela, F. (1986) Nucleic Acids Res. 14,

1319-1324.21. Hu, N.-T. & Messing, J. (1982) Gene 17, 271-277.22. Dever, T. E., Glynias, M. J. & Merrick, W. C. (1987) Proc. Natl. Acad.

Sci. USA 84, 1814-1818.23. Gorman, C. (1986) in DNA Cloning: A Practical Approach, ed. Glover,

D. M. (IRL, Oxford), Vol. 2, pp. 143-190.24. Kaveri, S. V., Cervantes-Olivier, P., Delavier-Klutchko, C. &

Strosberg, A. D. (1987) Eur. J. Biochem. 167, 449-456.25. Cato, A. C. B., Geisse, S., Wenz, M., Westphal, H. M. & Beato, M.

(1984) EMBO J. 3, 2771-2778.26. Tumilowicz, J. J., Nichols, W. W., Cholson, J. J. & Greene, A. E.

(1970) Cancer Res. 30, 2110-2118.27. Kozak, M. (1984) Nucleic Acids Res. 12, 857-872.28. Jones, K. A., Kadonaga, J. T., Luciw, P. A. & Tjian, R. (1986) Science

232, 755-759.29. Razin, A., Szyf, M. & Reeves, R. (1984) Biochim. Biophys. Acta 782,

331-393.

Proc. Natl. Acad. Sci. USA 84 (1987) 6999

00

x40,0 3

-6-

'a

m1

C 1(

0

0,1-0

-60LOa.J