DNA Stretching and Compression: Large-scale Simulations...

26
DNA Stretching and Compression: Large-scale Simulations of Double Helical Structures Konstantin M. Kosikov 1 , Andrey A. Gorin 2 , Victor B. Zhurkin 3 * and Wilma K. Olson 1 * 1 Department of Chemistry Rutgers, the State University of New Jersey, Wright-Rieman Laboratories, 610 Taylor Road Piscataway NJ 08854-8087, USA 2 Memorial Sloan-Kettering Cancer Center, 1275 York Avenue, New York NY 10021, USA 3 Laboratory of Experimental and Computational Biology National Cancer Institute, National Institutes of Health Bethesda, MD 20892, USA Computer-simulated elongation and compression of A- and B-DNA struc- tures beyond the range of thermal fluctuations provide new insights into high energy ‘‘activated’’ forms of DNA implicated in biochemical processes, such as recombination and transcription. All-atom potential energy studies of regular poly(dG) poly(dC) and poly(dA) poly(dT) double helices, stretched from compressed states of 2.0 A ˚ per base-pair step to highly extended forms of 7.0 A ˚ per residue, uncover four different hyperfamilies of right-handed structures that differ in mutual base-pair orientation and sugar-phosphate backbone conformation. The optimized structures embrace all currently known right-handed forms of double- helical DNA identified in single crystals as well as non-canonical forms, such as the original ‘‘Watson-Crick’’ duplex with trans conformations about the P—O5 0 and C5 0 —C4 0 backbone bonds. The lowest energy minima correspond to canonical A and B-form duplexes. The calculations further reveal a number of unusual helical confor- mations that are energetically disfavored under equilibrium conditions but become favored when DNA is highly stretched or compressed. The variation of potential energy versus stretching provides a detailed picture of dramatic conformational changes that accompany the transitions between various families of double-helical forms. In particular, the inter- changes between extended canonical and non-canonical states are remi- niscent of the cooperative transitions identified by direct stretching experiments. The large-scale, concerted changes in base-pair inclination, brought about by changes in backbone and glycosyl torsion angles, could easily give rise to the observed sharp increase in force required to stretch single DNA molecules more than 1.6-1.65 times their canonical extension. Our extended duplexes also help to tie together a number of pre- viously known structural features of the RecA-DNA complex and offer a self-consistent stereochemical model for the single-stranded/duplex DNA recognition brought in register by recombination proteins. The com- pression of model duplexes, by contrast, yields non-canonical structures resembling the deformed steps in crystal complexes of DNA with the TATA-box binding protein (TBP). The crystalline TBP-bound DNA steps follow the calculated compression-elongation pattern of an unusual ‘‘vertical’’ duplex with base planes highly inclined with respect to the helical axis, exposed into the minor groove, and accordingly accessible for recognition. Significantly, the double helix can be stretched by a factor of two and compressed roughly in half before its computed internal energy rises sharply. The energy profiles show that DNA extension-compression is related not only to the variation of base-pair Rise but also to concerted changes of Twist, Roll, and Slide. We suggest that the high energy ‘‘acti- vated’’ forms calculated here are critical for DNA processing, e.g. nucleo- protein recognition, DNA/RNA synthesis, and strand exchange. E-mail addresses of the corresponding authors: [email protected]; [email protected]. Abbreviations used: TBP, TATA box binding protein; MD, molecular dynamics; EM, electron microscopic; CRP, cyclic AMP receptor protein. Article No. jmbi.1999.2798 available online at http://www.idealibrary.com on J. Mol. Biol. (1999) 289, 1301–1326 0022-2836/99/251301–26 $30.00/0 # 1999 Academic Press

Transcript of DNA Stretching and Compression: Large-scale Simulations...

Page 1: DNA Stretching and Compression: Large-scale Simulations …brd/Teaching/Bio/asmb/Papers/JMB/DNA... · DNA Stretching and Compression: Large-scale Simulations of Double Helical Structures

Article No. jmbi.1999.2798 available online at http://www.idealibrary.com on J. Mol. Biol. (1999) 289, 1301±1326

DNA Stretching and Compression: Large-scaleSimulations of Double Helical Structures

Konstantin M. Kosikov1, Andrey A. Gorin2, Victor B. Zhurkin3* andWilma K. Olson1*

1Department of ChemistryRutgers, the State University ofNew Jersey, Wright-RiemanLaboratories, 610 Taylor RoadPiscatawayNJ 08854-8087, USA2Memorial Sloan-KetteringCancer Center, 1275 YorkAvenue, New YorkNY 10021, USA3Laboratory of Experimentaland Computational BiologyNational Cancer Institute,National Institutes of HealthBethesda, MD 20892, USA

E-mail addresses of the correspon

Abbreviations used: TBP, TATAcyclic AMP receptor protein.

0022-2836/99/251301±26 $30.00/0

Computer-simulated elongation and compression of A- and B-DNA struc-tures beyond the range of thermal ¯uctuations provide new insights intohigh energy ``activated'' forms of DNA implicated in biochemicalprocesses, such as recombination and transcription. All-atom potentialenergy studies of regular poly(dG) �poly(dC) and poly(dA) �poly(dT)double helices, stretched from compressed states of 2.0 AÊ per base-pairstep to highly extended forms of 7.0 AÊ per residue, uncover four differenthyperfamilies of right-handed structures that differ in mutual base-pairorientation and sugar-phosphate backbone conformation. The optimizedstructures embrace all currently known right-handed forms of double-helical DNA identi®ed in single crystals as well as non-canonical forms,such as the original ``Watson-Crick'' duplex with trans conformationsabout the PÐO50 and C50ÐC40 backbone bonds. The lowest energyminima correspond to canonical A and B-form duplexes.

The calculations further reveal a number of unusual helical confor-mations that are energetically disfavored under equilibrium conditionsbut become favored when DNA is highly stretched or compressed. Thevariation of potential energy versus stretching provides a detailed pictureof dramatic conformational changes that accompany the transitionsbetween various families of double-helical forms. In particular, the inter-changes between extended canonical and non-canonical states are remi-niscent of the cooperative transitions identi®ed by direct stretchingexperiments. The large-scale, concerted changes in base-pair inclination,brought about by changes in backbone and glycosyl torsion angles, couldeasily give rise to the observed sharp increase in force required to stretchsingle DNA molecules more than 1.6-1.65 times their canonical extension.

Our extended duplexes also help to tie together a number of pre-viously known structural features of the RecA-DNA complex and offer aself-consistent stereochemical model for the single-stranded/duplex DNArecognition brought in register by recombination proteins. The com-pression of model duplexes, by contrast, yields non-canonical structuresresembling the deformed steps in crystal complexes of DNA with theTATA-box binding protein (TBP). The crystalline TBP-bound DNA stepsfollow the calculated compression-elongation pattern of an unusual``vertical'' duplex with base planes highly inclined with respect to thehelical axis, exposed into the minor groove, and accordingly accessiblefor recognition.

Signi®cantly, the double helix can be stretched by a factor of two andcompressed roughly in half before its computed internal energy risessharply. The energy pro®les show that DNA extension-compression isrelated not only to the variation of base-pair Rise but also to concertedchanges of Twist, Roll, and Slide. We suggest that the high energy ``acti-vated'' forms calculated here are critical for DNA processing, e.g. nucleo-protein recognition, DNA/RNA synthesis, and strand exchange.

ding authors: [email protected]; [email protected].

box binding protein; MD, molecular dynamics; EM, electron microscopic; CRP,

# 1999 Academic Press

Page 2: DNA Stretching and Compression: Large-scale Simulations …brd/Teaching/Bio/asmb/Papers/JMB/DNA... · DNA Stretching and Compression: Large-scale Simulations of Double Helical Structures

1302 DNA Stretching and Compression

# 1999 Academic Press

Keywords: DNA stretching; conformational transitions; molecularsimulation; RecA-DNA assembly; TBP-DNA complex

*Corresponding author

Introduction

A remarkable series of new physical manipula-tions of individual DNA molecules (Bensimon,1996) has renewed interest in the large-scale con-formational ¯exibility of the double helix. Inaddition to the long known solvent-induced tran-sition of the slim, elongated B-type helix (Watson& Crick, 1953) into the wide, stubby A-form(Franklin & Gosling, 1953), B-DNA can now beconverted to a novel over-stretched S conformationupon direct pulling (Bensimon, 1996; Cluzel et al.,1996; Smith et al., 1996; Baumann et al., 1997). Thisextended state, while anticipated nearly 50 yearsago from the optical properties of ®bers under ten-sion (Wilkins et al., 1951), is only beginning to beunderstood. According to the best available mol-ecular models (Cluzel et al., 1996; Konrad &Bolonick, 1996; Lebrun & Lavery, 1996), comp-lementary base-pairing remains intact in theextended duplex (which is 1.6-1.65 times thenormal 3.4 AÊ /bp contour length), the duplex isunwound, and base-pairs are inclined signi®cantlywith respect to the helical axis.

These arti®cially induced changes in confor-mation provide new clues to the response of DNAto proteins and other ligands. While many DNA-binding proteins take advantage of the molecularmechanisms involved in the B$ A helical trans-formation: bending, unwinding, and displacingneighboring base-pairs (Calladine & Drew, 1984)through concerted changes in backbone and glyco-syl torsion angles (Olson, 1976; Yathindra &Sundaralingam, 1976; Zhurkin et al., 1978), agrowing number of proteins distort the doublehelix to conformational states outside the boundsof the natural B$ A transition (see reviews byGuzikevich-Guerstein & Shakked, 1995; Suzuki &Yagi, 1995; Werner et al., 1996; Dickerson, 1998;Olson et al., 1998). The stretching of DNA by pro-tein arises in many instances from the intercalation,or partial insertion, of amino acid side groupsbetween adjacent base-pairs, although there is atleast one example of non-intercalative DNA baseseparation (�5 AÊ ) in a single-stranded DNA frag-ment bound to the RecA protein (Nishinaka et al.,1997, 1998). The latter structure, which is stabilizedby novel sugar-base stacking, is thought to be rel-evant to the regularly elongated form of DNA inthe RecA ®lament (Howard-Flanders et al., 1987) aswell as to physically stretched forms of DNA. Mol-ecular models of related structures suggest thatsugar-sugar interactions may also be involved(Zhurkin et al., 1994a,b).

The precise mechanisms of DNA stretching arestill unknown at the atomic level. Recent energy

minimization and molecular dynamics (MD) simu-lations of extended duplex structures (Konrad &Bolonick, 1996; Lebrun & Lavery, 1996) havefocused more on the global conformational featuresof over-stretched duplexes than the concertedmovements of bases and sugar-phosphate back-bone involved in the deformation processes. More-over, there are discrepancies in the predictedconformational changes in the published studies.For example, the unwound ribbon-like structuresfound by pulling on the 30-termini of complemen-tary strands of oligonucleotide models are some-times (Konrad & Bolonick, 1996), but not always(Lebrun & Lavery, 1996), accompanied by highbase inclination. These differences presumablyre¯ect a variety of built-in methodological distinc-tions, including (1) the nature of the assumed force®elds, AMBER (Pearlman et al., 1995) versus JUMNA(Lavery et al., 1995), (2) the choice of independentstructural variables (Cartesian versus internal coor-dinates), and (3) the route taken to change confor-mation (MD versus minimization). In addition, thestretched forms revealed in energy minimizationdepend on the method used to bring about chainextension, i.e. pulling the molecule on the 30 and 50termini gives different results (Lebrun & Lavery,1996). On the other hand, the corresponding simu-lations of A and B-DNA compression have not yetbeen performed.

Here we report a systematic all-atom potentialenergy analysis of the compression and elongationof regular DNA double helices. Rather than push-ing or pulling the chain on the 50 or 30 ends withexplicit energy terms, we solve a constrained mini-mization problem that identi®es the conformationsof a DNA duplex with ®xed global helical exten-sion. This method allows us to follow the longi-tudinal deformations of the double helix in termsof its ``macroscopic'' stretching coordinate, the heli-cal rise. We optimize the con®gurations of bothpoly(dA) �poly(dT) and poly(dG) �poly(dC) homo-polymers (of given helical extension) under highand low salt conditions. In order to sample confor-mation space as widely as possible, we adopt animplicit representation of the chemical environ-ment with a distant-dependent dielectric constantand atomic charges modi®ed to mimic counterioncondensation (Young et al., 1997). Optimization ofA and B-DNA helices reveals several structural``hyperfamilies'' that exhibit signi®cantly differentenergy pro®les, backbone conformations, and base-pair step geometries. The combined data providenew insight into the deformations of duplex DNAand the nature of both highly stretched and pro-tein-deformed double helices.

Page 3: DNA Stretching and Compression: Large-scale Simulations …brd/Teaching/Bio/asmb/Papers/JMB/DNA... · DNA Stretching and Compression: Large-scale Simulations of Double Helical Structures

DNA Stretching and Compression 1303

Importantly, the simulated duplex variabilitygoes beyond the limits of the thermal ¯uctuationsthat occur under ``standard solution conditions''.Here we are interested in the large-scale defor-mations of DNA caused by bound proteins, suchas TBP or RecA, or by direct physical manipula-tions. The biological role of these high energy``activated'' states (Rich, 1959) is related to theimproved accessibility of Watson-Crick base-pairsduring the course of DNA ``processing'' (methyl-ation, recombination, transcription, etc.). Theseforms, which are unfavorable in free DNA, are pre-sumably stabilized by the binding of proteins.While the speci®c ways to facilitate DNA distor-tions are different for different proteins, one featurealways remains the same: the conformational pre-ferences of the duplex itself, which are the subjectof this study.

Results and Discussion

Overview of hyperfamilies

The minimization of conformational energyyields four hyperfamilies of right-handed helicalstructures with characteristic sugar-phosphatebackbones (Table 1). Each hyperfamily containsboth A and B-like duplexes with sugars puckeredrespectively in C30-endo and C20-endo domains andphosphodiester and glycosyl torsions concomi-tantly perturbed in well established directions, i.e.the O30ÐP phosphodiester z angle is 30-40 � higherand the glycosyl torsion w is �30-90 � lower in Aversus B-duplexes. These conformational distinc-tions are consistent with all available crystallo-graphic data (Schneider et al., 1997) and persistover the complete range of simulated stretchingand compression (2.0-7.0 AÊ /bp helical rise).

Table 1. Torsion angles, in degrees, for A and B families of f

Pa O30ÐPÐ

O50ÐC50b PÐO50Ð

C50ÐC40g

A1 20 to 30 270 to 290 170 to 190B1 120 to 150 280 to 300 180 to 190At ÿ10 to 20 150 to 170 170 to 190Bt 130 to 180 160 to 180 150 to 200A2 20 to 30 290 to 310 110 to 130Ak 15 to 25 280 to 300 110 to 130B2 160 to 180 240 to 280 150 to 160A3 ÿ25 to 15 250 to 280 190 to 210B3 140 to 170 270 to 280 190 to 200

260 to 320NDB, A-DNAa ÿ20 to 60 140 to 220

180 to 190160 to 200

NDB, B-DNAa 100 to 230 270 to 330130 to 160

Torsion angle ranges in the AtBt, A2B2, and A3B3 hyperfamilies dforms (A1B1 hyperfamily) are highlighted in boldface. Angles are datom sequences.

a Angular ranges in double helical A and B-DNA crystal structutaken from Schneider et al. (1997).

Despite the differences in sugar puckering, the Aand B-like structures in each hyperfamily exhibitthe same qualitative behavior during the completecourse of stretching and compression (Figures 1-5).The similar responses of backbone and bases tolongitudinal deformations support early ideas thatA and B-like forms occupy parallel ravines in(torsion angle/helical parameter) conformationspace and that these forms are simply shifted bydifferences in P � � �P and C10 � � �C10 distances(Zhurkin et al., 1978). Here we ®nd additional setsof related duplexes sharing other common struc-tural relationships as well as similar longitudinaldeformabilities.

The energetic and structural changes in the``high salt'' regime, which is thought to mimicthe natural cellular environment of the doublehelix, persist as well under simulated ``low salt''conditions (see Figure 1 and discussion below).Here we highlight the important conformationaldistinctions among the hyperfamilies at ``highsalt'' in terms of the preferred ranges of dihedralangles: trans (t), gauche� (g�) for backbone tor-sions of 180(�60)� and �60(�60)�, and anti, highanti, and syn for O40ÐC10ÐN9(N1)ÐC8(C6)glycosyl torsions of 30(�45)�, 120(�45)�, and210(�45)�, respectively. Terms such as stretch,stretch per step, global stretching, and extensionare used synonymously with ``helical rise'' in thetext. At the same time, dimer ``Rise'' and otherlocal parameters (designated by capital letters)are used to characterize the base-pair step geo-metry. The local parameters (Figure 2) describethe positioning of coordinate frames embeddedin adjacent base-pairs, whereas the helical par-ameters (Figure 3) are de®ned with respect to aglobal duplex axis (see Methods). While wefocus attention on the computed behavior ofpoly(dG) �poly(dC) polymers, the variation of

orms

O50ÐC50ÐC40ÐC30

e C40ÐC30ÐO30ÐP

z C30ÐO30ÐPÐO50

w O40ÐC10ÐN9(N1)Ð

C8(C6)

40 to 80 180 to 210 260 to 300 10 to 3540 to 60 180 to 190 220 to 270 35 to 70

160 to 180 200 to 220 280 to 300 ÿ5 to 10160 to 190 180 to 200 220 to 270 20 to 45

60 to 70 250 to 270 200 to 220 40 to 5555 to 65 265 to 280 205 to 235 30 to 5540 to 70 240 to 270 160 to 190 130 to 14550 to 70 180 to 200 250 to 280 100 to 10540 to 60 190 to 210 220 to 260 130 to 13530 to 80

160 to 250 250 to 320 ÿ20 to 70140 to 160

160 to 200 230 to 300 20 to 7020 to 80

230 to 270 150 to 210 80 to 120

ifferent from the corresponding values for canonical A and B-e®ned as 0 � for planar cis arrangements of the designated four

res (without mismatches and free of bound drugs or proteins)

Page 4: DNA Stretching and Compression: Large-scale Simulations …brd/Teaching/Bio/asmb/Papers/JMB/DNA... · DNA Stretching and Compression: Large-scale Simulations of Double Helical Structures

Figure 1. Stretching/compressionpro®les of energy versus helicalrise of optimized A and B-form poly(dG) �poly(dC) duplexes simulatedunder ((a) and (b)) ``high'' and((c) and (d)) ``low salt'' con-ditions. The interchanges betweenconformational forms occur atpoints where the pro®les intersect.Plot styles distinguish confor-mational families: ÐÐÐ , A1B1;- - - - - -, AtBt; ± � ± � ±, A2B2; ÐÐÐ,A3B3. The thin dotted curves ( � � � � )correspond to the kinked Ak helicesfound within the A2 family of struc-tures.

1304 DNA Stretching and Compression

poly(dA) �poly(dT) is roughly identical becauseof the simpli®ed nature of our homopolymermodel (see Methods).

With few exceptions, the torsion angles withineach conformational family are fairly constant over

the course of longitudinal deformations, with indi-vidual angles showing only modest variations (ca.20-30 �) from their characteristic values uponstretching. As a result, the P � � �P virtual bonddistances v (between successive phosphorus atoms

Figure 2. Local (symmetrical)base-pair step parameters: (a) and(b) Twist; (c) and (d) Roll; (e) and(f) Slide; associated with thestretching/compression pro®les ofA and B conformational familiesunder the ``high salt'' conditions ofFigure 1.

Page 5: DNA Stretching and Compression: Large-scale Simulations …brd/Teaching/Bio/asmb/Papers/JMB/DNA... · DNA Stretching and Compression: Large-scale Simulations of Double Helical Structures

Figure 3. Global helical par-ameters; (a) and (b) twist; (c) and(d) inclination; (e) and (f) shift(x-displacement), characterizing thestretching/compression pro®les ofA and B conformational familiesunder the ``high salt'' conditions ofFigure 1.

DNA Stretching and Compression 1305

on the same strand) are roughly constant (Figure 4),and the helical radii r (de®ned by the Patoms of the homopolymer model) follow thetheoretically expected dependence on global exten-sion (h), helical twist (h), and P � � �P distance (v):r2 � (v2 ÿ h2)/2(1 ÿ cos h).

A1B1 hyperfamily

The lowest energy forms overall, found in whatwe call the A1B1 hyperfamily, correspond to thecanonical A and B-DNA double helices that domi-nate the X-ray literature (heavy solid curves inFigure 1). Both forms respond to stress in muchthe same way. Speci®cally, Twist increases slightlyupon stretching from 2.0 to 3.0-3.5 AÊ /bp, and thendecreases substantially (heavy solid curves inFigure 2(a) and (b)). Roll, by contrast, decreaseswith stretching of the duplex up to helical risevalues of 4.5 AÊ /bp, and subsequently increaseswith further extension (Figure 2(c) and (d)). Thesecorrelated conformational changes mimic thebehavior of DNA in solution, stretched ®bers, and

single crystals. For example, the computed changesin base-pair inclination in the vicinity of the globalminimum are the same order of magnitude as the10-20 � deviations from perpendicularity deducedfrom the linear and circular dichroism spectraof representative DNAs in aqueous salt solution(NordeÂn et al., 1978; Edmondson & Johnson, 1986).Furthermore, our over-twisted B1 duplex is charac-terized by the same negative Roll (Figure 2(d)),positive base-pair inclination (Figure 3(d)), radialnarrowing (Figure 4(f)), and minor groove com-pression (Figure 5(b)) observed in the course of theB! C! D transition in ®bers (Arnott et al., 1974;Selsing et al., 1975).

The variations in local base-pair step and globalhelical parameters with longitudinal deformationsalso follow the conformational patterns exhibitedby dimer steps in high resolution A and B-DNAcrystal structures (Olson et al., 1998, 1999). Speci®-cally, the local Twist and Rise are positively corre-lated in nine out of ten dimeric steps, a ®ndingwhich is in agreement with the present simulationsand which serves as further indication of the

Page 6: DNA Stretching and Compression: Large-scale Simulations …brd/Teaching/Bio/asmb/Papers/JMB/DNA... · DNA Stretching and Compression: Large-scale Simulations of Double Helical Structures

Figure 4. Molecular distances: (a)and (b) base-pair Rise; (c) and (d)P � � �P virtual bond distance (v);and (e) and (f) phosphorus helicalradius (r); characterizing thestretching/compression pro®les ofA and B conformational familiesunder the ``high salt'' conditions ofFigure 1.

1306 DNA Stretching and Compression

sequence independence of this important corre-lation. In addition, the local Rise of A1B1 helices(Figure 4(a) and (b)) increases with duplex stretch-ing along intuitively expected lines, i.e. imposedglobal extension increases the spacing betweenbase-pair planes. The added base-pair separationpresumably contributes to the ``melting'' of thedouble helix at extreme extension (note the nearlyperpendicular orientations, i.e. large propellertwisting, of complementary base-pairs in the over-stretched A1 and B1 duplexes in Figures 6 and 7).The combined distortions of both base-pair andbackbone geometry destabilize the A1B1 formscompared to other states at high global extension(see discussion, below). In particular, the differencein groove sizes among helical families is tied totheir electrostatic stabilities, and is a key to under-standing the nature of the sharp cooperative tran-sition observed in DNA under extreme stretching(see the following section).

Of particular interest is the correspondencebetween our simulations of B1-DNA stretching andthe conformational patterns associated with loca-

lized, protein-induced elongation of the doublehelix in the high resolution nucleosome core par-ticle structure (Luger et al., 1997). The shorter ofthe two halves of the nucleosome-bound DNA, theso-called 72 bp fragment, spans the same contourlength as its longer (73 bp) counterpart. The addedextension of the shorter DNA is concentratedwithin a 12 bp section, covering the same region ofthe H3-H4 tetramer as a 13 bp region in the longerhalf; see so-called sites 2 and ÿ2 in Figure 4(d) ofLuger et al. (1997). The modest increase in helicalrise, which allows the 12 bp fragment to occupythe same space as the 13 bp piece, leads toincreases of Twist and base-pair inclination, as wellas to narrowing of the minor groove entirely con-sistent with our simulations (Figures 2(b), 3(d), and5(d)).

AtBt hyperfamily

Structures in the AtBt hyperfamily, distinguishedby trans (t) arrangements of their PÐO50 andC50ÐC40 (a and g) backbone torsions, respond

Page 7: DNA Stretching and Compression: Large-scale Simulations …brd/Teaching/Bio/asmb/Papers/JMB/DNA... · DNA Stretching and Compression: Large-scale Simulations of Double Helical Structures

Figure 5. Minor groove widths,measured as the minimal distancebetween phosphate groups fromcomplementary strands, that charac-terize the stretching/compressionpro®les of the A and B confor-mational families at (a) and (b)``high'' and (c) and (d) ``low salt'' inFigure 1.

DNA Stretching and Compression 1307

much like canonical A1 and B1-DNA to stretchingand compression (compare heavy broken AtBt

curves with continuous A1B1 curves in Figures 2-5).This conformational variant, which was featured inthe original Watson-Crick ®ber diffraction model(Crick & Watson, 1954) and is seen in �6 % of alldimer steps in currently determined A-DNA crys-tal structures (Schneider et al., 1997), increasesP � � �P backbone distances and reduces glycosyltorsion angles w compared to their values in theA1B1 hyperfamily (Table 1 and Figure 4(c) and (d)).Similar structures were predicted in early confor-mational energy studies (Olson, 1976; Yathindra &Sundaralingam, 1976; Zhurkin et al., 1978), andalso suggested on the basis of ®ber diffraction data(Chandrasekaran et al., 1980).

The infrequent occurrence of At conformations inthe X-ray literature is consistent with the energydifference computed here between unstretched At

versus the more favorably base-stacked A1 duplex(Figure 1(a), ``high salt''). By contrast, the absencein the X-ray database of Bt conformers, which arefairly close in energy to unstretched B1 formsunder both ``high'' and ``low salt'' conditions(Figure 1(b) and (d)), suggests possible limitationsof the present energy analysis. The inclusion ofexplicit water molecules and counterions in thecomputations might raise the energy of theunstretched Bt form. We ®nd no signi®cant changein the relative energies of B1 and Bt conformers,however, in our ``continuous medium'' simu-lations, when phosphate charges are reduced tolevels predicted by counterion condensation theory(Manning, 1978).

A2B2 hyperfamily

Helices in the A2B2 hyperfamily adopt the con-formational switch ®rst characterized in dimersteps at the ends of B-DNA dodecamer duplexes(Grzeskowiak et al., 1991) and anticipated in earlyDNA computations (Zhurkin et al., 1978; Olson,1982b). We use the subscript 2 to emphasize theconformational similarity of our computed struc-tures with these so-called B(II) states where the ezrotations about the C30ÐO30 and O30ÐP bondsadopt gÿt combinations rather than the tgÿ

arrangements typical of canonical A1B1 duplexes,the b angle about the O50ÐC50 bond decreases by30-70 �, and the w angle orienting the sugar andbase rises by 40-60 � to the high anti range (seeTable 1). The combined effect of all four torsionangle changes is a large-scale variation in local andglobal base-pair geometry. For example, thestretching-induced responses of Roll and helicalinclination in the B2 helices are completely oppositefrom those in the B1 family; compare heavy con-tinuous versus thin broken curves in Figures 2(d)and 3(d). In further contrast to the canonical B1

structure, the minor grooves of the B2 duplexesshow little, if any, variation upon extensionbeyond �4 AÊ /bp and a signi®cant decrease uponextreme compression (Figure 5(b)). Slide, Rise, andtotal energy are similarly unaffected by duplexstretching (Figures 1, 2(f), 4(b)). Importantly, the B2

conformation becomes energetically more favor-able than all other ``high salt'' B-forms whenstretched to �5.4 AÊ /bp.

The A2 conformation, by contrast, is the leastenergetically favorable of all A-DNA structuresunder both ``high'' and ``low salt'' conditons(Figure 1(a) and (c)). There are, however, some

Page 8: DNA Stretching and Compression: Large-scale Simulations …brd/Teaching/Bio/asmb/Papers/JMB/DNA... · DNA Stretching and Compression: Large-scale Simulations of Double Helical Structures

Figure 6. Ribbon representations, generated with MidasPlus (Ferrin et al., 1988), of the simulated stretching of A-DNA structures. Side views of the lowest energy structures at the speci®ed values of helical rise are shown. Thearrows designate the putative conformational transitions suggested by the energy pro®les in Figure 1. Chains areoriented so that the minor groove is always shown in the upper part of each duplex. Sugar-phosphate backbones ofduplex structures are represented by gray ribbons, and base-pairs are color-coded according to conformational family:cyan, canonical A1 and B1 structures; magenta, A3 and B2 forms; ochre, Ak duplex; green, Bt conformations.

1308 DNA Stretching and Compression

relatively stable, but highly kinked A-like formswith ez angles con®ned to the gÿt domain whichbear close resemblance to the dimer steps of DNAcomplexed to the TATA-box binding protein (seediscussion of these Ak states in Table 2 in a latersection).

A3B3 hyperfamily

The A3B3 structures, though similar at thebackbone level to the A1B1 hyperfamily, differdramatically from all other duplexes in terms ofglobal base-pair orientation and the associatedresponses to stretching and compression. Large-

scale changes in the glycosyl torsions w to thehigh anti range completely untwist neighboringresidues, signi®cantly incline and displace base-pairs with respect to the global helical axis, andappreciably widen the minor groove (thin con-tinuous curves in Figures 2(a), (b), 3(c)-(f) and5(a), (b)). These unusual ``vertical helices'', antici-pated in early theoretical work (Olson & Dasika,1976; Olson, 1977), preserve both the hydrogenbonding between complementary residues andthe overlap of successive base-pairs during theentire range of global extension (see Slide andRise values in Figures 2(e), (f) and 4(a), (b)). TheA3B3 forms resemble the A2B2 structures by

Page 9: DNA Stretching and Compression: Large-scale Simulations …brd/Teaching/Bio/asmb/Papers/JMB/DNA... · DNA Stretching and Compression: Large-scale Simulations of Double Helical Structures

Figure 7. Ribbon representationsof simulated B-DNA stretching. Seethe legend to Figure 6.

DNA Stretching and Compression 1309

stretching via Twist and Roll with limitedvariations in Slide, Rise, and minor groovewidth (Figures 2(a)-(f), 4(a) and (b), and 5(a) and(b)) and in becoming energetically advantageousonly at high extension (Figure 1(a) and (d)).

Over-stretched A2B2 and A3B3 duplexes are quitedifferent from extended A1B1 and AtBt forms withundistorted and negatively inclined (positivelyrolled) base-pairs in the former cases and partially``melted'' and positively inclined (negativelyrolled) base-pairs in the latter (Figures 2(c), (d),3(c), (d), 6, and 7). All four hyperfamilies convergein the limit of maximal extension to a common lad-der-like backbone with comparable values of heli-cal twist, x-displacement, P � � �P virtual bonddistance, global radius, and minor groove widthbut with altered base-pair geometry.

Stretching-induced cooperative transitions

The treatment of idealized DNA homopolymersundertaken here precludes detailed analysis of thecooperative transitions between different familiesof helical forms, such as the B$ A transition(Ivanov et al., 1974), characterized by irregularhigh-energy ``boundaries'' between interchangingconformational fragments. On the other hand, thevariation of energy with helical rise within isolatedfamilies (as analyzed here) has the followingadvantage. As seen in Figure 1, the conformationalstates of lowest energy depend upon the degree ofstretching. Thus, one can imagine how the DNAbackbone interconverts from one conformationalfamily to another as the energy pro®les intersect.These large-scale transitions, including inter-changes between extended helices (Figures 6 and

Page 10: DNA Stretching and Compression: Large-scale Simulations …brd/Teaching/Bio/asmb/Papers/JMB/DNA... · DNA Stretching and Compression: Large-scale Simulations of Double Helical Structures

Table 2. Range of conformational parameters computed for duplex stretching and observed in TBP-bound DNA crys-tal complexes

FamilyP

(deg.)w

(deg.)Twist(deg.)

Helicaltwist(deg.)

Roll(deg.)

Helicalinclination

(deg.)Slide(AÊ )

Helicalx-displacement

(AÊ )Rise(AÊ )

Helicalrise (AÊ )

A3a ÿ10 to 15 99 to 105 ÿ8 to ÿ2 27 to 33 26 to 34 ÿ107 to ÿ95 3.0 to 4.3 7.7 to 9.5 3.4 to 3.8 2.0 to 4.0

B3a 140 to 160 133 to 135 ÿ5 to 7 29 to 32 30 to 33 ÿ77 to ÿ99 2.6 to 3.2 5.2 to 7.9 3.7 to 3.8 2.0 to 4.0

TATA 3 to 109c 76 to 100c

Unkinked 96 to 214d 103 to 143d 1 to 27 20 to 41 7 to 31 ÿ87 to ÿ16 ÿ0.6 to 2.4 ÿ3.0 to 9.2 2.9 to 3.9 1.1 to 4.4Ak

b 24 to 25 32 to 43 20 to 25 34 to 42 28 to 38 ÿ60 to ÿ53 1.0 to 2.0 3.3 to 4.1 3.9 to 4.1 3.0 to 4.0TATA 6 to 179c 17 to 113c

Kinks 137 to 176d 79 to 130d 14 to 31 40 to 51 38 to 53 ÿ70 to ÿ49 ÿ1.5 to 0.7 4.7 to 6.3 4.3 to 5.4 1.1 to 3.3

Parameters derived from crystal structures taken from the Nucleic Acid Database (Berman et al., 1992) and described (Y. Kim etal., 1993; Kim & Burley, 1994; Juo et al., 1996; Nikolov et al., 1996; Tan et al., 1996).

a A3 and B3 structures in the Table are restricted to stretching in the range 2.0-4.0 AÊ /bp helical rise, rather than their complete sta-bility ranges (see the text).

b ``Kinked'' structures (Ak) are limited to helical rise values between 3.0 and 4.0 AÊ /bp , whereas the overall range of stable Ak

structures corresponds to 3.0-5.5 AÊ /bp helical rise.c Limits of variation for the yeast (Y. Kim et al., 1993; Tan et al., 1996), Arabidopsis (Kim & Burley, 1994), and 1.9 AÊ resolution

human (Nikolov et al., 1996) TBP-DNA complexes.d Limits of variation for the 2.9 AÊ resolution human TBP-DNA complex (Juo et al., 1996), except for one thymine deoxyribose

where P � 266 �.

1310 DNA Stretching and Compression

7), are reminiscent of those implicated by directstretching experiments (Bensimon, 1996; Cluzel etal., 1996; Smith et al., 1996; Baumann et al., 1997).

Conformational interchanges

At the starting point of the calculations wherethe double helix is ®xed in the canonical A and B-DNA forms (with respective helical rise of 3.0 and3.3 AÊ /bp in poly(dG) �poly(dC) and of 2.9 and3.2 AÊ /bp in poly(dA) �poly(dT)), the A1B1 energy isthe global minimum. Upon stretching, the energiesof the canonical helices increase while the energiesof other conformational forms, such as the Bt

duplex, decrease (see Figure 1). The B-DNA back-bone apparently converts to the Watson-Crick-likeBt form at 3.8-4.0 AÊ /bp helical rise, when theenergy of the Bt structure becomes lower than thatof the B1 form. As noted above, the computed pre-ference for Bt forms could re¯ect our simpli®edcomputational approach which excludes explicittreatment of water molecules and counterions. Thisunusual over-twisted (Bt) helix with low helicalradius and narrow minor groove is stabilized onlyunder high ionic strength conditions (seeFigures 1(b), 5(b), and 7). The minor groove isappropriately increased when the low salt model isconsidered (Figure 5(d)).

As we continue to stretch the DNA, the energyof the B2 form approaches that of the B1 and Bt

families, and at �5.5 AÊ /bp the B2 form becomesenergetically favored over the Bt helix (Figures 1(b),(d) and 7). The energy pro®les thus predict atwo-step B1! Bt! B2 transition, where each stepentails certain rearrangements of conformationalparameters. The B1! Bt transition involves thecorrelated switching of a and g torsions (Table 1)and little, if any, reorientation of neighboring base-pairs (Figure 2(b), (d), and (f)). The Bt! B2 step,by contrast, entails relatively minor changes in

backbone torsions (e, z, b), but large (anti to highanti) variations in the glycosyl rotation w andaccompanying sharp jumps in base-pair Roll andSlide (�40 � and �4-5 AÊ , respectively), which areexpected to introduce an extremely high energybarrier at the borders between the two equi-poten-tial forms (Olson et al., 1999; see Figure 7). Wetherefore hypothesize that the ``energeticallyexpensive'' switch into the B2 conformation mayaccount for the sharp phase transition observed at60-70 % DNA extension (Cluzel et al., 1996; Smithet al., 1996). In other words, the concerted �40 �rotations of base-pairs about their long axes in theB1/Bt! B2 transition offer an appropriate mechan-ism to account for the ``domino effect'' implied bythe cooperative force-dependent stretching ofDNA. Note that our data further predict that theB2 helix becomes more favorable than all other B-type structures at 5.35 AÊ /bp helical rise under``high salt'' conditions, and at 5.7 AÊ /bp at ``lowsalt'' (Figure 1(b) and (d)), degrees of extensionsclosely corresponding to the degree of DNAstretching at the observed transition.

On the other hand, if we consider the lack ofcrystallographic examples of Bt duplexes notedabove as an argument against the calculated ener-getic preferences of Bt states in the range 4.0-5.5 AÊ /bp, we conclude that the stretching of thecanonical B1 duplex can also occur directly via aB1! B2 transition at �5.0 AÊ /bp. In this case thecalculated helical rise at the transition pointis �10 % less than the observed extension, 5.4-5.6 AÊ /bp (Cluzel et al., 1996) or �5.8 AÊ /bp (Smithet al., 1996). The conformational mechanism, how-ever, remains the same: large rearrangements ofsugars and bases about the glycosyl linkages.

One of the factors leading to the steep increasein energy of B1 and Bt forms upon stretching is theelectrostatic repulsion between duplex strands(note the minima in minor groove widths at 4.0-

Page 11: DNA Stretching and Compression: Large-scale Simulations …brd/Teaching/Bio/asmb/Papers/JMB/DNA... · DNA Stretching and Compression: Large-scale Simulations of Double Helical Structures

DNA Stretching and Compression 1311

5.0 AÊ /bp under high salt conditions in Figures 5(b)and 7). As mentioned above, the decrease ingroove size is linked to the negative Roll (ÿ25 � toÿ30 �) in B1 and Bt forms (Figure 2(d)). The reorien-tation of base-pairs in the extended B2 duplexresults in a positive Roll (Figure 2(d)), whichwidens the minor groove (Figure 5(b) and 7) andaccordingly diminishes electrostatic interactions.Thus, the transition is expected to depend onenvironmental conditions which modulate thepolyelectrolyte character of DNA (see below).

The calculations predict a somewhat differentscenario for the stretching-induced transitions of A-type DNA. The At form is never optimal under``high salt'' conditions. Thus, the A1 form domi-nates over the stretching interval between 2.0 and5.1-5.3 AÊ /bp (Figure 1(a)) until the A3 formbecomes lower in energy. Like the B-type DNAstretching-induced transition, the A1! A3 conver-sion also entails signi®cant rearrangements of theglycosyl torsion w from the anti to high anti range(Table 1). Furthermore, the extended A3 duplex hasthe same strong positive Roll, and negative helicalinclination as the over-stretched B2 duplex(Figures 2(c), (d) and 3(c), (d)), meaning that thecollective rotation of base-pairs also encounters ahigh energy barrier during the A1! A3 transition(Figure 6). Finally, the larger minor groove of theextended A3 phosphodiester backbone relieveselectrostatic repulsions brought about by the slight,stretching-induced narrowing of the A1 minorgroove (Figure 5(a) and (c)).

Thus, the stretching of both A and B-type helicessimulated here is in principle consistent with theextreme cooperativity observed in single moleculemanipulation experiments (Cluzel et al., 1996;Smith et al., 1996). At present, it is impossible tosay whether the ®nal stretched DNA states belongto A or B-type structures. The observed decrease incooperativity of the transition in the presence ofalcohol (Smith et al., 1996) may indicate the invol-vement of A-helices in the process, but this kind ofreasoning is inconclusive and further experimentsare needed to clarify the issue (see the section onelectrostatic and solvation effects, below).

Comparison with published studies

As pointed out above, highly stretched A2B2 andA3B3 structures are similar to one another, but sig-ni®cantly different from extended A1B1 and AtBt

helices. The switch from A1B1 and AtBt structuresto the extended A2B2 and A3B3 states introduces aconformational barrier, much like a temperature-induced melting transition or a solvent-inducedB! A or B! Z transformation, that must becrossed before the DNA can be extended further.Importantly, the stretched DNA double helicesreported in previous computer simulations-the ``S-ribbon'' from the Lavery group (Cluzel et al., 1996;Lebrun & Lavery, 1996) with bases roughly per-pendicular to the helical axis, the ``S-ladder'' byKonrad & Bolonick (1996) with negatively inclined

bases, and Lavery's narrow ®ber also with nega-tively inclined bases (Lebrun & Lavery, 1996),share common features with our extended forms.For example, base-phosphate contact distances(between N1 of purine or N3 of pyrimidine andthe 50-P of the same nucleotide) in our stretched B2

and A3 structures correspond closely with numeri-cal values reported by Konrad & Bolonick (1996)for the ``S-ladder''. These extended duplexes, illus-trated in Figures 6 and 7, adopt novel, energystabilized cross-strand stacking interactions with agiven base on one strand associated with thecomplementary base of the next residue, i.e. base(i) on one strand interacts with the complement ofbase (i ÿ 1). These unusual contacts dominate theenergy and are responsible, in part, for the shallowminima in the A2B2 and A3B3 energy pro®les at 5.5-6.0 AÊ /bp (Figure 1).

At the same time, there is an important differ-ence between our results and those obtained byothers. The smooth pro®les of energy versus helicalrise presented here reveal potential pathways ofcooperative conformational interconversion unde-tected in previous work. In distinction to Lebrun &Lavery (1996) and Konrad & Bolonick (1996), wemake no assumptions as to how the level of chainextension is achieved so that all states with thesame helical rise are directly comparable. In otherwords, based on our energies we can conclude thatBt is the best among all possible structures of5.2 AÊ /bp extension, whereas B2 is the best for5.4 AÊ /bp (Figure 1(b)). This comparison is madepossible primarily because of our selection of thehelical rise, a natural stretching coordinate withclear ``macroscopic'' meaning, as an independentconformational variable.

By contrast, it is not clear whether duplexesstretched by different means (e.g. by pulling on 50-50 versus 30-30 ends) can be directly compared andrelated to experiment. For example, consider thetwo forms of poly(dGC) �poly(dGC) chains pulledvia 30 or 50 ends in Figure 2(a) of Lebrun & Lavery(1996). These two families of structures, one withpositive and the other with negative base-pairinclination, have equivalent deformation energiesat relative DNA extensions of 1.7, 1.9, and 2.1. Theroughly parallel increase in energy of these twoforms with stretching argues against any phasetransitions.

Our identi®cation of multiple conformationalforms at a given degree of duplex extension stemsfrom the ®ne (0.05 AÊ or less) incremental variationof helical rise used in our calculations. The larger(0.5 AÊ ) translational steps taken by Lebrun &Lavery (1996) in energy minimization yield jaggedplots of energy versus chain extension that miss thedistinctions that we ®nd among conformationalfamilies (compare their Figure 2 with our Figure 1).

The stretching-induced transition with an abruptincrease in internal energy calculated by Konrad &Bolonick (1996) also bears some resemblance to thecooperative transition found here. Dimer steps intheir over-stretched model have large positive

Page 12: DNA Stretching and Compression: Large-scale Simulations …brd/Teaching/Bio/asmb/Papers/JMB/DNA... · DNA Stretching and Compression: Large-scale Simulations of Double Helical Structures

1312 DNA Stretching and Compression

Slide values much like the steps in our extendedA3 and B2 states. Their transition state further cor-responds to approximately the same degree ofextension computed here (�1.75 times the contourlength of their unstressed duplex). (Local and heli-cal parameters were computed by us with theCompDNA software (Gorin et al., 1995), i.e. with thesame de®nitions used in our energy simulations,from the coordinates available on the GeneVue,Inc. web page.)

On the other hand, there is a crucial differencebetween our model and that by Konrad &Bolonick (1996). We interpret the observed phasetransition in over-stretched DNA as an abruptlarge-scale change in the orientations of base-pairs(similar to a ``domino effect''). This kind of reorien-tation, associated with the conversion of the doublehelix from the B1/Bt/A1 conformation to the B2/A3

form, implies that dozens, if not hundreds, of base-pairs must rotate simultaneously to avoid for-mation of the high-energy ``boundaries'' betweendifferent helical states. The ``mechanistic'' reasonbehind the cooperative transition predicted byKonrad & Bolonick, however, is unclear. The DNAin their simulations is pulled at the 30-ends duringthe complete course of stretching, and intermediateforms appear to have the same sign of base-pairinclination (see Figure 2 of Konrad & Bolonick,1996). The duplex seemingly remains within thesame family of conformations during the computersimulations.

Further analysis of the over-stretched modelavailable on the GeneVue web site reveals signi®-cant deviations in torsional parameters comparedto values observed in crystal forms. In particular,the sugar puckering lies in a prohibited range(P � 290-330 �) not even observed in the simplestmononucleosides (Gelbin et al., 1996). Furthermore,most of the dimer steps in the model assume back-bone linkages like the Bt form (i.e. trans ag values),even though the inclination of base-pairs resemblesthat in the B2 family. This mixed conformationalpicture is suggestive of an extremely high energystate located far from the structural transition``path'' induced by DNA stretching. It is also un-likely that the correlated changes of a and g anglescomputed in the Konrad & Bolonick study canaccount for the large cooperative effect observedexperimentally (Bensimon, 1996; Cluzel et al., 1996;Smith et al., 1996; Baumann et al., 1997), especiallysince such rotational combinations can occur inde-pendently in different nucleotides (Olson, 1981).

Electrostatic and solvation effects

The predicted stretching-induced transitions ofB-DNA duplexes follow the same qualitative pat-terns under simulated low and high salt conditions(Figure 1(b) and (d)). That is, the B-form duplexespreferentially adopt canonical (B1) conformationsover the range of normal chain extension and con-vert upon added stretching to the Bt and B2 forms.Indeed, the B1! Bt and Bt! B2 transitions occur

at roughly the same degree of extension under thetwo ionic extremes. The decrease in ionic strength,however, changes the relative energies of thedifferent hyperfamilies, enhancing their conver-gence to a common conformational form atextreme stretching. Compression of B-form DNAunder ``low salt'' conditions also favors the Bt overthe B1 form. This preference is apparently in¯u-enced by the greater local spacing between succes-sive phosphate groups in the Bt familycharacterized by extended trans, trans confor-mations of the ag phosphodiester linkage (Zhurkinet al., 1978).

The change in ionic conditions has a more pro-nounced effect on the stretching and compressionof A-DNA. The variation in salt concentrationshifts the relative energies of the A1 and At forms,interchanging their conformational preferences atlow and high salt (Figure 1(a) and (c)). Whereasthe A-type helices preferentially adopt the canoni-cal A1 form at high salt, the At state with morewidely separated phosphate groups dominatesover nearly the complete range of stretching atlow salt. The A1 state is favored at low ionicstrength over only a narrow range of helical rise(4.0-4.5 AÊ /bp). Both below and above this range,the At family is energetically preferable. In thisrespect, the A and B-forms behave similarly at``low salt'' (Figure 1(c) and (d)).

Finally, our data suggest a putative interpret-ation of the observed dependence of the stretching-induced transition on hydration. The reducedwater activity in 60 % ethanol almost eliminates thecooperativity of the transition (Baumann et al.,1997). Based on our data, the effect of ethanol canbe interpreted in two ways, neither of whichexcludes the other. First, the decrease in wateractivity facilitates the B! A transition (Ivanov etal., 1974). Therefore, reduction in the cooperativityof DNA stretching in the presence of ethanol maybe tied to the relative advantage of extended A-likeforms with C30-endo puckered sugars under theseconditions (Figure 1(a)). Second, in the presence ofalcohols the monovalent cations may stabilize B1-duplexes with narrower minor grooves than thecorresponding forms in aqueous solution (Ivanovet al., 1973). In such cases, the reduced cooperativ-ity can be linked to an ethanol-induced decrease inthe barriers introduced by unfavorable inter-phos-phate interactions in the B1 and Bt-forms at 4.0-5.0 AÊ /bp stretching (Figures 1(b) and 7). Evidently,further experiments and theoretical studies areneeded to clarify this issue.

TBP-bound DNA

The deformations of DNA brought about bystretching or compression can be compared withthe extreme distortions of the double helix inducedby the binding of proteins such as the TATA-boxbinding protein (TBP). Recent interpretations (Leb-run et al., 1997) of the crystal complex suggest thatthe TATA box is ``locally stretched and unwound''

Page 13: DNA Stretching and Compression: Large-scale Simulations …brd/Teaching/Bio/asmb/Papers/JMB/DNA... · DNA Stretching and Compression: Large-scale Simulations of Double Helical Structures

Figure 8. Variation of (a) local and (b) helical rise in®ve TBP-DNA complexes (Y. Kim et al., 1993; Kim &Burley, 1994; Juo et al., 1996; Nikolov et al., 1996; Tan etal., 1996). In those cases where two different structuresof a given complex are given (Y. Kim et al., 1993; Kim &Burley, 1994), the average values are presented. Nota-tions: ®lled circles, yTBP (Y. Kim et al., 1993); opencircles, cyTBP (Tan et al., 1996); ®lled triangles, aTBP(Kim & Burley, 1994); ®lled squares, hTBP (Juo et al.,1996); open squares, hTBPc (Nikolov et al., 1996). Thesequences in positions 5, 6 and 8 are as follows: X � Afor cyTBP, aTBP, and hTBPc, X � T for yTBP and hTBP;Y � T for hTBP and Y � A for others; Z � A for yTBPand hTBP; Z � C for cyTBP, Z � G for aTBP and hTBPc.The rectangles illustrate the sequence-dependence of thehelical rise for steps 2-5: AT < AA < TA.

DNA Stretching and Compression 1313

upon association with TBP. Our analysis, however,shows that the TATA-box conformation is not ade-quately described in such simple and intuitivelyde®ned terms. In fact, depending upon the choiceof structural parameters, the DNA appears to beeither stretched or compressed.

Local conformational distortions

The severe distortion of the TATA-box regionhas been described in terms of the distancesbetween P atoms in different strands (Guzikevich-Guerstein & Shakked, 1995; Lebrun et al., 1997).Speci®cally, the distances that are used to measurethe major and minor groove widths of B-DNA, the20.3 AÊ separation of 50-P atoms on either side of afour base-pair double helical stretch and the corre-sponding 13.8 AÊ distance between 30-P atoms, arefound to change asymmetrically upon TBP bind-ing. The distance between the 30 termini of DNA inthe TBP-complex increases dramatically to 28.8 AÊ ,whereas the 50-P � � �50-P separation retains the B-form value of 20.3 AÊ , suggesting a ``local'' orasymmetric stretching of DNA in the TATA-boxregion (Guzikevich-Guerstein & Shakked, 1995;Lebrun et al., 1997).

Interestingly, increased P-P distances between 30-P atoms are characteristic features of the A3B3

hyperfamily (see Figure 5 and the above discussionof the unusually wide minor grooves in thesehelices). These distances, which remain nearly con-stant (30-32 AÊ ) over the complete range of globalstretching (2.0-7.0 AÊ /bp), along with the corre-sponding 50-P � � �50-P distances (19-22 AÊ ), reveal theclose resemblance of the A3B3 hyperfamily to TBP-bound DNA. Apparently, the duplex does notnecessarily have to be stretched to reproduce thecrystallographically observed increase in 30-P � � �30-Pdistances.

Helical deformations

Further examination of DNA dimer steps in thepublished TBP complexes (J. L. Kim et al., 1993; Y.Kim et al., 1993; Kim & Burley, 1994; Nikolov et al.,1995, 1996; Juo et al., 1996; Tan et al., 1996) in termsof the helical rise uncovers a compression, ratherthan a stretching of base-pair steps. This parametercorresponds to the distance between consecutivebase-pairs along the helical axis of the regularpolymeric structure that is generated by successiverepetition of a given dimer step (Miyazawa, 1961;Rosenberg et al., 1976; see Methods). The helicalrise computed with the CompDNA software (Gorinet al., 1995) varies from fractions of an AÊ ngstromunit to slightly over 4.0 AÊ /bp in the seven dimersteps of the TATA box contacted by protein (seeFigure 8). Indeed, some steps are even more glob-ally compressed than those in the canonical A-DNA duplex.

Among the seven DNA steps involved in directcontact with TBP, steps 1 and 7 are strongly kinkedinto the major groove, as opposed to the ®ve less

severely deformed intermediate steps 2-6. We thusconsider these two groups of steps separately.Table 2 compares selected conformational par-ameters of TBP-bound DNA dimer steps againstcorresponding values found in our simulated A3B3

and Ak structures (where the Ak stands, as notedabove, for the kinked structures in the A2 family).The data for the computed structures are reportedin terms of the ranges of conformational par-ameters accompanying an increase of helical risebetween 2.0 and 4.0 AÊ /bp, an interval correspond-ing approximately to the variation in helical riseobserved in the TBP-DNA co-crystals. The calcu-

Page 14: DNA Stretching and Compression: Large-scale Simulations …brd/Teaching/Bio/asmb/Papers/JMB/DNA... · DNA Stretching and Compression: Large-scale Simulations of Double Helical Structures

1314 DNA Stretching and Compression

lated A3B3 and Ak homopolymers span a slightlynarrower range of helical rise compared to individ-ual TBP-bound DNA steps. Steric repulsions pre-vent compression of the idealized A3B3 structuresbelow 2 AÊ /bp helical rise, and the Ak homopoly-mers are only stable in the 3.0-5.5 AÊ /bp range (seethin dotted curves in Figure 1(a)).

Importantly, the DNA steps between TBP-induced kinks follow conformational trends foundupon stretching the A3B3 hyperfamily. Thesecharacteristics include: (1) roughly constant localRise (Figure 4(a) and (b)); (2) a decrease in Rolllinked to an increase in base-pair inclination withrespect to the helical axis (Figures 2(c), (d) and 3(c),(d)); (3) an increase in Slide tied to a decrease inhelical x-displacement (see Figures 2(e), (f) and3(e), (f), and discussion below); and (4) an increasein local Twist but almost no change in helical twistas the chain is extended (Figures 2(a), (b) and 3(a),(b)).

In most TBP-DNA complexes resolved to date,the intervening TATA steps are predominantlyA-like in that they tend to adopt C30-endo sugarpuckering, although the puckering at some steps isC40-exo (an intermediate pseudorotational statebetween the C30-endo and C20-endo puckers). Thus,the A3 duplex with its high anti w rotation and localuntwisting appears to be an extreme version ofTATA-box DNA, for which w is anti at most steps(see Table 2).

The only exception to this assessment is the2.9 AÊ resolution human TBP-DNA complex (Juo etal., 1996) where the steps between the two kinksexhibit the C20-endo sugar pucker characteristic ofB-like DNA. The glycosyl angle w in this structureis relatively high compared to that of ``standard''B-DNA, placing this form close to the B3 family.The variation of base-pair step parameters in thehuman TATA-box DNA also resembles that of thecalculated B3 structures (see Table 2). Therefore, weconclude that both subsets of the calculated A3B3

hyperfamily, A3 with C30-endo sugars and B3 withC20-endo sugars, are close relatives of the exper-imentally observed TATA-box structures.

Sequence effects

Figure 8 further illustrates the effects of sequenceon the structure of DNA in the TATA box. Thehelical rise values increase in the order: AT <AA < TA (see the boxes in Figure 8). Interestingly,

a similar sequence dependence has been observedfor local Twist (Juo et al., 1996). In particular, thecentral fourth step is most compressed andunwound when it is AT, with a helical rise of 1.0-1.3 AÊ /bp (Y. Kim et al., 1993; Juo et al., 1996). Thecorresponding helical twist is close to 20 � (localTwist � 0 �) and the base-pair (helical) inclinationis ÿ80 � to ÿ90 � (data not shown), values consist-ent with the behavior of the A3B3 structures(Table 2). On the other hand, when the fourth stepis AA, the dimer looks more like a canonical A-likeform with helical rise 2.5-3.0 AÊ /bp in the yeast

TBP-DNA structures (Y. Kim et al., 1993; Tan et al.,1996).

The unusually low values of helical rise for theAT steps correlate with the extremely high (7.7-9.5 AÊ ) values of helical x-displacement (seeTable 2). This parameter is identical to the displace-ment D used in earlier notations (Arnott, 1970),where a positive D value indicates the shifting ofbase-pairs toward the minor groove. Because ofthe tendency of the sugar-phosphate backbone toretain its optimal geometry (Gorin et al., 1995),an increase in D is linked with the decreasesin helical rise and helical twist that accompany theB-to-A transition (Olson, 1976; Yathindra &Sundaralingam, 1976; Zhurkin et al., 1978). Notice,however, that the displacement D is about 4.5-5.0 AÊ /bp in the canonical A-form, whereas here Dis as high as �9.5 AÊ at the central AT step in thehuman TBP-DNA complex (Juo et al., 1996). Inother words, the AT steps appear to be highlyexposed in the minor groove. This pronouncedsequence dependence in the TATA-box parametersis probably functionally signi®cant for mutualprotein-DNA ®t, i.e. optimization of TBP-DNAinteractions at the interface surface.

As for the kinks at either end of the protein-bound DNA fragment, the local Rise is appreciablyhigh compared to the intermediate dimer steps(Table 2 and Figure 8). However, the correspond-ing helical rise is rather low, especially at the ®rststep, which thus bears some similarity to extremelycompressed conformations. In fact, local and heli-cal base-pair parameters observed for the extremeterminal kinks closely match their calculated rangeand variation in Ak structures. For example, thehigh (over 40 �) helical twist at these steps appearsto reduce the duplex radius r and to induce largenegative helical inclination.

Mechanics of DNA distortion

Finally, we conclude that despite the simplicityof our homopolymer model, which (i) ignores themixed sugar puckering found at kinked TBP-DNAsteps and accordingly biases the choice of back-bone and glycosyl torsion angles; (ii) sets theBuckle angle to zero, although it varies noticeablyin the TBP-DNA structures, especially at kinkeddimer steps; and (iii) uses a different sequence, weclosely mimic the structure of DNA in contact withTBP at both non-kinked (A3B3 forms) and kinked(Ak subfamily) steps. The close resemblance of theco-crystal DNA structures to our calculated forms(both stretched and compressed) further corrobo-rates our conclusion that a simple ``stretching andunwinding'' mechanism of TATA-box recognition(Lebrun et al., 1997) does not re¯ect the compli-cated mechanics of DNA distortions observed inthe complex with TBP. In fact, an intricate inter-play between various base parameters leads to theunusual conformation of TATA-box DNA, whichis neither A nor B. Furthermore, the nature of theconformational change depends on the structural

Page 15: DNA Stretching and Compression: Large-scale Simulations …brd/Teaching/Bio/asmb/Papers/JMB/DNA... · DNA Stretching and Compression: Large-scale Simulations of Double Helical Structures

DNA Stretching and Compression 1315

parameters used to describe the observed defor-mations: while P � � �P distances are suggestive oflocal chain extension (Lebrun et al., 1997), the tworise values reported here indicate that either thechain length remains practically constant, or is atmost moderately stretched (local Rise), or that thestructure is noticeably compressed (helical rise).

Our interpretation of the TATA-box structurealso re¯ects differences in our de®nition of helicalrise compared to values reported by others (Lebrunet al., 1997) using the Lavery-Sklenar Curves

algorithm (Lavery & Sklenar, 1989). Our helicalrise corresponds to the distance between con-secutive base-pair centers along the axis of themini-helix generated by these two residues (seeMethods). By contrast, the inter-base-pair distancein Curves (Lavery & Sklenar, 1989) is measuredalong the normals to the base-pairs (with the risean average of the distances measured along thetwo normal vectors). To compare the two sets ofparameters, consider an ``ideal B-DNA'' with itshelical axis passing through the base-paircenters, and local Roll � Tilt � 0 �, Twist � 36 �,Rise � 3.4 AÊ /bp. Now, if each pair is rotated aboutits long axis by the same angle (e.g. 20 �), our heli-cal rise does not change, since the base-pair centersand helical axis remain in exactly the same places.The rise computed with Curves, however,increases noticeably since the normal vectors areno longer parallel to the helical axis (see Lu &Olson (1999) for further details). This simple con-sideration accounts for much of the increase in risefound with Curves at highly kinked DNA stepscompared to other analysis schemes (Werner et al.,1996; Fernandez et al., 1997; Lu et al., 1997; Lu &Olson, 1999), including the CompDNA routines(Gorin et al., 1995) used here.

RecA-stretched DNA

Our results can also be tied to the RecA-drivenprocess of homologous recombination, where bothsingle-stranded (ssDNA) and double-stranded(dsDNA) molecules are severely extended andunderwound (Stasiak et al., 1981; Stasiak & DiCapua, 1982). During the exchange of genetic infor-mation the dsDNA from one chromosome recog-nizes and swaps strands with the homologousssDNA fragment from a sister chromosome (forreviews, see West, 1992; Kowalczykowski &Eggleston, 1994). RecA, one of many knownrecombination proteins, binds ssDNA and forms ahelical ®lament that facilitates ssDNA-dsDNAinteractions and the search for homology (Egelman& Stasiak, 1986). One of the most intriguingcharacteristics of the RecA-DNA complex is itsextreme stretching (helical rise � 5.1 AÊ /bp) andunwinding (helical twist � 20 �) (Stasiak et al., 1981;Stasiak & Di Capua, 1982). Remarkably, the DNArise and twist in the recombination ®laments arenearly species-independent (Ogawa et al., 1993),suggesting that these distortions are functionallymeaningful. Molecular models of ssDNA-dsDNA

interactions (Zhurkin et al., 1994a,b) indicate thatthe stretching and unwinding may be advan-tageous for DNA-DNA homology recognition. Inparticular, the increased separation of bases, i.e.rise, reduces the chances of mismatch formationbetween single strand and duplex.

Stretched duplexes

In view of these data, it is interesting to considerwhether our calculated stretched forms offer anyclues to the recognition mechanisms involved inhomologous recombination. The energetically opti-mal dsDNA forms corresponding to the RecA-induced helical rise of 5.1 AÊ /bp include: the A1

and Bt duplexes at high salt and the At and Bt

forms at low salt (Figure 1). We assume that our``high salt'' models more closely mimic theenvironmental conditions in the center of the RecA®lament, where there are numerous positivelycharged amino acids, especially in the vicinity ofthe L2 loop (Story et al., 1992). We further choosethe A-like (A1) over the B-like (Bt) form in therecombination model detailed below, given that aC20-endo to C30-endo conformational transition isdetected in FTIR measurements of the RecA-three-stranded complex (Dagneaux et al., 1995). Thereasoning behind our protein-DNA model, how-ever, is applicable to any of the aforementionedconformations: A1, At, B1 and Bt, all of which sharesimilar global features.

The energetically optimal A1 form extended to5.1 AÊ /bp, is characterized by positive helical incli-nation and negative Roll (Figures 2, 3, and 6). Thatis, the bases are oriented in the same way as in theclassic C- and D-DNA ®ber forms (Arnott et al.,1974; Selsing et al., 1975). The extremely widemajor grooves of these helices (Figure 6) areexpected to facilitate favorable interactions withssDNA. An additional functional advantage of ourextended A1 model is related to the orientation ofbase-pairs in the duplex and the inclination of theduplex itself as it slides along the RecA ®lament insearch of the homologous single strand (Figure 9;Howard-Flanders et al., 1987). The inclination ofbase-pairs in the extended A1 duplex effectivelycancels the inclination of the A1 duplex itself withrespect to the RecA ®lament, so that the base-pairsof dsDNA are nearly perpendicular to the overallRecA axis and nearly parallel to the bases ofssDNA (Figure 9). Such an orientation is expectedto be functionally advantageous, facilitating base-base interactions, speci®c recognition, and strandexchange between double and single-strandedDNAs (Zhurkin et al., 1994a,b).

Recombination model

Consider in more detail two critical assumptionsin the model presented in Figure 9: (i) the bases inssDNA are nearly perpendicular to the RecA ®la-ment; (ii) the interaction between double andsingle-stranded helices occurs in the major groove

Page 16: DNA Stretching and Compression: Large-scale Simulations …brd/Teaching/Bio/asmb/Papers/JMB/DNA... · DNA Stretching and Compression: Large-scale Simulations of Double Helical Structures

Figure 9. Model, in stereo, of stretched dsDNA interacting with the extended ssDNA localized in the center of theRecA ®lament (Egelman & Stasiak, 1986; Howard-Flanders et al., 1987). The RecA ®lament is shown schematically bygray ribbons. Notice that the bases in the duplex are inclined so as to facilitate interactions with the ssDNA basesknown to be nearly perpendicular to the ®lament axis (NordeÂn et al., 1992, 1998). The large separation between thebases helps to avoid mispairing. The duplex is a member of the A1 family extended to 5.1 AÊ /bp (Figure 7). The threebase-pairs shown in magenta are in register with the three nucleotides of the single strand (green), facilitatingssDNA-dsDNA recognition in the major groove (Howard-Flanders et al., 1984) and subsequent strand exchange(Zhurkin et al., 1994a,b).

1316 DNA Stretching and Compression

of the duplex. The nearly perpendicular orientationof ssDNA bases is consistent with the lineardichroism of single-stranded DNA bound to RecA(NordeÂn et al., 1992, 1998). The most unusual fea-ture of the ssDNA is the large separation betweenthe bases, on one hand, large enough to weakenthe stacking interactions and allow for the swap-ping of bases between ssDNA and dsDNA and onthe other, too small to allow water molecules topenetrate between the bases (Zhurkin et al., 1994b).

Importantly, this base positioning has beencorroborated by recent NMR structures of severalshort ssDNA oligomers bound to RecA (Nishinakaet al., 1997, 1998), where the partially unstackedbases are separated by distances of �5 AÊ along thebase normals. The loss in stacking interactions inthe NMR structure is partially neutralized byfavorable sugar-base interactions (Figure 10), but isalso accompanied by an unusually high helicaltwist �50 � (a value calculated by us with theCompDNA software (Gorin et al., 1995) from thecoordinates deposited in the Protein Data Bank(Bernstein et al., 1977), PDB entry 3REC). Whilesuch over-twisting is apparently favorable inselected single-stranded DNA ®bers (Leslie &Arnott, 1978), it contradicts the well knownunwinding of long ssDNA in the RecA ®lament(Stasiak & Di Capua, 1982). The extreme overtwist-ing of bases in the NMR structure may stem fromthe very short fragments of ssDNA bound to RecAin that study (Nishinaka et al., 1997, 1998). Indeed,only one or two RecA monomers can form a com-

plex with the three to six nucleotide ssDNAcharacterized by NMR. The association of multipleRecA proteins is apparently necessary for stabiliz-ation of the underwound DNA structure operativein recombination. Therefore, we do not incorporatethe NMR-derived information directly in our mod-eling, but rather use a ssDNA computer model(Zhurkin et al., 1994a) consistent with the 5.1 AÊ /bpstretching and 20 � twisting of RecA-ssDNA com-plexes (Stasiak et al., 1981).

This single-stranded structure, illustrated inFigure 10, matches the duplex DNAs found in thiswork and is moderately stabilized by sugar-sugarinteractions. The sugars associate via an unusuallyshort (3.7 AÊ ) C20 � � �O40 contact, where the V-shaped CH2 group at the 20-carbon tightly ``envel-opes'' the oxygen atom of the adjacent deoxyri-bose. Such advantageous interactions are wellknown in the DNA crystallographic literature(Wahl et al., 1996; Mandel-Gutfreund et al., 1998).Geometrical analyses further show that this sugar-sugar contact is more favorable when the sugarpucker is C30-endo (in accordance with FTIRmeasurements (Dagneaux et al., 1995)), and thesugars are deoxyriboses (in accordance with thelow af®nity of RecA for RNA (McEntee et al.,1981)).

The second assumption of the model in Figure 9,the recognition of ssDNA via the major groove ofthe duplex (Lacks, 1966; Howard-Flanders et al.,1984), is consistent with recent data obtained fromthe cleavage of RecA-bound DNA by 125I-emitted

Page 17: DNA Stretching and Compression: Large-scale Simulations …brd/Teaching/Bio/asmb/Papers/JMB/DNA... · DNA Stretching and Compression: Large-scale Simulations of Double Helical Structures

Figure 10. Comparison, in stereo,between (a) our energy optimizedssDNA dimer (Zhurkin et al.,1994a) and (b) the NMR-basedmodel by Nishinaka et al. (1997,1998). The yellow broken lines des-ignate favorable contacts betweenthe CH2 group at the 20-carbon andthe O40 atom in (a) and the (20-car-bon) CH2 � � �base interaction in (b).Notice that the helical twist is 20 �in (a) and 50 � in (b). The helicalrise is �5 AÊ /bp in both cases (seethe text for details).

DNA Stretching and Compression 1317

Auger electrons (Malkov et al., 1999). This schemealso agrees with our ®ndings for extendedduplexes. As evident from the view into the minorgroove in Figure 9, the right-handedness of theRecA-DNA ®lament (Ogawa et al., 1993) impliesthat the base-pairs must be rotated counter-clock-wise around the dyad axis to form planar associ-ations with the bases of ssDNA approximatelyperpendicular to the protein ®lament axis (NordeÂnet al., 1992, 1998). That is, to facilitate ssDNA-dsDNA recognition in the major groove, the base-pair inclination has to be positive. Importantly,such base-pair inclination is energetically favorablein the extended A1B1 and AtBt duplexes (Figure 3).By contrast, if the recognition were to occurthrough the minor groove, as suggested by others(Baliga et al., 1995; Podyminogin et al., 1995; Zhou& Adzuma, 1997), the base-pairs would have to beinclined negatively, which is energetically unfavor-able for stretched canonical duplexes, but feasiblefor higher energy A2B2 and A3B3 helices (Figures 1and 3).

Thus, our energy-optimized duplex model helpsto tie together a number of previously knownstructural features of the RecA-DNA complex:(i) the RecA ®lament is right-handed (EM obser-vation); (ii) the ssDNA and dsDNA in the RecAcomplex are stretched and unwound by 50 % (EMdata); (iii) the bases in ssDNA are nearly perpen-dicular to the RecA helical axis (linear dichroismmeasurements); (iv) the base-pairs in the energeti-cally favorable extended duplex are inclinedpositively (Figures 1 and 3); (v) the recognitionoccurs in the major groove of the duplex (Howard-Flanders et al., 1984; Malkov et al., 1999). Takentogether, these features yield a self-consistent

stereochemical model for the ssDNA-dsDNArecognition brought in register by RecA proteins.

Conclusions

This study has attempted to ®nd the intrinsicstructural characteristics of double-helical DNAunder extreme stretching and compression. These``activated'' conformations, energetically unfavor-able under ``normal'' conditions, are observed incomplexes with proteins and other ligands and canbe induced by novel micromanipulation techniques(Bensimon, 1996; Cluzel et al., 1996; Smith et al.,1996; Baumann et al., 1997). The critical assumptionmade here is that the intrinsic energy preferencesof the double helix per se are used by the cellularmachinery involved in DNA metabolism, andtherefore, these natural preferences deserve exten-sive study. More speci®cally, we presume thatDNA binding proteins take advantage of the natu-ral DNA ``energy landscape'', and that when suchproteins deform the duplex, they do so in an``energy economical'' way. This assumption hasalready been vindicated by numerous crystallo-graphic examples of regulatory proteins bendingand twisting the DNA to which they bind (forreviews, see Travers, 1989; Steitz, 1990; Suzuki &Yagi, 1995; Dickerson, 1998; Olson et al., 1998).Here we have extended this assumption to the``longitudinal'' deformations of DNA.

Although usually ignored, the longitudinal elas-ticity of double-helical DNA is an important struc-tural factor facilitating its mutual ®t with proteins.One of the best known examples is the variable(142-149 bp) length of DNA on the nucleosomecore particle (Satchwell et al., 1986), an observation

Page 18: DNA Stretching and Compression: Large-scale Simulations …brd/Teaching/Bio/asmb/Papers/JMB/DNA... · DNA Stretching and Compression: Large-scale Simulations of Double Helical Structures

1318 DNA Stretching and Compression

possibly linked to local stretching and compressionof the double helix in sequence-dependent wrap-ping around the histone octamer (see the detailedcomparison in Results and Discussion of our calcu-lations with the localized stretching of DNAobserved in the crystallized nucleosome (Luger etal., 1997)). Here we wish to emphasize that theinterdependence between DNA stretching andtwisting revealed in our simulations may beimportant for protein-DNA recognition in general.Proteins usually recognize DNA sequences con-taining several ``consensus boxes'' separated byvariable spacers. Therefore, assuming that bendingdeformations are limited, the duplex has to deformboth in terms of twisting and stretching to facilitateeffective protein binding to different DNA sitesbelonging to the same class of response elements.In this regard, it is important that our simulationspredict positive correlations between B-DNA twist-ing and stretching, at least within the limit of10-15 % of the equilibrium values (Figure 2(b)).This correlation implies that the stretching of aduplex automatically leads to its over-twisting,making it relatively easy to ®t such a fragment intothe same space normally occupied by a longersequence, e.g. the terminal base-pairs of a stretched10-mer will easily superimpose on the end residuesof an 11 bp chain.

Another manifestation of the longitudinal``breathing'' of DNA is the variability in length ofEscherichia coli promoters. The DNA spacerbetween the `` ÿ 35`` and the `` ÿ 10 boxes'' variesfrom 15 to 19 bp (Auble & deHaseth, 1988), a ®nd-ing which suggests that the helical rise of DNAmay vary inversely with the length of the spacer(to secure the best ®t between promoter and RNApolymerase). A third example is the cyclic AMPreceptor protein (CRP), which binds to consensusCACA and TGTG tetramers separated by distancesvarying from eight to ten base-pairs (Barber et al.,1993; Ivanov et al., 1995). At least part of this varia-bility may stem from the longitudinal ¯exibility ofthe duplex. Notably, when CRP binds to asequence with a longer spacer, the DNA undergoesa B-to-A-like transition (Ivanov et al., 1995). Inother words, the double helix is compressed andunwound to improve the mutual ®t between pro-tein and DNA.

In principle, these ``activated'' forms of DNAcannot be analyzed by traditional all-atom compu-tational approaches, i.e. Monte Carlo or moleculardynamics simulations of a macromolecule and thesurrounding solvent environment, since the gener-ation of these states requires high temperaturesthat would break base-pairs and rearrange thebackbone. As a consequence, an alternativeapproach is needed to simulate the imposed highenergy states.

To study such a complex system, the number ofdegrees of freedom must be reduced to a minimumand the independent parameters must be chosenso that they re¯ect the property of interest. For thispurpose, we have used the generalized coordinates

of bases as independent parameters, both in localand global (helical) coordinate frames, an approachwhich has proven to be effective in previous com-puter simulations of DNA bending and twisting(Zhurkin et al., 1991). The interchange betweenglobal and local representations is straightforwardand offers a distinct advantage for computer simu-lations, as well as for visualization and interpret-ation of the obtained structures. Here we useglobal helical rise, the natural measure of thelength of a stretched DNA molecule, as an inde-pendent variable, and study the conformationalchanges that accompany imposed chain extensionand compression. This contrasts with previouswork where chain extension is brought about byarbitrarily imposed forces on the sugar-phosphateends of the molecule (Konrad & Bolonick, 1996;Lebrun & Lavery, 1996). Our choice of a natural``reaction coordinate'' avoids problems associatedwith the correct parameterization of applied forcesand the apparent dependence of chain confor-mation on the sites of pulling (Cluzel et al., 1996;Lebrun & Lavery, 1996). The simpli®cations intro-duced in our model, in particular the imposedsymmetry constraints on homopolymers, clearlyminimize sequence-dependent structural featuresof the duplex. The energetic and structuralresponses to stretching and compression are there-fore very similar for poly(dG) �poly(dC) and poly(dA) �poly(dT) deformed under ``low'' and ``highsalt'' conditions. Remarkably, the energy optimiz-ation yields most of the right-handed forms ofDNA found in crystal and ®ber structures ofdouble helices and their complexes with proteinsand drugs. Thus, a comprehensive classi®cation ofknown double-helical forms becomes possible.

Stretching transition models

Our simulations reveal a conformational ``domi-no effect'' that accounts for the stretching-inducedphase transition found at �1.6 times the normalextension of B-DNA (Bensimon, 1996; Cluzel et al.,1996; Smith et al., 1996; Baumann et al., 1997). Themodel draws upon three principal ®ndings. First,there are several hyperfamilies of duplex formscharacterized by qualitatively different responsesto imposed stretching at the base-pair level. In par-ticular, the base-pair inclination decreases uponDNA stretching in the A1B1 and AtBt duplexes, butincreases in the A2B2 and A3B3 forms (Figure 3(c)and (d)). Second, the ®rst set of hyperfamilies(A1B1 and AtBt) dominates at ``normal'' degrees ofDNA extension, whereas the latter set (A2B2 andA3B3) is preferable at ``extreme'' levels of defor-mation. Third, the two classes of structures areenergetically equivalent at ``intermediate'' stretch-ing (Figure 1). The substantial differences in base-pair inclination between these equipotential formsintroduce a steric barrier at the boundariesbetween the different conformations (Olson et al.,1999). The considerable magnitude of this barriercompared to the energy of base stacking is consist-

Page 19: DNA Stretching and Compression: Large-scale Simulations …brd/Teaching/Bio/asmb/Papers/JMB/DNA... · DNA Stretching and Compression: Large-scale Simulations of Double Helical Structures

DNA Stretching and Compression 1319

ent with the sharp cooperative transition observedat this level of DNA extension.

Thus, the over-stretching of single DNA mol-ecules can be described (to a ®rst approximation)by the one-dimensional Ising model that has alsobeen used to account for the B! A and B! Ztransitions of the double helix (Ivanov et al., 1974,1983). According to our calculations, the rearrange-ment of base-pairs caused by DNA stretching isgreater than that in the B! A transition, butless than those needed to effect a B! Z transition(Ivanov et al., 1983; Olson et al., 1983).

The present calculations, however, incorporatean implicit model of the solvent environment anda two-well potential for sugar puckering, whichpreclude treatment of a possible stretching-inducedswitch of sugar conformation. Molecular dynamicsor Monte Carlo simulations treating counterionsand water molecules explicitly (Cheatham &Kollman, 1996; Yang & Pettitt, 1996; Sprous et al.,1998), may be able to determine whether the un-usual AtBt structures found here over the rangeof 4.0-5.0 AÊ /bp are indeed favored (providedthat the helical rise is appropriately restrained).Open questions include whether a B-to-A-like tran-sition is facilitated upon stretching, and what rolenon-canonical forms may play in reducing thecooperativity of DNA stretching in the presence ofdehydrating agents (see Results and Discussion).Clearly, further experiments and theoretical ana-lyses are needed.

Signi®cantly, comparable conformational order-ing is found in simulations to be reported else-where of the longitudinal deformations of thealternating poly(dTA) �poly(dTA) and poly(dCG) �poly(dCG) copolymers. While the pyrimi-dine-purine dimer steps are slightly more respon-sive than the purine-pyrimidine steps to imposedglobal stretching and compression, the over-stretching transition occurs at an equivalent levelof extension and entails the identical conformation-al interconversions, i.e. A1! A3 and B1! Bt! B2,reported here. Base sequence thus plays the samesort of subtle role in the DNA over-stretching tran-sition as it does in guiding the natural folding anddeformability of the unstressed double helix (Olsonet al., 1998). Moreover, the over-stretching tran-sition occurs at nearly the same degree of extensionin different chains, e.g. EMBL3 l DNA (Cluzel etal., 1996) and l bacteriophage DNA (Smith et al.,1996), but the amount of force needed to effect thetransformation depends on sequence, e.g. �65 pNfor l DNA and poly(dCG) �poly(dCG) versus �35pN for poly(dTA) �poly(dTA) (Rief et al., 1999). Ourenergy calculations reveal a common ``dominoeffect'' in the over-stretching of different sequences,but do not directly probe the conformationalbarriers separating stretched helices of equivalentinternal energy.

Extended states in the complex withRecA protein

The extended DNA observed in recombination®laments (Stasiak & Egelman, 1988) is arguably thebest known example of DNA stretching in biologi-cal systems. Remarkably, the degree of DNA exten-sion in the complex, 5.1 AÊ /bp, is close to thehelical extension (5.3-5.6 AÊ /bp) corresponding tothe phase transition observed by DNA micromani-pulations (Bensimon, 1996; Cluzel et al., 1996;Smith et al., 1996; Baumann et al., 1997), and to theinterval (4.0-5.0 AÊ /bp) of inter-family B1! Bt andB1! A1/At transitions predicted here. Apparently,DNA extended to �5 AÊ /bp is characterized by anincreased conformational lability: various helicalforms coexist in the same region of DNA confor-mation space, thereby enabling interconversionsbetween equi-potential forms (Figure 1(a) and (b)).The conformational variability appears to be criti-cal for the mechanistically complicated process ofhomology recognition and strand exchange. Interms of detailed functional implications, theunwinding and displacement of base-pairs, linkedin this study to DNA stretching, enhance contactswith the DNA interior and reduces the probabilityof mispairing between double and single strands(Figure 9).

Earlier, one of us has predicted a B-to-A-liketransition in DNA stretched under the restrictionsimposed by the geometry of the RecA-DNA ®la-ment in the so-called ``extended R-form'' (Zhurkinet al., 1994a,b). This transition, subsequently con-®rmed by the partial C20-endo to C30-endo switchingof sugar puckering in FTIR and NMR studies ofDNA-RecA complexes (Dagneaux et al., 1995;Nishinaka et al., 1997, 1998), also agrees with thepresent study: A1-forms are favored but B1-formsare disfavored at �5 AÊ /bp stretching (Figure 1(a)and (b)). Like the earlier extended R-form model(Zhurkin et al., 1994b), the DNA stretched here hasan increased hydrophobic surface compared tocanonical A and B-form DNA. In principle, thisfeature could be an additional factor stabilizing theB-to-A switch in RecA-bound DNA, provided thatwater activity is decreased inside the RecA ®la-ment cavity. On the other hand, the decreasedcooperativity of ``physically manipulated'' DNAstretching detected in the presence of ethanol(Baumann et al., 1997), can be related to the sameB-to-A switch (see above). Thus, the RecA-DNAcomplex is perhaps another fascinating example ofthe synergism between the energy preferencesobserved for an isolated DNA molecule, and the``design'' of the protein machinery that takesadvantage of these preferences.

Compressed states and TATA-boxdeformations

Systematic compression of DNA homopolymersyields structures similar in terms of both dimericand helical parameters to some of the deformed

Page 20: DNA Stretching and Compression: Large-scale Simulations …brd/Teaching/Bio/asmb/Papers/JMB/DNA... · DNA Stretching and Compression: Large-scale Simulations of Double Helical Structures

1320 DNA Stretching and Compression

steps observed in crystal complexes of DNA withTATA-box binding proteins (J. L. Kim et al., 1993;Y. Kim et al., 1993; Kim & Burley, 1994; Nikolov etal., 1995, 1996; Juo et al., 1996; Tan et al., 1996).Paradoxically, the seven DNA steps contacted byprotein appear to be globally but not locally com-pressed, in addition to being strongly bent andunwound, as described earlier (J. L. Kim et al.,1993). The conformations of the two highly kinkedDNA steps at the ends of the sequence resemblethe Ak structures generated here as well as a com-pressed poly(dTA) �poly(dTA) copolymer to bedescribed elsewhere. The intermediate non-kinkedsteps closely match compressed A3 and B3 struc-tures which feature unusually wide minor grooves.

Despite the overall difference between RecA andTBP-induced deformations of DNA, the two sys-tems have features in common that may be func-tionally important: base-pairs are accessible forrecognition and single bases are easily ¯ipped outof the duplex core much like the single bases inDNA-methyltransferase complexes (Roberts &Cheng, 1998). In the case of RecA, the ¯ipping ofbases is an essential part of the strand exchangeprocess (Howard-Flanders et al., 1984; Zhurkin etal., 1994a,b; Nishinaka et al., 1997, 1998). In thecase of TBP, the situation is somewhat more com-plicated. Here we must assume, in the absence ofdetailed structural information, that the distortionsof DNA in the TBP complex are relevant to theinteractions of DNA with RNA polymerase (seealso Kim et al., 1997; Robert et al., 1998). Accordingto conventional thinking (von Hippel, 1998), DNAbase-pairs are somehow ``melted'' during RNAsynthesis so that the RNA can form Watson-Crickpairs with the DNA bases. The disruption of DNAbase-pairing may not necessarily follow the samestretching-induced pathway as in the RecA-DNAcomplex where the duplex is untwisted, bothgrooves are widened, and the bases are easily¯ipped out. DNA in the cell, especially in eukar-yotes, is tightly packed, and longitudinal stretchingis extremely expensive in terms of space: astretched ``bubble'' containing �20 base-pairswould be �100 AÊ long if the base-pair rise wereincreased to that in the RecA ®lament.

Evolution has apparently chosen an alternateway to open the double helix: DNA is sharplybent, perhaps as in the TBP complex, and suchdeformations have two important consequences:(1) individual bases are readily ¯ipped out ofthe base-pair stack (Ramstein & Lavery, 1988);(2) the large kink increases the distance betweenadjacent bases. RNA-DNA mismatches can thusbe avoided along the same lines as in the RecA-stretched DNA model (Zhurkin et al., 1994a).Therefore, TBP-like manipulations of DNA, inprinciple, can achieve the same two goals whichare secured by DNA extension, but without thewaste of space. The paradoxical deformations,i.e. local extension versus global compression, ofTBP-bound DNA described here perhaps re¯ecta novel mechanism of information readout from

the double helix. Such a process is spatiallyeconomical, kinetically ef®cient, and error-resist-ant. Notably, while direct structural data onDNA in the active center of RNA polymeraseare still missing, the deformations in DNAbound to the reverse transcriptase (Ding et al.,1998; Huang et al., 1998) are consistent with theabove scenario. In particular, the DNA duplexundergoes a local B! A transition, bendingsharply toward the major groove and therebyexposing base-pairs in the minor groove andfacilitating their opening.

Finally, the ``activated'' forms of DNA analyzedin this study mimic the protein-induced defor-mations of the double helix in known complexesand are presumably tied to other biochemical pro-cesses. The distortions of DNA in such cases maylead to either complete or partial disruption ofWatson-Crick base-pairs. In the absence of pro-teins, physical manipulations may yield these highenergy ``activated'' states and thereby suggestas new models for the design of DNA-bindingpharmaceuticals.

Methods

Conformational variables

Nucleic acid polymers are described in terms of gener-alized coordinates that de®ne the positions andorientations of bases and sugar-phosphate backbones.The con®gurations of successive base-pairs are speci®edby six independent ``pair'' parameters: Propeller Twist,Buckle, Opening rotations and Shear, Stretch, Staggerdisplacements; and six ``step'' variables, Twist, Tilt, Rollangles and Shift, Slide, Rise translations (see Dickerson etal., 1989) for qualitative de®nitions). Coordinates of back-bone atoms are found using a standard chain-closure algorithm (Zhurkin et al., 1978), which requirestwo additional variables per nucleoside unit (thepseudorotational phase angle P and the glycosyltorsion angle w) and an initial estimate of backbone geo-metry (e.g. trans versus gauche� forms of the exocyclicO50ÐC50ÐC40ÐC30 backbone torsion g). In addition toenergy minimization, this parameter set has proved tobe ef®cient in Monte Carlo simulations of DNA helicesbased on the DNAminiCarlo software package(Zhurkin et al., 1991).

Regular homopolymers composed of identical base-pair units are used to reduce the number of independentvariables. Residues in complementary strands are furtherassumed to adopt the same sugar puckering and glyco-syl angles. These simpli®cations make it possible to carryout a detailed search of conformation space, and alsoprovide an easy way to control the length of themolecule. Indeed, for a regular DNA molecule it is poss-ible to calculate a global axis and to determine all helicalparameters with respect to this axis (see below). Themagnitude of the helical rise serves as the naturalmeasure of chain stretching or compression.

Helical parameters

The helical parameters used as independent degreesof freedom in the calculations are expressed in termsof the global helical axis relating neighboring base-

Page 21: DNA Stretching and Compression: Large-scale Simulations …brd/Teaching/Bio/asmb/Papers/JMB/DNA... · DNA Stretching and Compression: Large-scale Simulations of Double Helical Structures

Figure 11. Schematic of coordinate frames andparameters used to relate local and helical parameters ofdimer steps. Frames Ai and Ai � 1 lie along the helicalaxis S, while frames Bi and Bi � 1 are embedded in theplanes of neighboring base-pairs.

DNA Stretching and Compression 1321

pairs. While ®nding such an axis is non-trivial in thecase of a deformed (asymmetric) structure (Lavery &Sklenar, 1990), identi®cation is straightforward for aregularly repeating polymer (Miyazawa, 1961; Rosen-berg et al., 1976). Here we present formulas relatingthe ``helical parameters`` of a DNA homopolymer,helical twist (h), helical shift or x-displacement (Dx),helical slide or y-displacement (Dy), and step height(h), to the ``local parameters`` describing the geometryof successive base-pairs: Twist (), Tilt (t), Roll (r),Shift (�x), Slide (�y), and Rise (�z).

We start by embedding coordinate frames Bi and Bi�1

in the planes of neighboring base-pairs and de®ning S asthe global helical axis (Figure 11). We introduce twoadditional coordinate frames, Ai and Ai�1, positioned attheir origins along S such that transformations betweenpairs of coordinate axes are de®ned by:

rAi�MrBi

�Dx

Dy

0

264375;

rAi�1�MrBi�1

�Dx

Dy

0

264375;

rAi� Z�h�rAi�1

�0

0

h

264375;

rBi� T�; t;r�rBi�1

��x

�y

�z

264375;

�1�

Here the rX are vectors in coordinate system X, M is thetransformation matrix that orients base-pairs in the glo-bal coordinate frame, Z(h) is the matrix de®ningrotation about the helical axis by angle h, T(,t,r) isthe matrix that relates neighboring base-pairs, (Dx, Dy, h)are the components of helical displacement, and (�x, �y,�z) are the components of local dimer displacement.

Rearrangement of the preceding expressions leads tothe following relationships between local and helicalparameters:

T�; t;r� �Mÿ1Z�h�M; �2�

M�x�y�z

24 35 � Z�h�Dx

Dy

0

24 35ÿ Dx

Dy

0

24 35� 00h

24 35 �3�

Local parameters are obtained from global parametersin two steps. First, the transformation matrix T relat-ing neighboring base-pairs is constructed from M andZ using the given helical inclination Z, tip y, andtwist h in equation (2). Here we take M � X(y)Y(Z),where X and Y designate matrices for rotations by thedesignated angles about the local x and y-coordinateaxes. Thus, matrix T is calculated asT(,t,r) � Y(ÿZ)X(ÿy) Z(h) X(y)Y(Z). Symmetricbase-pair step parameters can then be extractedfrom selected elements of T. For example, withT expressed as the symmetrized matrix productZ(/2 � a)Y(b)Z(/2 ÿ a) (Zhurkin et al., 1979), Twist(), Tilt (t), and Roll (r) are given by:

tan � T21 ÿ T12

T11 � T22�4a�

t � b sin a �4b�

r � b cos a �4c�Here b is the net bending angle, and a the phaseangle of bending, de®ned by:

cos b � T33 �5a�

tan a � T32 ÿ T23

T31 ÿ T13�5b�

Other de®nitions of T, e.g. (Babcock et al., 1994; ElHassan & Calladine, 1995), yield related expressionsfor symmetric base-pair step parameters. Localparameters used here follow the de®ntions of theCompDNA software package (Gorin et al., 1995).

Rearrangement of equation (3) similarly yields localtranslational parameters in terms of helical rise h, andthe Dx,Dy components of helical shift (x and y displace-ments):

�x

�y

�z

264375 �

Mÿ1

�cos h ÿ 1� ÿ sin h 0

sin h �cos h ÿ 1� 0

0 0 1

264375 Dx

Dy

h

264375:

�6�

Alternatively, helical parameters, h, y, and Z, can bedetermined from local parameters. The angular com-

Page 22: DNA Stretching and Compression: Large-scale Simulations …brd/Teaching/Bio/asmb/Papers/JMB/DNA... · DNA Stretching and Compression: Large-scale Simulations of Double Helical Structures

1322 DNA Stretching and Compression

ponents are extracted from the elements of T constructedfrom a given set of local dimer rotations (, t, r):

cos h � 1

2�T11 � T22 � T33 ÿ 1� �7a�

tan y � T23 ÿ T32

T12 ÿ T21�7b�

2 sinZ sin h � T13 ÿ T31 �7c�Substitution of these values and the local dimer displace-ment (�x, �y, �z) in equation (5) then yields the com-ponents of helical displacement:

Dx �Px�cos h ÿ 1� � Py sin h

2�1ÿ cos h� �8a�

Dy �Py�cos h ÿ 1� ÿ Px sin h

2�1ÿ cos h� �8b�

h � Pz �8c�where:

Px

Py

Pz

24 35 �M�x�y�z

24 35 �9�

Energies

Potential energies are calculated using standardall-atom potential functions (Zhurkin et al., 1981; Poltev& Shulyupina, 1986). Notably, these potentials reproducethe observed enthalpies of base stacking (Zhurkin et al.,1981) and planar base-base interactions (Poltev &Shulyupina, 1986). A sigmoidal, distance-dependentdielectric constant of the form (Lavery, 1988; Ramstein &Lavery, 1988),

e�rij� � e1 ÿ �e1 ÿ 1� 1

2�srij�2 � srij � 1

� �exp�ÿsrij� �10�

is chosen to account for solvent effects. We use twoextreme versions of this function: a steeper version(de®ned by the controlling parameter s � 0.356) com-bined with neutralized phosphate groups, which weterm ``high salt'' conditions, because of the masking ofmost electrostatic interactions, and a version with lowerslope (s � 0.16) combined with full (ÿ1) charges on phos-phate groups, which we call ``low salt''. We set e1 to78.3, the dielectric constant of water, so that e rangesfrom 8.2 to 21.4 at high salt and 2.0 to 4.7 at low salt asrij, the non-bonded distance between atoms i and j,increases from 3 to 5 AÊ . We thus can model a broadrange of ionic conditions as well as see the in¯uence ofshort and long-range electrostatic interactions on DNAconformation.

Rather than simulating the interconversion between Aand B helical forms, we restrict the furanose rings topuckered states in either the C20-endo or C30-endo confor-mational range. This is accomplished by replacing thestandard (i.e. ®xed amplitude, tm � 38 �) pseudorota-tional energy pro®le (Olson, 1982a; Zhurkin et al., 1982)

by a harmonic expression ®t to the computed energyminimaat 298 K. Speci®cally, the energy is increased by RT/2when the phase angle P deviates from the minimumenergy rest state, Po � 171 � for C20-endo and 9 � forC30-endo forms, by the root-mean-square ¯uctuation,h�P2i1/2. We consider conformational asymmetry byintroducing force constants that re¯ect positive versusnegative pseudorotational changes, i.e. h�P�

2 i1/2 � 12 �versus h�Pÿ

2 i1/2 � 14 � for C20-endo and h�P�2 i1/2 � 14 �

versus h�Pÿ2 i1/2 � 15 � for C30-endo states. We thus force

all sugars to be either B or A-like and to mimic the pre-ferred ¯uctuations in P.

Minimization

The minimization procedure uses the method ofRosenbrock (1960) with a linear search, modi®ed accord-ing to the ``numerical recipes'' provided by Fletcher &Reeves (1964). Recent analyses of the mutual correlationsof base-pair step parameters in DNA crystal structures(Gorin et al., 1995; Olson et al., 1998) suggest ways to per-form the conformational search more ef®ciently. Thus,for a given polymer stretch (i.e. ®xed helical rise), mini-mization in the (inclination, helical twist) plane, two ofthe most deformable helical variables, usually leads toan energetically tolerable structure. Stable minima arealso obtained with the corresponding local parameters,Roll and Twist, used as independent variables. Indeed,we have found that we can carry out minimizationsinterchangeably in local and helical parameter space andidentify the same low energy states. This furtherincreases the effectiveness of the energy optimizationprocedure. Here, in addition, independent parametersknown to span a limited range in DNA crystal struc-tures, Tilt (or helical tip), local Shift (or helical y-displace-ment), Buckle, and Opening, are set to zero. Each of theoptimized structures is represented by a point on thepro®les of energy versus helical rise in Figure 1 and theplots of local and helical parameters in Figures 2-5.These plots are obtained from calculations at incrementsin helical rise of 0.05 AÊ or less.

Acknowledgements

We are grateful to Drs Andrzej Stasiak, Mikhail A.Livshits, and Suse Broyde for useful discussions and toDrs Takehiko Shibata and Taro Nishinaka for providingatomic coordinates of their model of single-strandedDNA bound to RecA prior to public release. Support ofthis work through USPHS grant GM20861 and theNational Cancer Institute, under contract N10-CO-74102,is gratefully acknowledged. K.M.K. also acknowledgesfellowship support from the Program in Mathematicsand Molecular Biology based at Florida State University.Computations were carried out at the Rutgers UniversityCenter for Computational Chemistry and through thefacilities of the Nucleic Acid Database project (NSF grantDBI 9510703). This work was taken in part from thedissertation of K.M.K. written in partial ful®llment of therequirements for the degree of Doctor of Philosophy,Rutgers University, 1998.

Page 23: DNA Stretching and Compression: Large-scale Simulations …brd/Teaching/Bio/asmb/Papers/JMB/DNA... · DNA Stretching and Compression: Large-scale Simulations of Double Helical Structures

DNA Stretching and Compression 1323

References

Arnott, S. (1970). The geometry of nucleic acids. Prog.Biophys. Mol. Biol. 21, 265-319.

Arnott, S., Chandrasekaran, R., Hukins, D. W. L., Smith,P. J. C. & Watts, L. (1974). Structural detailsobserved for DNAs containing alternating purineand pyrimidine sequences. J. Mol. Biol. 88, 523-533.

Auble, D. T. & deHaseth, P. L. (1988). Promoter recog-nition by Escherichia coli RNA polymerase. In¯uenceof DNA structure in the spacer separating the ÿ10and ÿ35 regions. J. Mol. Biol. 202, 471-482.

Babcock, M. S., Pednault, E. P. D. & Olson, W. K.(1994). Nucleic acid structure analysis. Mathematicsfor local Cartesian and helical structure parametersthat are truly comparable between structures. J. Mol.Biol. 237, 125-156.

Baliga, R., Singleton, J. W. & Dervan, P. B. (1995).RecA �oligonucleotide ®laments bind in the minorgroove of double-stranded DNA. Proc. Natl Acad.Sci. USA, 92, 10393-10397.

Barber, A. M., Zhurkin, V. B. & Adhya, S. (1993). CRP-binding sites: evidence for two structural classeswith 6 bp and 8 bp spacers. Gene, 130, 1-8.

Baumann, C. G., Smith, S. B., Bloom®eld, V. A. &Bustamante, C. (1997). Ionic effects on the elasticityof single DNA molecules. Proc. Natl Acad. Sci. USA,94, 6185-6190.

Bensimon, D. (1996). Force: a new structural control par-ameter? Structure, 4, 885-889.

Berman, H. M., Olson, W. K., Beveridge, D. L.,Westbrook, J., Gelbin, A., Demeny, T., Hsieh, S.-H.,Srinivasan, A. R. & Schneider, B. (1992). The nucleicacid database: a comprehensive relational databaseof three-dimensional structures of nucleic acids.Biophys. J. 63, 751-759.

Bernstein, F. C., Koetzle, T. F., Williams, G. J., Meyer,E. E., Jr, Brice, M. D., Rodgers, J. R., Kennard, O.,Shimanouchi, T. & Tasumi, M. (1977). The ProteinData Bank: a computer-based archival ®le formacromolecular structures. J. Mol. Biol. 112, 535-542.

Calladine, C. R. & Drew, H. R. (1984). A base-centeredexplanation of the B-to-A transition in DNA. J. Mol.Biol. 178, 773-782.

Chandrasekaran, R., Arnott, S., Banerjee, A., Campbell-Smith, S., Leslie, A. G. W. & Puigjaner, L. (1980).Some new polynucleotide structures and some newthoughts about old structures. In Fiber DiffractionMethods. ACS Symposium Series 141 (French, A. D. &Gardner, K. H., eds), pp. 483-502, AmericanChemical Society, Washington, DC.

Cheatham, T. E., III & Kollman, P. A. (1996). Obser-vation of the A-DNA to B-DNA transition duringunrestrained molecular dynamics in aqueoussolution. J. Mol. Biol. 259, 434-444.

Cluzel, P., Lebrun, A., Heller, C., Lavery, R., Viovy,J.-L., Chatenay, D. & Caron, F. (1996). DNA: anextensible molecule. Science, 271, 792-794.

Crick, F. H. C. & Watson, J. D. (1954). The complemen-tary structure of deoxyribonucleic acid. Proc. Roy.Soc. ser. A, 223, 80-96.

Dagneaux, C., Porumb, H., Liquier, J., Takahashi, M. &Taillandier, E. (1995). Conformations of three-stranded DNA structures formed in presence and inabsence of the RecA protein. J. Biomol. Struct.Dynam. 13, 465-470.

Dickerson, R. E. (1998). DNA bending: the prevalence ofkinkiness and the virtures of normality. Nucl. AcidsRes. 26, 1906-1926.

Dickerson, R. E., Bansal, M., Calladine, C. R.,Diekmann, S., Hunter, W. N., Kennard, O., vonKitzing, E., Lavery, R., Nelson, H. C. M., Olson,W. K., Saenger, W., Shakked, Z., Sklenar, H.,Soumpasis, D. M. & Tung, C.-S., et al. (1989). De®-nitions and nomenclature of nucleic acid structureparameters. J. Mol. Biol. 208, 787-791.

Ding, J., Das, K., Hsiou, Y., Sara®anos, S. G., Clark,A. D., Jr, Jacobo-Molina, A., Tantillo, C., Hughes,S. H. & Arnold, E. (1998). Structure and functionalimplications of the polymerase active site region ina complex of HIV-1 RT with a double-strandedDNA template-primer and an antibody Fab frag-ment at 2.8 AÊ resolution. J. Mol. Biol. 284, 1095-1111.

Edmondson, S. P. & Johnson, W. C., Jr (1986). Base tiltof B-form poly[d(G)]-poly[d(C)] and the B- and Z-conformations of poly[d(GC)]-poly[d(GC)] in sol-ution. Biopolymers, 25, 2335-2348.

Egelman, E. & Stasiak, A. (1986). Structure of helicalRecA-DNA complexes. Complexes formed in thepresence of ATP-g-S or ATP. J. Mol. Biol. 191, 677-697.

El Hassan, M. A. & Calladine, C. R. (1995). The assess-ment of the geometry of dinucleotide steps indouble-helical DNA: a new local calculation schemewith an appendix. J. Mol. Biol. 251, 648-664.

Fernandez, L. G., Subirana, J. A., Verdaguer, N.,Pyshnyi, D., Campos, L. & Malinina, L. (1997).Structural variability of A-DNA in crystals ofd(pCpCpCpGpCpGpGpG). J. Biomol. Struct. Dynam.15, 151-163.

Ferrin, T. E., Huang, C. C., Jarvis, L. E. & Langridge, R.(1988). The MIDAS display system. J. Mol. Graphics,6, 13-27.

Fletcher, R. & Reeves, C. M. (1964). Function minimiz-ation by conjugate gradients. Comput. J. 7, 149-154.

Franklin, R. E. & Gosling, R. G. (1953). Molecular con-®guration in sodium thymonucleate. Nature, 171,740-741.

Gelbin, A., Schneider, B., Clowney, L., Olson, W. K. &Berman, H. M. (1996). Geometric parameters innucleic acids: sugar and phosphate constituents.J. Am. Chem. Soc. 118, 530-540.

Gorin, A. A., Zhurkin, V. B. & Olson, W. K. (1995). B-DNA twisting correlates with base-pair mor-phology. J. Mol. Biol. 247, 34-48.

Grzeskowiak, K., Yanagi, K., PriveÂ, G. G. & Dickerson,R. E. (1991). The structure of B-helical C-G-A-T-C-G-A-T-C-G and comparison with C-C-A-A-C-G-T-T-G-G. J. Biol. Chem. 266, 8861-8883.

Guzikevich-Guerstein, G. & Shakked, Z. (1995). A novelform of the DNA double helix imposed on theTATA-box by the TATA-binding protein. NatureStruct. Biol. 3, 32-37.

Howard-Flanders, P., West, S. C. & Stasiak, A. (1984).Role of RecA protein spiral ®laments in geneticrecombination. Nature, 309, 215-219.

Howard-Flanders, P., West, S. C., Cassuto, E., Hahn,T.-R., Egelman, E. & Stasiak, A. (1987). Structure ofRecA spiral ®laments and their role in homologouspairing and strand exchange in genetic recombina-tion. In DNA Replication and Recombination (Kelly, T.& McMacken, R., eds), pp. 609-617, Alan R. Liss,Inc., New York.

Huang, H., Chopra, R., Verdine, G. L. & Harrison, S. C.(1998). Structure of a covalently trapped catalyticcomplex of HIV-1 reverse transcriptase: implicationsfor drug resistance. Science, 282, 1669-1675.

Page 24: DNA Stretching and Compression: Large-scale Simulations …brd/Teaching/Bio/asmb/Papers/JMB/DNA... · DNA Stretching and Compression: Large-scale Simulations of Double Helical Structures

1324 DNA Stretching and Compression

Ivanov, V. I., Minchenkova, L. E., Schyolkina, A. K. &Poletayev, A. I. (1973). Different conformations ofdouble-stranded nucleic acid in solution as revealedby circular dichroism. Biopolymers, 12, 89-110.

Ivanov, V. I., Minchenkova, L. E., Minyat, E. E., Frank-Kamenetskii, M. D. & Schyolkina, A. K. (1974). TheB to A transition of DNA in solution. J. Mol. Biol.87, 817-833.

Ivanov, V. I., Minchenkova, L. E., Minyat, E. E. &Schyolkina, A. K. (1983). Cooperative transitions inDNA with no separation of strands. Cold SpringHarbor Symp. Quant. Biol. 47, 243-250.

Ivanov, V. I., Minchenkova, L. E., Chernov, B. K.,McPhie, P., Ryu, S., Garges, S., Barber, A. M.,Zhurkin, V. B. & Adhya, S. (1995). CRP-DNA com-plexes: inducing the A-like form in the binding siteswith an extended central spacer. J. Mol. Biol. 245,228-240.

Juo, Z. S., Chiu, T. K., Leiberman, P. M., Baikalov, I.,Berk, A. J. & Dickerson, R. E. (1996). How proteinsrecognize the TATA-box. J. Mol. Biol. 261, 239-254.

Kim, J. L. & Burley, S. K. (1994). 1.9 AÊ resolution re®nedstructure of TBP recognizing the minor groove ofTATAAAAG. Nature Struct. Biol. 1, 638-653.

Kim, J. L., Nikolov, D. B. & Burley, S. K. (1993). Co-crys-tal structure of TBP recognizing the minor grooveof a TATA element. Nature, 365, 520-527.

Kim, T.-K., LaGrange, T., Wang, Y.-H., Grif®th, J. D.,Reinberg, D. & Ebright, R. H. (1997). Trajectory ofDNA in the RNA polymerase II transcription prei-nitiation complex. Proc. Natl Acad. Sci. USA, 94,12268-12273.

Kim, Y., Geiger, J. H., Hahn, S. & Sigler, P. B. (1993).Crystal structure of a yeast TBP/TATA-box com-plex. Nature, 365, 512-520.

Konrad, M. W. & Bolonick, J. I. (1996). Moleculardynamics simulation of DNA stretching is consist-ent with the tension observed for extension andstrand separation and predicts a novel ladder struc-ture. J. Am. Chem. Soc. 118, 10989-10994.

Kowalczykowski, S. C. & Eggleston, A. K. (1994). Hom-ologous pairing and DNA strand-exchange proteins.Annu. Rev. Biochem. 63, 991-1043.

Lacks, S. (1966). Integration ef®ciency and geneticrecombination in pneumococcal transformation.Genetics, 53, 207-235.

Lavery, R. (1988). Junctions and bends in nucleicacids: a new theoretical modeling approach. InStructure & Expression: DNA Bending and Curvature(Olson, W. K., Sarma, M. H., Sarma, R. H. &Sundaralingam, M., eds), vol. 3, pp. 191-211,Adenine Press, Guilderland, NY.

Lavery, R. & Sklenar, H. (1989). De®ning the structureof irregular nucleic acids: conventions andprinciples. J. Biomol. Struct. Dynam. 6, 655-667.

Lavery, R. & Sklenar, H. (1990). A quantitative descrip-tion of the conformation of biological molecules. InStructure and Methods, DNA Protein Complexes andProteins (Sarma, R. H. & Sarma, M. H., eds), vol. 2,pp. 215-235, Adenine Press, Schenectady, NY.

Lavery, R., Zakrzewska, K. & Sklenar, H. (1995).JUMNA (junction minimisation of nucleic acids).Comput. Phys. Commun. 91, 135-158.

Lebrun, A. & Lavery, R. (1996). Modelling extremestretching of DNA. Nucl. Acids Res. 24, 2260-2267.

Lebrun, A., Shakked, Z. & Lavery, R. (1997). Local DNAstretching mimics the distortion caused by theTATA box-binding protein. Proc. Natl Acad. Sci.USA, 94, 2993-2998.

Leslie, A. G. W. & Arnott, S. (1978). Structure ofthe single-stranded polyribonucleotide poly(20-O-methylcytidylic acid). J. Mol. Biol. 119, 399-414.

Lu, X. & Olson, W. K. (1999). Resolving the discrepan-cies among nucleic acid conformational analyses.J. Mol. Biol. 285, 1563-1575.

Lu, X. J., El Hassan, M. A. & Hunter, C. A. (1997). Struc-ture and conformation of helical nucleic acids: anal-ysis program (SCHNAaP). J. Mol. Biol. 273, 668-680.

Luger, K., Mader, A. W., Richmond, R. K., Sargent, D. F.& Richmond, T. J. (1997). Crystal structure of thenucleosome core particle at 2.8 AÊ resolution. Nature,389, 251-260.

Malkov, V. A., Panyutin, I. G., Neumann, R. D.,Zhurkin, V. B. & Camerini-Otero, R. D. (1999). Thearrangement of the three DNA strands within theRecA protein mediated synaptic complexes. Biophys.J. 76, A130.

Mandel-Gutfreund, Y., Margalit, H., Jernigan, R. L. &Zhurkin, V. B. (1998). A role for CH � � �O inter-actions in protein-DNA recognition. J. Mol. Biol.277, 1129-1140.

Manning, G. (1978). The molecular theory of polyelectro-lyte solutions with applications to the electrostaticproperties of polynucleotides. Quart. Rev. Biophys.11, 179-246.

McEntee, K., Weinstock, G. M. & Lehman, I. R. (1981).Binding of the RecA protein of Escherichia coli tosingle- and double-stranded DNA. J. Biol. Chem.256, 8835-8844.

Miyazawa, T. (1961). Molecular vibrations and structureof high polymers. II. Helical parameters of in®nitepolymer chains as functions of bond lengths, bondangles, and internal rotation angles. J. Polymer Sci.55, 215-231.

Nikolov, D. B., Chen, H., Halay, E. D., Usheva, A. A.,Disatake, K., Lee, D. K., Roeder, R. G. & Burley,S. K. (1995). Crystal structure of a TFIIB-TBP-TATA-element ternary complex. Nature, 377, 119-128.

Nikolov, D. B., Chen, H., Halay, E. D., Hoffmann, A.,Roeder, R. G. & Burley, S. K. (1996). Crystal struc-ture of a human TATA box-binding protein/TATAelement complex. Proc. Natl Acad. Sci. USA, 93,4862-4867.

Nishinaka, T., Ito, Y., Yokoyama, S. & Shibata, T. (1997).An extended DNA structure through deoxyribose-base stacking induced by RecA protein. Proc. NatlAcad. Sci. USA, 94, 6623-6628.

Nishinaka, T., Shinohara, A., Ito, Y., Yokoyama, S. &Shibata, T. (1998). Base-pair switching by intercon-version of sugar puckers in DNA extended by pro-teins of RecA-family: a model for homology searchin homologous genetic recombination. Proc. NatlAcad. Sci. USA, 95, 11071-11076.

NordeÂn, B., Seth, S. & Tjerneld, F. (1978). Renaturationof DNA in ethanol-methanol solvent induced bycomplexation with methyl green. Biopolymers, 17,523-525.

NordeÂn, B., Elvingson, C., Kubista, M., Sjoberg, B.,Ryberg, H., Ryberg, M., Mortensen, K. &Takahashi, M. (1992). Structure of RecA-DNA com-plexes studied by combination of linear dichroismand small-angle neutron scattering measurementson ¯ow-oriented samples. J. Mol. Biol. 226, 1175-1191.

NordeÂn, B., Wittung-Stafshede, P., Ellouze, C., Kim, H.-K., Mortensen, K. & Takahashi, M. (1998). Baseorientation of second DNA in RecA �DNA ®laments.

Page 25: DNA Stretching and Compression: Large-scale Simulations …brd/Teaching/Bio/asmb/Papers/JMB/DNA... · DNA Stretching and Compression: Large-scale Simulations of Double Helical Structures

DNA Stretching and Compression 1325

Analysis by combination of linear dichroism andsmall angle neutron scattering in ¯ow-orientedsolution. J. Biol. Chem. 273, 15682-15686.

Ogawa, T., Yu, X., Shinohara, A. & Egelman, E. H.(1993). Similarity of the yeast RAD51 ®lament tothe bacterial RecA ®lament. Science, 259, 1896-1899.

Olson, W. K. (1976). The spatial con®guration of orderedpolynucleotide chains. I. Helix formation and stack-ing. Biopolymers, 15, 859-878.

Olson, W. K. (1977). Spatial con®guration of orderedpolynucleotide chains: a novel double helix. Proc.Natl Acad. Sci. USA, 74, 1775-1779.

Olson, W. K. (1981). Understanding the motions ofDNA. In Biomolecular Stereodynamics (Sarma, R. H.,ed.), vol. 1, pp. 327-343, GuilderlandAdenine Press,New York.

Olson, W. K. (1982a). How ¯exible is the furanose ring?2. An updated potential energy estimate. J. Am.Chem. Soc. 104, 278-286.

Olson, W. K. (1982b). Theoretical studies of nucleic acidconformation: potential energies, chain statistics,and model building. In Topics in Nucleic Acid Struc-tures (Neidle, S., ed.), part 2 edit., pp. 1-79,Macmillan Press, London.

Olson, W. K. & Dasika, R. D. (1976). The spatial con-®guration of ordered polynucleotide chains. III.Polycyclonucleotides. J. Am. Chem. Soc. 98, 5371-5380.

Olson, W. K., Srinivasan, A. R., Marky, N. L. & Balaji,V. N. (1983). Theoretical probes of DNA confor-mation: examining the B-to-Z conformational tran-sition. Cold Spring Harbor Symp. Quant. Biol. 47, 229-241.

Olson, W. K., Gorin, A. A., Lu, X.-J., Hock, L. M. &Zhurkin, V. B. (1998). DNA sequence-dependentdeformability deduced from protein-DNA crystalcomplexes. Proc. Natl Acad. Sci. USA, 95, 11163-11168.

Olson, W. K., Kosikov, K. M., Colasanti, A., Gorin, A. A.& Zhurkin, V. B. (1999). Pulling and pushing theDNA double helix. In Proceedings of the Third Sym-posium on Biological Physics (Frauenfelder, H., ed.),American Institute of Physics , Woodbury, NY Inthe press.

Pearlman, D. A., Case, D. A., Caldwell, J. W., Ross,W. S., Cheatham, T. E., III, DeBolt, S., Ferguson, D.,Seibel, G. & Kollman, P. (1995). AMBER, a packageof computer programs for applying molecularmechanics, normal mode analysis, moleculardynamics and free energy calculations to simulatethe structural and energetic properties of molecules.Comput. Phys. Commun. 91, 1-41.

Podyminogin, M. A., Meyer, R. B. & Gamper, H. B.(1995). Sequence-speci®c covalent modi®cation ofDNA by cross-linking oligonucleotides. Catalysis byRecA and implication for the mechanism of synap-tic joint formation. Biochemistry, 34, 13098-13108.

Poltev, V. I. & Shulyupina, N. V. (1986). Simulation ofinteractions between nucleic acid bases by re®nedatom-atom potential functions. J. Biomol. Struct.Dynam. 3, 739-765.

Ramstein, J. & Lavery, R. (1988). Energetic couplingbetween DNA bending and base-pair opening. Proc.Natl Acad. Sci. USA, 85, 7231-7235.

Rich, A. (1959). An analysis of the relation betweenDNA and RNA. Ann. NY Acad. Sci. 81, 709-722.

Rief, M., Clausen-Schaumann, H. & Gaub, H. E. (1999).Sequence-dependent mechanics of single DNA mol-ecules. Nature Struct. Biol. 6, 346-349.

Robert, F., Douzlech, M., Forget, D., Egly, J.-M.,Greenblatt, J., Burton, Z. F. & Coulombe, B. (1998).Wrapping of promoter DNA around the RNA poly-merase II initiation complex induced by TFIIF. Mol.Cell, 2, 341-351.

Roberts, R. J. & Cheng, X. (1998). Base ¯ipping. Annu.Rev. Biochem. 67, 181-198.

Rosenberg, J. M., Seeman, N. C., Day, R. O. & Rich, A.(1976). RNA double helices generated from crystalstructures of double helical dinucleoside phos-phates. Biochem. Biophys. Res. Commun. 69, 979.

Rosenbrock, H. H. (1960). An automated method for®nding the greatest or least value of a function.Comput. J. 3, 175-184.

Satchwell, S. C., Drew, H. R. & Travers, A. A. (1986).Sequence periodicities in chicken nucleosome coreDNA. J. Mol. Biol. 191, 659-675.

Schneider, B., Neidle, S. & Berman, H. M. (1997). Con-formations of the sugar-phosphate backbone in heli-cal DNA crystal structures. Biopolymers, 42, 113-124.

Selsing, E., Arnott, S. & Ratliff, R. L. (1975). Confor-mations of poly(d(A-T-T))-poly(d(A-A-T)). J. Mol.Biol. 98, 243-248.

Smith, S. B., Cui, Y. & Bustamante, C. (1996). Over-stretching B-DNA: the elastic response of individualdouble-stranded and single-stranded DNA mol-ecules. Science, 271, 795-799.

Sprous, D., Young, M. A. & Beveridge, D. L. (1998).Molecular dynamics studies of the conformationalpreferences of a DNA double helix in water and anethanol/water mixture: theoretical considerations ofthe A$ B transition. J. Phys. Chem. B, 102, 4658-4667.

Stasiak, A. & Di Capua, E. (1982). The helicity of DNAin complexes with RecA protein. Nature, 299, 185-186.

Stasiak, A. & Egelman, E. (1988). Structure of helicalRecA-DNA complexes. III. The structural polarity ofRecA ®laments and functional polarity in the RecA-mediated strand-exchange reaction. J. Mol. Biol. 202,659-662.

Stasiak, A., Di Capua, E. & Koller, T. (1981). Elongationof duplex DNA by RecA protein. J. Mol. Biol. 151,557-564.

Steitz, T. A. (1990). Structural studies of protein-nucleicacid interaction: the sources of sequence-speci®cbinding. Quart. Rev. Biophys. 23, 205-280.

Story, R. M., Weber, I. T. & Steitz, T. A. (1992). Thestructure of the E. coli RecA protein monomer andpolymer. Nature, 355, 318-325.

Suzuki, M. & Yagi, N. (1995). Stereochemical basis ofDNA bending by transcription factors. Nucl. AcidsRes. 23, 2083-2091.

Tan, S., Hunziker, Y., Sargent, D. F. & Richmond, T. J.(1996). Crystal structure of a yeast TFIIA/TBP/DNA complex. Nature, 381, 127-134.

Travers, A. A. (1989). DNA conformation and proteinbinding. Annu. Rev. Biochem. 58, 427-452.

von Hippel, P. H. (1998). An integrated model of thetranscription complex in elongation, termination,and editing. Science, 281, 660-665.

Wahl, M. C., Rao, S. T. & Sundaralingham, M. (1996).The structure of r(UUC(GCG)) has 50-UU overhangexhibiting Hoogsten-like trans U �U base-pairs.Nature Struct. Biol. 3, 24-31.

Watson, J. D. & Crick, F. H. C. (1953). A structure fordeoxyribose nucleic acid. Nature, 171, 737-738.

Page 26: DNA Stretching and Compression: Large-scale Simulations …brd/Teaching/Bio/asmb/Papers/JMB/DNA... · DNA Stretching and Compression: Large-scale Simulations of Double Helical Structures

1326 DNA Stretching and Compression

Werner, M. H., Gronenborn, A. M. & Clore, G. M.(1996). Intercalation, DNA kinking, and the controlof transcription. Science, 271, 778-784.

West, S. C. (1992). Enzymes and molecular mechanismsof genetic recombination. Annu. Rev. Biochem. 61,603-640.

Wilkins, M. H. F., Gosling, R. G. & Seeds, W. E. (1951).Nucleic acid: an extensible molecule?. Nature, 167,759-760.

Yang, L. & Pettitt, B. M. (1996). B to A transition ofDNA on the nanosecond time scale. J. Phys. Chem.100, 2564-2566.

Yathindra, N. & Sundaralingam, M. (1976). Analysis ofthe possible helical structures of nucleic acids andpolynucleotides. Application of (n-h) plots. Nucl.Acids Res. 3, 729-747.

Young, M. A., Jayaram, B. & Beveridge, D. L. (1997).Intrusion of counterions into the spine of hydrationin the minor groove of B-DNA: fractional occu-pancy of electronegative pockets. J. Am. Chem. Soc.119, 59-69.

Zhou, X. & Adzuma, K. (1997). DNA strand exchangemediated by the Escherichia coli RecA proteininitiates in the minor groove of double-strandedDNA. Biochemistry, 36, 4650-4661.

Zhurkin, V. B., Lysov, Y. P. & Ivanov, V. I. (1978).Different families of double-stranded conformationsof DNA revealed by computer calculations. Biopoly-mers, 17, 377-412.

Zhurkin, V. B., Lysov, Y. P. & Ivanov, V. I. (1979). Ani-sotropic ¯exibility of DNA and the nucleosomalstructure. Nucl. Acids Res. 6, 1081-1096.

Zhurkin, V. B., Poltev, V. I. & Florent'ev, V. L. (1981).Atom-atom potential functions for conformationalcalculations of nucleic acid. Mol. Biol. USSR (EnglishEdit.), 14, 882-895.

Zhurkin, V. B., Lysov, Y. P., Florentiev, V. L. & Ivanov,V. I. (1982). Torsional ¯exibility of B-DNA asrevealed by conformational analysis. Nucl. AcidsRes. 10, 1811-1830.

Zhurkin, V. B., Ulyanov, N. B., Gorin, A. A. & Jernigan,R. L. (1991). Static and statistical bending of DNAevaluated by Monte Carlo simulations. Proc. NatlAcad. Sci., USA, 88, 7046-7050.

Zhurkin, V. B., Raghunathan, G., Ulyanov, N. B.,Camerini-Otero, R. D. & Jernigan, R. L. (1994a). Aparallel DNA triplex as a model for the intermedi-ate in homologous recombination. J. Mol. Biol. 239,181-200.

Zhurkin, V. B., Raghunathan, G., Ulyanov, N. B.,Camerini-Otero, R. D. & Jernigan, R. L. (1994b).Recombination triple helix, R-form DNA. A stereo-chemical model for recognition and strandexchange. In Structural Biology: The State of the Art(Sarma, R. H. & Sarma, M. H., eds), vol. 2, pp. 43-66, Adenine Press, Schenectady, NY.

Note added in proof

Two studies published after this paper wasaccepted for publication are consistent with themechanism of information readout suggested here.First, model building based on the crystal structureof T7 RNA polymerase with promoter DNA(Cheetham, G. M. T., Jeruzalmi, D. & Steitz, T. A.(1999). Nature 399, 80-83.) implies that the DNAtemplate strand is sharply kinked at the active cen-ter of polymerase. Second, analysis of the Mn(II)binding to ribonucleotides (Garland, C. S., Tarien,E., Nirmala, R., Clark, P., Rifkind, J. & Eichhorn, G.L. (1999). Biochemistry 38, 3421-3425.) suggests thatboth the RNA and DNA strands are noticeablykinked at the elongation site of E. coli RNA poly-merase. Importantly, this kink increases the dis-tance between adjacent bases along the DNA chainand thereby diminishes the probability of misincor-poration in the RNA chain.

Edited by I. Tinoco

(Received 27 October 1998; received in revised form 15 April 1999; accepted 15 April 1999)