New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH...

38
1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella moellendorffii Amy M. James 1,2 , Achala S. Jayasena 1,2, , Jingjing Zhang 1,2 , Oliver Berkowitz 3 , David Secco 2 , Gavin J. Knott 1 , James Whelan 3 , Charles S. Bond 1 , Joshua S. Mylne 1,2* The University of Western Australia, 1 School of Molecular Sciences & 2 The ARC Centre of Excellence in Plant Energy Biology, 35 Stirling Highway, Crawley, Perth 6009, Australia 3 La Trobe University, School of Life Sciences, ARC Centre of Excellence in Plant Energy Biology, Melbourne 3086, Australia * Corresponding Author: [email protected] Short title: Exploring Bowman-Birk Inhibitor Evolution One-sentence summary: A sequence from spikemoss uncovers the true extent of the Bowman-Birk Inhibitor family to be wider than cereals and legumes, solving a long- standing mystery about their origins. The author responsible for distribution of materials integral to the findings presented in this article in accordance with the policy described in the Instructions for Authors (www.plantcell.org) is: Joshua S. Mylne (to [email protected]) ABSTRACT Bowman-Birk Inhibitors (BBIs) are a well-known family of plant protease inhibitors first described 70 years ago. BBIs are known only in the legume (Fabaceae) and cereal (Poaceae) families, but peptides that mimic their trypsin-inhibitory loops exist in sunflowers and frogs. The disparate biosynthetic origins and distant phylogenetic distribution implies these loops evolved independently, but their structural similarity suggests a common ancestor. Targeted bioinformatic searches for the BBI inhibitory loop discovered highly divergent BBI-like sequences in the seedless, vascular spikemoss Selaginella moellendorffii. Using de novo transcriptomics, we confirmed expression of five transcripts in S. moellendorffii whose encoded proteins share homology with BBI inhibitory loops. The most highly expressed, BBI3, encodes a protein that inhibits trypsin. We needed to mutate two lysine residues to abolish trypsin inhibition, suggesting BBI3’s mechanism of double-headed inhibition is shared with BBIs from angiosperms. As Selaginella belongs to the lycopod plant lineage, which diverged ~200-230 million years before the common ancestor of angiosperms, its BBI-like proteins imply there was a common ancestor for legume and cereal BBIs. Indeed, we discovered BBI sequences in six angiosperm families outside the Fabaceae and Poaceae. These findings provide the evolutionary missing links between the well-known legume and cereal BBI gene families. Plant Cell Advance Publication. Published on March 14, 2017, doi:10.1105/tpc.16.00831 ©2017 American Society of Plant Biologists. All Rights Reserved

Transcript of New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH...

Page 1: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

1

RESEARCH ARTICLE

Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginellamoellendorffii

Amy M. James1,2, Achala S. Jayasena1,2,, Jingjing Zhang1,2, Oliver Berkowitz3, David Secco2, Gavin J. Knott1, James Whelan3, Charles S. Bond1, Joshua S. Mylne1,2*

The University of Western Australia, 1 School of Molecular Sciences & 2 The ARC Centre of Excellence in Plant Energy Biology, 35 Stirling Highway, Crawley, Perth 6009, Australia 3La Trobe University, School of Life Sciences, ARC Centre of Excellence in Plant Energy Biology, Melbourne 3086, Australia * Corresponding Author: [email protected]

Short title: Exploring Bowman-Birk Inhibitor Evolution

One-sentence summary: A sequence from spikemoss uncovers the true extent of the Bowman-Birk Inhibitor family to be wider than cereals and legumes, solving a long-standing mystery about their origins.

The author responsible for distribution of materials integral to the findings presented in this article in accordance with the policy described in the Instructions for Authors (www.plantcell.org) is: Joshua S. Mylne (to [email protected])

ABSTRACT

Bowman-Birk Inhibitors (BBIs) are a well-known family of plant protease inhibitors first described 70 years ago. BBIs are known only in the legume (Fabaceae) and cereal (Poaceae) families, but peptides that mimic their trypsin-inhibitory loops exist in sunflowers and frogs. The disparate biosynthetic origins and distant phylogenetic distribution implies these loops evolved independently, but their structural similarity suggests a common ancestor. Targeted bioinformatic searches for the BBI inhibitory loop discovered highly divergent BBI-like sequences in the seedless, vascular spikemoss Selaginella moellendorffii. Using de novo transcriptomics, we confirmed expression of five transcripts in S. moellendorffii whose encoded proteins share homology with BBI inhibitory loops. The most highly expressed, BBI3, encodes a protein that inhibits trypsin. We needed to mutate two lysine residues to abolish trypsin inhibition, suggesting BBI3’s mechanism of double-headed inhibition is shared with BBIs from angiosperms. As Selaginella belongs to the lycopod plant lineage, which diverged ~200-230 million years before the common ancestor of angiosperms, its BBI-like proteins imply there was a common ancestor for legume and cereal BBIs. Indeed, we discovered BBI sequences in six angiosperm families outside the Fabaceae and Poaceae. These findings provide the evolutionary missing links between the well-known legume and cereal BBI gene families.

Plant Cell Advance Publication. Published on March 14, 2017, doi:10.1105/tpc.16.00831

©2017 American Society of Plant Biologists. All Rights Reserved

Page 2: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

2

INTRODUCTION 1

Bowman-Birk Inhibitors (BBIs) were first described 70 years ago and were the subject of 2

classical experiments in biochemistry (Bowman, 1946). BBIs are one of the many 3

different families of plant protease inhibitors reviewed by Habib & Fazili (2007), who 4

used the structural classifications provided by the MEROPS peptidase database 5

(Rawlings et al., 2016). BBIs originally characterized from soybean (Glycine max) are 6

dual inhibitors of both trypsin and chymotrypsin (Birk, 1961; Birk et al., 1963); this 7

duality was later found to be the result of two separate inhibitory loops, each formed by 8

a disulfide bridge (Odani and Ikenaka, 1973). This presence of two spatially separated 9

loops is referred to as a ‘double-headed’ structure (Figure 1). This double-headed 10

structure is predicted to have arisen from an internal gene duplication of a single-11

headed ancestral inhibitor (Mello et al., 2003). The inhibitory loops are bound by 12

proteases in a substrate-like conformation, a standard mechanism for protease 13

inhibition. A double-headed BBI can bind two proteases simultaneously (Song et al., 14

1999). BBIs also typically possess many internal disulfide bonds between conserved 15

Cys residues that provide structural stability and maintain the active conformation of the 16

inhibitory loops. Highlighting this, a recent study showed that mutating a single Cys 17

residue of a typical ‘double-headed’ BBI abolished the activity attributed to both loops of 18

the mutated BBI in seed extracts (Clemente et al., 2015). 19

Since their discovery, BBIs have been shown to be widely distributed in the legume 20

(Fabaceae) and cereal (Poaceae) families (Mello et al., 2003). Although these two plant 21

groups are distantly related (Figure 1b), the high sequence and structural similarity 22

between their BBIs suggests they share a common ancestor (Mello et al., 2003). Cereal 23

BBIs lack the double inhibitory-loop structure common in legume BBIs. This is due to a 24

lack of inhibitory-loop-forming Cys residues in the region where the second loop is 25

located in legumes, resulting in the second loop becoming non-functional (Park et al., 26

2004; Qi et al., 2005). Cereal BBIs differ in size from 8 kDa to 20 kDa, whereas legume 27

BBIs are consistently around 8 kDa in size. The wide size range for cereal BBIs was 28

caused by internal gene duplication events. Although cereal BBIs lack the Cys residues 29

required for a functional second inhibitory loop, the internal duplication events have 30

Page 3: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

3

resulted in multiple inhibitory loops for a single protein (Prakash et al., 1996; Song et al., 31

1999) (Figure 1). 32

The inhibitory loops of BBIs can function independently from the rest of the BBI protein, 33

as demonstrated by analysis of synthetic peptides made for the nine-residue loop alone 34

(Nishino et al., 1977). These small, synthetic peptides were also used to determine the 35

essential residues necessary for specificity to trypsin or chymotrypsin (Terada et al., 36

1978). The sequences of the synthetic nonapeptides analysed were CTKSNPPQC and 37

CALSTPAQC, which correspond to the soybean BBI trypsin inhibitory and chymotrypsin 38

inhibitory loops, respectively (Terada et al., 1978). By mutating the P1 residue 39

(underlined; the first amino acid in the N-terminal direction from the cleaved bond 40

following the notation described by (Schechter and Berger, 1967)), they showed that 41

Lys or Arg at the P1 position confers specificity for trypsin, whereas a Tyr or Leu will 42

confer specificity for chymotrypsin (Terada et al., 1978). 43

Since the first studies on synthetic BBI-loop peptides, every residue in the nonapeptide 44

except the P5' residue, which shows the weakest conservation, has been tested for its 45

functional importance (McBride et al., 2002). Apart from the obviously critical P1 residue 46

and loop forming Cys residues (P3, P6'), the remaining residues also influence BBI 47

structure and activity. For example, the hydroxyl and methyl group of the conserved Thr 48

at position P2 directly interacts with trypsin to stabilize the active site, resulting in 49

reduced hydrolysis (McBride et al., 1998). An Ile at P2' is optimal for inhibition of trypsin 50

(Gariani et al., 1999). The P3' and P4' Pro residues maintain a polyproline II 51

conformation important for structure (Park et al., 2004). Furthermore, the P3' Pro 52

residue must be in a cis conformation for correct positioning of the P1 residue (Brauer et 53

al., 2002). The P1' position is a highly conserved Ser, however, substitution with Ala 54

only slightly reduced BBI inhibitory activity (Brauer and Leatherbarrow, 2003). The low 55

tolerance for substitution in these studies illustrates that the high conservation observed 56

within BBI inhibitory loops is required for the optimal inhibition of trypsin. 57

Two decades after the first analysis of synthetic BBI inhibitory loops, a similar and 58

naturally occurring peptide was discovered in sunflower seeds (Luckett et al., 1999). 59

Sunflower Trypsin Inhibitor-1 (SFTI-1) shares the same function, a nearly identical 60

Page 4: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

4

amino acid sequence, and three-dimensional structural conformation with the trypsin 61

inhibitory loop of BBIs; however, SFTI-1 is composed of only 14 residues and is a head-62

to-tail macrocycle (Figure 1). Originally thought to be the smallest member of the BBI 63

family, SFTI-1 has an altogether different biosynthetic and evolutionary origin. SFTI-1 64

emerged from a protein that is also a precursor for a napin-type, seed storage albumin 65

(Mylne et al., 2011). Evidence suggests SFTI-1 evolved de novo, in a stepwise manner 66

within a standard preproalbumin gene, independently of BBIs (Elliott et al., 2014). A less 67

potent, but similar, inhibitory loop also exists in ORB proteins from frogs (Li et al., 2007), 68

implying BBI-like inhibitory loops have evolved independently multiple times for this 69

function. 70

The existence of a highly conserved sequence in disparate proteins and protein 71

contexts within distinct phylogenetic groups suggests the BBI inhibitory loop is a product 72

of convergent evolution. Here we investigated whether this trypsin inhibitory loop has 73

appeared in additional protein contexts. We searched databases using the query 74

sequence CTKSIPPIC, the sequence shared by SFTI-1 and BBIs, and found an 75

identical match to a predicted protein from the ancient seedless, vascular plant 76

Selaginella moellendorffii. S. moellendorffii belongs to the lycopod lineage, which is 77

estimated to have diverged 400 million years ago, an estimated 200-230 million years 78

before the evolution of angiosperms (Banks, 2009; Bell et al., 2010). Closer analysis 79

revealed that rather than being a new protein context, the Selaginella BBI-like protein 80

represents an ancient member of the BBI protein family, as it shares conserved Cys 81

residues outside the conserved inhibitory loops. Furthermore, a recombinant S. 82

moellendorffii BBI inhibited trypsin, and mutating predicted P1 residues removed its 83

trypsin inhibitory activity, suggesting it uses the same mechanism of inhibition as BBIs 84

in seed plants. Consistent with the hypothesis that the Selaginella BBI-like proteins 85

share a common ancestor with legume and cereal BBIs, we discovered BBIs in six 86

angiosperm families outside the Fabaceae and Poaceae plant families. 87

Page 5: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

5

RESULTS 88

Identification of BBI-like genes in S. moellendorffii 89

Between SFTI-1 and BBIs, only the inhibitory loop is shared. As this loop is sufficient for 90

inhibitory activity, we performed BLAST searches using the sequence CTKSIPPIC as 91

bait to identify other proteins that possess this functional motif. We identified two 92

sequences derived from the S. moellendorffii genome that contained highly similar 93

motifs: BBI1 (NCBI accession: XM_002980953) and BBI2 (NCBI accession: 94

XM_002980955). 95

The proteins encoded by BBI1 and BBI2 showed conservation within the inhibitory 96

motifs, but less sequence similarity outside of these motifs (Figure 1). The translated 97

sequences possessed a high number of Cys residues (13.7% of the complete primary 98

amino acid sequence), which is a characteristic of BBIs. The top plant BLASTp hit in 99

NCBI (NCBI accession: ADV40041) was a predicted BBI from the legume Lathyrus 100

sativus that shares 25.4% identity and 44.9% similarity with BBI1, as calculated in a 101

global alignment. Similarly, the most similar BBI sequence to BBI2 by BLASTp (NCBI 102

accession: NP_001236539) was a BBI from the legume Glycine max that shares 29.7% 103

identity and 50.7% similarity in a global alignment. 104

A S. moellendorffii transcriptome identifies five BBI-like sequences 105

Two BBI-like sequences were found through mining the published S. moellendorffii 106

genome. To determine if these genes are expressed and to determine if BBI-like 107

sequences are conserved within Selaginella, the transcriptomes of S. moellendorffii, 108

S. kraussiana, and S. martensii were assembled from RNA-seq data. Searches within 109

the S. martensii and S. kraussiana transcriptomes found no BBI-like sequences. A 110

search of publicly available RNA-seq data for S. stauntoniana identified a sequence with 111

two CTKSIPPIC-like motifs (CTMSYPPSC and CTKAPPNC). This suggests there are 112

BBI-like sequences present in Selaginella species other than S. moellendorffii, despite 113

being undetectable in S. kraussiana and S. martensii RNA-seq datasets. 114

In total, five BBI-like genes were identified as being expressed in S. moellendorffii 115

(Figure 2; Supplemental Figure 1). All five genes were sequenced from both cDNA 116

Page 6: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

6

and genomic DNA and further validated by mapping RNA-seq reads to the complete 117

open reading frames. 118

Using SignalP 4.1 (Petersen et al., 2011), we found that all encoded BBIs except BBI4 119

had a predicted endoplasmic reticulum (ER) localization signal (Figure 2). BBI4 is 39 120

residues shorter and shows the greatest sequence divergence (38-43% similarity) from 121

the other four BBI-like sequences, which share >89% similarity. The shorter length was 122

verified by 5' RACE. Altogether, these data imply that BBI4, although expressed, 123

probably does not code for a functional BBI and is a pseudo-gene. 124

S. moellendorffii BBI3 is a functional trypsin inhibitor 125

To test if the Selaginella proteins share a similar function with characterized BBIs, we 126

produced recombinant BBI3 in E. coli and purified it for trypsin inhibition assays 127

(Supplemental Figure 2). BBI3 was chosen because it had the highest number of 128

reads mapped to its open reading frame (ORF), suggesting it is the most highly 129

expressed of the five BBI transcripts (Table 1). Trypsin inhibitory ability was determined 130

by end-point analysis after 45 minutes, assuming that the reaction had reached 131

equilibrium at this stage. Therefore, the results will not indicate any intermediate levels 132

of inhibition due to the inhibitor being in molar excess. Comparing BBI3 to soybean BBI 133

in trypsin inhibition assays showed that both were effective trypsin inhibitors, reducing 134

activity by 99.0% and 99.5%, respectively (Supplemental Figure 3). A protein model of 135

BBI3 predicted a double-headed structure similar to that of legume BBIs (Figure 2). 136

Based on the protein model generated, Lys67 and Lys105 were identified as the 137

putative P1 Lys residues (Figure 2). However, a third region within the BBI3 sequence 138

shares similarity with the BBI inhibitory motif and included a third potential inhibitory P1 139

residue, Lys77. In order to provide support to our protein model and to test which 140

predicted P1 residues contribute to the inhibition by BBI3, we generated a series of 141

BBI3 mutants (Figure 3). Mutating Lys67, Lys77 or Lys105 to Ala in isolation had no 142

effect on the ability of BBI3 to inhibit trypsin. Similarly, a double mutant containing 143

Lys77Ala and Lys105Ala did not reduce inhibition of trypsin activity compared to the 144

wildtype control. The inhibitory activity of the Lys67Ala/Lys105Ala BBI3 double mutant, 145

however, was completely abolished (Figure 3). Furthermore, the triple mutant showed 146

Page 7: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

7

no difference in inhibition compared to the Lys67Ala/Lys105Ala double mutant (Figure 147

3). Taken together, these data suggest that a combination of Lys67Ala and Lys105Ala 148

mutations are sufficient to completely abolish inhibitory activity to a level equivalent to 149

that of a negative control and that Lys77 is not an inhibitory residue. 150

BBIs are widely distributed in the plant kingdom 151

The discovery of BBI-like sequences in S. moellendorffii suggests there was an ancient 152

common ancestor for BBIs in legumes and cereals. If so, BBIs should be more widely 153

distributed in the plant kingdom. Comprehensive BLAST searches in published plant 154

genomes identified BBI-like sequences in plant lineages outside legumes and cereals. 155

BBI-like sequences were not identified in the genomes of the two bryophyte species 156

searched. At the time the BLAST searches were completed, there were no published 157

genomes for monilophytes (ferns and horsetails), which diverged after the evolution of 158

lycopods and prior to the evolution of angiosperms. We did not find any sequence in the 159

genome of loblolly pine (Pinus taeda), the only gymnosperm genome searched. All plant 160

genomes searched are included in Figure 4 along with the number of BBIs identified in 161

each genome. 162

To provide experimental support of the publicly available data, we chose to confirm the 163

presence of three of the BBI-like sequences from banana (Musa acuminata), which 164

does not belong to the legume or cereal families. Sequences that matched the publicly 165

available sequences were cloned from banana leaf genomic DNA (Figure 5). This 166

supports the bioinformatics finding that angiosperm-type BBIs are found in plant 167

lineages outside legumes and cereals. As we did not identify BBI-like sequences in the 168

other Selaginella species, S. kraussiana and S. martensii, we assembled a 169

transcriptome of an available lycopod, Isoetes drummondii, and found a BBI-like 170

sequence. The I. drummondii BBI-like sequence was verified by cloning and sequencing 171

the full-length cDNA (Figure 5). Therefore, despite not being expressed in 172

S. kraussiana and S. martensii, BBIs likely are present in the common ancestor of 173

Selaginella and Isoetes. 174

The number of BBIs identified in each plant genome searched was mapped onto a 175

phylogenetic reconstruction derived from trees from Cogepedia and Angiosperm 176

Page 8: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

8

Phylogeny Group III (Figure 4) 177

(http://genomevolution.org/wiki/index.php/Sequenced_plant_genomes;(Angiosperm 178

Phylogeny Group, 2016). BBIs appear to be maintained throughout monocot evolution, 179

with examples of BBI-like sequences found in all monocot species genomes searched 180

with the exception of duckweed (Spirodela polyrhiza). In dicots, the presence and 181

absence of genes indicates a complex pattern of gene loss. The identification of a BBI-182

like sequence in the basal eudicot, the Colorado blue columbine (Aquilegia coerulea), 183

suggests that dicot BBIs share a common ancestor followed by several independent 184

loss events. BBI-like sequences were not found in any asterid species searched, 185

suggesting there was a loss event prior to their divergence. This is true for Malphigiales 186

and rosid clade II as well. The pattern of loss events in rosid clade II is more complex, 187

with BBI-like sequences found in Fabaceae and Rhamnaceae but apparently lost in all 188

other rosid clade II species searched. 189

All dicot sequences analysed had the typical double-headed BBI motif with the 190

exception of one sequence from jujube (Ziziphus jujuba) (XP_015882735.1), which has 191

lost two inhibitory loop forming Cys residues and another that appears to have lost one 192

of the two loop forming residues (XP_015882689.1). As predicted, most monocot 193

sequences lack the conserved Cys residues forming the second inhibitory loop. Three 194

sequences from the monocot species, Musa acuminata, have two inhibitory loops 195

similar to dicot sequences, two of which were confirmed by sequences as well as a third 196

that lacks the two Cys residues similarly to other monocot sequences (Figure 5). This 197

suggests these sequences could represent the ancestral form of monocot BBIs prior to 198

the predicted loss of the two Cys residues of the second BBI loop. One of the BBI-like 199

sequences from Amborella trichopoda is predicted by sequence homology to possess 200

the same double-headed motif as dicots, suggesting this is likely the ancestral structure 201

of angiosperm BBIs. However, four of the five A. trichopoda BBI-like sequences have an 202

additional Cys residue within their first BBI-loop, suggesting they may no longer function 203

as a protease inhibitors and have potentially evolved a new function. 204

BBI-like sequences identified in angiosperms as well as those found in lycopods were 205

used in phylogenetic analysis (Figure 6). The phylogenetic tree of full-length sequences 206

Page 9: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

9

was divided into six clades based on the branching pattern of the tree. The lycopod 207

clade forms a species-specific clade with 100% bootstrap support, strongly suggesting 208

that they share a single common ancestor. All angiosperm BBIs are predicted to have 209

evolved from a single ancestral BBI sequence as they together form a single clade 210

separate from lycopod BBIs. The dicot sequences all fall within a single clade. Aquilegia 211

coerulea and Ziziphus jujuba each form species-specific nodes with strong (≥88.6%) 212

bootstrap support, suggesting the sequences belonging to these nodes probably 213

duplicated following speciation. A duplication event occurring prior to the divergence of 214

Glycine max and Phaseolus vulgaris likely resulted in two paralagous sequences, with 215

one paralog being lost in Phaseolus vulgaris, as all Fabaceae sequences fall into two 216

distinct bootstrap supported (≥63.3%) nodes. 217

Nearly all monocot sequences fall within three clades designated the monocot clade, 218

Poaceae clade I, and Poaceae clade II. The nodes leading to these three clades have 219

weak bootstrap support and therefore conclusions on the relationships between these 220

groups cannot be made. Both Poaceae clades contain nodes with bootstrap support 221

that contain sequences from more than one species, suggesting potential independent 222

duplication events resulting in multiple paralogous sequences within Poaceae. The 223

African oil palm (Elaeis guineensis) and pineapple (Ananas comosus), both sister to 224

Poaceae, each form species-specific nodes with bootstrap support (≥73.5%) within the 225

monocot clade. This suggests that the BBI sequences belonging to each species 226

evolved from a single common ancestral sequence, so all monocot sequences share a 227

common ancestral sequence. 228

Musa acuminata BBI-like sequences showed the highest sequence divergence from 229

other BBIs, with the five sequences analysed falling into three distinct nodes. 230

Potentially, species-specific duplication events occurred followed by sequence 231

divergence. Only one M. acuminata sequence falls into the monocot clade and the other 232

sequences form two distinct nodes from other monocot sequences. Sequences from 233

other species do fall within these nodes, but with very low bootstrap support. Possibly, 234

duplication events leading to the divergent sequences are independent of other 235

monocots and could therefore be a result of the three whole genome duplication events 236

Page 10: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

10

in the Musa lineage (D'Hont et al., 2012). Alternatively, these divergent M. acuminata 237

sequences could be the result of a duplication event in the ancestor of monocots and 238

therefore could be paralogs of sequences found in the monocot clade and Poaceae 239

clade II. Any definitive conclusions cannot be made due to the lack of bootstrap support 240

for the branches leading to these sequences. 241

DISCUSSION 242

BBIs were first studied 70 years ago and since then their structure and function have 243

been well defined (Bowman, 1946). Here we report five BBI genes from the seedless, 244

vascular plant S. moellendorffii. The genes identified are highly divergent from those in 245

legumes and cereals but possess the conserved trypsin inhibitory motif. They also 246

possess a conserved primary protein architecture, sharing a conserved pattern of Cys 247

residues around the inhibitory loop motifs, and all but one S. moellendorffii BBI 248

possesses a predicted ER signal. A predicted structure of a S. moellendorffii BBI 249

modelled against other known BBIs suggests a similar double-headed structure. 250

Together, these results support the notion that the BBI-like genes identified here are 251

related to BBIs in angiosperms. Selaginella represents the most ancient extant lineage 252

of vascular plants and therefore the identification of these BBIs suggests an ancient 253

ancestral origin of BBIs in angiosperms. 254

We have determined that at least one S. moellendorffii BBI protein, BBI3, is a functional 255

trypsin inhibitor. The structure of BBI3 is predicted to be double-headed based on 256

homology-based modelling. However, due to high sequence divergence outside of the 257

predicted inhibitory sequence motifs and the presence of a third potential inhibitory Lys 258

residue, we further tested the predicted double-headed structure by analysing modified 259

proteins. By mutagenesis, we demonstrated that Ala substitutions at both K67 and K105 260

were required to abolish BBI3 inhibitor activity, suggesting that there are two 261

independent sites that bind to and inhibit trypsin. Consistent with this, single Ala 262

substitutions at K67 and K105 did not affect BBI3 inhibition. The residues K67 and K105 263

reside within the short BBI3 sequences that displayed high similarity to the angiosperm 264

BBI inhibitory motifs and are predicted to share a similar loop structure from our protein 265

model. Mutation of a third Lys residue that also fell within a short sequence that shared 266

Page 11: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

11

similarity to BBI inhibitory motifs, but was not predicted to be part of an inhibitory loop 267

from our protein model, had no effect on BBI3 inhibition. Studies of short peptide mimics 268

of the BBI inhibitory loop showed similar results when the P1 residue was mutated 269

(Terada et al., 1978; Domingo et al., 1995). 270

Since BBIs had thus far only been described in legumes and cereals (Mello et al., 2003; 271

Qi et al., 2005), and these two plant families are distantly related, we reasoned that if 272

the BBIs in these two families share an ancient ancestral origin, they are likely more 273

widely distributed in angiosperms. We demonstrated that BBIs are indeed found in 274

angiosperm lineages outside the legumes and grasses. Supporting a hypothesis for a 275

common ancestral origin for angiosperm BBIs, we identified BBI-like sequences in what 276

is considered to be the most basal living representative of angiosperms, Amborella 277

trichocarpa (Soltis et al., 2008). 278

BBIs appear to be widespread throughout the monocot lineage, being absent from only 279

one of the genomes searched. In the evolutionary scheme proposed by Mello et 280

al.(2003), the loss of two Cys residues in the monocot lineage results in the loss of 281

functionality of the second inhibitory loop and the double-headed structure characteristic 282

of dicot BBIs. Our findings support this model, as we observed a loss of Cys residues in 283

monocot BBI-like sequences; however, we also demonstrated that the loss of the two 284

inhibitory loop-forming Cys residues potentially occurred following the divergence of 285

monocots from dicots, as we found double-headed BBI-like sequences in the monocot 286

M. acuminata. 287

Given our identification of the Selaginella BBIs and the now apparent widespread 288

distribution in angiosperms, we expected to find BBI-like sequences in gymnosperms, 289

which diverged after lycopods, but prior to the evolution of angiosperms. However, 290

searches failed to identify any BBI-like sequences in P. taeda. This could be due to the 291

lack of genetic data for other gymnosperm species, or potentially the BBIs might have 292

been lost during evolution. Alternatively, BBI-like proteins might have evolved 293

independently in Selaginella and are not related to those found in angiosperms. 294

However, this is unlikely given the high sequence similarity of the inhibitory motif and 295

shared inhibitory function. 296

Page 12: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

12

BBIs appear to have been lost in several angiosperm lineages. The main mechanisms 297

for gene loss, as reviewed by Albalat and Canestro (2016), are through unequal 298

crossing over during meiosis, the mobilization of a transposable element leading to 299

physical loss of the gene from the genome, or through the introduction of iterative 300

mutations resulting in pseudogenization or new functionality. Examples of BBI gene loss 301

have been observed in pea (Pisum sativum) germplasm collections, with evidence of 302

partial BBI sequences resulting from premature stop codons (Clemente et al., 2015). 303

However, we failed to identify any sequences, including partial sequences, showing 304

similarity to BBIs in whole plant lineages including rosid class II and the asterids, 305

suggesting gene loss occurred by complete gene loss or through the introduction of 306

mutations resulting in the sequence becoming unrecognizable as a BBI. The loss of 307

BBIs is likely prevalent through angiosperm evolution as a result of functional 308

redundancy, as there are several families of protease inhibitors in plants that could 309

compensate for the loss of BBIs (Habib and Fazili, 2007). For example, the serpin family 310

of protease inhibitors is widespread throughout angiosperms, including species lacking 311

BBIs such as Arabidopsis and cucumber (Roberts and Hejgaard, 2008). Species-312

specific gene loss has been observed for other protein-based plant defences such as 313

polyphenol oxidases (Tran et al., 2012). Gene loss is common and occurs more 314

frequently for genes coding for non-essential proteins that have minor influences on 315

plant fitness (Hirsh and Fraser, 2001; Krylov et al., 2003). 316

The evolutionary split between Isoetes and Selaginella is predicted to have occurred 317

370 million years ago (Arrigo et al., 2013). Therefore, the common ancestor of the BBI-318

like sequences we have found in Isoetes and Selaginella is at least this old and pre-319

dates the evolution of angiosperms, which are predicted to have evolved between 167-320

199 million years ago (Bell et al., 2010). Both the Isoetes and Selaginella BBI-like 321

sequences have a double-headed motif that is observed in legume BBIs and therefore it 322

can be predicted that the ancestor of both lycopod and angiosperm BBIs had a double-323

headed motif. This is supported by the identification of BBI-like sequences in the basal 324

angiosperm species A. trichopoda. Although the double-headed motif likely arose from 325

an internal gene duplication of a single inhibitory loop, we found no current day 326

examples of a single-headed BBI other than those found in cereals, whose sequences 327

Page 13: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

13

suggest they lost two Cys residues resulting in the second loop becoming non-328

functional. 329

In conclusion, we have shown that the highly conserved BBI inhibitory motif has been 330

maintained through vascular plant evolution since at least prior to the divergence of 331

Isoetes and Selaginella. This supports the many studies on synthetic peptide mimics of 332

this inhibitory loop showing that the CT[K/R]SIPPXC motif is optimal for trypsin 333

inhibition. Having identified BBIs throughout the angiosperm lineage, it is clear that BBIs 334

in angiosperms share a common ancestor. We propose that the lycopod BBIs identified 335

here share an ancient ancestral origin with the well-characterized BBIs in angiosperms 336

given the sequence similarity at the conserved loops, a shared protein architecture, and 337

the common mode of action. 338

METHODS 339

BLAST searches using the CTKSIPPIC motif as bait 340

To determine whether the sequence CTKSIPPIC existed in different protein contexts 341

other than SFTI-1 and the legume and cereal BBIs, we performed a BLASTp analysis in 342

the BLAST portal of Phytozome v11.0 excluding the genus Helianthus (sunflower SFTI-343

1) and Fabaceae or Poaceae families (well-known BBI-containing families). The top 344

matches in this analysis were two unknown proteins from Selaginella moellendorffii 345

(BBI1 GenBank: XP_002980999, BBI2 GenBank: XP_002981001). To assess the 346

similarity to other BBIs, the BBI-like protein sequences were used as bait sequences in 347

a BLASTp search in the NCBI non-redundant protein database restricted to Fabaceae 348

and Poaceae sequences. The sequence similarity between the S. moellendorffii BBI-like 349

genes and their corresponding BLASTp hits were calculated in a global alignment using 350

the default settings of the ExPASy LALIGN program. 351

Plant tissue 352

Cuttings were taken from S. kraussiana and S. martensii growing at Hilltop Nursery 353

(Menzies Creek, Victoria, Australia), frozen on dry ice, and shipped to Western Australia 354

(entry into WA under Quarantine Inspector’s Direction No. KH090713). Cuttings were 355

taken from S. moellendorffii (Cat. #3228, Plant Delights Nursery Inc., Raleigh, NC USA) 356

Page 14: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

14

growing at the Joint BioEnergy Institute. Plant Delights Nursery was the source of the 357

S. moellendorffii used to generate the published genome sequence (Banks et al., 2011). 358

The S. moellendorffii cuttings were preserved in RNAlater RNA Stabilization Reagent 359

(Ambion) and shipped on ice packs to Western Australia. 360

RNA extraction from S. moellendorffii, S. kraussiana, and S. martensii 361

Tissue from each species was ground with glass beads to a fine powder under liquid 362

nitrogen. Some of the thicker S. martensii stems were removed during the grinding. 363

Total RNA from approximately 0.3 mL of frozen tissue powder was extracted as 364

previously described (Mylne et al., 2012) using phenol:chloroform and a 2 M lithium 365

chloride RNA-selective precipitation. Contaminating genomic DNA was removed by 366

digesting total RNA with DNase and subsequent purification with a NucleoSpin RNA 367

Clean-up kit (Macherey-Nagel). The resulting total RNA was analysed on a NanoDrop 368

Spectrophotometer for its A260/280 and A260/230 ratios, and all three samples 369

displayed distinct banding when run on a 1% agarose gel, implying the mRNA was 370

intact. 371

Transcriptome sequencing, assembly, mining data for BBI-like sequences 372

Sequencing libraries were generated using the TruSeq® Stranded Total RNA LT with 373

Ribo-Zero Plant kit (Illumina) with 300-1000 ng of purified total RNA according to the 374

manufacturer’s instructions. Sequencing was then performed on an Illumina HiSeq 1500 375

instrument as 101 bp single read runs or 2 x 101 bp paired-end read runs. 376

The de novo transcriptome assemblies were done as described (Jayasena et al., 2014). 377

Raw reads were inspected for quality using FastQC 378

(www.bioinformatics.babraham.ac.uk/projects/fastqc/). Quality trimming and filtering 379

was done using the FASTX toolkit (hannonlab.cshl.edu/fastx_toolkit/). Raw reads were 380

trimmed to maintain a phred score of 30, which sets the base call accuracy to be 99.9%, 381

and the minimum length after trimming was set at 50. Trimmed reads were filtered with 382

a quality threshold of 22 and the percentage of bases that match the quality threshold 383

was set to 90. 384

Page 15: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

15

De novo transcriptomes were assembled using CLC Genomics Workbench 7.0 385

(CLCBio). RNA from S. moellendorffii was sequenced only as single reads. S. martensii 386

and S. kraussiana were sequenced as paired-end reads. Each dataset was assembled 387

initially with the default CLC settings (i.e. word size: 23, bubble size: 50) and 388

subsequently with three different word sizes (i.e. 30, 40, and 60), which determines the 389

size the reads are fragmented into before being assembled, keeping all other 390

parameters at default settings. In each case, reads were mapped back to contigs 391

keeping the mapping parameters as follows: mismatch cost: 2, insertion cost: 3, deletion 392

cost: 3, length fraction: 0.8, similarity fraction: 0.8, update contigs: yes. Relevant 393

statistics are presented in Table 2. In this way, four transcriptomes for each of the three 394

species were assembled. 395

Each assembly was queried using tBLASTn for the two BBI-like genes identified from 396

the S. moellendorffii genome. 397

The sequence referred to from S. stauntoniana was found within the NCBI short read 398

archive (Run ERR364347) with the specific sequence ID within the dataset being 399

ERR364347.14864137.1. The pertinent region of the translated sequence encoded 400

TVCTMSYPPSCFCTKAPPNCGVGSSCCSD, in which the CTKSIPPIC-like motifs are 401

underlined. 402

Cloning of S. moellendorffii BBI-like sequences 403

5' and 3' RACE-ready cDNA was generated using the SMARTer RACE cDNA 404

Amplification Kit (Clontech). Primers for 3' and 5' RACE were designed against 405

sequences from the assembled transcriptome (for BBI3 and BBI4) and from the 406

published genome (for BBI1 and BBI2). All primers are included in Supplemental Table 407

1. Sequences were amplified using Taq DNA polymerase. Purified 3' and 5' RACE PCR 408

products were cloned into pGEM-T Easy vector (Promega) and sequenced. Primers 409

designed based on the RACE results were used to clone full-length sequences with Taq 410

DNA polymerase. For each PCR-amplified product, at least three independent clones 411

were sequenced to account for errors induced by Taq DNA polymerase. Because of the 412

high sequence similarity between BBI2 and BBI3, a single primer was designed directly 413

Page 16: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

16

upstream of the polyA tail that matched both the BBI2 and BBI3 3' UTR sequences. 414

BBI2 and BBI3 full-length sequences were obtained with the same primer pair. 415

Sequencing and validation of BBI genes 416

Searches of the S. moellendorffii transcriptome identified two BBI-like sequences (BBI3; 417

BBI4) (Figure 2). These sequences were not initially found in the published genome 418

using BLASTp searches. Using the RNA sequences in BLASTn searches, BBI3 and 419

BBI4 were found in the genome v1.0 (BBI3, gene locus: scaffold 54: 291,062…291,531; 420

BBI4, gene locus: scaffold 53: 389,315...389,727). To verify the sequences identified in 421

both published genome and our transcriptome assembly, the sequences were amplified 422

by 5' and 3' RACE, cloned and sequenced. All sequences were further verified by 423

amplification of an identical sequence from genomic DNA, which also demonstrated the 424

genes for these transcripts are intronless. 425

We cloned a fifth BBI-like sequence (BBI5) using primers originally designed for BBI3 426

(Figure 2). We found BBI5 through searches in the genome v1.0 427

(scaffold 47: 346,127…346,596). Following identification of five BBI-like sequences, the 428

RNA-seq reads were mapped to all five BBI-like sequences (Table 1; Supplemental 429

Figure 1). Reads supporting each sequence indicated all were expressed. In the 430

current model for BBI2 (GenBank: XM_002980955.1), an intron is predicted. However, 431

upon cloning the full length sequence from cDNA we found no intron in the sequence. 432

We verified this by mapping the RNA-seq reads onto the genomic sequence with and 433

without the predicted intron present (Supplemental Figure 4). RNA-seq reads mapped 434

to the predicted intron sequence, but not across the exon-exon junction with the 435

predicted intron manually removed, confirming that BBI2 is actually intronless. 436

S. moellendorffii BBI mRNA abundance 437

To estimate the relative abundance of each BBI gene at the mRNA level, clean 438

RNA-seq reads from S. moellendorffii mRNA (NCBI SRA BioSample Accession: 439

SAMN05958544) were mapped onto the ORF of each gene using CLC Genomics 440

Workbench 6.5.1 (CLC bio). To avoid non-specific reads mapping due to the high 441

sequence similarity between the BBI-like genes, stringent parameters were used. 442

Page 17: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

17

Mapping parameters were set to a length fraction of 1.0 and similarity fraction of 1.0 to 443

allow only perfect matches to map to each gene. 444

To determine if BBI2 was intronless as we predicted, reads were mapped to genomic 445

DNA with and without the currently annotated (GenBank: XP_002981001) intron using 446

the same parameters above (Supplemental Figure 4). 447

Homology modelling of S. moellendorffii BBI3 448

A protein model of BBI3 was generated using EasyModeller 4.0, a graphical user 449

interface for MODELLER. Only the BBI-like domain (BBI346-116) was used for the protein 450

model. The model was built against the following templates: soybean BBI (1BBI), the 451

snail medick (Medicago scutellata) BBI (1MVZ), and cowpea (Vigna unguiculata) BBI 452

(2R33), which show 28%, 31% and 28% sequence similarity with the BBI3 BBI domain, 453

respectively. 454

Recombinant protein expression, purification and mutagenesis 455

The protein encoded by BBI3 was chosen for expression in E. coli as it had the highest 456

number of reads map to its sequence, indicating that it is the most abundant BBI (Table 457

2; Supplemental Figure 1). A synthetic BBI3 ORF that included an N-terminal six-His 458

tag and a TEV protease cleavage site in lieu of its ER signal was designed with optimal 459

codon usage for E. coli (GENEART) (Supplemental Figure 2). The sequence was sub-460

cloned into the BamHI and SalI sites of pQE30 (Qiagen). The pQE30-BBI3 construct 461

and the suppressor plasmid pREP4 (Qiagen) were co-transformed into the E.coli strain 462

Shuffle Express (New England Biolabs, Cat. # C3028H) for protein production. 463

To test the predicted P1 inhibitory residues of BBI3, a series of mutant sequences were 464

generated by site-directed mutagenesis (Zheng et al., 2004). Three BBI3 motifs are 465

similar to the conserved BBI inhibitory motif. In all three cases, the putative P1 Lys was 466

mutated to Ala by site-directed PCR mutagenesis using primers listed in Supplemental 467

Table 1. 468

E. coli expressing BBI3 or BBI3 mutants were grown in 750 mL LB in 2 L flasks at 30°C 469

to an OD600 of 0.8, induced by isopropyl β-D-1-thiogalactopyranoside addition to 1 mM, 470

and incubated overnight at 16°C. Bacterial pellets (~5 mL) were resuspended in 25 mL 471

Page 18: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

18

lysis buffer (50 mM Tris-HCl pH 8.0, 1 M sodium chloride, 30 mM imidazole) and lysed 472

by sonication. After centrifugation, the cleared lysate was incubated with Ni-NTA resin 473

(BioRad) at 4°C overnight with mild agitation. The resin was washed with lysis buffer 474

and eluted in elution buffer (50 mM Tris-HCl pH 8.0, 100 mM sodium chloride, 300 mM 475

imidazole). The protein was further purified by gel-filtration chromatography (BioLogic 476

DuoFlow, Bio-Rad) on a 10/300 Superdex™ 200 Increase column (GE Healthcare) or 477

10/300 Superdex™ 75 GL column (GE Healthcare) in gel filtration buffer (20 mM Tris-478

HCl pH 8.0, 50 mM sodium chloride). 479

Trypsin inhibition assay 480

Inhibition was determined as described (Prasad et al., 2010). Protein concentration was 481

determined with a Pierce™ BCA Protein Assay Kit (Thermo Scientific) using BSA as a 482

standard. BBI from soybean was used as a positive control (Sigma Aldrich, Cat. # 483

T9777). The inhibitors were diluted to 100 µg/mL in gel filtration buffer and 5 µL was 484

added to 20 µL of 25 µg/mL trypsin from bovine pancreas (Sigma Aldrich) dissolved in 485

50 mM Tris-Cl, 20 mM calcium chloride, pH 8.0 and incubated for 15 min at 37°C. 486

Residual trypsin activity was determined by the addition of 125 µL of 1 mM 487

N-α-benzoyl-DL-arginine-p-nitroanilide (BAPNA) substrate (Sigma Aldrich) dissolved in488

99% 50 mM Tris-HCl, 20 mM calcium chloride, pH 8.0, 1% (v/v) dimethyl sulfoxide and 489

incubation for 45 min at 37 °C. The reaction was stopped with the addition of 25 µL of 490

30% (v/v) acetic acid. The absorbance was measured at 410 nm. 491

Comprehensive BLAST searches and phylogenetic analysis 492

BLAST searches to identify BBI-like sequences from other species were performed 493

against the NCBI and phytozome genome databases using BBI sequences from 494

legumes and cereals as well as the Selaginella BBI sequences as baits. A list of 495

genome version numbers used for tBLASTn searches is included in Supplemental 496

Data Set 1. Searches were completed by tBLASTn using an E-value threshold of 50. 497

Sequences that shared similarity around the conserved motif were collected and pooled 498

into a single file. Only full-length sequences were used for analysis. Accession numbers 499

for all sequences are included in Supplemental Data Set 1. These sequences were 500

Page 19: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

19

analysed in a PFAM batch sequence search and each sequence was found to be have 501

homology with the conserved BBI domain. 502

An alignment using the predicted amino acid sequences from all BBI-like sequences 503

identified through tBLASTn searches, as well as those from Selaginella moellendorffii 504

and Isoetes drumondii, was generated with ClustalW (2.0.12) followed by manual 505

editing with BioEdit Sequence Alignment Editor v7.2.5 (Hall, 1999). Other than manually 506

adjusting the alignment, no deletions were made to avoid reducing the accuracy of the 507

analysis. According to a comprehensive study of filtering methods by (Tan et al., 2015), 508

all current filtering methods reduce accuracy of the resulting phylogenetic analysis. A 509

phylogenetic tree was generated from a protein distance matrix using neighbour joining 510

methods using the Jones-Taylor-Thornton model in the Phylip package (v 3.67). Highly 511

similar sequences were not included. Sequences that resulted in very long branch 512

lengths were removed as potential gene model errors. Following the removal of these 513

sequences, the alignment and phylogenetic analysis was repeated with a 105 BBI-like 514

sequences (Supplemental File 1). Bootstrap values were calculated from 1,000 515

phylogenetic constructions using a distance neighbour joining analysis and mapped 516

back onto the original tree. The phylogenetic tree graphical representation was 517

generated with FigTree (v 1.4). 518

Sequencing BBI-like genes from banana and quillwort 519

Leaf tissue was collected from a Cavendish banana (Musa acuminata) tree sourced 520

from Bunnings Warehouse. Live Isoetes drummondii plants were collected by Kingsley 521

Dixon from Alison Baird Reserve, Kenwick in Western Australia. 522

I. drummondii was sequenced by Illumina NextSeq500 as 2×151 pair-end read length523

runs. The transcriptome was assembled as described for Selaginella species. Relevant 524

statistics are presented in Table 2. 525

To confirm the presence of BBI-like sequences in Musa acuminata, genomic DNA was 526

extracted using the DNeasy Plant Minikit (Qiagen). RNA was extracted from 527

I. drummondii as described for Selaginella species. Reverse transcription of this RNA528

was performed using a ProtoScript® II First Strand cDNA Synthesis Kit (NEB). Primers 529

Page 20: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

20

designed to amplify the complete ORF from cDNA or genomic DNA are included in 530

Supplemental Table 1. Purified PCR products were cloned into pGEM-T Easy 531

(Promega) and sequenced. At least two independent clones were used to account for 532

errors introduced by Taq DNA polymerase. 533

Accession numbers 534

Sequence data generated for this article can be found in the NCBI short read archive 535

and GenBank under accession numbers SRP092379 and KY069178-KY069186, 536

respectively. Protein Data Bank codes for data used in figures are: SFTI-1 1JBL; Gmaa 537

1BBI; HvuBBIa 2FJ8; MscBBI 1MVZ. GenBank accession numbers for sequences used 538

in figures are Gmab: ACU13240; HvuBBIb: BAJ91702. 539

Supplemental Data 540

Supplemental Figure 1. Relative BBI transcript abundance in S. moellendorffii tissue 541

shown as the number of clean RNAseq reads that mapped to each open reading frame. 542

Supplemental Figure 2. BBI3 synthetic protein sequence used for recombinant protein 543

production and SDS-PAGE gel of expressed protein before and after TEV cleavage to 544

remove the 6-His tag. 545

Supplemental Figure 3. Trypsin inhibition by S. moellendorffii BBI3 and Glycine max 546

BBI. 547

Supplemental Figure 4. Mapping of RNA-seq reads to genomic DNA sequence shows 548

that BBI2 is intronless. 549

Supplemental Table 1. All primers used in this study. 550

Supplemental Data Set 1. Accession numbers and sequence information for 551

sequences used to generate the phylogeny presented in Figure 5. Sequences used in 552

the phylogeny presented in Figure 5 have a corresponding ID. Assembly version 553

numbers are given for genomes accessed through the Phytozome BLAST portal. 554

Assembly accession numbers are given for genomes accessed through the NCBI 555

Page 21: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

21

BLAST portal. Genomes for which no BBI-like sequences were found are shown in red 556

text. 557

Supplemental File 1. Sequence alignment of full-length BBI sequences used to 558

generate the BBI phylogeny shown in Figure 6. 559

ACKNOWLEDGMENTS 560

The authors thank J. Heazlewood for S. moellendorffii cuttings and staff at Hilltop 561

Nursery for cuttings from S. kraussiana and S. martensii. A.M.J. was supported by a 562

University International Stipend and a Scholarship for International Research Fees from 563

UWA; A.J. was supported by an International Postgraduate Research Scholarship and 564

an Australian Postgraduate Award from UWA; J.Z. was supported by an International 565

Postgraduate Research Scholarship and an Australian Postgraduate Award from UWA; 566

G.J.K. was supported by a Hackett Postgraduate Scholarship from UWA; J.W. was 567

supported by an Australian Research Council (ARC) Centre of Excellence grant 568

CE140100008; J.S.M. was supported by an ARC Future Fellowship (FT120100013). 569

This work was supported by ARC grant DP130101191. 570

AUTHOR CONTRIBUTIONS 571

J.S.M. conceived the study. A.M.J. designed and performed all experimentation with the 572

exception of the following: J.S.M. performed RNA extractions; A.S.J. and J.Z. 573

assembled transcriptomes; D.S., O.B., and J.W. prepared libraries and performed 574

sequencing; all authors analyzed the data; G.J.K. and C.B. optimized methods for 575

protein purification. A.M.J. and J.S.M. wrote the manuscript with contributions from all 576

authors. 577

Table 1. SmoBBI mRNAexpression. 578

The number of RNA-seq clean reads mapping to each SmoBBI open reading frame and 579

the average coverage, which is indicative of expression level. Average coverage 580

represents the average number of reads mapping to each position of the reference 581

gene. Mapped reads are shown as a graphical representation in Supplemental Figure 582

1. 583

Page 22: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

22

Gene Mapped reads

Average Coverage

SmoBBI1 77 20. 7SmoBBI2 1,539 413.2SmoBBI3 2,383 639.7SmoBBI4 30 11.7SmoBBI5 589 158.1

Table 2. De novo transcriptome assembly statistics. 584

The number of contigs, number of raw and clean reads, the word size used to generate 585

each assembly and the N50 statistic (summary of the lengths of the longest contigs until 586

50% of the total contig length is reached) are shown. 587

Species Raw reads Clean reads Word size Contigs N50 S. moellendorffii 63,151,678 56,711,459 40 (Single) 44,564 584 S. martensii 90,146,302 82,241,287 40 (Paired) 51,848 557 S. kraussiana 145,585,252 135,198,965 60 (Paired) 44,996 780 I. drummondii 103,198,884 68,341,138 60 (Paired) 264,900 427 588

589

References 590

Albalat, R., and Canestro, C. (2016). Evolution by gene loss. Nat. Rev. Genet. 17, 379-391. 591 Angiosperm Phylogeny Group. (2016). An update of the Angiosperm Phylogeny Group classification for 592

the orders and families of flowering plants: APG IV. Bot. J. Linn. Soc. 181, 1-20. 593 Arrigo, N., Therrien, J., Anderson, C.L., Windham, M.D., Haufler, C.H., and Barker, M.S. (2013). A total 594

evidence approach to understanding phylogenetic relationships and ecological diversity in 595 Selaginella subg. Tetragonostachys. Am. J. Bot. 100, 1672-1682. 596

Banks, J.A. (2009). Selaginella and 400 million years of separation. Annu. Rev. Plant Biol. 60, 223-238. 597 Banks, J.A., Nishiyama, T., Hasebe, M., Bowman, J.L., Gribskov, M., dePamphilis, C., Albert, V.A., Aono, 598

N., Aoyama, T., Ambrose, B.A., Ashton, N.W., Axtell, M.J., Barker, E., Barker, M.S., Bennetzen, 599 J.L., Bonawitz, N.D., Chapple, C., Cheng, C., Correa, L.G.G., Dacre, M., DeBarry, J., Dreyer, I., 600 Elias, M., Engstrom, E.M., Estelle, M., Feng, L., Finet, C., Floyd, S.K., Frommer, W.B., Fujita, T., 601 Gramzow, L., Gutensohn, M., Harholt, J., Hattori, M., Heyl, A., Hirai, T., Hiwatashi, Y., 602 Ishikawa, M., Iwata, M., Karol, K.G., Koehler, B., Kolukisaoglu, U., Kubo, M., Kurata, T., 603 Lalonde, S., Li, K., Li, Y., Litt, A., Lyons, E., Manning, G., Maruyama, T., Michael, T.P., Mikami, 604 K., Miyazaki, S., Morinaga, S.i., Murata, T., Mueller‐Roeber, B., Nelson, D.R., Obara, M., Oguri, 605 Y., Olmstead, R.G., Onodera, N., Petersen, B.L., Pils, B., Prigge, M., Rensing, S.A., Riaño‐606 Pachón, D.M., Roberts, A.W., Sato, Y., Scheller, H.V., Schulz, B., Schulz, C., Shakirov, E.V., 607 Shibagaki, N., Shinohara, N., Shippen, D.E., Sørensen, I., Sotooka, R., Sugimoto, N., Sugita, M., 608

Page 23: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

23

Sumikawa, N., Tanurdzic, M., Theißen, G., Ulvskov, P., Wakazuki, S., Weng, J.K., Willats, 609 W.W.G.T., Wipf, D., Wolf, P.G., Yang, L., Zimmer, A.D., Zhu, Q., Mitros, T., Hellsten, U., Loqué, 610 D., Otillar, R., Salamov, A., Schmutz, J., Shapiro, H., Lindquist, E., Lucas, S., Rokhsar, D., and 611 Grigoriev, I.V. (2011). The Selaginella genome identifies genetic changes associated with the 612 evolution of vascular plants. Science 332, 960-963. 613

Bell, C.D., Soltis, D.E., and Soltis, P.S. (2010). The age and diversification of the angiosperms re-614 revisited. Am. J. Bot. 97, 1296-1303. 615

Birk, Y. (1961). Purification and some properties of a highly active inhibitor of trypsin and α-616 chymotrypsin from soybeans. Biochim. Biophys. Acta 54, 378-381. 617

Birk, Y., Gertler, A., and Khalef, S. (1963). A pure trypsin inhibitor from soya beans. Biochem. J. 87, 281. 618 Bowman, D.E. (1946). Differentiation of soy bean antitryptic factors. Proc. Soc. Exp. Biol. Med. 63, 547-619

550. 620 Brauer, A.B.E., and Leatherbarrow, R.J. (2003). The conserved P1' Ser of Bowman-Birk-type proteinase 621

inhibitors is not essential for the integrity of the reactive site loop. Biochem. Biophys. Res. 622 Commun. 308, 300-305. 623

Brauer, A.B.E., Domingo, G.J., Cooke, R.M., Matthews, S.J., and Leatherbarrow, R.J. (2002). A 624 conserved cis peptide bond is necessary for the activity of Bowman-Birk inhibitor protein. 625 Biochemistry 41, 10608-10615. 626

Clemente, A., Arques, M.C., Dalmais, M., Le Signor, C., Chinoy, C., Olias, R., Rayner, T., Isaac, P.G., 627 Lawson, D.M., Bendahmane, A., and Domoney, C. (2015). Eliminating anti-nutritional plant 628 food proteins: the case of seed protease inhibitors in pea. PLoS One 10, e0134634. 629

D'Hont, A., Denoeud, F., Aury, J.M., Baurens, F.C., Carreel, F., Garsmeur, O., Noel, B., Bocs, S., Droc, G., 630 Rouard, M., Da Silva, C., Jabbari, K., Cardi, C., Poulain, J., Souquet, M., Labadie, K., Jourda, C., 631 Lengelle, J., Rodier‐Goud, M., Alberti, A., Bernard, M., Correa, M., Ayyampalayam, S., McKain, 632 M.R., Leebens‐Mack, J., Burgess, D., Freeling, M., Mbeguie, A.M.D., Chabannes, M., Wicker, T., 633 Panaud, O., Barbosa, J., Hribova, E., Heslop‐Harrison, P., Habas, R., Rivallan, R., Francois, P., 634 Poiron, C., Kilian, A., Burthia, D., Jenny, C., Bakry, F., Brown, S., Guignon, V., Kema, G., Dita, 635 M., Waalwijk, C., Joseph, S., Dievart, A., Jaillon, O., Leclercq, J., Argout, X., Lyons, E., Almeida, 636 A., Jeridi, M., Dolezel, J., Roux, N., Risterucci, A.M., Weissenbach, J., Ruiz, M., Glaszmann, J.C., 637 Quetier, F., Yahiaoui, N., and Wincker, P. (2012). The banana (Musa acuminata) genome and 638 the evolution of monocotyledonous plants. Nature 488, 213-217. 639

Domingo, G.J., Leatherbarrow, R.J., Freeman, N., Patel, S., and Weir, M. (1995). Synthesis of a mixture 640 of cyclic peptides based on the Bowman-Birk reactive site loop to screen for serine protease 641 inhibitors. Int. J. Pept. Protein Res. 46, 79-87. 642

Elliott, A.G., Delay, C., Liu, H., Phua, Z., Rosengren, K.J., Benfield, A.H., Panero, J.L., Colgrave, M.L., 643 Jayasena, A.S., Dunse, K.M., Anderson, M.A., Schilling, E.E., Ortiz‐Barrientos, D., Craik, D.J., 644 and Mylne, J.S. (2014). Evolutionary origins of a bioactive peptide buried within preproalbumin. 645 Plant Cell 26, 981-995. 646

Engelken, J., Brinkmann, H., and Adamska, I. (2010). Taxonomic distribution and origins of the extended 647 LHC (light-harvesting complex) antenna protein superfamily. BMC Evol. Biol. 10, 233. 648

Gariani, T., McBride, J.D., and Leatherbarrow, R.J. (1999). The role of the P2' position of Bowman-Birk 649 proteinase inhibitor in the inhibition of trypsin. Studies on P2' variation in cyclic peptides 650 encompassing the reactive site loop. Biochim. Biophys. Acta 1431, 232-237. 651

Habib, H., and Fazili, K.M. (2007). Plant protease inhibitors: a defense strategy in plants. Biotechnol. 652 Mol. Biol. Rev. 2, 68-85. 653

Hall, T.A. (1999). BioEdit: a user-friendly biological sequence alignment editor and analysis program for 654 Windows 95/98/NT. Nucleic Acids Symp. S. 41, 95-98. 655

Page 24: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

24

Hazkani‐Covo, E., Altman, N., Horowitz, M., and Graur, D. (2002). The Evolutionary History of 656 Prosaposin: Two Successive Tandem-Duplication Events Gave Rise to the Four Saposin Domains 657 in Vertebrates. J. Mol. Evol. 54, 30-34. 658

Hirsh, A.E., and Fraser, H.B. (2001). Protein dispensability and rate of evolution. Nature 411, 1046-1049. 659 Jayasena, A.S., Secco, D., Bernath‐Levin, K., Berkowitz, O., Whelan, J., and Mylne, J.S. (2014). Next 660

generation sequencing and de novo transcriptomics to study gene evolution. Plant Methods 10, 661 34. 662

Krylov, D.M., Wolf, Y.I., Rogozin, I.B., and Koonin, E.V. (2003). Gene loss, protein sequence divergence, 663 gene dispensability, expression level, and interactivity are correlated in eukaryotic evolution. 664 Genome Res. 13, 2229-2235. 665

Li, J., Zhang, C., Xu, X., Wang, J., Yu, H., Lai, R., and Gong, W. (2007). Trypsin inhibitory loop is an 666 excellent lead structure to design serine protease inhibitors and antimicrobial peptides. FASEB J. 667 21, 2466-2473. 668

Luckett, S., Garcia, R.S., Barker, J.J., Konarev, A.V., Shewry, P.R., Clarke, A.R., and Brady, R.L. (1999). 669 High-resolution structure of a potent, cyclic proteinase inhibitor from sunflower seeds. J. Mol. 670 Biol. 290, 525-533. 671

McBride, J.D., Brauer, A.B.E., Nievo, M., and Leatherbarrow, R.J. (1998). The role of threonine in the P2 672 position of Bowman-Birk proteinase inhibitors: studies on P2 variation in cyclic peptides 673 encompassing the reactive site loop. J. Mol. Biol. 282, 447-457. 674

McBride, J.D., Watson, E.M., Brauer, A.B., Jaulent, A.M., and Leatherbarrow, R.J. (2002). Peptide 675 mimics of the Bowman-Birk inhibitor reactive site loop. Biopolymers 66, 79-92. 676

Mello, M.O., Tanaka, A.S., and Silva‐Filho, M.C. (2003). Molecular evolution of Bowman–Birk type 677 proteinase inhibitors in flowering plants. Mol. Phylogenet. Evol. 27, 103-112. 678

Mylne, J.S., Colgrave, M.L., Daly, N.L., Chanson, A.H., Elliott, A.G., McCallum, E.J., Jones, A., and Craik, 679 D.J. (2011). Albumins and their processing machinery are hijacked for cyclic peptides in 680 sunflower. Nat. Chem. Biol. 7, 257-259. 681

Mylne, J.S., Chan, L.Y., Chanson, A.H., Daly, N.L., Schaefer, H., Bailey, T.L., Nguyencong, P., Cascales, L., 682 and Craik, D.J. (2012). Cyclic peptides arising by evolutionary parallelism via asparaginyl-683 endopeptidase-mediated biosynthesis. Plant Cell 24, 2765-2778. 684

Nishino, N., Aoyagi, H., Kato, T., and Izumiya, N. (1977). Studies on the synthesis of proteinase 685 inhibitors. I. Synthesis and activity of nonapeptide fragments of soybean Bowman-Birk inhibitor. 686 J. Biochem. 82, 901-909. 687

Odani, S., and Ikenaka, T. (1973). Studies on soybean trypsin inhibitors VIII. Disulfide bridges in soybean 688 Bowman-Birk proteinase inhibitor. J. Biochem. 74, 697-715. 689

Park, E.Y., Kim, J.A., Kim, H.W., Kim, Y.S., and Song, H.K. (2004). Crystal structure of the Bowman-Birk 690 inhibitor from barley seeds in ternary complex with porcine trypsin. J. Mol. Biol. 343, 173-186. 691

Petersen, T.N., Brunak, S., von Heijne, G., and Nielsen, H. (2011). SignalP 4.0: discriminating signal 692 peptides from transmembrane regions. Nat. Meth. 8, 785-786. 693

Prakash, B., Selvaraj, S., Murthy, M.R., Sreerama, Y.N., Rao, D.R., and Gowda, L.R. (1996). Analysis of 694 the amino acid sequences of plant Bowman-Birk inhibitors. J. Mol. Evol. 42, 560-569. 695

Prasad, E.R., Merzendorfer, H., Madhurarekha, C., Dutta‐Gupta, A., and Padmasree, K. (2010). 696 Bowman−Birk Proteinase inhibitor from Cajanus cajan seeds: purification, characterization, and 697 insecticidal properties. J. Agric. Food Chem. 58, 2838-2847. 698

Qi, R.F., Song, Z.W., and Chi, C.W. (2005). Structural features and molecular evolution of Bowman-Birk 699 protease inhibitors and their potential application. Acta Biochim. Biophys. Sin. 37, 283-292. 700

Rawlings, N.D., Barrett, A.J., and Finn, R. (2016). Twenty years of the MEROPS database of proteolytic 701 enzymes, their substrates and inhibitors. Nucleic Acids Res. 44, D343-D350. 702

Page 25: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

25

Roberts, T.H., and Hejgaard, J. (2008). Serpins in plants and green algae. Funct. Integr. Genomics 8, 1-703 27. 704

Schechter, I., and Berger, A. (1967). On the size of the active site in proteases. I. Papain. Biochem. 705 Biophys. Res. Commun. 27, 157-162. 706

Soltis, D.E., Albert, V.A., Leebens‐Mack, J., Palmer, J.D., Wing, R.A., dePamphilis, C.W., Ma, H., 707 Carlson, J.E., Altman, N., Kim, S., Wall, P.K., Zuccolo, A., and Soltis, P.S. (2008). The Amborella 708 genome: an evolutionary reference for plant biology. Genome Biol 9, 402. 709

Song, H.K., Kim, Y.S., Yang, J.K., Moon, J., Lee, J.Y., and Suh, S.W. (1999). Crystal structure of a 16 kDa 710 double-headed Bowman-Birk trypsin inhibitor from barley seeds at 1.9 Å resolution. J. Mol. Biol. 711 293, 1133-1144. 712

Tan, G., Muffato, M., Ledergerber, C., Herrero, J., Goldman, N., Gil, M., and Dessimoz, C. (2015). 713 Current Methods for Automated Filtering of Multiple Sequence Alignments Frequently Worsen 714 Single-Gene Phylogenetic Inference. Syst. Biol. 64, 778-791. 715

Terada, S., Sato, K., Kato, T., and Izumiya, N. (1978). Inhibitory properties of nonapeptide loop 716 structures related to reactive sites of soybean Bowman-Birk inhibitor. FEBS Lett. 90, 89-92. 717

Tran, L.T., Taylor, J.S., and Constabel, C.P. (2012). The polyphenol oxidase gene family in land plants: 718 Lineage-specific duplication and expansion. BMC Genomics 13, 395. 719

Zheng, L., Baumann, U., and Reymond, J.L. (2004). An efficient one-step site-directed and site-720 saturation mutagenesis protocol. Nucl. Acids Res. 32, e115. 721

722

Page 26: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

26

723

Figure Captions 724

Figure 1. The CTKSIPPIC motif: structures, sequences and contexts. 725

(A) Structural alignment of SFTI-1 (PDB code: 1JBL), a BBI from soybean (GmaBBIa; 726

PDB code: 1BBI) and a BBI from barley (HvuBBIa; PDB code: 2FJ8). The trypsin 727

inhibitory loop of all three sequences (boxed) share structural similarity. 728

(B) BBIs are common in the phylogenetically separate cereals and legumes marked on 729

an angiosperm phylogeny of rbcL sequences (Elliott et al., 2014). 730

(C) Sequence alignment of SFTI-1, GmaBBI and HvuBBI. The homologous inhibitory 731

‘heads’ are indicated with black brackets. Inhibitory loops are indicated with Roman 732

numerals and the corresponding loops are also labelled in panel A. The soybean 733

sequence exemplifies the “double-headed” structure, with two homologous inhibitory 734

motifs, the second of which has been lost in cereal BBIs as a result of the loss of two 735

Cys residues (inhibitory loop II). Many monocot BBIs have undergone internal gene 736

duplications resulting in multiple inhibitory loops (e.g. inhibitory loop III). 737

Figure 2. S. moellendorffii BBI3 is predicted to be a BBI based on sequence similarity at 738

the inhibitory motifs and shared primary protein architecture. 739

(A) BOXSHADE alignment of S. moellendorffii BBI-like (SmoBBI) protein sequences 740

with soybean BBI (GmaBBIb) and barley BBI (HvuBBIb) shows conservation of the 741

trypsin inhibitory loop (grey box). All proteins, with the exception of SmoBBI4, share a 742

similar protein architecture, with an ER signal (line above sequence) followed by Cys-743

rich sequence with two spatially distinct conserved inhibitory motifs. Cys residues that 744

are conserved in all sequences are indicated with asterisks. 745

(B) Protein homology model of SmoBBI346-116 aligned with Medicago scutellata BBI 746

(PDB code: 1MVZ) is shown along with the sequence alignment. The conserved 747

inhibitory loops are shown as opaque, while the remaining protein is translucent and the 748

corresponding inhibitory motifs are boxed in the alignment. The side chains of the 749

labelled P1 residues (R64, K67, R90, K105) and disulfide bonds are displayed in stick 750

format. 751

Page 27: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

27

Figure 3. Trypsin inhibition by wild-type and mutant S. moellendorffii BBI3. 752

(A) Full-length wild-type BBI3 showing predicted inhibitory residues (bold). 753

(B) Partial BBI3 sequences showing mutations (black highlight). 754

(C) Inhibitors were incubated with trypsin and a colorogenic substrate of trypsin whose 755

activity is measurable as OD410. Error bars represent standard deviation for three 756

technical replicates in a single microtiter plate. Trypsin incubated with BAPNA substrate 757

but no inhibitor was used as a control (NIC). 758

Figure 4. Distribution of BBIs in land plants. 759

The number of BBI-like sequences identified in each plant genome searched. Accession 760

numbers for all sequences are included in Supplemental Data Set 1. Predicted gene 761

loss events are indicated with an X. The phylogeny of the plant genomes investigated is 762

derived from species trees available from Cogepedia and APG III. 763

Figure 5. BBI-like sequences identified from quillwort and banana. 764

Alignment of translated sequences of genes cloned from genomic DNA from banana 765

(Musa acuminata, Mac) and cDNA from quillwort (Isoetes drummondii, Idr). Sequences 766

are aligned with a BBI encoded by a gene from soybean (GmaBBIb) to illustrate 767

conserved regions including the inhibitory loops (grey boxes). Conserved Cys residues 768

are indicated by asterisks. 769

Figure 6. Phylogeny of BBI-like sequences in plants and study of sequence diversity 770

through phylogenetic analysis of BBI ‘heads’. 771

Neighbour-joining tree from a protein distance matrix of 104 BBI-like sequences. 772

Sequences were identified by searches within plant genome assemblies. Major clades 773

are indicated and defined by branching patterns. The five S. moellendorffii sequences 774

and the I. drummondii sequence were used to root the tree with the assumption they 775

were evolutionarily most distant from all other sequences. Percent bootstrap values 776

from 1000 replicates of a distance neighbour joining analysis are indicated at each node 777

(only nodes with >50% bootstrap support are labelled). Identifiers and accession 778

numbers are listed in Supplemental Data Set 1. The scale bar represents number of 779

substitutions per site. 780

Page 28: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

AICTRSNPPTCRCVDEVKKCAPTCKTCLPSRSRPSRRVCIDSYFGPVPPRCTPR124

G1RCTKSIPPICFPD

14

D1DESSKPCCDQCACTKSNPPQCRCSDMRLNSCHSACKSCICALSYPAQCFCVDITDFCYEPCKPSEDDKEN

71

A1GKKRPWKCCDEAVCTRSIPPICTCMDEVFE-CPKTCKSCGPSMGDPSRRICQDQYVGDPGPICRPWECCDK

SFTI-1

GmaBBIa

HvuBBIa

HvuBBIa

III

II

I

III

III

A B

C

cereals

legumes

Figure 1. The CTKSIPPIC motif: structures, sequences and contexts.

(A) Structural alignment of SFTI-1 (PDB code: 1JBL), a BBI from soybean (GmaBBIa; PDB code: 1BBI) and a BBI from barley

(HvuBBIa; PDB code: 2FJ8). The trypsin inhibitory loop of all three sequences (boxed) share structural similarity.

(B) BBIs are common in the phylogenetically separate cereals and legumes marked on an angiosperm phylogeny of rbcL

sequences (Elliott et al., 2014).

(C) Sequence alignment of SFTI-1, GmaBBI and HvuBBI. The homologous inhibitory ‘heads’ are indicated with black brackets.

Inhibitory loops are indicated with Roman numerals and the corresponding loops are also labelled in panel A. The soybean

sequence exemplifies the “double-headed” structure with two homologous inhibitory motifs, the second of which has been lost in

cereal BBIs as a result of the loss of two Cys residues (inhibitory loop II). Many monocot BBIs have undergone internal gene

duplications resulting in multiple inhibitory loops (e.g. inhibitory loop III).

Page 29: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

Q

Q

Q

H

SmoBBI3 NSQCCPNSKFCPKGKFCTICTKSYPPSCFCTKAPPNCGVGSSCCSDGDCPKGKECSVCTKSFPPICRCG--TK

MscBBI TKSTTTACCDFCP-------CTRSIPPQCQCTDVREKC---HSAC--------KSC-LCTRSFPPQCRCYDITDFCYPSCS

A

B

GmaBBIbMELSMKVLVKVASLLFLLEFTATVVDARFDPSSFITQFLPNAEANNYYVKSTTKACCN---SC- CTKSIPPQC CSDIG

HvuBBIb

MRPQVLLVAFAVLAVVAALP--------IAHGQG------ASPWPCCD---KCG CTKSIPPQC CQDVS

SmoBBI1 MAQSIKFAPLIALLLCIAATAEVLDVPGEFFFVDGAARLPSECRPESLCCPNSKFCPK-GKYCT MPSNPPLC CTEAP

SmoBBI2 MAQSIKFAPLIALLLCIAVTAEVLDLPGEFFFVDGAASRPSGCRPNS CCPNSKFCPK-GKYCT CTKSYPPSC CTKAP

SmoBBI3 MAQSIKFAPLIALLLCIAVTAEVLDLPGEFFFVDGAASRPSACRPNS CCPNSKFCPK-GKFCT CTKSYPPSC CTKAP

SmoBBI4 MVCRPNS CCPGSKRCSKAGQFCG CTRSFPPTC CDAPN

SmoBBI5 MAQSIKFAPLIALLLCIAVTAEVLDLPGDFFFVDGAASRPSECRPNS CCPNSKFCPK-GKYCT CTRSNPPSC CTKAP

GmaBBIbET-CHSA-----------CKTC- CTRSIPPQC CSD-ITNFCYEPCNSSETEAH

HvuBBIbPTGCNSA-----------CKSC- -VRSTGGFQ CADSITNFCERRCTPAA

SmoBBI1 PN-CGVGSSCCSDGDCPKGKACS CTKSIPPIC CGPEPKIVLVSE

SmoBBI2 PN-CGVGSSCCSDGDCPKGRECS CTRSIPPIC CGAKRKLVLVSE

SmoBBI3 PN-CGVGSSCCSDGDCPKGKECS TKSFPPIC CGTKPKLVLVSE

SmoBBI4 PKRCSAASQCCSDADCLNGGSCSL TRSIPPIC CFKPGKAKLKL

SmoBBI5 PN-CGVGSSCCSDGDCPKGKECS CTKSFPPIC CGPKPKLVLVSE

I

-V

V

VC

C

V

H

-R

R

R

R

R

P

V

IC

V

I

R

V

R

R

V

F

F

F

F

* **

* * * **

* *

K67

R64

K105R90

64 90

67 105

Figure 2. S. moellendorffii BBI3 is predicted to be a BBI based sequence similarity at the inhibitory motifs and shared primary protein architecture.(A) BOXSHADE alignment of S. moellendorffii BBI-like (SmoBBI) protein sequences with soybean BBI (GmaBBIb) and barley BBI (HvuBBIb) shows conservation of the trypsin inhibitory loop (grey box). All proteins, with the exception of SmoBBI4, share a similar protein architecture with an ER signal (line above sequence) followed by Cys rich sequence with two spatially distinct conserved inhibitory motifs. Cys residues that are conserved in all sequences are indicated with asterisks.(B) Protein homology model of SmoBBI346-116 aligned with Medicago scutellata BBI (PDB code: 1MVZ) is shown along with the sequence alignment. The conserved inhibitory loops are shown as opaque while the remaining protein is translucent and the corresponding inhibitory motifs are boxed in the alignment. The P1 residues and disulfide bonds are represented as sticks.

Page 30: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

0

0.2

0.4

0.6

0.8

1.0

1.2

WT K67A K77A K105A K77A

K105A

K67A

K105A

K67A

K77A

K105A

NIC

Try

psin

activity (

OD

410)

SmoBBI3 WT CTICTKSYPPSCFCTKAPPNC CTKSFPPICRCK67A CTICTASYPPSCFCTKAPPNC CTKSFPPICRCK77A CTICTKSYPPSCFCTAAPPNC CTKSFPPICRCK105A CTICTKSYPPSCFCTKAPPNC CTASFPPICRCK77A K105A CTICTKSYPPSCFCTAAPPNC CTASFPPICRCK67A K105A CTICTASYPPSCFCTKAPPNC CTASFPPICRCK67A K77A K105A CTICTASYPPSCFCTAAPPNC CTASFPPICRC

... ...

...

...

.........

.........

... ...

...

......

...

...

...

...

...

...

67 77 105

A

B

C

10 20 30 40

50 60 70 80 90

100 110 120

| | | |

|||||

| | |

MAQSIKFAPLIALLLCIAVTAEVLDLPGEFFFVDGAASRPSACRPNS

QCCPNSKFCPKGKFCTICTKSYPPSCFCTKAPPNCGVGSSCCSDGDC

PKGKECSVCTKSFPPICRCGTKPKLVLVSE

Figure 3. Trypsin inhibition by S. moellendorffii BBI3 and mutants

(A) Full length wildtype BBI3 showing predicted inhibitory residues (bold).

(B) Partial BBI sequences showing mutations (black highlight).

(C) Inhibitors were incubated with trypsin and a colorogenic substrate of trypsin whose activity is

measurable as OD410

. Error bars represent standard deviation for three technical replicates in a

single microtiter plate. Trypsin incubated with BAPNA substrate but no inhibitor was used as a

control (NIC).

Page 31: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

5

5 basal angiospermgymnosperm

mon

ocot

eudi

cot

rosi

d II

aste

ridro

sid

IM

alpi

ghia

les

lycopod

bryphyte

species # Physcomitrella patens 0 Sphagnum fallax 0 Selaginella moellendorffii

Pinus taeda 0 Amborella trichopoda

Spirodela polyrhiza 0 Phalaenopsis equestris 1 Elaeis guineensis 8 Musa acuminata 10 Brachypodium distachyon 5 Brachypodium stacei 4 Oryza sativa 10 Panicum hallii 2 Setaria viridis 8 Zea mays 10 Sorghum bicolor 10 Ananas comosus 2 Aquilegia coerulea 3 Kalanchoe laxiflora 0 Vitis vinifera 0 Linum usitatissimum 0 Ricinus communis 0 Manihot esculenta 0 Populus trichocarpa 0 Salix purpurea 0 Glycine max 13 Phaseolus vulgaris 3 Medicago truncatula 9 Ziziphus jujuba 11 Fragaria vesca 0 Prunus persica 0 Cucumis sativus 0 Eucalyptus grandis 0 Citrus sinensis 0 Citrus clementina 0 Gossypium raimondii 0 Carica papaya 0 Arabidopsis lyrata 0 Arabidopsis thaliana 0 Boechera stricta 0

0 Capsella grandiflora 0 Capsella rubella

Brassica rapa

0

Eutrema salsugineum

0 Amaranthus hypochondriacus 0 Beta vulgaris 0 Primula vulgaris 0 Solanum lycopersicum 0 Solanum tuberosum 0 Mimulus guttatus 0 Carthamus tinctorius 0

X

X

XX

X

X

X

X

X

Figure 4. Distribution of BBIs in land plants. The number of BBI-like sequences identified in each plant genome searched. Accession numbers for all sequences are included in Supplemental Dataset 1. Predicted gene loss events are indicated with an X. The phylogeny of the plant genomes investigated is derived from species trees available from Cogepedia and APG III.

Page 32: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

E

T

A

A

*

* * * * * * * *

* * * * *GmaBBIb MELSMKVLVKVASLLFLLEFTATVV--DARFDPSSFITQFLPN-------AEANNYYVKSTTKACCNSCP CTKSIPPQC CSD-IGE

IdrBBI MATKMGLLILMAVILLCYKSAAGLVCR-----------------PDSECCPKSKFCEAGKFCT CDRSAHPYC CSG-PSS

MacBBI1 MRSAGLVVAFLALV---FLVSLSAAREDPD----IFLPS-------QGIGEEVGGEKPWACCDSCS CTKSIPPQC CTDQLIG

MacBBI2 MRSGGLVVTFLALV---FLVSLSTARVDPHL--LLLLPS-------QGNGEGLAGEKPWACCDMCL CTRSFPPQC CTDELIG

MacBBI3 MRYNMVVFSLVLMVAAAFFASATTTASSSHPELRSALSTKGHEEDGEGVGERSRQRRTWPCCDRCG CTKS PPQC CQD-MVR

GmaBBIb TCHSACKT-----------C-- CTRSIPPQ CSDITN-FCYEPCNSSETE H

IdrBBI DCRAESQCCSDQDCPHGSSCS- CTKSIPPIC CTGVYLDADN

MacBBI1 GCDPNCKT-----------C-- CTRSYPPKC CYDIINDYCGERCNPEQ

MacBBI2 GCHPNCKN-----------C-- CTRSFPPK CRDIIYEDCGDRCHP

MacBBI3 SCHPSCRH-----------CVR PLSVSPPL CMDRIPNYCRRRCTP PLL Q

R

R

R

R

Q

-

A

-

-

G

I

I

I

H

S

CH

CY

YQ

R

R

Figure 5. BBI-like sequences identified from quillwort and banana.Alignment of translated sequences of genes cloned from genomic DNA of banana (Musa acuminata, Mac) and cDNA of quillwort (Isoetes drummondii, Idr). Sequences are aligned with a BBI encoded by a gene from soybean (GmaBBIb) to illustrate conserved regions including the inhibitory loops (grey boxes). Conserved Cys residues are indicated by asterisks.

Page 33: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

0.4

Zea mays 2

Amborella trichopoda 1

Selaginella moellendorffii 2

Medicago truncatula 7

Amborella trichopoda 3

Brachypodium stacei 3

Medicago truncatula 8

Brachypodium distachyon 3

Selaginella moellendorffii 4

Amborella trichopoda 5

Elaeis guineensis 6Elaeis guineensis 2

Musa acuminata 1

Ziziphus jujuba 10

Musa acuminata 5

Phaseolus vulgaris 1

Medicago truncatula 5

Zea mays 6

Aquilegia coerulea 2

Glycine max10

Oryza sativa 2

Ziziphus jujuba 5

Sorghum bicolor 7

Musa acuminata 3

Medicago truncatula 4

Medicago truncatula 3

Oryza sativa 5

Glycine max12

Sorghum bicolor 12

Setaria viridis 1

Panicum hallii 1

Isoetes drummondii 1

Glycine max 5

Oryza sativa 8

Setaria viridis 3

Zea mays 10

Glycine max 8

Selaginella moellendorffii 1

Glycine max 7

Sorghum bicolor 4

Zea mays 1

Setaria viridis 9

Oryza sativa 3

Selaginella moellendorffii 3

Brachypodium distachyon 1

Sorghum bicolor 2Sorghum bicolor 1

Elaeis guineensis 3

Musa acuminata 4

Ziziphus jujuba 1

Zea mays 3

Brachypodium stacei 2

Panicum hallii 2

Brachypodium distachyon 2

Medicago truncatula 2

Medicago truncatula 6

Phaseolus vulgaris 2

Ananas comosus 1

Brachypodium distachyon 5

Oryza sativa 7

Medicago truncatula 1

Brachypodium stacei 4

Elaeis guineensis 8

Aquilegia coerulea 1

Ananas comosus 2

Musa acuminata 2

Brachypodium distachyon 4

Glycine max 1

Medicago truncatula 9

Zea mays 5

Glycine max 6

Elaeis guineensis 7

Setaria viridis 8

Ziziphus jujuba 11

Phalaenopsis equestris 1

Ziziphus jujuba 8

Setaria viridis 2

Oryza sativa 1

Sorghum bicolor 10

Sorghum bicolor 3

Oryza sativa 4

Setaria viridis 7

Selaginella moellendorffii 5

Ziziphus jujuba 6

Elaeis guineensis 4

Sorghum bicolor 8

Amborella trichopoda 4

Ziziphus jujuba 7

Zea mays 9

Elaeis guineensis 5

Oryza sativa 10

Zea mays 4

Amborella trichopoda 2

Setaria viridis 6

Glycine max13

Oryza sativa 6

Elaeis guineensis 1

Setaria viridis 4

Zea mays 8

Ziziphus jujuba 3

Brachypodium stacei 1

Ziziphus jujuba 2

Ziziphus jujuba 9

Phaseolus vulgaris 3

10051.4

98.0

58.5

97.0

99.599.0

77.7

50.0

10066.1

56.7

99.084.1

61.1

99.873.5

70.7

66.269.7

10096.5

10062.8

67.4

85.0

70.4

68.560.8

56.4

94.3

94.8

85.498.7

99.183.9

99.9

100

99.896.5

66.883.3

63.3

58.794.7

63.2

62.4

79.378.8

88.5

88.6

100 92.897.0

53.081.973.2

10064.8

92.1

60.6

97.7

72.888.4

99.7

88.994.3

56.2

67.3

98.562.3

monocot clade

dicot clade

Poaceae clade II

Poaceae clade I

Amborella trichopoda clade

lycopod clade

Figure 6. Phylogeny of BBI-like sequences in plants. Neighbour-joining tree from a protein distance matrix of 104 BBI-like sequences. Sequences were identified by searches within plant genome assemblies. Major clades are indicated and defined by branching patterns. The five S. moellendorffii sequences and the I. drummondii sequence were used to root the tree with the assumption they were evolutionarily most distant from all other sequences. Percent bootstrap values from 1000 replicates of a distance neighbour joining analysis are indicated at each node (only nodes with >50% bootstrap support are labelled). Identifiers and accession numbers are listed in Supplemental Data Set 1. The scale bar represents number of substitutions per site.

Page 34: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

Parsed Citations

Albalat, R., and Canestro, C. (2016). Evolution by gene loss. Nat. Rev. Genet. 17, 379-391.Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Angiosperm Phylogeny Group. (2016). An update of the Angiosperm Phylogeny Group classification for the orders and families offlowering plants: APG IV. Bot. J. Linn. Soc. 181, 1-20.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Arrigo, N., Therrien, J., Anderson, C.L., Windham, M.D., Haufler, C.H., and Barker, M.S. (2013). A total evidence approach tounderstanding phylogenetic relationships and ecological diversity in Selaginella subg. Tetragonostachys. Am. J. Bot. 100, 1672-1682.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Banks, J.A. (2009). Selaginella and 400 million years of separation. Annu. Rev. Plant Biol. 60, 223-238.Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Banks, J.A., Nishiyama, T., Hasebe, M., Bowman, J.L., Gribskov, M., dePamphilis, C., Albert, V.A., Aono, N., Aoyama, T., Ambrose,B.A., Ashton, N.W., Axtell, M.J., Barker, E., Barker, M.S., Bennetzen, J.L., Bonawitz, N.D., Chapple, C., Cheng, C., Correa, L.G.G.,Dacre, M., DeBarry, J., Dreyer, I., Elias, M., Engstrom, E.M., Estelle, M., Feng, L., Finet, C., Floyd, S.K., Frommer, W.B., Fujita, T.,Gramzow, L., Gutensohn, M., Harholt, J., Hattori, M., Heyl, A., Hirai, T., Hiwatashi, Y., Ishikawa, M., Iwata, M., Karol, K.G., Koehler,B., Kolukisaoglu, U., Kubo, M., Kurata, T., Lalonde, S., Li, K., Li, Y., Litt, A., Lyons, E., Manning, G., Maruyama, T., Michael, T.P.,Mikami, K., Miyazaki, S., Morinaga, S.i., Murata, T., Mueller-Roeber, B., Nelson, D.R., Obara, M., Oguri, Y., Olmstead, R.G., Onodera,N., Petersen, B.L., Pils, B., Prigge, M., Rensing, S.A., Riaño-Pachón, D.M., Roberts, A.W., Sato, Y., Scheller, H.V., Schulz, B., Schulz,C., Shakirov, E.V., Shibagaki, N., Shinohara, N., Shippen, D.E., Sørensen, I., Sotooka, R., Sugimoto, N., Sugita, M., Sumikawa, N.,Tanurdzic, M., Theißen, G., Ulvskov, P., Wakazuki, S., Weng, J.K., Willats, W.W.G.T., Wipf, D., Wolf, P.G., Yang, L., Zimmer, A.D., Zhu,Q., Mitros, T., Hellsten, U., Loqué, D., Otillar, R., Salamov, A., Schmutz, J., Shapiro, H., Lindquist, E., Lucas, S., Rokhsar, D., andGrigoriev, I.V. (2011). The Selaginella genome identifies genetic changes associated with the evolution of vascular plants. Science332, 960-963.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Bell, C.D., Soltis, D.E., and Soltis, P.S. (2010). The age and diversification of the angiosperms re-revisited. Am. J. Bot. 97, 1296-1303.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Birk, Y. (1961). Purification and some properties of a highly active inhibitor of trypsin and a-chymotrypsin from soybeans. Biochim.Biophys. Acta 54, 378-381.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Birk, Y., Gertler, A., and Khalef, S. (1963). A pure trypsin inhibitor from soya beans. Biochem. J. 87, 281.Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Bowman, D.E. (1946). Differentiation of soy bean antitryptic factors. Proc. Soc. Exp. Biol. Med. 63, 547-550.Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Brauer, A.B.E., and Leatherbarrow, R.J. (2003). The conserved P1' Ser of Bowman-Birk-type proteinase inhibitors is not essentialfor the integrity of the reactive site loop. Biochem. Biophys. Res. Commun. 308, 300-305.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Brauer, A.B.E., Domingo, G.J., Cooke, R.M., Matthews, S.J., and Leatherbarrow, R.J. (2002). A conserved cis peptide bond isnecessary for the activity of Bowman-Birk inhibitor protein. Biochemistry 41, 10608-10615.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Clemente, A., Arques, M.C., Dalmais, M., Le Signor, C., Chinoy, C., Olias, R., Rayner, T., Isaac, P.G., Lawson, D.M., Bendahmane, A.,and Domoney, C. (2015). Eliminating anti-nutritional plant food proteins: the case of seed protease inhibitors in pea. PLoS One 10,e0134634.

Pubmed: Author and Title

Page 35: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

CrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

D'Hont, A., Denoeud, F., Aury, J.M., Baurens, F.C., Carreel, F., Garsmeur, O., Noel, B., Bocs, S., Droc, G., Rouard, M., Da Silva, C.,Jabbari, K., Cardi, C., Poulain, J., Souquet, M., Labadie, K., Jourda, C., Lengelle, J., Rodier-Goud, M., Alberti, A., Bernard, M.,Correa, M., Ayyampalayam, S., McKain, M.R., Leebens-Mack, J., Burgess, D., Freeling, M., Mbeguie, A.M.D., Chabannes, M., Wicker,T., Panaud, O., Barbosa, J., Hribova, E., Heslop-Harrison, P., Habas, R., Rivallan, R., Francois, P., Poiron, C., Kilian, A., Burthia, D.,Jenny, C., Bakry, F., Brown, S., Guignon, V., Kema, G., Dita, M., Waalwijk, C., Joseph, S., Dievart, A., Jaillon, O., Leclercq, J., Argout,X., Lyons, E., Almeida, A., Jeridi, M., Dolezel, J., Roux, N., Risterucci, A.M., Weissenbach, J., Ruiz, M., Glaszmann, J.C., Quetier, F.,Yahiaoui, N., and Wincker, P. (2012). The banana (Musa acuminata) genome and the evolution of monocotyledonous plants. Nature488, 213-217.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Domingo, G.J., Leatherbarrow, R.J., Freeman, N., Patel, S., and Weir, M. (1995). Synthesis of a mixture of cyclic peptides based onthe Bowman-Birk reactive site loop to screen for serine protease inhibitors. Int. J. Pept. Protein Res. 46, 79-87.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Elliott, A.G., Delay, C., Liu, H., Phua, Z., Rosengren, K.J., Benfield, A.H., Panero, J.L., Colgrave, M.L., Jayasena, A.S., Dunse, K.M.,Anderson, M.A., Schilling, E.E., Ortiz-Barrientos, D., Craik, D.J., and Mylne, J.S. (2014). Evolutionary origins of a bioactive peptideburied within preproalbumin. Plant Cell 26, 981-995.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Engelken, J., Brinkmann, H., and Adamska, I. (2010). Taxonomic distribution and origins of the extended LHC (light-harvestingcomplex) antenna protein superfamily. BMC Evol. Biol. 10, 233.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Gariani, T., McBride, J.D., and Leatherbarrow, R.J. (1999). The role of the P2' position of Bowman-Birk proteinase inhibitor in theinhibition of trypsin. Studies on P2' variation in cyclic peptides encompassing the reactive site loop. Biochim. Biophys. Acta 1431,232-237.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Habib, H., and Fazili, K.M. (2007). Plant protease inhibitors: a defense strategy in plants. Biotechnol. Mol. Biol. Rev. 2, 68-85.Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Hall, T.A. (1999). BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. NucleicAcids Symp. S. 41, 95-98.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Hazkani-Covo, E., Altman, N., Horowitz, M., and Graur, D. (2002). The Evolutionary History of Prosaposin: Two Successive Tandem-Duplication Events Gave Rise to the Four Saposin Domains in Vertebrates. J. Mol. Evol. 54, 30-34.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Hirsh, A.E., and Fraser, H.B. (2001). Protein dispensability and rate of evolution. Nature 411, 1046-1049.Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Jayasena, A.S., Secco, D., Bernath-Levin, K., Berkowitz, O., Whelan, J., and Mylne, J.S. (2014). Next generation sequencing and denovo transcriptomics to study gene evolution. Plant Methods 10, 34.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Krylov, D.M., Wolf, Y.I., Rogozin, I.B., and Koonin, E.V. (2003). Gene loss, protein sequence divergence, gene dispensability,expression level, and interactivity are correlated in eukaryotic evolution. Genome Res. 13, 2229-2235.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Li, J., Zhang, C., Xu, X., Wang, J., Yu, H., Lai, R., and Gong, W. (2007). Trypsin inhibitory loop is an excellent lead structure to designserine protease inhibitors and antimicrobial peptides. FASEB J. 21, 2466-2473.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Page 36: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

Luckett, S., Garcia, R.S., Barker, J.J., Konarev, A.V., Shewry, P.R., Clarke, A.R., and Brady, R.L. (1999). High-resolution structure ofa potent, cyclic proteinase inhibitor from sunflower seeds. J. Mol. Biol. 290, 525-533.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

McBride, J.D., Brauer, A.B.E., Nievo, M., and Leatherbarrow, R.J. (1998). The role of threonine in the P2 position of Bowman-Birkproteinase inhibitors: studies on P2 variation in cyclic peptides encompassing the reactive site loop. J. Mol. Biol. 282, 447-457.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

McBride, J.D., Watson, E.M., Brauer, A.B., Jaulent, A.M., and Leatherbarrow, R.J. (2002). Peptide mimics of the Bowman-Birkinhibitor reactive site loop. Biopolymers 66, 79-92.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Mello, M.O., Tanaka, A.S., and Silva-Filho, M.C. (2003). Molecular evolution of Bowman-Birk type proteinase inhibitors in floweringplants. Mol. Phylogenet. Evol. 27, 103-112.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Mylne, J.S., Colgrave, M.L., Daly, N.L., Chanson, A.H., Elliott, A.G., McCallum, E.J., Jones, A., and Craik, D.J. (2011). Albumins andtheir processing machinery are hijacked for cyclic peptides in sunflower. Nat. Chem. Biol. 7, 257-259.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Mylne, J.S., Chan, L.Y., Chanson, A.H., Daly, N.L., Schaefer, H., Bailey, T.L., Nguyencong, P., Cascales, L., and Craik, D.J. (2012).Cyclic peptides arising by evolutionary parallelism via asparaginyl-endopeptidase-mediated biosynthesis. Plant Cell 24, 2765-2778.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Nishino, N., Aoyagi, H., Kato, T., and Izumiya, N. (1977). Studies on the synthesis of proteinase inhibitors. I. Synthesis and activity ofnonapeptide fragments of soybean Bowman-Birk inhibitor. J. Biochem. 82, 901-909.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Odani, S., and Ikenaka, T. (1973). Studies on soybean trypsin inhibitors VIII. Disulfide bridges in soybean Bowman-Birk proteinaseinhibitor. J. Biochem. 74, 697-715.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Park, E.Y., Kim, J.A., Kim, H.W., Kim, Y.S., and Song, H.K. (2004). Crystal structure of the Bowman-Birk inhibitor from barley seedsin ternary complex with porcine trypsin. J. Mol. Biol. 343, 173-186.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Petersen, T.N., Brunak, S., von Heijne, G., and Nielsen, H. (2011). SignalP 4.0: discriminating signal peptides from transmembraneregions. Nat. Meth. 8, 785-786.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Prakash, B., Selvaraj, S., Murthy, M.R., Sreerama, Y.N., Rao, D.R., and Gowda, L.R. (1996). Analysis of the amino acid sequences ofplant Bowman-Birk inhibitors. J. Mol. Evol. 42, 560-569.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Prasad, E.R., Merzendorfer, H., Madhurarekha, C., Dutta-Gupta, A., and Padmasree, K. (2010). Bowman-Birk Proteinase inhibitorfrom Cajanus cajan seeds: purification, characterization, and insecticidal properties. J. Agric. Food Chem. 58, 2838-2847.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Qi, R.F., Song, Z.W., and Chi, C.W. (2005). Structural features and molecular evolution of Bowman-Birk protease inhibitors and theirpotential application. Acta Biochim. Biophys. Sin. 37, 283-292.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Rawlings, N.D., Barrett, A.J., and Finn, R. (2016). Twenty years of the MEROPS database of proteolytic enzymes, their substratesand inhibitors. Nucleic Acids Res. 44, D343-D350.

Page 37: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Roberts, T.H., and Hejgaard, J. (2008). Serpins in plants and green algae. Funct. Integr. Genomics 8, 1-27.Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Schechter, I., and Berger, A. (1967). On the size of the active site in proteases. I. Papain. Biochem. Biophys. Res. Commun. 27, 157-162.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Soltis, D.E., Albert, V.A., Leebens-Mack, J., Palmer, J.D., Wing, R.A., dePamphilis, C.W., Ma, H., Carlson, J.E., Altman, N., Kim, S.,Wall, P.K., Zuccolo, A., and Soltis, P.S. (2008). The Amborella genome: an evolutionary reference for plant biology. Genome Biol 9,402.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Song, H.K., Kim, Y.S., Yang, J.K., Moon, J., Lee, J.Y., and Suh, S.W. (1999). Crystal structure of a 16 kDa double-headed Bowman-Birk trypsin inhibitor from barley seeds at 1.9 Å resolution. J. Mol. Biol. 293, 1133-1144.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Tan, G., Muffato, M., Ledergerber, C., Herrero, J., Goldman, N., Gil, M., and Dessimoz, C. (2015). Current Methods for AutomatedFiltering of Multiple Sequence Alignments Frequently Worsen Single-Gene Phylogenetic Inference. Syst. Biol. 64, 778-791.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Terada, S., Sato, K., Kato, T., and Izumiya, N. (1978). Inhibitory properties of nonapeptide loop structures related to reactive sitesof soybean Bowman-Birk inhibitor. FEBS Lett. 90, 89-92.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Tran, L.T., Taylor, J.S., and Constabel, C.P. (2012). The polyphenol oxidase gene family in land plants: Lineage-specific duplicationand expansion. BMC Genomics 13, 395.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Zheng, L., Baumann, U., and Reymond, J.L. (2004). An efficient one-step site-directed and site-saturation mutagenesis protocol.Nucl. Acids Res. 32, e115.

Pubmed: Author and TitleCrossRef: Author and TitleGoogle Scholar: Author Only Title Only Author and Title

Page 38: New Evidence for Ancient Origins of Bowman-Birk Inhibitors from … · 2017. 3. 14. · 1 RESEARCH ARTICLE Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella

DOI 10.1105/tpc.16.00831; originally published online March 14, 2017;Plant Cell

James Whelan, Charles Bond and Joshua S MylneAmy Midori James, Achala S Jayasena, Jingjing Zhang, Oliver Berkowitz, David Secco, Gavin J Knott,

Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella moellendorffii

 This information is current as of April 14, 2021

 

Supplemental Data /content/suppl/2017/04/17/tpc.16.00831.DC2.html /content/suppl/2017/03/14/tpc.16.00831.DC1.html

Permissions https://www.copyright.com/ccc/openurl.do?sid=pd_hw1532298X&issn=1532298X&WT.mc_id=pd_hw1532298X

eTOCs http://www.plantcell.org/cgi/alerts/ctmain

Sign up for eTOCs at:

CiteTrack Alerts http://www.plantcell.org/cgi/alerts/ctmain

Sign up for CiteTrack Alerts at:

Subscription Information http://www.aspb.org/publications/subscriptions.cfm

is available at:Plant Physiology and The Plant CellSubscription Information for

ADVANCING THE SCIENCE OF PLANT BIOLOGY © American Society of Plant Biologists