Genome-wide identification and characterization of Dof … · 2018. 3. 6. · v4.11.0) was used to...

21
Submitted 25 September 2017 Accepted 20 February 2018 Published 6 March 2018 Corresponding author Chonglai Bao, [email protected] Academic editor Gerard Lazo Additional Information and Declarations can be found on page 16 DOI 10.7717/peerj.4481 Copyright 2018 Wei et al. Distributed under Creative Commons CC-BY 4.0 OPEN ACCESS Genome-wide identification and characterization of Dof transcription factors in eggplant (Solanum melongena L.) Qingzhen Wei, Wuhong Wang, Tianhua Hu, Haijiao Hu, Weihai Mao, Qinmei Zhu and Chonglai Bao Institute of Vegetable Research, Zhejiang Academy of Agricultrual Sciences, Hangzhou, Zhejiang, China ABSTRACT Eggplant (Solanum melongena L.) is an important vegetable cultivated in Asia, Africa and southern Europe and, following tomato and pepper, ranks as the third most important solanaceous vegetable crop. The Dof (DNA-binding with one finger) family is a group of plant-specific transcription factors that play important roles in plant growth, development, and response to biotic and abiotic stresses. The genes in the Dof family have been identified and analysed in many plant species, but the information remains lacking for eggplant. In the present study, we identified 29 SmeDof members from the eggplant genome database, which were classifed into nine subgroups. The phylogeny, gene structure, conserved motifs and homologous genes of SmeDof genes were comprehensively investigated. Subsequently, we analysed the expression patterns of SmeDof genes in six different eggplant subspecies. The results provide novel insights into the family of SmeDof genes and will promote the understanding of the structure and function of Dof genes in eggplant, and the role of Dof expression during stress. Subjects Agricultural Science, Bioinformatics, Genomics, Plant Science Keywords Dof genes, Solanum melongena, Transcription factor, Phylogeny INTRODUCTION Eggplant (Solanum melongena) is an economically important vegetable crop in the large family Solanaceae, which contains other widespread crops including tobacco, potato, tomato and pepper. Endemic to the Old World, eggplant was domesticated in the region of India and southeast China and is now cultivated worldwide (Doganlar et al., 2002). However, eggplant is susceptible to many abiotic stresses and diseases such as Fusarium wilt (caused by Fusarium oxysporum) and Verticillium wilt (caused by Verticillium dahlia), thus pose a substantial threat to production and food security. The availability of genome sequences and development of bioinformatics approaches provide a good opportunity to characterize a family of stress resistance-related genes from a whole-genome perspective. The Dof (DNA-binding with one finger) proteins are a set of plant-specific transcription factors (TFs) that typically have 200–400 amino acids and two major domains: a C-terminal domain for transcriptional activation and a highly conserved N-terminal DNA-binding domain (also referred to as the Dof domain).The Dof domain is composed of 52 amino How to cite this article Wei et al. (2018), Genome-wide identification and characterization of Dof transcription factors in eggplant (Solanum melongena L.). PeerJ 6:e4481; DOI 10.7717/peerj.4481

Transcript of Genome-wide identification and characterization of Dof … · 2018. 3. 6. · v4.11.0) was used to...

Page 1: Genome-wide identification and characterization of Dof … · 2018. 3. 6. · v4.11.0) was used to search for motifs in all 29 eggplant Dof proteins with a motif window length from

Submitted 25 September 2017Accepted 20 February 2018Published 6 March 2018

Corresponding authorChonglai Bao, [email protected]

Academic editorGerard Lazo

Additional Information andDeclarations can be found onpage 16

DOI 10.7717/peerj.4481

Copyright2018 Wei et al.

Distributed underCreative Commons CC-BY 4.0

OPEN ACCESS

Genome-wide identification andcharacterization of Dof transcriptionfactors in eggplant (Solanummelongena L.)Qingzhen Wei, Wuhong Wang, Tianhua Hu, Haijiao Hu, Weihai Mao,Qinmei Zhu and Chonglai BaoInstitute of Vegetable Research, Zhejiang Academy of Agricultrual Sciences, Hangzhou, Zhejiang, China

ABSTRACTEggplant (Solanum melongena L.) is an important vegetable cultivated in Asia, Africaand southern Europe and, following tomato and pepper, ranks as the third mostimportant solanaceous vegetable crop. TheDof (DNA-binding with one finger) familyis a group of plant-specific transcription factors that play important roles in plantgrowth, development, and response to biotic and abiotic stresses. The genes in the Doffamily have been identified and analysed in many plant species, but the informationremains lacking for eggplant. In the present study, we identified 29 SmeDof membersfrom the eggplant genome database, which were classifed into nine subgroups. Thephylogeny, gene structure, conserved motifs and homologous genes of SmeDof geneswere comprehensively investigated. Subsequently, we analysed the expression patternsof SmeDof genes in six different eggplant subspecies. The results provide novel insightsinto the family of SmeDof genes and will promote the understanding of the structureand function of Dof genes in eggplant, and the role of Dof expression during stress.

Subjects Agricultural Science, Bioinformatics, Genomics, Plant ScienceKeywords Dof genes, Solanum melongena, Transcription factor, Phylogeny

INTRODUCTIONEggplant (Solanum melongena) is an economically important vegetable crop in the largefamily Solanaceae, which contains other widespread crops including tobacco, potato,tomato and pepper. Endemic to the Old World, eggplant was domesticated in the regionof India and southeast China and is now cultivated worldwide (Doganlar et al., 2002).However, eggplant is susceptible to many abiotic stresses and diseases such as Fusariumwilt (caused by Fusarium oxysporum) and Verticillium wilt (caused by Verticillium dahlia),thus pose a substantial threat to production and food security. The availability of genomesequences and development of bioinformatics approaches provide a good opportunity tocharacterize a family of stress resistance-related genes from a whole-genome perspective.

The Dof (DNA-binding with one finger) proteins are a set of plant-specific transcriptionfactors (TFs) that typically have 200–400 amino acids and twomajor domains: a C-terminaldomain for transcriptional activation and a highly conserved N-terminal DNA-bindingdomain (also referred to as the Dof domain).The Dof domain is composed of 52 amino

How to cite this article Wei et al. (2018), Genome-wide identification and characterization of Dof transcription factors in eggplant(Solanum melongena L.). PeerJ 6:e4481; DOI 10.7717/peerj.4481

Page 2: Genome-wide identification and characterization of Dof … · 2018. 3. 6. · v4.11.0) was used to search for motifs in all 29 eggplant Dof proteins with a motif window length from

acid residues and features a C2C2-type zinc finger motif, some of which could recognizethe specific regulatory elements of AAAG or CTTT in the promoters of target genes(Plesch, Ehrhardt & Mueller-Roeber, 2001; Wray et al., 2003; Yanagisawa, 2004; Chen etal., 2014; Mehrotra et al., 2014), although the pumpkin Dof protein AOBP recognizes anAGTA repeat (Shimofurutani et al., 1998). Similar to other zinc fingers, the Dof domainplays a bi-functional role in DNA-binding and protein-protein interactions (Yanagisawa,2002). The Dof family may have evolved from a common ancestor: a single gene in thegreen unicellular alga Chlamydomonas reinhardtii, with the family then expanding intomosses, ferns and vascular plants by consecutive gene duplications (Moreno-Risueno et al.,2007). Yanagisawa (2002) first identified 37 putative Dof genes in Arabidopsis, which wereclassified into eight groups. Lijavetzky, Carbonero & Vicente-Carbajosa (2003) classifiedDof genes into classes A, B1, B2, C1, C2.1, C2.2, C3, D1 and D2, which is the classificationsystem adopted in the analysis ofDof genes of Chinese cabbage (Ma et al., 2015) and carrot(Huang et al., 2016) or modified such as in tomato (Cai et al., 2013).

Dof transcription factors are associated with a variety of biological processes in plantgrowth and development, including the responses to light and those in defence (Park et al.,2003; Shaw et al., 2009; Zhang et al., 1995), germination and development of seeds (Papiet al., 2002; Dong et al., 2007; Gabriele et al., 2010; Zou et al., 2013), control of flowering(Imaizumi et al., 2005; Fornara et al., 2009), branching of shoots, and development ofpollen, fruit and vascular tissues (Konishi & Yanagisawa, 2007; Chen et al., 2012; Da Silvaet al., 2016; Wu et al., 2016). Dof TFs also participate in carbon metabolism (Yanagisawa,2000) and various physiological processes such as nitrogen assimilation, phytochrome andcytochrome signalling, plant hormonal signalling, and biotic and abiotic stress tolerance(Park et al., 2003; Ward et al., 2005; Kurai et al., 2011; Cai et al., 2013; Corrales et al., 2014;Wen et al., 2016). For example, the newly found Dof protein MaDof23 in banana functionsas a transcriptional repressor and interacts with the transcriptional activatorMaERF29. Thetwo proteins are suggested to control banana fruit ripening by working antagonistically toregulate 10 ripening-related genes involved in cell wall degradation and aroma formation(Feng et al., 2016). In Arabidopsis, Dof6 acts as an opponent of TCP14, which is a positiveregulator of seed germination, to regulate ABA biosynthesis pathway-related genes, andtherefore, seed germination is negatively affected (Rueda-Romero et al., 2012).

Dof TFs have been functionally characterized in many plant species and crops such asArabidopsis, tomato, rice, cucumber and soybean (Papi et al., 2000; Gualberti et al., 2002;Washio, 2003; Cai et al., 2013; Guo & Qiu, 2013; Wen et al., 2016). However, informationon Dof genes continues to be lacking for eggplant. The aim of the present study was toconduct a genome-wide analysis of the Dof family of genes in eggplant. We identified 29SmeDof members in the eggplant genome and classified these proteins into nine subgroups.The physical and chemical characteristics, gene structure, motifs and evolutionary patternsof the SmeDof family of genes were investigated. We also analysed the expression patternsof SmeDof members in different types of eggplant. Based on this study, our understandingof the structure and function of the SmeDof family of genes in eggplants has increased, andthe results will contribute to futher understanding the role of SmeDof genes in tolerancemechanism and the interaction networks of these genes.

Wei et al. (2018), PeerJ, DOI 10.7717/peerj.4481 2/21

Page 3: Genome-wide identification and characterization of Dof … · 2018. 3. 6. · v4.11.0) was used to search for motifs in all 29 eggplant Dof proteins with a motif window length from

MATERIALS AND METHODSIdentification and characterization of the Dof transcription factorsThe genome, genes and corresponding protein sequences of eggplant were downloadedfrom the Eggplant Genome Database (http://eggplant.kazusa.or.jp). The latest Markovmodel for the Dof transcription factors named PF02701.13 was downloaded from the Pfamdatabase (http://pfam.xfam.org/) (Eddy, 2011; Finn et al., 2014). The HMMER programwas used to search for Dof proteins in all eggplant proteins with a cutoff E-value of 1e−4

using PF02701.13 as a query. After a comprehensive check, the candidate proteins that onlycontained fragmental Dof domains were eliminated.

The Dof transcription factor search was also conducted for five other representativespecies in the plant kingdom and for the family Solanaceae. The annotated proteins of analga (C. reinhardtii), moss ( Physcomitrella patens), lycophyte (Selaginella moellendorffii),Arabidopsis thaliana, grape (Vitis vinifera), poplar (Populus trichocarpa) and rice (Oryzasativa) were downloaded from the Pfam database (v10) (Goodstein et al., 2012). Thegenomic information of the gymnospermNorway spruce (Picea abies) (Nystedt et al., 2013)was collected from the CongenieWebsite (http://congenie.org/). The annotated proteins ofpetunia (Petunia axillaris), tomato (Solanum lycopersicum), potato (Solanum tuberosum),pepper (Capsicum annuum), and tobacco (Nicotiana tabacum) were downloaded from theSGN Database FTP Server (ftp://ftp.solgenomics.net/genomes/).

Phylogenetic analysis of the Dof genesThe Dof proteins of C. reinhardtii, P. patens, A. thaliana, O. sativa and eggplant wereselected for phylogenetic analysis in planta. The ClustalX2 program was used to align Dofprotein sequences with the Gonnet protein weight matrix (Larkin et al., 2007). The MEGAprogram (v6.06) was used to construct a neighbour-joining phylogenetic tree using theJones-Taylor-Thornton (JTT) model with 500 bootstrap replicates (Tamura et al., 2013).The uniform rates and homogeneous lineages were adopted, whereas the partial deletionwith a site coverage cutoff of 70% was used for gaps/missing data treatment. To clearlydistinguish the genes in this phylogenetic tree, the term ‘Nta|’ was added as a prefix toindicate the genes were from tobacco. The MEGA program was also used to constructa neighbour-joining phylogenetic tree of Dof proteins in plants of Solanaceae, followingprevious methods. The frequency of each divergent branch was displayed when the valuewas higher than 50%. Adobe Illustrator software was used to clearly show the Dof branchesafter classification of all proteins based on the known background information.

Gene structure and motif analysesThe Gene Structure Display Server tool (http://gsds.cbi.pku.edu.cn/, v2.0) (Hu et al., 2015)was used to analyse gene structure. MEME software (http://meme-suite.org/meme_4.11.0/,v4.11.0) was used to search for motifs in all 29 eggplant Dof proteins with a motif windowlength from 8 to 100 bp (Bailey et al., 2009). The default number of motifs to be foundwas set to 15, resulting in identification of 14 motifs with an E-value smaller than the 0.05significance level. The motifs were displayed following a pattern with the most statisticallysignificant (lowest E-value) motifs displayed first.

Wei et al. (2018), PeerJ, DOI 10.7717/peerj.4481 3/21

Page 4: Genome-wide identification and characterization of Dof … · 2018. 3. 6. · v4.11.0) was used to search for motifs in all 29 eggplant Dof proteins with a motif window length from

Identification of orthologous and paralogous genesOrthoMCL (v2.0.3) (Li, Stoeckert Jr & Roos, 2003) was used to search for orthologous,co-orthologous and paralogous genes in eggplant, tomato, Arabidopsis, and rice usingthe entire Dof protein sequence data set. The E-value cutoff of 1.0e−5 was used for theall-against-all BLASTP alignment process, and a match cutoff value of 50 was adopted. Theorthologous and paralogous relationships were gathered and displayed using Cytoscapesoftware (http://www.cytoscape.org, v2.8.3) (Smoot et al., 2011).

Analysis of Dof gene expression in different eggplantsThe genome-wide digital gene expression of SmeDof genes in eggplant varieties and closespecies was assessed as described by Tan et al. (2015). We used RNA-seq transcriptiomeswith the related FPKM (fragments per kilobase per million measure) values to studySmeDof gene expression patterns in six eggplants. Illumina sequencing reads fromRNA-seq data of six types of eggplants with different morphological traits were used(raw data is avalible at NCBI, SRP127743), including two cultivated eggplants Solanummelongena L. (MEL_HZHQ and MEL_LYQ), three closest wild relatives S. Aethiopicum, S.sisymbriifolium and S. integrifolium (AET_SG, SIS_2007 and INT_2006) and a semi-wildrelative S. melongena (MEL_S58). Seeds (10 per species) from six types of eggplant weregerminated. The leaf tissue was collected from the seedlings with three true leaves when14 days post germination. Total RNA of each sample was extracted using an RNAprepPure Plant Kit (DP432, TIANGEN, China. http://www.tiangene.com) according to themanufacturer’s protocol. Illumina sequencing was performed at Novogene BioinformaticsTechnology Co., Ltd., Beijing, China (http://www.novogene.cn). The library preparationswere sequenced on an Illumina Hiseq 2000 platform and 90 bp paired-end reads weregenerated. Approximately 28 million paired reads were generated with 5.6 G data foreach sample. The program Tophat (https://ccb.jhu.edu/software/tophat/) was used tomap the reads to the eggplant genome (Hirakawa et al., 2014); then the expressionprofile of all spinach genes was obtained with FPKM (Fragments Per Kilobase of exonper million fragments Mapped) value using software Cufflinks (http://cole-trapnell-lab.github.io/cufflinks, v2.2.1) under the guidance of annotated gene models with a GFFfile. The SmeDof gene expression profile from each sample was analysed using the HemIprogram (http://hemi.biocuckoo.org/) with the average hierarchical clustering method.

RESULTS AND DISCUSSIONGenome-wide identification of Dof genes in eggplantGene family analysis is an efficient approach to understand the structure, function, andevolution of genes. TheDof genes are plant-specifc transcription factors that are ubiquitousin plant species and participate in various biological processes. To identify SmeDofhomologues in eggplant, we used the HHM profile of the Dof domain (PF02701.13)as a query to perform an HMMER search against the Eggplant Genome Database(http://eggplant.kazusa.or.jp). A total of 85,446 genes were predicted in eggplant genome(versione SME_r2.5.1), including 42,035 predicted genes (38,498 intrinsic genes and 3,537partial genes), 41,048 transposable elements and 2,363 were pseudo or short genes or both.

Wei et al. (2018), PeerJ, DOI 10.7717/peerj.4481 4/21

Page 5: Genome-wide identification and characterization of Dof … · 2018. 3. 6. · v4.11.0) was used to search for motifs in all 29 eggplant Dof proteins with a motif window length from

A total of 29 genes encoding transcription factors of the SmeDof family were identified. Thenumber is similar to that in rice (30OSDof genes;Washio, 2003) but lower than the numberof Dof genes in Arabidopsis (36 At Dof genes; Lijavetzky, Carbonero & Vicente-Carbajosa,2003), soybean (78 GmDof genes; Guo & Qiu, 2013), Chinese cabbage (76 BraDof genes;Ma et al., 2015), potato (35 StDof genes; Venkatesh & Park, 2015), cucumber (36 CsDofgenes; Wen et al., 2016), and tomato (34 SlDof genes; Cai et al., 2013). For convenience,the 29 SmeDof genes were assigned names from SmeDof-01 to SmeDof-29.

The 29 SmeDof genes in eggplant were classified into four groups and nine subgroupsbased on predicted Dof domains: A, B1, B2, C1, C2.1, C2.2, C3, D1 and D2 (Fig. 1, Table 1).In a previous study on Dof transcription factors from green unicellular algae to vascularplants, Dof genes are classified into groups A, B, C, D, E, F and G (Moreno-Risueno etal., 2007), whereas group G is absent from Dof gene analysis between sorghum and rice(Kushwaha et al., 2011). The classification of the 29 SmeDof genes was consistent with thatof Arabidopsis, although the number of members in each subgroup might vary. However,in cucumber, the CsDof genes were classified into eight subgroups, with no genes found insubgroup C3 (Wen et al., 2016). The genome-wide analysis of the Dof transcription factorsclassified the soybean Dof genes into nine different groups: A, B1, B2, C1, C2, C3, D1, D2,and D3 (Guo & Qiu, 2013).

The SmeDof genes had molecular weights ranging from 18.6 to 64.4 kD with pIvalues varying from 4.55 to 9.48. Two eggplant genes Sme2.5_00232.1_g00011 andSme2.5_01519.1_g00001 were identified as candidates with the E-value cutoff of 2.2e–09and 9.7e-06, respectively. However, because 26 and 21 amino acid residues, respectively,aligned to the Dof domain were too short to act as a functional zinc-finger Dof domain,these two genes were excluded. Detail information regarding the SmeDof genes, includingname, coding protein, CDS length, molecular weight and PI value, is shown in Table 1.

Phylogenetic relationship of the DOF proteins in major plant speciesThe formation of the plant-specific Dof transcription factors is associated with theevolutionary process of plants (Taylor & Raes, 2004; Shigyo et al., 2007). To provide insightinto the evolution of the Dof family of genes, we performed phylogenetic analysis acrossrepresentative plant species, including an alga, moss, lycophyte, gymnosperm, eudicot andmonocot. Detail information of the gene family in different plant species is shown in TableS1. However, the lycophyte Dof proteins were removed from the final phylogenetic treebecause of the lack of complete and functional domains. A phylogenetic tree is usuallyconstructed by a neighbour-joining or maximum-likelihood method using sequences ofwhole-length proteins, domains or CDS regions. In this study, the neighbour-joiningmethod with 1,000 bootstraps using the Dof domain sequence was performed to constructthe phylogenetic tree (Fig. 1), which was generally consistent with previous reports (Shigyoet al., 2007;Moreno-Risueno et al., 2007; Ma et al., 2015).

As demonstrated previously, the number of Dof transcription factors varies among landplants (Yang, Tuskan & Cheng, 2006), and the results of this study confirm that conclusion.As shown in Fig. 1, the green algae C. reinhardtii only had one Dof gene in subgroup D1,indicating thatDof members in other groups might originate from subgroup D1members;

Wei et al. (2018), PeerJ, DOI 10.7717/peerj.4481 5/21

Page 6: Genome-wide identification and characterization of Dof … · 2018. 3. 6. · v4.11.0) was used to search for motifs in all 29 eggplant Dof proteins with a motif window length from

D2

D1

B2

C2.1C2.2

A

C1

B2

B1

C3

Pp3

c5 1

4440

Pp3

c6 1

3360

Pp3

c6 1

3355

Pp3

c15

3180

Pp3c

9 49

10Pp

3c16

646

0Pp

3c12

223

90Pp

3c4

850

Pp3c

17 2

3300

Pp3c

14 2

4190

Pp3c

17 3

860

LOC

Os1

2g39

990

LOC

Os0

3g42

200

LOC

Os08

g382

20

LOC

Os09g

2996

0

AT1G

4765

5

SmeD

of-12

LOC O

s04g

5819

0

AT5G66

940-D

2

AT3G50

410-D

2

SmeDof-

28

SmeDof-11

MA 130243g0010

Cre12.g521150

LOC Os01g48290

MA 944867g0010

Pp3c13 21190

Pp3c3 26370

MA 175298g0010

Pp3c26 1490

Pp3c17 7930

MA 172793g0010

MA 133293g0010

Pp3c3 950

Pp3c13 2000

MA 10436933g0030

MA 5642475g0010

MA 10435735g0010MA 10436933g0020AT1G69570-D1SmeDof-23MA 97259g0020LOC Os01g55340AT2G34140-D1AT1G29160-D1SmeDof-22LOC Os10g26620

SmeDof-26LOC Os01g17000

SmeDof-29AT1G26790-D1

LOC Os07g48570

LOC Os03g07360

LOC Os01g15900

SmeDof-21

AT5G62430-D1

AT3G47500-D1

SmeDof-18

AT5G39660-D1

LOC Os01g09720

AT4G38000-B2

AT2G46590-C2.1

AT3G61850-C2.1

AT4G24060-C2.1

LOC O

s04g47990

LOC O

s02g45200

SmeDof-02

SmeD

of-15

SmeD

of-19

SmeD

of-01

LOC

Os10g35300

AT4G00940-C

2.1

AT1G64620-C

2.1

Pp3c22 5870

Pp3c19 19050

LOC

Os02g15350

AT1G

21340-C2.2

AT3G

52440-C2.2

Sm

eDof-25

LOC

Os05g36900

LOC

Os01g64590AT

2G28

510-

C1

MA

1669

2g00

10

MA

5247

4g00

10

MA

1761

4g00

10

AT3G

4561

0-C

1

AT5G

6020

0-C

1

AT5G

6294

0-C

1

SmeD

of-1

3

SmeD

of-0

5

LOC

Os02

g478

10

SmeD

of-0

8

AT5G

6085

0-A

MA 12

3810

g001

0

LOC O

s06g

1741

0

LOC O

s02g

4944

0

AT3G21

270-A

SmeDof-

09

SmeDof-14

AT1G51700-A

AT1G28310-B2SmeDof-03

SmeDof-04

LOC Os07g32510

MA 18267g0010

MA 459791g0010MA 74014g0010

LOC Os03g38870MA 56828g0010AT5G65590-B2SmeDof-07MA 83624g0010LOC Os03g55610SmeDof-16

MA 46520g0010MA 939289g0010

AT3G55370-B1LOC Os03g16850

SmeDof-20

AT2G37590-B1

SmeDof-27

LOC Os03g60630

AT2G28810-B1

AT1G07640-B1

AT5G02460-B1

LOC Os07g13260

LOC Os12g38200

LOC Os05g02150

MA 7537g0010

SmeDof-06

SmeDof-10

SmeDof-24SmeDof-17

MA 248613g0010

AT4G21050-C3

AT4G21040-C3

AT4G21080-C3

AT4G21030-C3

Pp3c10 25160

Pp3c16 6810

Pp3c17 18350

Pp3c17 18340Pp3c2 26790

Pp3c2 26900Pp3c14 21520Pp3c1 1300

Figure 1 The phylogenetic tree of theDof. genes from six plant species. Individual species are dis-tinguished by different shapes with different colours. The prefixes of Cre, Pp3c, MA, LOC_Os, AT andSmeDof indicate these genes were found in C. reinhardtii, P. patens, Norway spruce, rice, Arabidopsis andeggplant, respectively.

Full-size DOI: 10.7717/peerj.4481/fig-1

whereas the number of Dof transcription factors in the moss P. patens enlarged into 27genes. In P. patens, 11, 8, 6 and 2 Dof genes were classified into subgroups C3, D1, D2and C2.2, respectively; thus, these four clades were likely relatively original compared withother clades. In some species, the Dof genes are enriched in certain branches. MultipleDof genes of P. patens in subgroups C3 and D1 were aggregated into small branchesthat split with Dof genes in the other species, which possibly resulted from Dof geneduplication in P. patens. A total of 23 Dof genes were found in Picea abies distributed in allsubgroups. According to the phylogenetic tree, the 29 eggplant Dof genes were distributedin all 9 branches, with 3, 2, 4, 2, 4, 2, 1, 6 and 3 in subgroups A, B1, B2, C1, C2.1, C2.2,

Wei et al. (2018), PeerJ, DOI 10.7717/peerj.4481 6/21

Page 7: Genome-wide identification and characterization of Dof … · 2018. 3. 6. · v4.11.0) was used to search for motifs in all 29 eggplant Dof proteins with a motif window length from

Table 1 Classification and characterization of eggplantDof genes.

SmeDof name Protein name Group CDSlength

Wholegenelength

Molecularweight

pI Instabilityindex

Instabilitystatus

Aliphatic GRAVY

SmeDof-01 Sme2.5_02793.1_g00001.1 C2.1 4,037 302 33,233.76 8.46 47.87 unstable 60 −0.766SmeDof-02 Sme2.5_00047.1_g00017.1 C2.1 2,071 281 30,200.42 9.03 33.68 stable 59.68 −0.622SmeDof-03 Sme2.5_00720.1_g00002.1 B2 3,408 391 42,895.37 6.98 52.72 unstable 62.58 −0.672SmeDof-04 Sme2.5_02054.1_g00006.1 B2 1,242 264 29,975.35 8.16 41.07 unstable 64.92 −0.746SmeDof-05 Sme2.5_00536.1_g00017.1 C1 1,576 288 31,920.38 6.3 41 unstable 52.08 −0.666SmeDof-06 Sme2.5_00019.1_g00022.1 C3 18,793 589 64,449.89 6.14 52.01 unstable 82.61 −0.337SmeDof-07 Sme2.5_02584.1_g00003.1 B2 3,886 360 39,038.66 7.09 48.45 unstable 52.31 −0.736SmeDof-08 Sme2.5_00326.1_g00010.1 A 7,425 386 41,284.63 5.94 44.67 unstable 58.16 −0.581SmeDof-09 Sme2.5_01232.1_g00017.1 A 1,629 182 19,981.01 8.14 56.25 unstable 53.13 −0.815SmeDof-10 Sme2.5_03054.1_g00002.1 C3 2,634 293 32,721.61 7.48 48.1 unstable 59.25 −0.661SmeDof-11 Sme2.5_01191.1_g00008.1 D2 6,024 243 25,553.48 8.72 46.98 unstable 57.08 −0.423SmeDof-12 Sme2.5_18208.1_g00003.1 D2 1,061 232 24,021.45 6.58 49.43 unstable 51.29 −0.582SmeDof-13 Sme2.5_00151.1_g00002.1 C1 2,298 297 32,940.49 6.75 39.71 stable 57.1 −0.718SmeDof-14 Sme2.5_01155.1_g00012.1 A 1,301 217 23,346.49 9.17 53.57 unstable 44.06 −0.872SmeDof-15 Sme2.5_03699.1_g00001.1 C2.1 2,569 267 29,110.24 9.21 48.03 unstable 56.97 −0.676SmeDof-16 Sme2.5_01191.1_g00002.1 B2 1,185 290 31,399.01 9.03 50.92 unstable 65 −0.563SmeDof-17 Sme2.5_04366.1_g00001.1 B1 9,807 261 28,334.34 9.4 35.24 stable 53.1 −0.713SmeDof-18 Sme2.5_01247.1_g00005.1 D1 1,913 371 41,024.83 7.59 37.61 stable 66.98 −0.725SmeDof-19 Sme2.5_00013.1_g00005.1 C2.1 1,607 217 24,352.14 6.98 54.26 unstable 63.73 −0.721SmeDof-20 Sme2.5_00146.1_g00017.1 B1 1,302 301 33,581.31 9.48 61.73 unstable 55.35 −0.744SmeDof-21 Sme2.5_00669.1_g00003.1 D1 2,744 504 54,727.03 5.66 52.16 unstable 56.31 −0.787SmeDof-22 Sme2.5_02533.1_g00002.1 D1 812 166 18,642.8 8.87 40.68 unstable 52.89 −0.873SmeDof-23 Sme2.5_03125.1_g00004.1 D1 4,976 445 49,545.73 6.47 48.04 unstable 54.36 −0.859SmeDof-24 Sme2.5_00720.1_g00003.1 C3 1,335 305 33,223.53 9.43 70.52 unstable 44.46 −0.913SmeDof-25 Sme2.5_09080.1_g00001.1 C2.2 1,225 258 29,384.18 4.55 50.9 unstable 53.33 −0.825SmeDof-26 Sme2.5_00161.1_g00001.1 D1 4,032 467 52,088.65 5.9 43.24 unstable 51.58 −0.846SmeDof-27 Sme2.5_02556.1_g00009.1 B1 1,679 370 39,849.03 8.98 63.55 unstable 52.22 −0.652SmeDof-28 Sme2.5_08918.1_g00002.1 D2 920 205 22,056.75 9.23 55.35 unstable 62.34 −0.42SmeDof-29 Sme2.5_00298.1_g00013.1 D1 2,953 525 57,189.32 6.01 56.78 unstable 56.32 −0.835

C3, D1 and D2, respectively (Fig. 1). In algae, only one Dof factor was identified in thegreen alga C. reinhardtii and none in the red alga Cyanidioschyzon merolae (Yanagisawa,2002; Yanagisawa, 2004; Shigyo et al., 2007). By contrast, Glycine max contains the highnumber of 93 members of Dof family genes (Ma et al., 2015). These results revealed thatthe number of Dof genes increases from algae to higher plants, suggesting an expansion ofDof family transcription factors during the evolution of lower plants to terrestrial plants.This conclusion is consistent with previous study on Dof TFs in Chinese cabbage (Ma etal., 2015). Moreover, the origin of the Dof transcription factors predates the divergencebetween the green algae and the ancestors of land plants (Moreno-Risueno et al., 2007;Shigyo et al., 2007).

Wei et al. (2018), PeerJ, DOI 10.7717/peerj.4481 7/21

Page 8: Genome-wide identification and characterization of Dof … · 2018. 3. 6. · v4.11.0) was used to search for motifs in all 29 eggplant Dof proteins with a motif window length from

The Dof genes in rice were found in 9 branches, but were absent in subgroup C3, whichwas similar to cucumber (Wen et al., 2016). The Dof members in subgroup C3 originatedafter moss and were more divergent than members in other groups, although none wereidentified in rice. Three members of eggplant SmeDof genes were classified into subgroupC3; however, the domain sequences were also disparate with members in Arabidopsis.Gene duplication in a single species is a possible result from genome duplication after thedivergence between asterids and eurosides II clades, which were the ancestors ofArabidopsisand eggplant.

Phylogenetic relationships of the Dof proteins among SolanaceaespeciesWe constructed a phylogenetic tree of the Dof genes from six species of Solanaceae basedon the Dof domains using the neighbour-joining method (Fig. 2, Table S1). As shownin Table 2, five of the six species, S. melongena, S. lycopersicum, S. tuberosum, C . annuumand P. axillaris, had similar numbers of Dof genes ranging from 29 to 33. By contrast, theallotetraploid Nicotiana tabacum had 60 Dof TF coding genes, which was likely becauseof genome duplication events increasing gene copies. The Dof proteins in tobacco wereenriched in 3 subgroups: B1, C2.1, andD1, with 13, 16, and 14 genes, respectively. However,the Dof genes of N. tabacum in subgroup C2.2 did not differ much from those of the otherfive species, except for those of petunia, which had 4 genes in this group. The Dof TFsof eggplant, tomato, potato and pepper in subgroups C1 and D1 were the same, with 2and 6, respectively. Except for the allotetraploid tobacco, Dof genes from different speciesvaried slightly from A to D2 subgroups, indicating an even distribution within the familySolanaceae.

The genome size of eggplant (833.1 Mb, 42,035 coding genes; Hirakawa et al., 2014),tomato (950 Mb, 34,727 coding genes; Tomato Genome Consortium, 2012) and potato(844 Mb, 35,119 coding genes; Potato Genome Sequencing Consortium, 2011) did not varysignificantly, which corresponded to their similar number of Dof transcription factorencoding genes, with 29, 33, and 32 members, respectively. Notably, although the genomesize of hot pepper (3.48 Gb, 34,899 coding genes; Kim et al., 2014) was approximatelyfourfold larger than that of its close relatives tomato and eggplant, the total coding genesand Dof genes did not have proportionally more. We propose that because hot pepper is adiploid species which did not experience the recent whole-genome duplication (Kim et al.,2014), and therefore, no significant enlargement ofDof genes occurred. The genome size ofanother solanaceous species N. tabacum (4.41–4.57 Gb, 85,439 coding genes; Sierro et al.,2014) was almost fivefold larger than that of eggplant, but the number of Dof transcriptionfactors (60 members) was fewer than twice that of eggplant (29 members), which might beattributed to gene loss after the allopolyploidization event.

Gene structure and conserved motifs analyses of the eggplantDof genesThe exon-intron organization can be used as supporting evidence to determine theevolutionary relationships among genes or organisms (Koralewski & Krutovsky, 2011;Malviya et al., 2015). Therefore, structural analysis of gene families with phylogenetic

Wei et al. (2018), PeerJ, DOI 10.7717/peerj.4481 8/21

Page 9: Genome-wide identification and characterization of Dof … · 2018. 3. 6. · v4.11.0) was used to search for motifs in all 29 eggplant Dof proteins with a motif window length from

Ntb

|641

62N

tb|6

4161

Ntb

|641

6067

Ntb

|638

97

83

Sm

eDof

-19

PG

SC

0003

DM

P40

0010

277

99Pe

axi1

62Sc

f000

74g0

0347

SmeD

of-0

1So

lyc0

8g08

2910

.1PG

SC00

03D

MP4

0002

1675

CA0

1g00

490

6768

61

Ntb

|893

47N

tb|3

6076

Ntb

|893

48

85

Solyc

03g0

8284

0.2

PGSC

0003

DMP4

0003

2312

60

Peax

i162

Scf0

0740

g003

28

Ntb|

7447

1CA

03g2

0220

Ntb|

5423

6

51

Ntb|

7868

0Pe

axi16

2Scf0

0345

g001

28

Solyc

10g0

0936

0.2

Ntb|46

686

Ntb|46

685

71

Peaxi1

62Scf0

0087

g022

24

SmeDof-

15

SmeDof-

02

CA00g6

4610

Solyc0

1g09

6120

.2

PGSC0003

DMP4000

0026

6

Ntb|17444

Ntb|53153

Ntb|17445

Ntb|53152

87

Ntb|91427

Solyc04g070960.1

PGSC0003DMP400001503

CA00g84150

Peaxi162Scf00263g00072

67SmeDof-25

86

Peaxi162Scf00582g00084

PGSC0003DMP400037029

CA10g08680

CA00g96300

CA10g08690

67697777SmeDof-13

Solyc03g112930.2

PGSC0003DMP400031524

CA03g2692067

Ntb|72147Ntb|72421

67

63

Peaxi162Scf00009g00023

Ntb|44098Ntb|28121CA06g19670SmeDof-05Solyc06g071480.2PGSC0003DMP40004705868

51

54

Solyc06g075370.2Peaxi162Scf00072g00039SmeDof-06CA06g23590

54

Ntb|40583Ntb|10398352

Peaxi162Scf00389g00621

Ntb|123840Ntb|65436

71

CA02g15180Peaxi162Scf00158g00123

PGSC0003DMP400051750

SmeDof-24Solyc02g077960.1

95

SmeDof-10Solyc06g005130.2

76

PGSC0003DMP400050832

88

CA00g45170

Solyc11g072500.1

PGSC0003DMP400044011

77

PGSC0003DMP400052918

Solyc06g076030.2

CA06g24550

60

Peaxi162Scf00679g00310

56

Ntb|112244

Ntb|111861

Ntb|111860

73

94

Ntb|12852

Ntb|12851

75

Ntb|6500

84

SmeDof-17

PGSC0003DMP400000873

Solyc11g066050.1

7374

CA00g57880

PGSC0003DMP400015688

Solyc09g010680.2

6262

Ntb|46183

Ntb|68319

68Peaxi162Scf00165g01722

Peaxi162Scf00015g10132

Peaxi162Scf00330g00006

PGSC

0003DMP400033281

Solyc10g086440.1

51Sm

eDof-27

55N

tb|109554

Ntb|69483

Ntb|109555

CA10g11420

6351

Ntb|36017

Ntb|74424

85S

olyc00g024680.1

Peaxi162S

cf00832g00315

CA05g18640

Sm

eDof-20

PG

SC

0003DM

P400040516

Solyc05g054510.1

Sm

eDof

-08

PG

SC

0003

DM

P40

0007

249

55

CA0

6g18

250

65

Pea

xi16

2Scf

0001

3g00

084

69

Ntb

|132

561

Ntb

|132

560

90

86

Peax

i162

Scf0

1414

g000

12

SmeD

of-0

7

Soly

c11g

0109

40.1

PGSC

0003

DM

P400

0281

50

Peax

i162

Scf0

0085

g000

45

69

Ntb

|786

01

CA0

2g25

910

Solyc

02g0

9031

0.1

SmeD

of-1

6

PGSC

0003

DMP4

0001

7726

6253

96

61

PGSC

0003

DMP4

0003

8896

CA02

g157

50

Solyc

02g0

7862

0.1

64

Peax

i162S

cf005

24g0

0082

66

Peax

i162S

cf005

30g0

0071

PGSC

0003

DMP4

0003

9559

60

Peax

i162S

cf005

30g0

0002

SmeDof-

04

PGSC0003

DMP4000

2270

8

97

Ntb|12

5947

Ntb|98

005

Ntb|66

911

66

Peaxi1

62Scf0

0158

g001

71

SmeDof-03

Solyc02g077950.1

PGSC0003DMP400051791

CA02g15190

69 63

SmeDof-09

Solyc06g062520.154

CA06g16390

91

Peaxi162Scf00207g01131

50

Ntb|97857Ntb|73364

CA00g78480

Peaxi162Scf00038g01316

PGSC0003DMP400004435SmeDof-14

Solyc03g121400.165 64 68 66 79

Solyc02g090220.2

Peaxi162Scf00993g00011

PGSC0003DMP400017852 57SmeDof-11CA02g2598056

67Solyc02g065290.1SmeDof-28CA02g01550 5696

SmeDof-12Peaxi162Scf01514g0000566

CA04g19530Solyc04g079570.1

PGSC0003DMP400020702 6292

SmeDof-29CA06g18840

PGSC0003DMP400055798

64

Solyc06g069760.2

Peaxi162Scf00013g0042967

63

Ntb|46548

94

Peaxi162Scf00274g00447

Ntb|28690

Ntb|91939

Ntb|2636

Solyc03g115940.2

PGSC0003DMP400033936

SmeDof-21

CA03g2997063

56

Solyc05g007880.2

PGSC0003DMP400032069

SmeDof-23

Peaxi162Scf00222g00057

Ntb|97919

Ntb|36942

69

53

CA11g04250

Ntb|52144

Ntb|122577

SmeDof-18

CA05g08190

52

SmeDof-22

Solyc02g076850.1

PGSC0003DMP400059133

Peaxi162Scf00778g00021

98

Solyc02g067230.2

PGSC0003DMP400043612

84

Ntb|38293

Ntb|67487

CA02g01120

Peaxi162Scf00129g10011Sm

eDof-26

CA02g28150

Solyc02g088070.2

PGSC0003DM

P400002386

65

Ntb|113713N

tb|9941N

tb|9938N

tb|9940

99

C3

B1AB2

AD2

D1C2.2

C1

C2.1

Figure 2 The phylogenetic tree of theDof. genes from six plant species in Solanaceae. Individualspecies were distinguished by different shapes with different colors. Bootstrapping values are indicatedas percentages (when >50%) along the branches. The prefix of SmeDof, Solyc, PGSC, Peaxi, CA and Ntbindicated these genes were found in eggplant, tomato, potato, petunia, pepper and tobacco.

Full-size DOI: 10.7717/peerj.4481/fig-2

Table 2 Detailed information on the Dof members in each subgroup of the six plant species from family Solanaceae.

Plant species of Solanaceae Subgroup Unclassified Total

A B1 B2 C1 C2.1 C2.2 C3 D1 D2

Solanum melongena 3 3 4 2 4 1 3 6 3 0 29Solanum lycopersicum 2 6 4 2 4 1 4 6 3 1 33Solanum tuberosum 2 5 6 2 4 2 3 6 2 0 32Capsicum annuum 3 4 3 2 3 4 3 6 3 2 33Petunia axillaris 3 5 6 1 4 1 3 5 2 3 33Nicotiana tabacum 4 13 4 4 16 1 4 14 0 0 60

Wei et al. (2018), PeerJ, DOI 10.7717/peerj.4481 9/21

Page 10: Genome-wide identification and characterization of Dof … · 2018. 3. 6. · v4.11.0) was used to search for motifs in all 29 eggplant Dof proteins with a motif window length from

0

2 1

0 2

2

0

5'3'

0kb 1kb 2kb 3kb 4kb 5kb 6kb 7kb 8kb 9kb 10kb 11kb 17kb 18kb

Legend: CDS upstream/ downstream Intron 0 1 2: intron phase

A SmeDof-08

SmeDof-09

SmeDof-14

SmeDof-17

SmeDof-20

SmeDof-27

SmeDof-07

SmeDof-16

SmeDof-03

SmeDof-04

SmeDof-05

SmeDof-13

SmeDof-01

SmeDof-19

SmeDof-02

SmeDof-15

SmeDof-25

SmeDof-06

SmeDof-10

SmeDof-24

SmeDof-12

SmeDof-11

SmeDof-28

SmeDof-18

SmeDof-23

SmeDof-22

SmeDof-26

SmeDof-21

SmeDof-29

95

83

79

77

70

63

63

62

61

59

58

50

99

B1

B2

C1

C2.1

C2.2

C3

D1

D2

Figure 3 Gene structure of the eggplantDof genes. Yellow boxes represent CDS, and black and bluelines represent introns and UTRs, respectively. The lengths of the exons, introns and UTRs were drawn ac-cording to the length of sequences.

Full-size DOI: 10.7717/peerj.4481/fig-3

relationships can provide valuable information on duplication events and evolutionarypatterns (Cai et al., 2013). To investigate the mechanisms of structural evolution of SmeDofgenes in eggplant, we compared the exon-intron structures of 29 annotated SmeDof genes.A detailed illustration of gene structure including the CDS, intron and UTR structures isshown in Fig. 3. Similar to the structure of Arabidopsis, rice, cucumber and tomato Dofgenes (Lijavetzky, Carbonero & Vicente-Carbajosa, 2003; Wen et al., 2016; Cai et al., 2013),the eggplant Dof genes had few introns, which ranged in number from 0 to 2 in eachgene (Fig. 3). Three of the SmeDof genes, SmeDof-15, SmeDof-17 and SmeDof-29, hadone intron, whereas SmeDof-02 and SmeDof-07 both had two introns. Moreover, 24 of29 SmeDof genes were found without introns. SmeDof genes with introns showed either‘‘0’’, ‘‘1’’, or ‘‘2’’ intron phases and were in four different subgroups: B1, B2, C2.1, and D2.The gene SmeDof-17 had the longest intron. Of all eggplant Dof genes, SmeDof-06 was thelongest gene with the longest UTRs, whereas the shortest gene was SmeDof-22.

The diversity of motif compositions in SmeDof proteins was assessed using the MEMEprogramme, and a total of 15 conserved motifs were identified. The schematic distributionof these 15 motifs among SmeDof proteins is shown in Figs. 4 and 5. The motif, motif1, wasuniformly observed in all SmeDof proteins andwas confirmed as the conservedDof domain.

Wei et al. (2018), PeerJ, DOI 10.7717/peerj.4481 10/21

Page 11: Genome-wide identification and characterization of Dof … · 2018. 3. 6. · v4.11.0) was used to search for motifs in all 29 eggplant Dof proteins with a motif window length from

Figure 4 Conserved motifs embedded in the eggplantDof genes.Full-size DOI: 10.7717/peerj.4481/fig-4

Similar to the results from Arabidopsis, rice, cucumber and tomato (Lijavetzky, Carbonero& Vicente-Carbajosa, 2003; Wen et al., 2016; Cai et al., 2013), this result suggested thatSmeDof transcription factors were evolutionarily conserved in plants. Groups B2 andC2.2 and six other genes (SmeDof-02, SmeDof-06, SmeDof-08, SmeDof-10, SmeDof-15, andSmeDof-28) had only one conserved motif1 (Fig. 4). All members in group A containedmotif10 except for SmeDof-08, whereas motif13 was in all SmeDof proteins in Group B1.Both Dof proteins in Group C1 contained motif1 and motif14, with SmeDof-05 containingthe additional motif9. Members of group D2 generally had more than 9 motifs exceptfor SmeDof-22, which only contained motif1 and motif4; all other members had motif2,motif3, motif4, motif5, motif6, and motif11. The SmeDof genes in each subgroup hadseveral unique motifs, suggesting that Dof proteins within the same subclusters sharedcertain functional similarities. Distribution of the motifs revealed that the SmeDof geneswere likely conserved during evolution.

Wei et al. (2018), PeerJ, DOI 10.7717/peerj.4481 11/21

Page 12: Genome-wide identification and characterization of Dof … · 2018. 3. 6. · v4.11.0) was used to search for motifs in all 29 eggplant Dof proteins with a motif window length from

Figure 5 Conserved motifs embedded in the eggplantDof genes. Amino acid sequences of each motif.The font size represents the frequency of the respective amino acid.

Full-size DOI: 10.7717/peerj.4481/fig-5

Identification of orthologous, co-orthologous and paralogous Dofgenes in eggplant, tomato, Arabidopsis and riceThe relationships of orthologous, co-orthologous, and paralogous Dof genes amongfour species were investigated. Orthologs are genes evolved by vertical descent from asingle ancestral gene whereas paralogs are originated by duplication. The co-orthologsare subdivided type of orthologs, which are two or more genes in one lineage that are,collectively, orthologous to one or more genes in another lineage due to a lineage-specificduplication(s) (Koonin, 2005). The comparative analysis was performed using OrthoMCLwith default settings, and the results are presented in Fig. 6 and Table S2. Among the 29

Wei et al. (2018), PeerJ, DOI 10.7717/peerj.4481 12/21

Page 13: Genome-wide identification and characterization of Dof … · 2018. 3. 6. · v4.11.0) was used to search for motifs in all 29 eggplant Dof proteins with a motif window length from

Solyc04g070960.1

LOC_Os05g36900

AT3G52440

Solyc11g072500.1LOC_Os01g64590

SmeDof−25

Solyc00g024680.1AT4G24060SmeDof−01

SmeDof−21

AT3G47500 Solyc02g088070.2

Solyc08g082910.1

AT2G46590

Solyc01g096120.2

SmeDof−02

LOC_Os04g47990 LOC_Os02g45200

AT3G61850

Solyc10g086440.1

AT1G51700 LOC_Os03g07360

LOC_Os01g17000 SmeDof−29 Solyc02g067230.2

AT5G65590

AT5G39660

LOC_Os07g48570 Solyc03g115940.2

LOC_Os01g15900

Solyc06g069760.2

SmeDof−26

LOC_Os03g60630

AT2G37590

Solyc06g075370.2

AT5G02460

AT3G45610

SmeDof−27

AT5G60200SmeDof−06

AT2G28510LOC_Os07g13260

SmeDof−13 SmeDof−03SmeDof−10SmeDof−12SmeDof−14SmeDof−15SmeDof−16

Solyc02g077950.1Solyc06g005130.2Solyc04g079570.1Solyc03g112930.2

AT5G66940

AT4G21030

SmeDof−11

AT3G50410AT4G21050

AT4G21040

SmeDof−07Solyc02g090220.2

AT4G21080

Solyc11g010940.1

LOC_Os03g38870

Solyc06g062520.1

SmeDof−23

SmeDof−09

AT3G21270

LOC_Os10g26620

SmeDof−17

Solyc05g007880.2

SmeDof−24

LOC_Os01g09720

SmeDof−19SmeDof−28

AT1G29160

Solyc02g076850.1

AT3G55370

AT2G34140

SmeDof−22

AT5G62940

Solyc06g071480.2

SmeDof−20SmeDof−05

Solyc05g054510.1

Solyc11g066050.1 Solyc03g121400.1Solyc02g090310.1 Solyc10g009360.2Solyc02g077960.1 Solyc08g008500.2Solyc02g065290.1

Arabidopsis gene

Rice geneOrthologous

Coorthologous

ParalogousTomato gene

Eggplant gene

Gene Pairs Relationship

AT2G28810AT1G69570LOC_Os06g17410 LOC_Os09g29960

AT1G07640LOC_Os08g38220 LOC_Os02g49440 AT1G26790

Figure 6 Ortholog, co-ortholog and paralog gene pairs. The ellipse, rectangle, triangle and diamond indicate that a gene belongs to A. thaliana,O. sativa, S. lycopersicum and S. melongena, respectively, The red, purple and green lines indicate orthologous, co-orthologous and paralogous rela-tionships, respectively, and the width of the line is associated with the relationship index produced by OrthoMCL software.

Full-size DOI: 10.7717/peerj.4481/fig-6

SmeDof genes of eggplant and the 33 Dof genes of tomato, 28 orthologous gene pairs wereidentified. Ten orthologous gene pairs were identified among the 29 SmeDof genes and the36 Dof genes of Arabidopsis, whereas only five orthologous gene pairs were found betweeneggplant and rice (30 Dof genes). The most orthologous genes were between eggplant andtomato, which could be explained because both species are in the family Solanaceae. Wealso identified six co-orthologous gene pairs between eggplant and Arabidopsis and sevenpairs between eggplant and rice. Paralogous genes were detected in all four species, with 14pairs in Arabidopsis, nine pairs in rice, one pair in tomato, and one gene pair in eggplant.The genes SmeDof-21 and SmeDof-29 were the only paralogous Dof genes in eggplant.

Expression pattern of the Dof genes among different eggplantvarieties and close speciesTo increase our understanding of the expression profiles of the Dof among differenteggplant varieties and close species, we investigated gene expression for each SmeDof geneusing RNA-seq data of six samples: two cultivated eggplants, three closest wild relatives anda semi-wild relative. The FPKM expression of SmeDof genes in six eggplants is provided

Wei et al. (2018), PeerJ, DOI 10.7717/peerj.4481 13/21

Page 14: Genome-wide identification and characterization of Dof … · 2018. 3. 6. · v4.11.0) was used to search for motifs in all 29 eggplant Dof proteins with a motif window length from

Figure 7 The heat map shows the expression profile of the eggplantDof genes in six eggplants. The x-axis represents the names of the six eggplants, and the y-axis represents different SmeDof genes. The ex-pression levels of SmeDof genes are revealed by different colours, which increase from blue to red.

Full-size DOI: 10.7717/peerj.4481/fig-7

in Table S3. As shown in Table 3 and Fig. 7, the wild species AET_SG and INT_2006 wereclosely related. The alignment rate between RNA-seq reads of the wild SIS_2007 and thepublished eggplant genome sequence was only 13.40%, and therefore, this eggplant was themost distant from domesticated eggplants (Fig. 7). The cultivated eggplant MEL_HZHQwith linear, long fruits was phylogenetically more closely related to the semi-wild speciesMEL_S58 instead of the other cultivated typeMEL_LYQ, which has round fruits.Moreover,MEL_HZHQ is similar to the semi-wild species in morphology, with vine branches andmulti-pistillate flowering. The low expression values of genes in SIS_2007, AET_SG, andINT_2006 were highly supported by that these plant materials were close wild relatives toeggplant, not varieties of eggplant. The left reads mapped ratio were 70%–88% and the

Wei et al. (2018), PeerJ, DOI 10.7717/peerj.4481 14/21

Page 15: Genome-wide identification and characterization of Dof … · 2018. 3. 6. · v4.11.0) was used to search for motifs in all 29 eggplant Dof proteins with a motif window length from

Table 3 The alignment rates of RNA-seq sequences of six eggplants against the eggplant referencegenome.

Sample name Subspecies Eggplanttype

Left readsmapped

Left multimapped

Concordantmapped

AET_SG S. aethiopicum gr. Gilo wild 70.20% 4.70% 56.40%INT_2006 S.intergrifolium wild 71.80% 5.00% 58.40%MEL_HZHQ S. melongena cultivated 87.40% 4.70% 81.70%MEL_LYQ S. melongena cultivated 88.60% 5.00% 82.00%MEL_S58 S. melongena semi-wild 86.90% 5.00% 80.60%SIS_2007 S. sisymbriifolium wild 26.70% 6.30% 13.40%

concordant mapped ratio were 56%–82% except SIS_2007. The mapped ratio were in anormal range of RNA-Seq mapping ratio, as most studies have a concordant mapped ratiowere 50%–90%.

The expression levels of different SmeDof genes in the six types of eggplants arerepresented by different colours and are shown in Fig. 7. SmeDof-25 and SmeDof-04 didnot express in any of the eggplants. The expression of SmeDof genes SmeDof-20, SmeDof-19,and SmeDof-22 was either very low or not detecable in the six types of eggplants, whereasSmeDof-17 was highly expressed in all types of eggplants. The expression patterns ofSmeDof-05, SmeDof-08, and SmeDof-23 in MEL_LYQ were different from those in theother five eggplants; SmeDof-05 and SmeDof-08 were highly expressed in MEL_LYQ, butthe expression was comparatively low in other eggplants. By contrast, SmeDof-23 had highexpression levels in the other five eggplants but did not express in MEL_LYQ. The twogenes SmeDof-21 and SmeDof-27 had different expression in the wild eggplant SIS_2007compared with that in the other types of eggplants. The expression levels of SmeDof-29varied significantly between AET_SG and INT_2006. However, the functions of Dof genesin different ecotypes of eggplants remain to be further investigated.

CONCLUSIONSWe conducted a comprehensive analysis of Dof transpcrition factors in the genome ofeggplant. A total of 29 SmeDof genes encoding Dof transpcrition factors were identifiedfrom eggplant, which were classified into four groups and nine subgroups: A, B1, B2,C1, C2.1, C2.2, C3, D1 and D2. The gene structure, conserved motifs and homologousgenes of Dof genes in eggplant were investigated. All of the SmeDof proteins containedmotif1 which was considered as the conserved Dof domain. A comparative study of theDof family factors between eggplant and other plant species was performed that couldfacilitate the functional analysis of Dof factors in eggplant. Moreover, we also investigatedthe expression patterns of Dof genes in different types of eggplants. Our study providesvaluble information for further understanding of the mechanisms underlying abiotic stressresponse of SmeDof transcription factors.

Wei et al. (2018), PeerJ, DOI 10.7717/peerj.4481 15/21

Page 16: Genome-wide identification and characterization of Dof … · 2018. 3. 6. · v4.11.0) was used to search for motifs in all 29 eggplant Dof proteins with a motif window length from

ACKNOWLEDGEMENTSWe thank Nanjing Huasequen Biotechnologies Co, Ltd., China for assistance onbioinformatics analysis.

ADDITIONAL INFORMATION AND DECLARATIONS

FundingThis work was supported by the Natural Science Foundation of Zhejiang (LQ18C150004),and the New Variety Breeding Project of the Major Science and Technology Projects ofZhejiang (2016C02051-2). The funders had no role in study design, data collection andanalysis, decision to publish, or preparation of the manuscript.

Grant DisclosuresThe following grant information was disclosed by the authors:Natural Science Foundation of Zhejiang: LQ18C150004.Major Science and Technology Projects of Zhejiang: 2016C02051-2.

Competing InterestsThe authors declare there are no competing interests.

Author Contributions• Qingzhen Wei conceived and designed the experiments, performed the experiments,analyzed the data, prepared figures and/or tables, authored or reviewed drafts of thepaper, approved the final draft.

• Wuhong Wang analyzed the data, authored or reviewed drafts of the paper, approvedthe final draft.

• Tianhua Hu, Haijiao Hu, Weihai Mao and Qinmei Zhu analyzed the data, contributedreagents/materials/analysis tools, authored or reviewed drafts of the paper, approved thefinal draft.

• Chonglai Bao conceived and designed the experiments, authored or reviewed drafts ofthe paper, approved the final draft.

Data AvailabilityThe following information was supplied regarding data availability:

The raw data of RNA-seq experiment is now provided and deposited in Sequence ReadArchive (NCBI): SRP127743.

Supplemental InformationSupplemental information for this article can be found online at http://dx.doi.org/10.7717/peerj.4481#supplemental-information.

Wei et al. (2018), PeerJ, DOI 10.7717/peerj.4481 16/21

Page 17: Genome-wide identification and characterization of Dof … · 2018. 3. 6. · v4.11.0) was used to search for motifs in all 29 eggplant Dof proteins with a motif window length from

REFERENCESBailey TL, BodenM, Buske FA, Frith M, Grant CE, Clementi L, Ren J, Li WW, Noble

WS. 2009.MEME SUITE: tools for motif discovery and searching. Nucleic AcidsResearch 37:W202–W208 DOI 10.1093/nar/gkp335.

Cai XF, Zhang YY, Zhang CJ, Zhang TY, Hu TX, Ye J, Zhang JH,Wang TT, Li HX, YeZB. 2013. Genome-wide analysis of plant-specific dof transcription factor family intomato. Journal of Integrative Plant Biology 55:552–566 DOI 10.1111/jipb.12043.

Chen Y, Sun AJ, WangM, Zhu Z, Ouwerkerk P. 2014. Functions of the CCCH type zincfinger protein OsGZF1 in regulation of the seed storage protein GluB-1 from rice.Plant Molecular Biology 84:621–634 DOI 10.1007/s11103-013-0158-5.

Chen XY,Wang DX, Liu C,WangMZ,Wang T, Zhao Q, Yu JJ. 2012.Maize tran-scription factor Zmdof1 involves in the regulation of Zm401 gene. Plant GrowthRegulation 66:271–284 DOI 10.1007/s10725-011-9651-5.

Corrales AR, Nebauer SG, Carrillo L, Fernandez-Nohales P, Marques J, Renau-MorataB, Granell A, Pollmann S, Vicente-Carbajosa J, Molina RV, Medina J. 2014.Characterization of tomato Cycling Dof Factors reveals conserved and new functionsin the control of flowering time and abiotic stress responses. Journal of ExperimentalBotany 65:995–1012 DOI 10.1093/jxb/ert451.

Da Silva DC, Da Silveira Falavigna V, Fasoli M, Buffon V, Porto DD, Pappas Jr GJ,Pezzotti M, Pasquali G, Revers LF. 2016. Transcriptome analyses of the Dof-like gene family in grapevine reveal its involvement in berry, flower and seeddevelopment. Horticultural Research 3:16042 DOI 10.1038/hortres.2016.42.

Doganlar S, Frary A, DaunayMC, Tanksley SD. 2002. A comparative genetic linkagemap of eggplant (Solanum melongena) and its implications for genome evolution inthe solanaceae. Genetics 161(4):1697–1711.

Dong G, Ni Z, Yao Y, Nie X, Sun Q. 2007.Wheat Dof transcription factor WPBFinteracts with TaQM and activates transcription of an alpha-gliadin gene duringwheat seed development. Plant Molecular Biology 63:73–84.

Eddy SR. 2011. Accelerated profile HMM searches. PLOS Computational Biology7(10):e1002195 DOI 10.1371/journal.pcbi.1002195.

Feng BH, Han YC, Xiao YY, Kuang JF, Fan ZQ, Chen JY, LuWJ. 2016. The banana fruitDof transcription factor MaDof23 acts as a repressor and interacts with MaERF9in regulating ripening-related genes. Journal of Experimental Botany 67:2263–2275DOI 10.1093/jxb/erw032.

Finn RD, Bateman A, Clements J, Coggill P, Eberhardt RY, Eddy SR, Heger A,Hetherington K, Holm L, Mistry J, Sonnhammer ELL, Tate J, Punta M. 2014.Pfam: the protein families database. Nucleic Acids Research 42:D222–D230DOI 10.1093/nar/gkt1223.

Fornara F, Panigrahi KC, Gissot L, Sauerbrunn N, Ruhl M, Jarillo JA, Coupland G.2009. Arabidopsis DOF transcription factors act redundantly to reduce CONSTANSexpression and are essential for a photoperiodic flowering response. DevelopmentalCell 17:75–86 DOI 10.1016/j.devcel.2009.06.015.

Wei et al. (2018), PeerJ, DOI 10.7717/peerj.4481 17/21

Page 18: Genome-wide identification and characterization of Dof … · 2018. 3. 6. · v4.11.0) was used to search for motifs in all 29 eggplant Dof proteins with a motif window length from

Gabriele S, Rizza A, Martone J, Circelli P, Costantino P, Vittorioso P. 2010. TheDof protein DAG1 mediates PIL5 activity on seed germination by negativelyregulating GA biosynthetic gene AtGA3ox1. The Plant Journal 61:312–323DOI 10.1111/j.1365-313X.2009.04055.x.

Goodstein DM, Shu S, Howson R, Neupane R, Hayes RD, Fazo J, Mitros T, DirksW, Hellsten U, PutnamN, Rokhsar DS. 2012. Phytozome: a comparativeplatform for green plant genomics. Nucleic Acids Research 40:D1178–D1186DOI 10.1093/nar/gkr944.

Gualberti G, Papi M, Bellucci L, Ricci L, Bouchez D, Camilleri C, Costantino P,Vittorioso P. 2002.Mutations in the Dof zinc finger genes DAG2 and DAG1influence with opposite effects the germination of Arabidopsis seeds. The Plant Cell14(6):1253–1263 DOI 10.1105/tpc.010491.

Guo Y, Qiu LJ. 2013. Genome-wide analysis of the Dof transcription factor gene familyreveals soybean-specific duplicable and functional characteristics. PLOS ONE8(9):e76809 DOI 10.1371/journal.pone.0076809.

Hirakawa H, Shirasawa K, Miyatake K, Nunome T, Negoro S, Ohyama A, YamaguchiH, Sato S, Isobe S, Tabata S, Fukuoka H. 2014. Draft genome sequence of eggplant(Solanum melongena L.): the representative solanum species indigenous to the oldworld. DNA Research 21:649–660 DOI 10.1093/dnares/dsu027.

Hu B, Jin J, Guo AY, Zhang H, Luo J, Gao G. 2015. GSDS 2.0: an upgraded gene featurevisualization server. Bioinformatics 31:1296–1297 DOI 10.1093/bioinformatics/btu817.

HuangW, Huang Y, Li MY,Wang F, Xu ZS, Xiong AS. 2016. Dof transcription factorsin carrot: genome-wide analysis and their response to abiotic stress. BiotechnologyLetters 38:145–155 DOI 10.1007/s10529-015-1966-2.

Imaizumi T, Schultz TF, Harmon FG, Ho LA, Kay SA. 2005. FKF1 F-box proteinmediates cyclic degradation of a repressor of CONSTANS in Arabidopsis. Science309:293–297 DOI 10.1126/science.1110586.

Kim S, ParkM, Yeom SI, Kim YM, Lee JM, Lee HA, Seo E, Choi J, Cheong K, Kim KT,Jung K, Lee GW, Oh SK, Bae C, Kim SB, Lee HY, Kim SY, KimMS, Kang BC, JoYD, Yang HB, Jeong HJ, KangWH, Kwon JK, Shin C, Lim JY, Park JH, Huh JH,Kim JS, Kim BD, Cohen O, Paran I, SuhMC, Lee SB, Kim YK, Shin Y, Noh SJ,Park J, Seo YS, Kwon SY, KimHA, Park JM, KimHJ, Choi SB, Bosland PW, ReevesG, Jo SH, Lee BW, Cho HT, Choi HS, Lee MS, Yu Y, Do Choi Y, Park BS, DeynzeAvan, Ashrafi H, Hill T, KimWT, Pai HS, Ahn HK, Yeam I, Giovannoni JJ, RoseJK, Sorensen I, Lee SJ, Kim RW, Choi IY, Choi BS, Lim JS, Lee YH, Choi D. 2014.Genome sequence of the hot pepper provides insights into the evolution of pungencyin Capsicum species. Nature Genetics 46:270–278 DOI 10.1038/ng.2877.

Konishi M, Yanagisawa S. 2007. Sequential activation of two Dof transcription factorgene promoters during vascular development in Arabidopsis thaliana. Plant Physiol-ogy and Biochemistry 45:623–629 DOI 10.1016/j.plaphy.2007.05.001.

Koonin EV. 2005. Orthologs, paralogs, and evolutionary genomics. Annual Review ofGenetics 39:309–338 DOI 10.1146/annurev.genet.39.073003.114725.

Wei et al. (2018), PeerJ, DOI 10.7717/peerj.4481 18/21

Page 19: Genome-wide identification and characterization of Dof … · 2018. 3. 6. · v4.11.0) was used to search for motifs in all 29 eggplant Dof proteins with a motif window length from

Koralewski TE, Krutovsky KV. 2011. Evolution of exon-intron structure and alternativesplicing. PLOS ONE 6(3):e18055 DOI 10.1371/journal.pone.0018055.

Kurai T, WakayamaM, Abiko T, Yanagisawa S, Aoki N, Ohsugi R. 2011. Intro-duction of the ZmDof1 gene into rice enhances carbon and nitrogen assim-ilation under low-nitrogen conditions. Plant Biotechnol Journal 9:826–837DOI 10.1111/j.1467-7652.2011.00592.x.

Kushwaha H, Gupta S, Singh VK, Rastogi S, Yadav D. 2011. Genome wide identificationof Dof transcription factor gene family in sorghum and its comparative phylogeneticanalysis with rice and Arabidopsis.Molecular Biology Reports 38:5037–5053DOI 10.1007/s11033-010-0650-9.

LarkinMA, Blackshields G, Brown NP, Chenna R, McGettigan PA, McWilliamH,Valentin F, Wallace IM,Wilm A, Lopez R, Thompson JD, Gibson TJ, HigginsDG. 2007. Clustal W and Clustal X version 2.0. Bioinformatics 23:2947–2948DOI 10.1093/bioinformatics/btm404.

Li L, Stoeckert Jr CJ, Roos DS. 2003. OrthoMCL: identification of ortholog groups foreukaryotic genomes. Genome Research 13:2178–2189 DOI 10.1101/gr.1224503.

Lijavetzky D, Carbonero P, Vicente-Carbajosa J. 2003. Genome-wide comparative phy-logenetic analysis of the rice and arabidopsis Dof gene families. BMC EvolutionaryBiology 3:17 DOI 10.1186/1471-2148-3-17.

Ma J, Li MY,Wang F, Tang J, Xiong AS. 2015. Genome-wide analysis of Dof familytranscription factors and their responses to abiotic stresses in Chinese cabbage. BMCGenomics 16(1):33 DOI 10.1186/s12864-015-1242-9.

Malviya N, Gupta S, Singh VK, YadavMK, Bisht NC, Sarangi BK, Yadav D.2015. Genome wide in silico characterization of Dof gene families of pigeon-pea (Cajanus cajan (L) Millsp).Molecular Biology Reports 42(2):535–552DOI 10.1007/s11033-014-3797-y.

Mehrotra R, Jain V, Shekhar C, Mehrotra S. 2014. Genome wide analysis of Arabidopsisthaliana reveals high frequency of AAAGN7CTTT motif.Meta Gene 2:606–615DOI 10.1016/j.mgene.2014.05.003.

Moreno-RisuenoMA,Martinez M, Vicente-Carbajosa J, Carbonero P. 2007. Thefamily of DOF transcription factors: from green unicellular algae to vascular plants.Molecular Genetics and Genomics 277(4):379–390 DOI 10.1007/s00438-006-0186-9.

Papi M, Sabatini S, AltamuraMM, Hennig L, Schafer E, Costantino P, Vittorioso P.2002. Inactivation of the phloem-specific Dof zinc finger gene DAG1 affects responseto light and integrity of the testa of Arabidopsis seeds. Plant Physiology 128:411–417DOI 10.1104/pp.010488.

Papi M, Sabatini S, Bouchez D, Camilleri C, Costantino P, Vittorioso P. 2000.Identification and disruption of an Arabidopsis zinc finger gene controlling seedgermination. Genes and Development 14(1):28–33.

Park DH, Lim PO, Kim JS, Cho DS, Hong SH, NamHG. 2003. The ArabidopsisCOG1 gene encodes a Dof domain transcription factor and negatively regulatesphytochrome signaling. The Plant Journal 34:161–171DOI 10.1046/j.1365-313X.2003.01710.x.

Wei et al. (2018), PeerJ, DOI 10.7717/peerj.4481 19/21

Page 20: Genome-wide identification and characterization of Dof … · 2018. 3. 6. · v4.11.0) was used to search for motifs in all 29 eggplant Dof proteins with a motif window length from

Plesch G, Ehrhardt T, Mueller-Roeber B. 2001. Involvement of TAAAG elementssuggests a role for Dof transcription factors in guard cell-specific gene expression.The Plant Journal 28:455–464.

Potato Genome Sequencing Consortium. 2011. Genome sequence and analysis of thetuber crop potato. Nature 475:189–195 DOI 10.1038/nature10158.

Rueda-Romero P, Barrero-Sicilia C, Gomez-Cadenas A, Carbonero P, Onate-Sanchez L. 2012. Arabidopsis thaliana DOF6 negatively affects germination innon-after-ripened seeds and interacts with TCP14. Journal of Experimental Botany63:1937–1949 DOI 10.1093/jxb/err388.

Shaw LM,McIntyre CL, Gresshoff PM, Xue GP. 2009.Members of the Dof transcriptionfactor family in Triticum aestivum are associated with light-mediated gene regula-tion. Functional and Integrative Genomics 9:485–498 DOI 10.1007/s10142-009-0130-2.

ShigyoM, Tabei N, Yoneyama T, Yanagisawa S. 2007. Evolutionary processes duringthe formation of the plant-specific Dof transcription factor family. Plant and CellPhysiology 48(1):179–185 DOI 10.1093/pcp/pcl044.

Shimofurutani N, Kisu Y, Suzuki M, EsakaM. 1998. Functional analyses of the Dofdomain, a zinc finger DNA-binding domain, in a pumpkin DNA-binding proteinAOBP. Febs Letters 430:251–256 DOI 10.1016/S0014-5793(98)00670-X.

Sierro N, Battey JN, Ouadi S, Bakaher N, Bovet L, Willig A, Goepfert S, PeitschMC,Ivanov NV. 2014. The tobacco genome sequence and its comparison with those oftomato and potato. Nature Communications 5:3833–3842.

Smoot ME, Ono K, Ruscheinski J, Wang PL, Ideker T. 2011. Cytoscape 2.8: newfeatures for data integration and network visualization. Bioinformatics 27:431–432DOI 10.1093/bioinformatics/btq675.

Tamura K, Stecher G, Peterson D, Filipski A, Kumar S. 2013.MEGA6: molecular evolu-tionary genetics analysis version 6.0.Molecular Biology and Evolution 30:2725–2729DOI 10.1093/molbev/mst197.

Tan HW, Song XM, DuanWK,Wang Y, Hou XL. 2015. Genome-wide analysis of theSBP-box gene family in Chinese cabbage (Brassica rapa subsp. pekinensis). Genome58:463–477 DOI 10.1139/gen-2015-0074.

Taylor JS, Raes J. 2004. Duplication and divergence: the evolution of new genes and oldideas. Annual Review of Genetics 38:615–643DOI 10.1146/annurev.genet.38.072902.092831.

Tomato Genome Consortium. 2012. The tomato genome sequence provides insightsinto fleshy fruit evolution. Nature 485:635–641 DOI 10.1038/nature11119.

Venkatesh J, Park SW. 2015. Genome-wide analysis and expression profiling of DNA-binding with one zinc finger (Dof) transcription factor family in potato. PlantPhysiology and Biochemistry 94:73–85 DOI 10.1016/j.plaphy.2015.05.010.

Ward JM, Cufr CA, Denzel MA, Neff MM. 2005. The Dof transcription factor OBP3modulates phytochrome and cryptochrome signaling in Arabidopsis. The Plant Cell17:2614–2614 DOI 10.1105/tpc.105.170960.

Washio K. 2003. Functional dissections between GAMYB and Dof transcriptionfactors suggest a role for protein-protein associations in the gibberellin-mediated

Wei et al. (2018), PeerJ, DOI 10.7717/peerj.4481 20/21

Page 21: Genome-wide identification and characterization of Dof … · 2018. 3. 6. · v4.11.0) was used to search for motifs in all 29 eggplant Dof proteins with a motif window length from

expression of the RAmy1A gene in the rice aleurone. Plant Physiology 133:850–863DOI 10.1104/pp.103.027334.

Wen CL, Cheng Q, Zhao L, Mao A, Yang J, Yu S,Weng Y, Xu Y. 2016. Identificationand characterisation of Dof transcription factors in the cucumber genome. ScientificReports 6:23072 DOI 10.1038/srep23072.

Wray GA, HahnMW, Abouheif E, Balhoff JP, Pizer M, RockmanMV, Romano LA.2003. The evolution of transcriptional regulation in eukaryotes.Molecular Biologyand Evolution 20:1377–1419 DOI 10.1093/molbev/msg140.

WuZM, Cheng JW, Cui JJ, Xu XW, Bang GS, Luo XR, Chen XC, Tang XQ, Hu KL, QinC. 2016. Genome-wide identification and expression profile of Dof transcriptionfactor gene family in pepper (Capsicum annuum L.). Frontiers in Plant Science 7:574DOI 10.3389/fpls.2016.00574.

Yanagisawa S. 2000. Dof1 and Dof2 transcription factors are associated with expressionof multiple genes involved in carbon metabolism in maize. The Plant Journal21:281–288 DOI 10.1046/j.1365-313x.2000.00685.x.

Yanagisawa S. 2002. The Dof family of plant transcription factors. Trends in Plant Science7(12):555–560 DOI 10.1016/S1360-1385(02)02362-2.

Yanagisawa S. 2004. Dof domain proteins: plant-specific transcription factors associatedwith diverse phenomena unique to plants. Plant and Cell Physiology 45(4):386–391DOI 10.1093/pcp/pch055.

Yang X, Tuskan GA, ChengMZ. 2006. Divergence of the Dof gene families in poplar,Arabidopsis, and rice suggests multiple modes of gene evolution after duplication.Plant Physiology 142(3):820–830 DOI 10.1104/pp.106.083642.

Zhang B, ChenW, Foley RC, Buttner M, Singh KB. 1995. Interactions between distincttypes of DNA binding proteins enhance binding to ocs element promoter sequences.The Plant Cell 7:2241–2252 DOI 10.1105/tpc.7.12.2241.

Zou HF, Zhang YQ,WeiW, Chen HW, Song QX, Liu YF, ZhaoMY,Wang F, Zhang BC,Lin Q, ZhangWK,Ma BA, Zhou YH, Zhang JS, Chen SY. 2013. The transcriptionfactor AtDOF4.2 regulates shoot branching and seed coat formation in Arabidopsis.Biochemical Journal 449:373–388 DOI 10.1042/BJ20110060.

Wei et al. (2018), PeerJ, DOI 10.7717/peerj.4481 21/21