Structure homology and interaction redundancy for …...Structure homology and interaction...

7
Structure homology and interaction redundancy for discovering virus–host protein interactions Benoıˆ t de Chassey 1 , Laure`ne Meyniel-Schicklin 2,3 , Anne Aublin-Gex 2,3 , Vincent Navratil 2,3, w , Thibaut Chantier 2,3 , Patrice Andre´ 1,2,3 & Vincent Lotteau 2,3+ 1 Hospices Civils de Lyon, Ho ˆpital de la Croix-Rousse, Laboratoire de virologie, Lyon , 2 Inserm, U1111, Lyon , and 3 Universite ´ de Lyon, Lyon, France Virus–host interactomes are instrumental to understand global perturbations of cellular functions induced by infection and discover new therapies. The construction of such interactomes is, however, technically challenging and time consuming. Here we describe an original method for the prediction of high-confidence interactions between viral and human proteins through a combination of structure and high-quality interactome data. Validation was performed for the NS1 protein of the influenza virus, which led to the identification of new host factors that control viral replication. Keywords: interactome; prediction; protein interaction; structure; virus EMBO reports advance online publication 6 September 2013; doi:10.1038/embor.2013.130 INTRODUCTION Viruses are obligate parasites that rely on cellular functions to replicate. Identifying the cellular functions that a virus must use, counteract or interfere with to assure its replication is therefore a major issue for both basic knowledge and drug discovery. Biological processes being largely dependent on protein–protein interactions, cellular functions involved in viral replication can be identified by mapping virus–host protein–protein interactions and by constructing virus–host interactomes. Until 2007, viral protein– host protein interactions have essentially been identified in low-throughput experiments. Several databases are integrating these data spread in the literature [1–2]. The construction of physical viral ORFeomes and the development of high-throughput technologies, such as yeast two-hybrid (Y2H) and tandem affinity purification coupled to mass spectrometry, led to the publication of the first virus–host interactomes [3–4]. Although incorporation of data from different methods or variation of the same method [5] has improved the quality of the data sets, diversification of methods is still clearly needed to generate high-quality comprehensive virus–host interactomes. In addition, regarding the size of host genome and the huge diversity of viruses, millions of binary interactions remain to be tested. Therefore, accurate and rapid methods for the identification of cellular interactors controlling viral replication is a major issue and a crucial step towards the selection of original therapeutic targets and drug development. This could benefit from predictive methods preselecting subsets of putative interactions. Computational methods are essentially dedicated to predict intraspecies interac- tions [6] but applications to pathogen–host protein inter- actions are now emerging [7]. These methods are generally evaluated by the overlap between predicted and published interaction data sets. Here, we present and experimentally validate an original method for the prediction of virus–host interactions combining protein structure homology and interaction redundancy. RESULTS AND DISCUSSION Principle of the method Several methods have proposed to use structural information to predict protein–protein interactions [8–10]. The method described here relies on the assumption that when two proteins are structurally homologous, they are more likely to have interactors in common (Fig 1A). Using the Structural Classification of Proteins from SCOP database [11] and the high-quality data sets of protein–protein interactions from the VirHostNet database [2], we showed that this assumption is a general feature of human and viral proteins (supplementary information online). Briefly, for a viral protein having a solved structure, structurally homologous human and viral proteins are first selected. Inter- actors of these homologous proteins are then identified. These proteins are considered as putative interactors and ranked according to a score that favors proteins independently identified from multiple structural homologues. 1 Hospices Civils de Lyon, Ho ˆpital de la Croix-Rousse, Laboratoire de virologie, 103 grande rue de la Croix-Rousse, Lyon, F-69004, 2 Inserm, U1111, 21 Avenue Tony Garnier, Lyon, F-69007, 3 Universite ´ de Lyon, Lyon, France + Corresponding author. Tel: þ 33 437262971; Fax: þ 33 437282341; E-mail: [email protected] w Present address: Laboratoire de Biome ´trie et Biologie E ´ volutive, CNRS, Universite ´ Claude Bernard Lyon 1, 43 boulevard du 11 Novembre 1918, 69622 Villeurbanne Cedex, France. Received 14 February 2013; revised 29 July 2013; accepted 31 July 2013; published online 6 September 2013 scientificreport scientific report 1 &2013 EUROPEAN MOLECULAR BIOLOGY ORGANIZATION EMBO reports

Transcript of Structure homology and interaction redundancy for …...Structure homology and interaction...

Page 1: Structure homology and interaction redundancy for …...Structure homology and interaction redundancy for discovering virus–host protein interactions Benoıˆt de Chassey 1,Laure`ne

Structure homology and interaction redundancyfor discovering virus–host protein interactionsBenoıt de Chassey 1, Laurene Meyniel-Schicklin 2,3, Anne Aublin-Gex 2,3, Vincent Navratil 2,3,w,Thibaut Chantier 2,3, Patrice Andre 1,2,3 & Vincent Lotteau 2,3+

1Hospices Civils de Lyon, Hopital de la Croix-Rousse, Laboratoire de virologie, Lyon , 2Inserm, U1111, Lyon , and 3Universite de Lyon,

Lyon, France

Virus–host interactomes are instrumental to understand globalperturbations of cellular functions induced by infection anddiscover new therapies. The construction of such interactomes is,however, technically challenging and time consuming. Here wedescribe an original method for the prediction of high-confidenceinteractions between viral and human proteins througha combination of structure and high-quality interactome data.Validation was performed for the NS1 protein of the influenzavirus, which led to the identification of new host factors thatcontrol viral replication.Keywords: interactome; prediction; protein interaction;structure; virusEMBO reports advance online publication 6 September 2013; doi:10.1038/embor.2013.130

INTRODUCTIONViruses are obligate parasites that rely on cellular functions toreplicate. Identifying the cellular functions that a virus must use,counteract or interfere with to assure its replication is therefore amajor issue for both basic knowledge and drug discovery.Biological processes being largely dependent on protein–proteininteractions, cellular functions involved in viral replication can beidentified by mapping virus–host protein–protein interactions andby constructing virus–host interactomes. Until 2007, viral protein–host protein interactions have essentially been identified inlow-throughput experiments. Several databases are integratingthese data spread in the literature [1–2]. The construction ofphysical viral ORFeomes and the development of high-throughputtechnologies, such as yeast two-hybrid (Y2H) and tandem affinity

purification coupled to mass spectrometry, led to the publicationof the first virus–host interactomes [3–4]. Although incorporationof data from different methods or variation of the same method [5]has improved the quality of the data sets, diversification ofmethods is still clearly needed to generate high-qualitycomprehensive virus–host interactomes. In addition, regardingthe size of host genome and the huge diversity of viruses, millionsof binary interactions remain to be tested. Therefore, accurate andrapid methods for the identification of cellular interactorscontrolling viral replication is a major issue and a crucial steptowards the selection of original therapeutic targets and drugdevelopment. This could benefit from predictive methodspreselecting subsets of putative interactions. Computationalmethods are essentially dedicated to predict intraspecies interac-tions [6] but applications to pathogen–host protein inter-actions are now emerging [7]. These methods are generallyevaluated by the overlap between predicted and publishedinteraction data sets. Here, we present and experimentallyvalidate an original method for the prediction of virus–hostinteractions combining protein structure homology andinteraction redundancy.

RESULTS AND DISCUSSIONPrinciple of the methodSeveral methods have proposed to use structural information topredict protein–protein interactions [8–10]. The method describedhere relies on the assumption that when two proteins arestructurally homologous, they are more likely to have interactorsin common (Fig 1A). Using the Structural Classification of Proteinsfrom SCOP database [11] and the high-quality data sets ofprotein–protein interactions from the VirHostNet database [2], weshowed that this assumption is a general feature of humanand viral proteins (supplementary information online). Briefly,for a viral protein having a solved structure, structurallyhomologous human and viral proteins are first selected. Inter-actors of these homologous proteins are then identified. Theseproteins are considered as putative interactors and rankedaccording to a score that favors proteins independently identifiedfrom multiple structural homologues.

1Hospices Civils de Lyon, Hopital de la Croix-Rousse, Laboratoire de virologie,103 grande rue de la Croix-Rousse, Lyon, F-69004,2Inserm, U1111, 21 Avenue Tony Garnier, Lyon, F-69007,3Universite de Lyon, Lyon, France

+Corresponding author. Tel: þ 33 437262971; Fax: þ 33 437282341;E-mail: [email protected]

wPresent address: Laboratoire de Biometrie et Biologie Evolutive, CNRS, UniversiteClaude Bernard Lyon 1, 43 boulevard du 11 Novembre 1918, 69622 VilleurbanneCedex, France.

Received 14 February 2013; revised 29 July 2013; accepted 31 July 2013;published online 6 September 2013

scientificreportscientific report

1&2013 EUROPEAN MOLECULAR BIOLOGY ORGANIZATION EMBO reports

Page 2: Structure homology and interaction redundancy for …...Structure homology and interaction redundancy for discovering virus–host protein interactions Benoıˆt de Chassey 1,Laure`ne

Prediction of influenza A virus NS1 protein interactionsAs a proof of concept, we applied this method to identify newpartners of the influenza A NS1 protein whose full-length structurehas been solved by x-ray crystallography [12], (Fig 1B). A total of22,052 human protein structures and 4,933 viral protein structuresrespectively corresponding to 3,163 human proteins and 440 viralproteins were extracted from the Protein Data Bank (PDB). Thestandalone version 3.1 of the Dali programme was used to performone-against-all pairwise alignments between NS1 and these largedata sets of viral and human protein structures [13]. NS1 structuralhomologues with a Dali z-score between 2 and 20 were retainedto include weakly homologous proteins and exclude fullyhomologous NS1 proteins [14]. All homologues that are full-length or part of NS1 proteins of influenza A viruses were alsoexcluded. Hence, known NS1–human interactors were not favored.Influenza NS1 protein was found structurally homologous to 344human proteins and 42 viral proteins. Using high-quality data sets

of protein–protein interactions from the VirHostNet database [2],a total of 1,384 human proteins were predicted to interact withNS1- 1,377 interacting with human proteins that are homologousto NS1 and 86 interacting with viral proteins that are homologousto NS1. These 1,384 proteins were then ranked according tothe number and origin of the NS1 homologues weighted bytheir degree. The 5% top-ranked candidates, corresponding to69 proteins, were selected for further validation (Fig 1B,supplementary Table S1 online). In silico mapping of the predictedinteractions can also be performed when the structure of domainsor fragments are available. For NS1, an RNA-binding domainand an effector domain have been structurally defined (PDBidentifiers: 2Z0A and 3D6R, supplementary Table S2 online). Fiftyeight of the 69 predicted interactors for the full-length NS1 proteinwere predicted to interact with the RNA-binding domain whilenone of them were predicted to interact with the effector domain(supplementary Table S1 online).

Protein of interest(full-lengthor domain)

Viral and human3D structural

homologs

Humaninteractors

Best predictedinteractors

New picture ofthe interactome

Scoring

Protein-proteininteraction db(VirHostNet)

VirHostNet72,339 HH ppi

16,187 HH ppihigh–quality dataset

14,422 H proteinstructures

ChainSplitting

22,052 PDBchains (3,163H proteins) 344 homologous

H proteins1,377 predicted

H interactors

PDB

Influenza NS1 full-length structure

(3F5TA)

1/allpairwisestructure

alignments(Dali)

2<z-score<20

Scoring5 %best

69 predictedH interactors

Experimentalbenchmarking

3,130 V proteinstructures

ChainSplitting

4,933 PDBchains (440 V

proteins)

42 homologousV proteins

86 predictedH interactors

Score>3

26 predictedH interactors

VirHostNet4,458 VH ppi

2,539 VH ppi high-quality data set

A

B

Fig 1 | Description of the method. (A) Principle of the method. (B) Detailed steps of the method applied to influenza A virus NS1 protein. The score

favors independent predictions provided by interaction redundancies. Score (putative interactor)¼ [(nH2/kH)þ (nV

2/kV)]� a. With nH and nV number

of distinct human interactors and viral interactors, respectively; kH and kV degree of the protein in the human protein–protein interaction data set and

in the virus–human protein–protein interaction data set, respectively; a¼ 3 for a prediction from both human and virus–human data sets, a¼ 2 from

only virus–human data set, 1 else. H, human; V, virus.

Prediction of virus–host protein interactions

B. de Chassey et alscientificreport

2 EMBO reports &2013 EUROPEAN MOLECULAR BIOLOGY ORGANIZATION

Page 3: Structure homology and interaction redundancy for …...Structure homology and interaction redundancy for discovering virus–host protein interactions Benoıˆt de Chassey 1,Laure`ne

Functional analysis of NS1 interactorsSixty-four of the 69 predicted interactors are known to beexpressed in the normal lung or trachea in accordance withthe tropism of the virus (supplementary Table S1 online). Analysisof enriched gene ontology terms and KEGG pathways corres-ponding to the 69 predicted interactors of NS1 confirms andprovides more molecular supports to the interplay of NS1 withessential cellular functions. A significant proportion of thepredicted interactors being involved in apoptosis (BenjaminiP-value¼ 6.31� 10�6), this new data set might help clarifyingthe complex and controversial role of NS1 and apoptosis in viralreplication [15–18]. In the enriched RIG-I signalling pathway(Benjamini P-value¼ 5.4� 10�3), NS1 is known [19,20] tointerfere with the signalling of viral RNA recognition by thepathogen recognition receptors by interacting with RIG-I andTRIM25 [21,22]. Here, the interaction of NS1 with 4 IKK proteinsfurther supports its ability to interfere with this signalling at a moredownstream step. Predicted interaction of NS1 with nine proteinsof the mitogen-activated protein kinase pathway (BenjaminiP-value¼ 1.9� 10�3) is also demonstrative of the crucial role ofthis pathway whose late activation is required for efficientreplication of the virus [23].

Validation of NS1 interactorsExperimental validation of the predicted interactions wasperformed by Y2H and GST pull down for 32 candidates.Eighty-one percent of the interactions were confirmed by Y2Hand 72% by co-affinity precipitation (supplementary Fig S1online). Interestingly, 100% of the interactions were validatedby at least one method when the predicted interactor had a

score 4¼ 3. The validation rate was 83% for predicted interactorhaving a score o3 (supplementary Table S1 online). Therefore,using a score 4¼ 3 as a threshold to select the best interactors,the prediction method allowed the identification of very high-confidence interactors. Accordingly, from the 69 interactorsinitially predicted, a very high-confidence set of 26 NS1interactors was retained (Figs 1B and 2A). Performance assessmentof the prediction method shows that this score threshold leadsto an optimal success regarding specificity and precision(supplementary information online). Twenty-five out of these 26interactors were new, illustrating the potential of this method topredict novel interactions (supplementary Table S1 online). Fromthe 26 interactors, 24 were predicted to interact with the RNA-binding domain but not the effector domain of NS1 (supplementaryTable S1 online). This was confirmed by GST pull down for theNS1–BID interaction further validating the method (Fig 2B).

As the overlap of predicted interactors for full-length NS1 andthe RBD is not 100%, both domains and full-length proteinsare essential to explore the full potential of this method.

Impact of silencing NS1 interactors on viral replicationThe 26 high-confidence NS1 predicted interactors were furthertested for their ability to control influenza virus replication.Systematic gene silencing was performed with three independentsmall interfering RNA per gene and virus production was firstevaluated 24 h post infection by measuring neuraminidase activityin the culture supernatant. When viral production was inhibited orenhanced by at least twofold with at least two short interferingRNAs (siRNAs) in at least two out of three experiments, viralproduction was further quantified with a plaque-forming unit

Final predictedcellular interactor

Interactionvalidation

siRNAassay

Ensembl IDOfficialname Y2H GST pull

downNeuraminidase

activity PFU

ENSG00000002330

ENSG00000153094

ENSG00000107223

ENSG00000110324

ENSG00000169047

ENSG00000143393

ENSG00000132155

ENSG00000160691

ENSG00000136560

ENSG00000175564

ENSG00000168646

ENSG00000087088

ENSG00000015475

ENSG00000179151

ENSG00000030110

ENSG00000157764

ENSG00000101224

ENSG00000189229

ENSG00000107566

ENSG00000143466

ENSG00000185950

ENSG00000155506

ENSG00000116191

ENSG00000147133

ENSG00000103197

ENSG00000138592

BAD

BCL2L11

EDF1

IL10RA

IRS1

PIK4B

RAF1

SHC1

TANK

UCP3

AXIN2

BAX

BID

EDC3

BAK1

BRAF

CDC25B

CDK11B

CHUK

IKBKE

IRS2

LAFP1

RALGPS2

TAF1

TSC2

USP8

×

××××××××××

××

×××××××

×

×

×

×

× ×

× ×

× ×

× ×× ×× ×× ×

× ×

× ×

GS

T

GS

T–N

S1

GS

T–N

S1–

RB

D

GS

T–N

S1–

effe

ctor

IB: myc

Pulldown

Input

IB: GST

IB: myc

22.020.018.016.014.012.010.08.06.04.02.00.0

CTRL

BID

BCL2L1

1TA

F1

Rep

licat

ion

fold

incr

ease

1.2

1

0.8

0.6

0.4

0.2

0Rep

licat

ion

fold

incr

ease

CTRL

IL10R

A

LARP1

BRAFBAX

RAF1

AXIN2

EDC3

A BC

D

Fig 2 | Functional characterization of the 26 best-predicted interactors. (A) The physical and genetic interactions have been tested (white) or not (grey).

Essential host factors are in green and restriction host factors are in red. (B) BID–NS1 interaction mapping. Interaction of myc–BID with GST–NS1

and GST–NS1 RBD but not with GST alone or GST–NS1 effector domain. (C,D) Impact of silencing indicated genes on viral replication. Values are the

number of PFUs normalized to control siRNA. The mean values ±- s.d. from three independent experiments are shown. (C) Restriction host factors.

(D) Essential host factors. GST, glutathione S-transferase; PFU, plaque-forming unit; RBD, RNA-binding domain; siRNA, short interfering RNA.

Prediction of virus–host protein interactions

B. de Chassey et al scientificreport

3&2013 EUROPEAN MOLECULAR BIOLOGY ORGANIZATION EMBO reports

Page 4: Structure homology and interaction redundancy for …...Structure homology and interaction redundancy for discovering virus–host protein interactions Benoıˆt de Chassey 1,Laure`ne

assay. Using this criteria, 10 new cellular interactors of NS1 werefound to directly modulate the replication of influenza A virus—three were functioning as restriction host factors and seven asessential host factors (Figs 2A,C,D). None of the tested siRNAsaffected cell survival (supplementary Fig S2 online). Therefore,the method generated a data set that is strongly enrichedin modulators of replication—43.5 versus 0.05–1.2% forsiRNA pangenomic screens.

Proteome-wide prediction of virus–host interactionsHaving validated the method, predictions were extended to otherproteins of the same virus to construct a predicted influenza–human interactome. Structural information was available for13 structures corresponding to nine viral proteins. Following theprocess described above for NS1, 108 interactions werepredicted connecting 41 cellular proteins to six viral proteins(Fig 3, supplementary Tables S3 and S4 online). The low overlapof our set of interactions with literature reflects the incomplete-ness of previously reported influenza virus–host proteininteraction data (supplementary information online). In addition,structural and interactomic data availability restricts the list ofpredictable interactors, as illustrated for NS1 in supplementaryinformation online. Some human proteins are predicted to

interact with several viral proteins. Multi-targeting of specifichost factors by different proteins of the same virus has beenobserved previously, highlighting the importance of suchproteins for the virus [24]. Interestingly, viral proteins sharingcommon interactors are those displaying a certain degree ofstructural homology (supplementary Table S5 online). Finally, anup-to-date influenza–human interactome combining literatureand predicted data are presented in Fig 4.

METHODSRetrieval and annotation of viral and human structures. Allsequences from viral and human protein structures stored at thePDB were retrieved in a Fasta format and blasted to be annotatedwith a GenBank identification for viral proteins and Ensemblprotein identification for human proteins as it is the case in theVirHostNet database. A total of 4,933 PDB sequences (viralGenBank identification) and 22,052 PDB sequences (humanEnsembl protein identification) have been retrieved and assignedto 440 distinct viral proteins and 3,163 distinct human proteins.For the full-length influenza A virus NS1 protein, the chain A ofthe 3F5T PDB structure has been used.High-quality protein–protein interaction data from VirHostNet.Protein–protein interaction data were retrieved from the VirHostNet

Viral protein

Human protein

Human protein/host factor

Influenza-humanprotein–protein interaction

CHUK BBC3

BAXCDK11B

BADBNIP1

CDC25BIL10RA

BID

IKBKE

LIFREDF1

BAK1 TANK

TAF6BRAF

CBL

AXIN2

NAP1L1

GRB10 RAF1

BCL2L11

UCP3EDC3

RALGPS2LARP1

TAF1

PI4KBUSP8

TSC2

BRD8

MED1

NCOA2PNRC2

NRBF2

NR0B2

MECR

FKBP8

IRS1

SHC1

IRS2

PA

PB1

NS1

NP

M1

HA

Fig 3 | Predicted influenza–human protein–protein interaction network. Network composed of 108 interactions involving 6 influenza viral proteins and

41 human proteins. Twelve of 41 human proteins have been identified to be host factors for influenza virus replication (from literature and this work).

The CHUK-NS1 interaction was already reported in the literature.

Prediction of virus–host protein interactions

B. de Chassey et alscientificreport

4 EMBO reports &2013 EUROPEAN MOLECULAR BIOLOGY ORGANIZATION

Page 5: Structure homology and interaction redundancy for …...Structure homology and interaction redundancy for discovering virus–host protein interactions Benoıˆt de Chassey 1,Laure`ne

database [2]. A high-quality data set of human protein–proteininteractions were defined by selecting interactions identifiedby two different methods or in two independent papers.Indeed, the human interactome is an integration of data fromseveral databases, and might be polluted by low-confidenceinteractions [25]. A high-quality data set of virus–human protein–protein interactions were constructed with exclusively manuallycurated interactions. To experimentally validate the predictedinteractions, an additional level of stringency was added byfocusing on interactions described by classical laboratory tech-nologies. Altogether, the predictions are on the basis of 16,187human protein–protein interactions involving 6,378 distinct humanproteins and 2,539 virus–host protein–protein interactions involving434 viral proteins and 1,395 human proteins (supplementaryTables S6 and S7 online).Score definition. All putative interactors were individuallyscored according to several criteria. First, an interactor is moreconfident when predicted by several independent NS1 homo-logues. Interactors independently predicted by both viral and

human homologues are favored. The prediction from viralhomologues only is also better considered than the predictionfrom human homologues only. Finally, as a correlation wasobserved between the score and the degree of a putative interactor(that is, the number of partners in the protein interaction data set),the score was weighted by the degree. Altogether, the scoreformula is as follows: score (putative interactor¼ [(nH

2/KH)þ (nV2/

kV)]� a, with: (i) nH and nV number of distinct human interactorsand viral interactors, respectively. (ii) kh and kV degree of theprotein in the human protein–protein interaction data set and inthe virus–human protein–protein interaction data set, respectively.(iii) a¼ 3 for a prediction from both human and virus–human datasets, a¼ 2 from only virus–human data set, 1 else. (iv) H: human,V: virus.Cells and virus. A549 human lung epithelial cell line and Madin-Darby canine kidney (MDCK) cell line were grown in DMEMmedia supplemented with 100 U ml�1 penicillin/streptomycinand 10% fetal calf serum at 37 1C and 5% CO2. The epidemicA/H1N1/New Caledonia/2006 strain was propagated in MDCK

Viral protein

Human protein

Human protein

Human protein

Human protein/ host factor

Human protein/ host factor

Influenza-humanprotein–protein interaction

Influenza-humanprotein–protein interaction

Influenza-humanprotein–protein interaction

Literature

Prediction

Both

PPP2R5C

NUP54

NR0B2

GLYAT SSBP2

ACOT9SNAPC4

DNM2ATM

LY6D

KIAA0586

UROS NCOA2

PCBD1YIPF6

BCAP29OLA1

PNRC2MCM2

MED1

ABLIM1LRRFIP1

MCM5GSN

RNF5MCM7

C7

UXS1SLPIPTPMT1

C14orf166

POLR2A SECISBP2 MCM3 NQO2

NRBF2

GLYR1MECR COMMD1MCM4C10orf35

NUP214 EEF1D CHD6

CHMP6 ATP6V1G1

PIGQ

PCBP1 DST

NCAPH2

CDC42EP4

PSMA7MRI1

TMEM86B

PA

COL4A3BP

NA

CBS

SETBP1

CRX

BECN1

CREB3

MT2A

NOMO3

GLUL MAVS

MZ HSP90AA1

RBPMSTCF1Z

TRAF1MIPOL1MAGEA2B

TFCP2EXOSC8

USHBP1

MAGEA11

CEP70 GMCL1

DVL3ZBTB25

MPIMEOX2HA

CBLLNX2

DDB1DVL2

NPM1 FXR2 BANP

BHLHE40KPNA1

KPNA6

KPNA2

MAPK9

TUBA1A

DOCK8OCDC33

XPO1

ATL1

HTATSF1 SHC1KPNA4

BRAF

IGF2BP3

HNRNPA2B1KHDRBS3

KIAA1967

AKT2

LGALS3BP

NCL LINC00525

ISG15

CRKL

NBPF8

CRKELAVL1

CSDA

HNRNPUL1

KPNA5

HRNR

YBX1

EDF1HNRNPH1

UPF1

CTSBDHX9

ERH

GOLGA2

CPSF4

MOV1O

RPS5

RPLP1

TARBP2RPS24GAS8

WDR6

HNRNPA0

PCID2

HLA-B

PABPC4

STX5

CSAG2YWHAQ

IRS4 YWHAEHNRNPA1SLC25A5SEPT1

MTAP NEFHHNRNPHNRNPTDABPC1

C22orf2SLC25ARALY

NS1

MATR3

PIK3R2

HNRNPRRPS10

PIK3R1RBMX

TANKHNRNPH3LZTS2

HNRNPUHNRNPAB

YWHABPTBP1RAE1

PIK3CA

TUBB2OUMO1

YWHAZ

NISCHRP52

ILF2IKBKB

EIF2AK2TRIM25

ATXN2L

KIAA1143

HNRPDL

YWHAG

HNRNPA3

NXTJPIK3CBAXIN2

DDX58

HNRNPLTUBB

EEF1A2

STAU1

SLC25A4SSX2IP

IGF2BP2HNRNPFDHX15

HIST1H4BPCBP2

IGF2BP1DHX30

LYPLA1HNRNPKRPS3

IL6R

SP100

HMBOX1PIK3R3TRIM27

NXF1SYNCRIP

ZMAT3

PABPN1

IR51

ILF3UBE2I

KHDRBS1 IRS2 RAF1ZNF346EEF1A1

BAXBID

BAK1 CDC25B

CDK11BCHUKBAD

TSC2 IKBKE

TAF1ZMAT4DDX39B

M1

KPNA3

ACTB IL10RA

MLH1C16orf45

CLNS1A

BBC3

UCP3

PRKRA

LIFR

USP8LARP1

PI4KB

BCL2L11PB1

RALGPS2EDC3FKBP8

MAGED1

CEP152

PA2G4

SIAH1IPO5

TRIP6

PLAC8

FTH1SLC16A9

IKZF3

GABPB1

BRD8

DDX5 RPL11RP59 BNIP1

SDCBP2ELP4

ZBTB1 IPO9

DAZAP2 TTC12

CD74

PPIAP22 CDC42

C1QAVPS28

HSPA8ITM2B

GNB2L1

NBPF22P

FUSIGHM

NRF1 RPL8

TAF6 GRB10NAP1L1

NP

ZXDCHSPD1

POLR3F TXNIP STIP1

PB2OCT2

CCDC172

TACC1

CAPN2C9orf135

DNAJB13

GCOM1

NPC2

CES1

HSP90AB1

DUSP14UBC

ASXL1

FAM156BC1QBP

HOOK1

FAM96B

PPIA

NSMCE4A

TRAF2

ALDH1L1

POMPDCAF6

DNAJB5MYCDDX54

AKAP1

MVPANXA1

FANCGC1orf94

ZNF251 FLOT1

ANKRD18CPCHAF1A

TSNAXIP1 EIF3L

MCM8CDK9

CMTM5

POLR2B

KIF6DNAJB1

CALCOCO1PCNAPLA2G4A

PNMA1NF74

TRIM29

BCRDCTN1

HSPA1A

PCCASMURF2ZOCHC17

CRYABNCC27

NMIFKBP5RABGEF1

COPS5KCNRG

BPIFB1

MNAT1

PSMD8

SMARCB1BLZF1

NCOA7COL7A1

NS2

DYNLL2

AIMP2

NUP98

Fig 4 | Influenza–human interactome combining predicted and literature data. Network composed of 515 interactions involving 10 influenza viral

proteins and 364 human proteins. One-hundred and six interactions were obtained from predictions, 407 from literature and 2 from both. Forty-seven

of 364 human proteins have been identified to be host factors directly controlling viral replication (literature and this work).

Prediction of virus–host protein interactions

B. de Chassey et al scientificreport

5&2013 EUROPEAN MOLECULAR BIOLOGY ORGANIZATION EMBO reports

Page 6: Structure homology and interaction redundancy for …...Structure homology and interaction redundancy for discovering virus–host protein interactions Benoıˆt de Chassey 1,Laure`ne

cells in DMEM supplemented with 1 mg ml�1 modifiedtrypsin TPCK in absence of fetal calf serum. Virus stocks weretitrated by standard plaque assay on MDCK cells using an agaroverlay medium.GST pull down. Cellular proteins (corresponding to the 32 cDNAsavailable in orfeotech ORFeome v3.1 [26], description of thehORFeome v3.1 in supplementary information online) wereamplified from the MGC cDNA plasmid collection using KODpolymerase (Toyobo), cloned by recombinational cloninginto pDONR207 (Life technologies) and transferred to pDEST-myc or pCIneo3xFlag (kind gift of Y. Jacob). H5N1 NS1 wastransferred into pDEST27 (GST fusion in N-term). A total of 4.105

HEK293T cells were then co-transfected (6 ml JetPei, Polyplus)with 1.5 mg of each pair of plasmid. Two days after transfection,cells were harvested and lysed (0.5% NP-40, 20 mM Tris–HCl (pH8.0), 150 mM NaCl, 1 mM EDTA and Roche complete proteaseinhibitor cocktail). Soluble protein complexes were purified byincubating 300 mg of cleared cell lysate with 40 ml glutathionesepharose 4B beads (GE Healthcare). A total of 50 mg of clearedcell lysate was separated on SDS–PAGE and transferred tonitrocellulose membrane. GST-tagged viral proteins and Myc or3XFlag-tagged cellular proteins were detected using standardimmunoblotting techniques with anti-GST (Covance), anti-Myc oranti-Flag (Sigma) monoclonal antibodies.Yeast two-hybrid. Pairwise Y2H interactions were analysedby yeast mating, using Y187 and AH109 yeast strains (Clontech).Prey and bait vectors were transformed into Y187 and AH109.Bait and prey strains were mated in an all-against-all array andplated on a selective medium lacking histidine and supplementedwith increasing concentrations of 3-amino-triazole to test theinteraction-dependent transactivation of HIS3 reporter gene.Some of the preys are membrane proteins but reported in theliterature to give positive results in Y2H (for example, BCL2L11,BAD) or proteins whose signal peptide is likely to be masked(supplementary information online). Therefore, we did notexclude them from the Y2H.siRNA screening. Five picomoles of each siRNA (stealth selectRNAi, Invitrogen) was arrayed in 96 plates in 10 ml of OptiMEM(GibCo). Ten microliters per well of a mix lipofectamineRNAiMAX (Invitrogen)-OptiMEM (0.2ml of lipofectamine RNAi-MAX in 10 ml of OptiMEM) was added. After 20 min roomtemperature-incubation, siRNA–lipofectamine mixes were addedto 30� 103 A549 suspension cells. Cells were incubated for 48 hat 37 1C and 5% CO2 before influenza virus infection atmultiplicity of infection 0.5. Forty-eight hours after infection,supernatants were titrated.Virus infection. siRNA-transfected cells were washed twicewith Dulbecco’s phosphate-buffered saline 1� and infectedwith H1N1 influenza strain at multiplicity of infection 0.5 inDulbecco’s modified Eagle medium supplemented with0.2 mg ml�1 trypsin TPCK (infection medium). After 60 min at37 1C, the inoculum was discarded and cells were washed againand incubated for 48 h in infection medium at 37 1C and 5% CO2.Titre measure by neuraminidase activity. Influenza virusneuraminidase is able to cleave the methyl-umbelliferyl-N-acetylneuraminic acid (4-MUNANA, Sigma) modifying itsemission wavelength in a dose-dependent manner. In 96-blackplate, 25ml infection supernatants were diluted in 25 mlDulbecco’s phosphate-buffered saline 1� containing calcium

and magnesium and 50 ml of 20 mM 4-MUNANA. After 1 hincubation at 37 1C, 100 ml of glycine 0.1 M 25% ethanolpH 10.7 was added. Measures were done with TECANinfinite M1000 instrument at 365 nm excitation and 450 nmemission wavelengths.

Plaque-forming unit assay. To quantify infectious virus particlesin infected cell culture supernatants, 300,000 MDCK cellswere seeded in 6-well plates. Three days later, cells were washedtwice with DMEM supplemented with 100 U ml� 1 penicillin/streptomycin. Dilutions of infected cell culture supernatants weredispensed on the cells. After 1 h 30 min incubation at 37 1C and5% CO2 cells were washed again and covered with overlaymedium containing MEM 1� , 1% agarose (Lonza) and modifiedtrypsin TPCK 1 mg ml� 1. Plates were incubated upside down at37 1C and 5% CO2 up to 72 h. Cells were then fixed with formol10% and coloured with crystal violet 0.3% in methanol 20%and lysis plaques were counted.

Functional annotation. DAVID functional annotation chart toolwas used to perform enrichment analysis, in which an annotationterm was considered as significant with a Benjamini–Hochbergcorrected P-value smaller than 0.05 [27].

Supplementary information is available at EMBO reports online(http://www.emboreports.org).

ACKNOWLEDGEMENTSThe authors thank Liisa Holm for her technical help on Dali.This work was supported by the European Union’s 7th Program(FP7/2007–2013) under grant agreement no. 267429 (SysPatho) andEuropean Community’s 7th Framework Program (FP7/2007–2013)and Cosmetics Europe under grant agreement no.266753 (SCR&Tox).

Author contributions: B.d.C conceived the method; B.d.C. and V.L.supervised the experiments; B.d.C., L.M.-S. and V.N. designed andperformed bioinformatic studies; A.A.-G., T.C. and B.d.C. designedand performed biological studies; P.A., A.A.-G., B.d.C., V.L. andL.M.-S. analysed the data; B.d.C., V.L. and L.M.-S. wrote the manuscript.

CONFLICT OF INTERESTThe authors declare that they have no conflict of interest.

REFERENCES1. Chatr-aryamontri A et al (2009) VirusMINT: a viral protein interaction

database. Nucleic Acids Res 37: D669–D6732. Navratil V, De Chassey B, Meyniel L, Delmotte S, Gautier C, Andre P,

Lotteau V, Rabourdin-Combe C (2009) VirHostNet: a knowledge base forthe management and the analysis of proteome-wide virus-host interactionnetworks. Nucleic Acids Res 37: D661–D668

3. de Chassey B et al (2008) Hepatitis C virus infection protein network. MolSyst Biol 4: 230

4. Jager S et al (2012) Global landscape of HIV-human protein complexes.Nature 481: 365–370

5. Braun P et al (2009) An experimentally derived confidence score forbinary protein-protein interactions. Nat Methods 6: 91–97

6. Jessulat M et al (2011) Recent advances in protein-protein interactionprediction: experimental and computational methods. Expert Opin DrugDiscov 6: 921–935

7. Mukhopadhyay A, Maulik U, Bandyopadhyay S (2012) A novelbiclustering approach to association rule mining for predictingHIV-1-human protein interactions. PLoS One 7: e32289

8. Aloy P, Russell RB (2002) Interrogating protein interaction networksthrough structural biology. Proc Natl Acad Sci USA 99: 5896–5901

9. Davis FP, Braberg H, Shen MY, Pieper U, Sali A, Madhusudhan MS(2006) Protein complex compositions predicted by structural similarity.Nucleic Acids Res 34: 2943–2952

Prediction of virus–host protein interactions

B. de Chassey et alscientificreport

6 EMBO reports &2013 EUROPEAN MOLECULAR BIOLOGY ORGANIZATION

Page 7: Structure homology and interaction redundancy for …...Structure homology and interaction redundancy for discovering virus–host protein interactions Benoıˆt de Chassey 1,Laure`ne

10. Tuncbag N, Gursoy A, Nussinov R, Keskin O (2011) Predicting protein-protein interactions on a proteome scale by matching evolutionary and

structural similarities at interfaces using PRISM. Nat Protoc 6: 1341–135411. Murzin AG, Brenner SE, Hubbard T, Chothia C (1995) SCOP: a structural

classification of proteins database for the investigation of sequences and

structures. J Mol Biol 247: 536–54012. Bornholdt ZA, Prasad BV (2008) X-ray structure of NS1 from a highly

pathogenic H5N1 influenza virus. Nature 456: 985–98813. Holm L, Park J (2000) DaliLite workbench for protein structure

comparison. Bioinformatics 16: 566–56714. Holm L, Kaariainen S, Wilton C, Plewczynski D (2006) Using Dali for

structural comparison of proteins. Curr Protoc Bioinformatics Chapter 5

Unit 5.515. Schultz-Cherry S, Dybdahl-Sissoko N, Neumann G, Kawaoka Y,

Hinshaw VS (2001) Influenza virus ns1 protein induces apoptosis in

cultured cells. J Virol 75: 7875–788116. Zhirnov OP, Konakova TE, Wolff T, Klenk HD (2002) NS1 protein of

influenza A virus down-regulates apoptosis. J Virol 76: 1617–162517. Ehrhardt C, Wolff T, Pleschka S, Planz O, Beermann W, Bode JG,

Schmolke M, Ludwig S (2007) Influenza A virus NS1 protein activates the

PI3K/Akt pathway to mediate antiapoptotic signaling responses. J Virol

81: 3058–306718. Lam WY, Tang JW, Yeung AC, Chiu LC, Sung JJ, Chan PK (2008) Avian

influenza virus A/HK/483/97(H5N1) NS1 protein induces apoptosis in

human airway epithelial cells. J Virol 82: 2741–2751

19. Lu Y, Wambach M, Katze MG, Krug RM (1995) Binding of the influenzavirus NS1 protein to double-stranded RNA inhibits the activation of theprotein kinase that phosphorylates the elF-2 translation initiation factor.Virology 214: 222–228

20. Talon J, Horvath CM, Polley R, Basler CF, Muster T, Palese P,Garcia-Sastre A (2000) Activation of interferon regulatory factor 3 isinhibited by the influenza A virus NS1 protein. J Virol 74: 7989–7996

21. Pichlmair A, Schulz O, Tan CP, Naslund TI, Liljestrom P, Weber F,Reis e Sousa C (2006) RIG-I-mediated antiviral responses tosingle-stranded RNA bearing 5’-phosphates. Science 314:997–1001

22. Gack MU et al (2009) Influenza A virus NS1 targets the ubiquitin ligaseTRIM25 to evade recognition by the host viral RNA sensor RIG-I.Cell Host Microbe 5: 439–449

23. Pleschka S (2008) RNA viruses and the mitogenic Raf/MEK/ERK signaltransduction cascade. Biol Chem 389: 1273–1282

24. Meyniel-Schicklin L, de Chassey B, Andre P, Lotteau V (2012) Virusesand interactomes in translation. Mol Cell Proteomics 11: 014738

25. Cusick ME et al (2009) Literature-curated protein interaction datasets.Nat Methods 6: 39–46

26. Lamesch P et al (2007) hORFeome v3.1: a resource of human openreading frames representing over 10,000 human genes. Genomics 89:307–315

27. Huang da W, Sherman BT, Lempicki RA (2009) Bioinformaticsenrichment tools: paths toward the comprehensive functional analysisof large gene lists. Nucleic Acids Res 37: 1–13

Prediction of virus–host protein interactions

B. de Chassey et al scientificreport

7&2013 EUROPEAN MOLECULAR BIOLOGY ORGANIZATION EMBO reports