Text mining to support the evaluation of research grant ... · JRC Competence centre on text mining...

12
Text mining to support the evaluation of research grant applications Olivier Eulaerts Text Mining & Analysis Competence Centre DG Joint Research Centre European Commission

Transcript of Text mining to support the evaluation of research grant ... · JRC Competence centre on text mining...

Page 1: Text mining to support the evaluation of research grant ... · JRC Competence centre on text mining Support policy makers with text mining tools and services across policy fields.

Text mining to support the evaluation of research grant applications

Olivier EulaertsText Mining & Analysis Competence CentreDG Joint Research CentreEuropean Commission

Page 2: Text mining to support the evaluation of research grant ... · JRC Competence centre on text mining Support policy makers with text mining tools and services across policy fields.

European Commission's science and knowledge service

Support EU policies with independent evidence throughout the whole policy cycle.

Contributing to e.g. a healthy and safe environment, secure energy supplies, sustainable mobility and consumer health and safety.

Page 3: Text mining to support the evaluation of research grant ... · JRC Competence centre on text mining Support policy makers with text mining tools and services across policy fields.

JRC Competence centre on text mining

Support policy makers with text mining tools and services across policy fields.

Text miningData harvesting, processing and visualisation

Computational linguisticScientometrics

IT product development.

Page 4: Text mining to support the evaluation of research grant ... · JRC Competence centre on text mining Support policy makers with text mining tools and services across policy fields.

Europe Media Monitor - see, explore and understand current news reported by world’s online media -monitoring >8000+ news sources - 70 languages -advanced information extraction techniques -automatically determines what is being reported, where things are happening, who is involved, what they said.

EMM OSINT Suite - desktop software application - find, acquire, extract and analyse information from the Internet and local sources - contains tools to automate various tasks in the process of gathering intelligence from open sources.

TIM innovation suite – tools to explore and map technological development – bridging patent data, scientific publications data, grant data – technology monitoring and detection of trends.

Page 5: Text mining to support the evaluation of research grant ... · JRC Competence centre on text mining Support policy makers with text mining tools and services across policy fields.

https://conservationbytes.com/2015/05/04/twenty-tips-for-writing-a-research-proposal/

What is the issue?

Page 6: Text mining to support the evaluation of research grant ... · JRC Competence centre on text mining Support policy makers with text mining tools and services across policy fields.

What is the issue?

Duplication of research grants

Difficulty to detect scientific overlap in research grant applications and grants

No means to detect applications for grants submitted to two or more different funding sources

No means to spot resubmission of past failed applications

Page 7: Text mining to support the evaluation of research grant ... · JRC Competence centre on text mining Support policy makers with text mining tools and services across policy fields.

What is the issue?

Study in US

Funding agencies urged to check for duplicate grants, Nature, January 2013, volume 493.

Reviewing US grant applications in publicly accessible databases. 1,300 applications with potential overlap (over 850,000 applications). 167 pairs very similar.

~$70 million in overlapping funds may have been awarded over the period 2002-2012

Europe?

No such study. Or is there?

European context

28 (fragmented) public funding systems for research in Member States,

Funding for research at international level (fragmented).

14 (fragmented) public funding systems in H2020 associated countries

Page 8: Text mining to support the evaluation of research grant ... · JRC Competence centre on text mining Support policy makers with text mining tools and services across policy fields.

JRC contribution

Semantic similarity platform for research grants applications

To give evaluators the means to compare incoming applications to a corpus of grants and other relevant documents

• Increase quality of applications• Decrease duplication

To give applicants the possibility to retrieve previous similar grants

• Increase quality of incoming applications

To give non-public funding entities means to better assess the quality of submitted projects

To support the dialogue between the European Commission and funding agencies on data standards

Using (open) grant data from MS, Commission, and other sources.(Legal support from Central IP service of the Commission).

Page 9: Text mining to support the evaluation of research grant ... · JRC Competence centre on text mining Support policy makers with text mining tools and services across policy fields.

Evaluation process

Semantic comparison Module + Entity matching module

Index of grants data + other relevant data

English text in user interface

Indexing

Grants data from funding agencies

Translation

Expert evaluation

Application

Semantically similar grants, publications…

+

Flagging of application/grant pair with similar applying entities

Page 10: Text mining to support the evaluation of research grant ... · JRC Competence centre on text mining Support policy makers with text mining tools and services across policy fields.

Technical feasibility

It is working on my machine…

Most similar patents to a grant on hydraulic actuators.(Grant from National Research, Development And Innovation Office of Hungary).

Most similar EU grants to a grant related to nanotechnology. (Grant from FP7).

Most similar publications with a proposal on oxidative catalysis using metalloenzyme.(Grant from National Research, Development And Innovation Office of Hungary).

Page 11: Text mining to support the evaluation of research grant ... · JRC Competence centre on text mining Support policy makers with text mining tools and services across policy fields.

What is next?

Now: Finalising proposal before submitting for ISA² funding

Public funding agencies from Spain (FECYT, SEIDI) and Hungary (NKFIH) ready to take part.

Looking for 2 additional MS to be represented for proof-of-concept phase.

2018: Proof-of-concept

User requirements (MS funding agencies + EC) Development pilot platform + networking Evaluation by evaluators Go/no-go

(2019-2020: Full deployment)

Page 12: Text mining to support the evaluation of research grant ... · JRC Competence centre on text mining Support policy makers with text mining tools and services across policy fields.

On JRC: https://ec.europa.eu/jrc/en

On TIM: www.timanalytics.eu

On EMM: http://emm.newsbrief.eu/NewsBrief/clusteredition/en/latest.html

For any inquiry: [email protected]

Or

[email protected]

Thank you for your attention!