Implementing Open Science in...
Transcript of Implementing Open Science in...
@openaire_eu
Implementing Open Science in EOSCPutting the puzzle together
Creating Platform-Driven E-Infrastructure Innovation On EOSC | July 10, 2019
Natalia ManolaOpenAIRE Managing Director
Athena Research & Innovation Center
Paolo ManghiOpenAIRE Technical Director
CNR-ISTI
Open Access to publicationsOpen / FAIR dataOpen SoftwareLinked Open Science (Provenance)Open methodology (Open peer review)Access to resources for analyticsAccess by non-academics
Open Science
Creating Platform-Driven E-Infrastructure Innovation On EOSC | July 10, 2019
… practice science in such a way that others can collaborate and contribute, where research
data, lab notes and other research processes are freely
available, under terms that enable reuse, redistribution and reproduction of the research and its underlying data and methods.
open and reproducible science
scientific/scholarly communication
data infrastructuresocial + technical links
service + data interoperability
A key pillar of EOSC
Bridging the worlds where science is performed and
science is published
Creating Platform-Driven E-Infrastructure Innovation On EOSC | July 10, 2019
Scholarly communication services
(sharing, evaluating, monitoring science)
Research infrastructure services, i.e. digital labs
(performing science)
E-infrastructure(enabling digital services for science)
EOSC as a facilitator of Open Science
?? ?
?Services
Actors
Creating Platform-Driven E-Infrastructure Innovation On EOSC | July 10, 2019
Scholarly communication services
(sharing, evaluating, monitoring science)
Research infrastructure services. i.e. digital labs
(performing science)
E-infrastructure(enabling digital services for science)
EOSC as a facilitator of Open Science
Architecture
Functionality
Participation rules
Practices
Quality
Interoperability
Economy of scale Sustainability
Scholarly communication services
(sharing, evaluating, monitoring science)
Creating Platform-Driven E-Infrastructure Innovation On EOSC | July 10, 2019
EOSC, Open Science and data
Creating Platform-Driven E-Infrastructure Innovation On EOSC | July 10, 2019
Small data, Big data
Creating Platform-Driven E-Infrastructure Innovation On EOSC | July 10, 2019
Small Data Big Data
Data Source Accessible, informative, actionable
No traditional data processing
Volume < 1 TB Terra and Exascale
Velocity Controlled and steady data flow
Very fast SpeedFast accumulation
Variety Structured data High Variety Data Sets
Veracity Less noise as controlled collection
Rigorous data validation required before processing
Value Business intelligence, analysis, reporting
Data Mining for prediction, pattern finding, etc.
Time Variance Historical data equal valued In some cases data gets old
Data Location Databases, local servers Distributed storages on Cloud
Infrastructure Predictable resource allocation
Agile Infra, with horizontally scalable architecture
Differences in: Collection, Processing, Scalability, Modeling, Storage & Computation Coupling, Data Science, Data Security
…small data will increasingly be made more big data-like through the development of new data infrastructures that pool, scale and link small data in order to create larger datasets,encourage sharing and reuse, and open them up to combination with big data and analysis using big data analytics
Small data combined needs big data infrastructure
EOSC deconstructed
Creating Platform-Driven E-Infrastructure Innovation On EOSC | July 10, 2019
Network
Storage
Compute
Data Management
Analytics
ACCESS LAYER FAIR data
RegistriesIdentifiersPapersFundingServicesPeopleFacilities
MonitoringKPIsCitationsUsage Stats
Actors
Publishing- Sharing
Interoperability Layer
AAI
ServiceManagem
ent
Data Access
Research in Context
Research Assessment in the heart of Open
Science
Policies Training
Services
Connectingo Establishing
infrastructureo Added value
services
Empowering• Open Science• Open Access• Policies• Services
3 pillars of action
Aligningo Standardo Guidelineso Practiceso Workflows
1. ServicesProviding the glue via
scholarly/scientific communication
Making small data big
Researchdata
Research Software
e-infra Tools &Services
Researchdata
Research process
Research literature:Articles, docs, white papers
011010100110000111010010
011010100110000111010010
Scholarly Communication InfrastructureResearch Infrastructures
Publishing all kinds of products
Enabling Reproducibility
(R*)
Fully-fledged assessment of
science
Fully-fledged scientific reward
Enabling Monitoring
Bridging RIs and OS publishing
practices
Scholarly Communication transition to Open Science
Creating Platform-Driven E-Infrastructure Innovation On EOSC | July 10, 2019
Open Science and Scholarly Communication
Researchdata
Research Software
e-infra Tools & Services
Researchdata
Research process
Research literature:Articles, docs, white papers
011010100110000111010010
011010100110000111010010
Scholarly Communication Infrastructure
LiteratureRepository
011010100110000111010010
DataRepository
SoftwareRepository
011010100110000111010010
011010100110000111010010
“Experiment”Repository
citation
part
Of
part
Of
Provenance: e.g. created by
Research Infrastructures
Creating Platform-Driven E-Infrastructure Innovation On EOSC | July 10, 2019
OpenAIRE ResearchGraph
Materializing the Open Research Graph
Project community
FunderFunding
Product
PublicationResearch
DataSoftware
Organization
Source
Other res. products
MiningHarvestingDeduplication
• Harvested data sources10K +
• Harvested records450Mi +
• Publication full-texts10.5Mi+
• Harvested/mined links340Mi +
Creating Platform-Driven E-Infrastructure Innovation On EOSC | July 10, 2019
PeopleServicesFacilities…Including CitationsUsage Stats
Providing an open metadata research graph of interlinked
scientific products, with Open Access information, linked to
funding information and researchcommunities
The OpenAIRE research graphOpen
Complete
De-duplicated
Transparent
Participatory
Decentralized
TrustedCreating Platform-Driven E-Infrastructure Innovation On EOSC | July 10, 2019
Added value servicesDiscovery, monitoring, assessment of researchLinks to non-academic infras
Strategic for Open Science Making the research graphan EOSC resourceOpen, Trusted, Complete, De-duplicated, Participatory, Transparent, Decentralized
ActorsInstitutions, research organizations, funders, content
providers, researchers, SMEs, etc.
Creating Platform-Driven E-Infrastructure Innovation On EOSC | July 10, 2019
Complete aggregationcoverage
Academic Graph
Project community
FunderFunding
Product
PublicationResearch
DataSoftware
Organization
Source
Other res. products
… and more… and more
… and more
… and more
… and more
… and more
Creating Platform-Driven E-Infrastructure Innovation On EOSC | July 10, 2019
Transition from OA content acquisition policies to OS content acquisition policiesnumbers from: explore.openaire.eu and beta.explore.openaire.eu
literature-researchdata links
Open Access PDFs for mining
120Mi
10Mi+0
10000000
20000000
30000000
40000000
50000000
60000000
70000000
80000000
90000000
100000000
old CAP new CAP
literature
0
500000
1000000
1500000
2000000
2500000
3000000
3500000
4000000
4500000
5000000
old CAP new CAP
research data
0
20000
40000
60000
80000
100000
120000
140000
160000
old CAP new CAP
software
0
500000
1000000
1500000
2000000
2500000
3000000
3500000
4000000
4500000
5000000
old CAP new CAP
other
26Mi
94Mi
1M
8Mi
95K
192K3.6Mi
7.5Mi
225Mi inferred links:Article-projectArticle-article
Article-softwareArticle-community
Ecc.Creating Platform-Driven E-Infrastructure Innovation On EOSC | July 10, 2019
Services for all stakeholders
Creating Platform-Driven E-Infrastructure Innovation On EOSC | July 10, 2019
Funders, institutions, RIs, initiatives, 3rd parties
Content providers, Research Infras
Researchers, scientists
Support
Accelerate
Monitor
2. Support and trainingProviding the human aspects
Making the local global
Creating Platform-Driven E-Infrastructure Innovation On EOSC | July 10, 2019
National
Global
Thematic
Social/Training Technical/Services
3 levels of operation
Creating Platform-Driven E-Infrastructure Innovation On EOSC | July 10, 2019
34 countriesà Key national organizations
4 regional area coordinators3 coordinators for
o Policieso RDMo Legal
National Open Access Desks (NOADs) A pan-European network to address diversity in culture & maturity of national/local infras
National Strategy
Outreach Support Training Policy
NOADs: A key vehicle in policies and training
Creating Platform-Driven E-Infrastructure Innovation On EOSC | July 10, 2019
Ground work for OS and EOSC10 national workshops 1048 participants170 conferences attended, presented in 969 funder mandates4109 repositories, 1720 OA journals contacted
2018
HELPDESK• Ask a question• FAQs
RESOURCES• OA guides• Copyright issues• Factsheets
TRAINING• Webinars• Workshops
Support and Training
Creating Platform-Driven E-Infrastructure Innovation On EOSC | July 10, 2019
Distributed and hierarchical training: train-the-trainersNOADs ⇢ National / research infras, organizations ⇢ Researchers
45 webinars 2790 participants55 f2f training events 1637 participants
8 train-the-trainer events 155 OS trainers
2018
• Rules: Open Science policies• Practices: Openness and FAIRness RDM• Technical: APIs (ResourseSync, schema.org),
OpenAIRE Guidelines for Content Providers (metadata)
Cross infrastructure OS trainingIt’s all about synergies
Creating Platform-Driven E-Infrastructure Innovation On EOSC | July 10, 2019
Community of practice for training the trainers
Thank you!
Creating Platform-Driven E-Infrastructure Innovation On EOSC | July 10, 2019
Natalia [email protected]