Ict4 d rhul talk

14
Research Data Science and its potential impact in Low and Middle Income countries Hugh Shanahan [email protected] @hughshanahan

Transcript of Ict4 d rhul talk

Page 1: Ict4 d rhul talk

Research Data Science and its potential impact in Low and Middle

Income countries

Hugh [email protected]

@hughshanahan

Page 2: Ict4 d rhul talk

Opportunities for LMIC Researchersfrom the Data Deluge

In many research fields, data becoming more open

Public data gradually becoming more open as well

Placement of sites in LMIC - Square Kilometre Array

Page 3: Ict4 d rhul talk

Example - Bioinformatics

PBytes of Omic data freely available

Good basic Science can be done just by analysing these data sets

Clear results from test species could be applied to local species

Page 4: Ict4 d rhul talk

Square Kilometre Array

Array of Radio Telescopes

Most of these to be South Africa

South Africa wants to own thisnot just be the site

1 PByte of data per day

Page 5: Ict4 d rhul talk

Challenges

Absence/deficit of infrastructureEquipment / Electricity / IT Support; Internet access - though improvingOnline collaboration and communities

Absence/deficit of awareness (leading to funding gap)

Absence/deficit of education and training

Page 6: Ict4 d rhul talk

Solutions

Provide training through Summer School systemto create cohort of Professionals who cognisant in

Research Data Science

Provide access to cloud computing resources that have the data

Page 7: Ict4 d rhul talk

OrganisationsCo-chair of Working Group RDA/CODATA Summer Schools

in Data Science and Cloud Computing in the Developing World

RDA - young organisation (<3 years) for Data sharing

CODATA - 40 year old organisation with deep interestin developing world research

Page 8: Ict4 d rhul talk

Schools in ResearchData Science

Give attendees an introduction to principles behind Data Science and how it can be applied.

Aim to make this a professional qualification

Focus on standards avoid reinvention of the wheel

sharing data

Page 9: Ict4 d rhul talk

Outline

Vanilla

Flavour

Flavour

Page 10: Ict4 d rhul talk

Vanilla covers the basics for anyone with BSc/BA

Machine Learning/Statistics

Software Carpentry

Data Carpentry

Infrastructures

Visualisation

Curriculum

Page 11: Ict4 d rhul talk

Cover topics that represent issues specific to disciplines

Flavoured schools

Examples :- Extreme Data

Life Sciences

Databases/Geospatial

Page 12: Ict4 d rhul talk

Partners so far

Page 13: Ict4 d rhul talk

Cloud Computing

Technical solution to infrastructure problem

Cost for individual LMIC researcher is barrier

Free at the point of use cloud facilities

Page 14: Ict4 d rhul talk

Use case - North African Bioinformatics

Potential user community based in Morocco, Tunisia and Egypt

Suggestion - Federated clouds to find resources in “the cracks”.