SolGS Hyderabad conference 2016

53
solGS: A Web-based Genomic Selection Analysis Tool Isaak Y Tecle, Naama Menda, Guillaume Bauchet, Lukas Mueller

Transcript of SolGS Hyderabad conference 2016

Page 1: SolGS Hyderabad conference 2016

solGS: A Web-based Genomic Selection Analysis

Tool Isaak Y Tecle, Naama Menda,

Guillaume Bauchet, Lukas Mueller

Page 2: SolGS Hyderabad conference 2016

Websites with solGS…

Page 3: SolGS Hyderabad conference 2016
Page 4: SolGS Hyderabad conference 2016
Page 5: SolGS Hyderabad conference 2016

Phenotyped &

genotyped individuals

Genomic selection…

Prediction model

Predicted breeding

Values (GEBVs)

Genotyped selection candidates

Training population

Page 6: SolGS Hyderabad conference 2016

Genomic Selection advantages… Little or no phenotyping

reduced cost Shorter breeding cycles Higher selection gain per unit time

Increased prediction accuracy

Page 7: SolGS Hyderabad conference 2016

Genomic Selection challenges…

‘Big data’ Data organization, cleaning, imputation

Data storage and accessibility Raw data and results visualization and sharing

Statistical analysis complexity

Page 8: SolGS Hyderabad conference 2016

solGShttp://cassavabase.org/solgs

Page 9: SolGS Hyderabad conference 2016

Data storage…

Jung et.al., 2011. Database.

Chado schema

Page 10: SolGS Hyderabad conference 2016

Data access interfaces

Search wizard

Page 11: SolGS Hyderabad conference 2016

pre-modeling data processing

Page 12: SolGS Hyderabad conference 2016

Phenotype data processing…

Missing phenotype data handling

Adjusts phenotype means for environmental effects lme4

Combines multiple trials

Page 13: SolGS Hyderabad conference 2016

Genotype data processing Filters out

monomorphic markers markers with > 60% missing values markers with MAF < 5% individuals with > 80% missing values

Imputes missing marker data Median substitution

Genotype coding [-1, 0, 1], [0, 1, 2]

Isaak Tecle
Page 14: SolGS Hyderabad conference 2016

Prediction modeling

Page 15: SolGS Hyderabad conference 2016

statistical modeling

Univariate Two-stage analysis RR-BLUP

Endelman, Plant Genome (2010) GBLUP

Marker-based realized relationship matrix

Prediction accuracy Based on 10-fold cross-validation

Isaak Tecle
Page 16: SolGS Hyderabad conference 2016

Use case

Page 17: SolGS Hyderabad conference 2016

Creating a training dataset

Page 18: SolGS Hyderabad conference 2016
Page 19: SolGS Hyderabad conference 2016

Creating a custom training dataset…

Page 20: SolGS Hyderabad conference 2016
Page 21: SolGS Hyderabad conference 2016
Page 22: SolGS Hyderabad conference 2016
Page 23: SolGS Hyderabad conference 2016

Building a prediction model

Page 24: SolGS Hyderabad conference 2016
Page 25: SolGS Hyderabad conference 2016
Page 26: SolGS Hyderabad conference 2016

Exploring model input

Page 27: SolGS Hyderabad conference 2016
Page 28: SolGS Hyderabad conference 2016
Page 29: SolGS Hyderabad conference 2016
Page 30: SolGS Hyderabad conference 2016

Exploring model accuracy

Page 31: SolGS Hyderabad conference 2016
Page 32: SolGS Hyderabad conference 2016

Exploring model output

Page 33: SolGS Hyderabad conference 2016
Page 34: SolGS Hyderabad conference 2016
Page 35: SolGS Hyderabad conference 2016
Page 36: SolGS Hyderabad conference 2016

Estimating breeding values of selection

candidates

Applying the model…

Page 37: SolGS Hyderabad conference 2016
Page 38: SolGS Hyderabad conference 2016
Page 39: SolGS Hyderabad conference 2016

Selection gain?

Page 40: SolGS Hyderabad conference 2016
Page 41: SolGS Hyderabad conference 2016

Prediction modeling for multiple traits

Page 42: SolGS Hyderabad conference 2016
Page 43: SolGS Hyderabad conference 2016
Page 44: SolGS Hyderabad conference 2016

Estimating breeding values of a selection candidates for multiple traits

Applying the models…

Page 45: SolGS Hyderabad conference 2016
Page 46: SolGS Hyderabad conference 2016

Estimating genetic correlations

Page 47: SolGS Hyderabad conference 2016
Page 48: SolGS Hyderabad conference 2016

Calculating selection indices

Page 49: SolGS Hyderabad conference 2016
Page 50: SolGS Hyderabad conference 2016

To sum up…solGS Stores data Builds prediction models Estimates breeding values Additional analyses:

Correlation analysis Population structure Selection indices Genetic gain

Open source Organism agnostic

Page 51: SolGS Hyderabad conference 2016

Thanks to…

Page 52: SolGS Hyderabad conference 2016
Page 53: SolGS Hyderabad conference 2016

Many thanks!!