Analysis of the bread wheat genome using whole- genome shotgun sequencing Manuel Spannagl MIPS,...

20
Analysis of the bread wheat genome using whole- genome shotgun sequencing Manuel Spannagl MIPS, Helmholtz Center Munich

Transcript of Analysis of the bread wheat genome using whole- genome shotgun sequencing Manuel Spannagl MIPS,...

Page 1: Analysis of the bread wheat genome using whole- genome shotgun sequencing Manuel Spannagl MIPS, Helmholtz Center Munich Analysis of the bread wheat genome.

Analysis of the bread wheat genome using whole-genome shotgun sequencing

Manuel SpannaglMIPS, Helmholtz Center Munich

Page 2: Analysis of the bread wheat genome using whole- genome shotgun sequencing Manuel Spannagl MIPS, Helmholtz Center Munich Analysis of the bread wheat genome.

Wheat - why bother?

① Many varieties incl. bread wheat, durum („pasta“) wheat…

② Third most-produced cereal with 651 millions tons (2010), cultivated worldwide in different climates

③ Leading source of vegetable protein in human food

Page 3: Analysis of the bread wheat genome using whole- genome shotgun sequencing Manuel Spannagl MIPS, Helmholtz Center Munich Analysis of the bread wheat genome.

The Challenge

Page 4: Analysis of the bread wheat genome using whole- genome shotgun sequencing Manuel Spannagl MIPS, Helmholtz Center Munich Analysis of the bread wheat genome.

Wheat – a WGS approachAims and Goals

Page 5: Analysis of the bread wheat genome using whole- genome shotgun sequencing Manuel Spannagl MIPS, Helmholtz Center Munich Analysis of the bread wheat genome.

① 5x 454 WGS sequencing => 85 Gb sequence, 220 million reads

② ~79% of reads repeat-related

③ direct Low-copy-number genome assembly (LCG, Newbler) => collapses many homologous gene sequences

④ to prevent collapsing of homologous gene sequences and reduce complexity => orthologous group assembly at high stringency

Wheat – a WGS approach

Page 6: Analysis of the bread wheat genome using whole- genome shotgun sequencing Manuel Spannagl MIPS, Helmholtz Center Munich Analysis of the bread wheat genome.

① Use fully sequenced and analysed reference genomes (rice, Brachypodium, sorghum)

② Group genes into families (Orthologous Groups)

③ Use the orthologous group representatives as sequence baits to capture corresponding sequence reads.

④ Do sub-assembly for each „orthologous bin“ seperately

WGS assembly using „in silico exon capture“

Page 7: Analysis of the bread wheat genome using whole- genome shotgun sequencing Manuel Spannagl MIPS, Helmholtz Center Munich Analysis of the bread wheat genome.

Bread Wheat Genaology

Page 8: Analysis of the bread wheat genome using whole- genome shotgun sequencing Manuel Spannagl MIPS, Helmholtz Center Munich Analysis of the bread wheat genome.

Ortholome directed assembly circumvents limitations faced by WGS assembly

Page 9: Analysis of the bread wheat genome using whole- genome shotgun sequencing Manuel Spannagl MIPS, Helmholtz Center Munich Analysis of the bread wheat genome.

The ortholome directed assembly delivers ordered segments

Page 10: Analysis of the bread wheat genome using whole- genome shotgun sequencing Manuel Spannagl MIPS, Helmholtz Center Munich Analysis of the bread wheat genome.

The ortholome directed assembly delivers ordered segments II

1 32

Page 11: Analysis of the bread wheat genome using whole- genome shotgun sequencing Manuel Spannagl MIPS, Helmholtz Center Munich Analysis of the bread wheat genome.

Gene Copy Retention after Polyploidization- Calibration of the method-

97% 99% 100%

Maize

Hexaploid Rice„TRice“

Page 12: Analysis of the bread wheat genome using whole- genome shotgun sequencing Manuel Spannagl MIPS, Helmholtz Center Munich Analysis of the bread wheat genome.

Gene Copy Retention after Polyploidization

Page 13: Analysis of the bread wheat genome using whole- genome shotgun sequencing Manuel Spannagl MIPS, Helmholtz Center Munich Analysis of the bread wheat genome.

Gene Copy Retention after Polyploidization

Page 14: Analysis of the bread wheat genome using whole- genome shotgun sequencing Manuel Spannagl MIPS, Helmholtz Center Munich Analysis of the bread wheat genome.

Expanded Wheat Gene Families

Page 15: Analysis of the bread wheat genome using whole- genome shotgun sequencing Manuel Spannagl MIPS, Helmholtz Center Munich Analysis of the bread wheat genome.

Shotguns (Illumina 80x (T.monococcum)) and 454 (3x (Ae.tauschii))

cDNA seq‘s from the Ae. speltoides group (B)

Can A and D genome shotgun data be used to dissect the ABD of wheat?

The Three Nephews: the A, B and D‘s of wheat

Page 16: Analysis of the bread wheat genome using whole- genome shotgun sequencing Manuel Spannagl MIPS, Helmholtz Center Munich Analysis of the bread wheat genome.

The Three Nephews: Similarity on a Sequence Basis

Page 17: Analysis of the bread wheat genome using whole- genome shotgun sequencing Manuel Spannagl MIPS, Helmholtz Center Munich Analysis of the bread wheat genome.

Wheat A, B and D Assignment using Machine Learning (SVM)

Page 18: Analysis of the bread wheat genome using whole- genome shotgun sequencing Manuel Spannagl MIPS, Helmholtz Center Munich Analysis of the bread wheat genome.

Particular Gene Categories are preferentially retained

Page 19: Analysis of the bread wheat genome using whole- genome shotgun sequencing Manuel Spannagl MIPS, Helmholtz Center Munich Analysis of the bread wheat genome.

Franz Marc„Hocken im Schnee“

Almost full gene complement detected and structured

10000s of pseudogenes detected

Separation of A, B and D using machine learning with > 75% accuracy

Complementary to chromosome sorting approaches

Applicable to polyploids in general to get genome overview

Rapid and economic approach to pragmatically cope with limitations in sequence technology

Summary

Page 20: Analysis of the bread wheat genome using whole- genome shotgun sequencing Manuel Spannagl MIPS, Helmholtz Center Munich Analysis of the bread wheat genome.

acknowledgements

MIPSMatthias PfeiferKlaus MayerAll other group members

The UK Wheat ConsortiumMike BevanNeil HallAnthony HallKeith EdwardsRachel Brenchley

CSHLDick McCombie

UC Davis & USDA AlbanyJan DvorakMincheng LuoOlin Anderson

Kansas State UniversityBikram GillSunish Segal

EBIPaul KerseyDan Bolser