Induction Characterization ofArtificial Diploids Haploid Yeast
Use of genotyping technologies to identify tree variabilityPINUS GENOMES Nuclear: • All diploids...
Transcript of Use of genotyping technologies to identify tree variabilityPINUS GENOMES Nuclear: • All diploids...
![Page 1: Use of genotyping technologies to identify tree variabilityPINUS GENOMES Nuclear: • All diploids (n=12) • Massive genomes: 28 pg (Arabidopsis 0.2 pg) • High levels of intra-specific](https://reader033.fdocuments.net/reader033/viewer/2022042221/5ec76408b075612ca66dd882/html5/thumbnails/1.jpg)
Use of genotyping technologies to
identify tree variability
Valerie Hipkins, NFGEL Director
The National Forest Genetics Laboratory
![Page 2: Use of genotyping technologies to identify tree variabilityPINUS GENOMES Nuclear: • All diploids (n=12) • Massive genomes: 28 pg (Arabidopsis 0.2 pg) • High levels of intra-specific](https://reader033.fdocuments.net/reader033/viewer/2022042221/5ec76408b075612ca66dd882/html5/thumbnails/2.jpg)
• Ponderosa Pine (Pinus ponderosa)
Source ID
Building a database
Modeling unsampled areas
• Golden Chinquapin (Chrysolepis chrysophylla)
Assessing rangewide variation
Building database for management decisions to drive
conservation
Marker choice
• Mulanje Cedar (Widdringtonia whytei)
Matching seed to origin
Marker choice
DNA isolation from wood
Outline
![Page 3: Use of genotyping technologies to identify tree variabilityPINUS GENOMES Nuclear: • All diploids (n=12) • Massive genomes: 28 pg (Arabidopsis 0.2 pg) • High levels of intra-specific](https://reader033.fdocuments.net/reader033/viewer/2022042221/5ec76408b075612ca66dd882/html5/thumbnails/3.jpg)
CONTEXT TO ALL PROJECTS
• QUESTION TO BE ANSWERED
Species
Scope (taxonomic, source, individual ID)
Application (what, where, when)
Use (access over time, # users)
• AVAILABLE SAMPLE MATERIAL
DNA Isolation (quantity and quality)
• MARKERS (existence and appropriateness)
• REFERENCE MATERIAL (databases; maps)
• TIME (inquiry to answer)
• COST
• FACILITY/EXPERTISE
The National Forest Genetics Lab (NFGEL)
![Page 4: Use of genotyping technologies to identify tree variabilityPINUS GENOMES Nuclear: • All diploids (n=12) • Massive genomes: 28 pg (Arabidopsis 0.2 pg) • High levels of intra-specific](https://reader033.fdocuments.net/reader033/viewer/2022042221/5ec76408b075612ca66dd882/html5/thumbnails/4.jpg)
Ponderosa Pine: Looking to it’s Genes to Ensure it’s Future
US Forest Service National Forest Genetics Lab, USFS Region 1, and Bureau of Land Management (BLM)
Ponderosa pine is vulnerable to climate shifts, wildfires, bark beetle attack, and loss of habitat due to development. Genetic diversity is an essential component of long-term forest health.
Local seed source Offsite seed source
![Page 5: Use of genotyping technologies to identify tree variabilityPINUS GENOMES Nuclear: • All diploids (n=12) • Massive genomes: 28 pg (Arabidopsis 0.2 pg) • High levels of intra-specific](https://reader033.fdocuments.net/reader033/viewer/2022042221/5ec76408b075612ca66dd882/html5/thumbnails/5.jpg)
Identifying Original Seed Source of Plantations
• Reference Material114 stands over 3,000 trees; needle collections
Samples received from May 2001 – Aug 2012
• Markersallozymes; microsatellites (nuclear and cp); mtDNA
• Database Useover 10-yrs of lab work; $550,000+ (Federal Gov $)
4 peer-reviewed publications
source ID of known stands
Potter, Hipkins, Mahalovich, and Means. 2013. AJB 100: 1562
Potter, Hipkins, Mahalovich, and Means. 2015. Tree Genetics & Genomes 11: 38
Willyard, Gernandt, Potter, Hipkins, Marquardt, Mahalovich, Langer, Telewski, Cooper, Douglas, Finch, Karemera, Lefler, Lea, and Wofford. 2017. AJB 104: 161
![Page 6: Use of genotyping technologies to identify tree variabilityPINUS GENOMES Nuclear: • All diploids (n=12) • Massive genomes: 28 pg (Arabidopsis 0.2 pg) • High levels of intra-specific](https://reader033.fdocuments.net/reader033/viewer/2022042221/5ec76408b075612ca66dd882/html5/thumbnails/6.jpg)
PONDEROSA PINE SOURCE ID
Probability of Occurrence
![Page 7: Use of genotyping technologies to identify tree variabilityPINUS GENOMES Nuclear: • All diploids (n=12) • Massive genomes: 28 pg (Arabidopsis 0.2 pg) • High levels of intra-specific](https://reader033.fdocuments.net/reader033/viewer/2022042221/5ec76408b075612ca66dd882/html5/thumbnails/7.jpg)
Relative likelihood of origin for the Round Mountain stand based on mitochondrial haplotypes.
Dr. Brook Milligan, Department of Biology, New Mexico State University
![Page 8: Use of genotyping technologies to identify tree variabilityPINUS GENOMES Nuclear: • All diploids (n=12) • Massive genomes: 28 pg (Arabidopsis 0.2 pg) • High levels of intra-specific](https://reader033.fdocuments.net/reader033/viewer/2022042221/5ec76408b075612ca66dd882/html5/thumbnails/8.jpg)
PINUS GENOMES
Nuclear:
• All diploids (n=12)
• Massive genomes: 28 pg (Arabidopsis 0.2 pg)
• High levels of intra-specific diversity
• Incomplete lineage sorting
• Draft genome (haploid DNA from seed megagametophyte)
• P. taeda; sister to subsection Ponderosae
Organelles:
• Plastids paternally inherited pollen vs. seed movement (cp
and mtDNA)
• Ancient plastid transfer within taxonomic subsection
• Plastid Simple Sequence Repeats (SSRs)
• Mitochondrial haplotypes - complex minisatellite
Target-Enriched High-Throughput Sequencing
(Ann Willyard, Hendrix College, USA)
![Page 9: Use of genotyping technologies to identify tree variabilityPINUS GENOMES Nuclear: • All diploids (n=12) • Massive genomes: 28 pg (Arabidopsis 0.2 pg) • High levels of intra-specific](https://reader033.fdocuments.net/reader033/viewer/2022042221/5ec76408b075612ca66dd882/html5/thumbnails/9.jpg)
Target-Enriched High-Throughput Sequencing
(Ann Willyard, Hendrix College, USA)
Use HTS so obtain several hundred gene trees to build good species tree.
Considered Low Copy Nuclear Genes:
• 11,396 low-copy expressed genes
• 6 Sanger-sequenced for previous phylogeny
• 3 conserved ortholog set
Chose 1045 genes to target:
• Longer than 850 bp
• 96 - linkage mapped
Prelim: nearly complete plastomes; alignments for 285 nuclear genes;
alignments for 5 mitochondrial genes. Ambiguous calls; missing samples
at some genes; messy/missing data.
![Page 10: Use of genotyping technologies to identify tree variabilityPINUS GENOMES Nuclear: • All diploids (n=12) • Massive genomes: 28 pg (Arabidopsis 0.2 pg) • High levels of intra-specific](https://reader033.fdocuments.net/reader033/viewer/2022042221/5ec76408b075612ca66dd882/html5/thumbnails/10.jpg)
Golden Chinquapin (Chrysolepis chrysophylla)
• Beech family (Fagaceae)
• Endemic to western U.S.
• Evergreen tree about 20 – 40m
in height (coastal, low
elevation)
• Sensitive species in WA
John Ruter, University of Georgia, Bugwood.org
Pete Veilleux, East Bay Wilds. USFS
Objectives:
• What is the level and distribution of genetic
diversity within species
• How different are the northern-most disjunct
populations from rest of range
• Are there varietal level genetic differences among
the ‘shrub’ and ‘tree’ forms
Project cooperators: Andy Bower, Ann Willyard, Eli
Meyer, Jennifer DeWoody, Jacob Snelling
![Page 11: Use of genotyping technologies to identify tree variabilityPINUS GENOMES Nuclear: • All diploids (n=12) • Massive genomes: 28 pg (Arabidopsis 0.2 pg) • High levels of intra-specific](https://reader033.fdocuments.net/reader033/viewer/2022042221/5ec76408b075612ca66dd882/html5/thumbnails/11.jpg)
Marker Choice for Chinquapin
Allozymes: low variation obtained in tests
Microsatellites (SSRs): pulled over from other Fagaceae
(oaks, European chestnut, etc)
16 loci, 716 trees, 23 stands
(~$20,000; 1 yr)
2b-RAD: Next Generation Sequencing approach for
genome-wide genotyping (restriction site-
associated DNA (RAD), based on sequencing
fragments produced by type IIB restriction
endonucleases)
628 samples, 2021 SNPs with coverage > 20X
(~$75,000; 2.5 yrs)
![Page 12: Use of genotyping technologies to identify tree variabilityPINUS GENOMES Nuclear: • All diploids (n=12) • Massive genomes: 28 pg (Arabidopsis 0.2 pg) • High levels of intra-specific](https://reader033.fdocuments.net/reader033/viewer/2022042221/5ec76408b075612ca66dd882/html5/thumbnails/12.jpg)
SNP Genotyping Using 2b-RAD
Genome reduction by type IIB
RE
•Cuts in 2 sites, producing
even-sized fragment with
sticky ends
•Reduction using selective
amplification (think AFLP)
•Produces equal-sized
fragments for sequencing
•De novo reference construction
based on overlapping tags
Comparison to RAD, GBS
•Flexible reduction (selection)
•Faster bench protocol
•But some questions about use
in highly complex genomes
Wan
g e
t al.
(2012)
Nat.
Met
h.
![Page 13: Use of genotyping technologies to identify tree variabilityPINUS GENOMES Nuclear: • All diploids (n=12) • Massive genomes: 28 pg (Arabidopsis 0.2 pg) • High levels of intra-specific](https://reader033.fdocuments.net/reader033/viewer/2022042221/5ec76408b075612ca66dd882/html5/thumbnails/13.jpg)
2b-RAD: Sequencing Results
Example Sequencing & Processing Stats (single lane):
Mean raw reads 1.63M per sample
Mean HQ reads 1.59M per sample
Mean mapped HQ reads 1.21M per sample
Mean N loci genotyped 96,351
Reads
628 samples
2,021 SNPs with coverage ≥20X
Mean 25% missing data per individual (range 10-62%)
SNP frequency over entire data set: 2.05%
= 1 SNP every 48 bases
SNP Matrix
![Page 14: Use of genotyping technologies to identify tree variabilityPINUS GENOMES Nuclear: • All diploids (n=12) • Massive genomes: 28 pg (Arabidopsis 0.2 pg) • High levels of intra-specific](https://reader033.fdocuments.net/reader033/viewer/2022042221/5ec76408b075612ca66dd882/html5/thumbnails/14.jpg)
Ho
od
Ca
na
l
LW
S
Ha
pp
y C
am
p
Lo
st L
ak
e
Ca
sca
de-
FL
Bro
ok
ing
s
Ma
cKen
zi
eD
etro
it
Fre
mo
nt-
Win
em
a
Cla
cka
ma
sM
t. F
uji
Rd
Wa
lker
Ma
ry’s
Pea
k
Six
Riv
ers
Gre
en B
utt
e
La
cks
Cre
ek
Lit
tle
Riv
er
Mo
nta
na
de
Oro
Blo
dg
ett
Stu
rgis
Cre
ek
Peb
ble
Bea
ch
Po
int
Rey
es
Kin
gs
Ra
ng
e
![Page 15: Use of genotyping technologies to identify tree variabilityPINUS GENOMES Nuclear: • All diploids (n=12) • Massive genomes: 28 pg (Arabidopsis 0.2 pg) • High levels of intra-specific](https://reader033.fdocuments.net/reader033/viewer/2022042221/5ec76408b075612ca66dd882/html5/thumbnails/15.jpg)
2b-RAD: Population STRUCTURE
SSR: Population STRUCTUREK=2
![Page 16: Use of genotyping technologies to identify tree variabilityPINUS GENOMES Nuclear: • All diploids (n=12) • Massive genomes: 28 pg (Arabidopsis 0.2 pg) • High levels of intra-specific](https://reader033.fdocuments.net/reader033/viewer/2022042221/5ec76408b075612ca66dd882/html5/thumbnails/16.jpg)
Mulanje Cedar (Widdringtonia whytei)
Matching Seed to Origin
Background
• Malawi’s national tree
• occurs naturally only in Mulanje Mtn Reserve
• critically endangered
• over-exploitation, theft, and fire
• discrete Regions/Basins/Stands
• 3 offsite stands producing seed; origin unknown
• match seed to appropriate planting source• BGCI (Botanic Gardens Conservation International);
Forestry Research Institute of Malawi; Mulanje
Mountain Conservation Trust; USFS
Reference Database
• single native stand; plantations
Constraints• inquiry: February 2017
• 700 wood samples, 150 des. tissue samples
• no markers
• final data requested by December 2017 (10 months)
• $7,000 @tapaculo99
![Page 17: Use of genotyping technologies to identify tree variabilityPINUS GENOMES Nuclear: • All diploids (n=12) • Massive genomes: 28 pg (Arabidopsis 0.2 pg) • High levels of intra-specific](https://reader033.fdocuments.net/reader033/viewer/2022042221/5ec76408b075612ca66dd882/html5/thumbnails/17.jpg)
Mulanje Cedar (Widdringtonia whytei)
Matching Seed to Origin
DNA Isolation and Markers
• 12 Microsatellites: “SSR markers from a thousand plant transcriptomes”. Hodel et al.
2016. Applications in Plant Sciences 4:1600024; Hodel et al. 2016. Dryad Digital Repository. From
Widdringtonia cedarbergensis.
![Page 18: Use of genotyping technologies to identify tree variabilityPINUS GENOMES Nuclear: • All diploids (n=12) • Massive genomes: 28 pg (Arabidopsis 0.2 pg) • High levels of intra-specific](https://reader033.fdocuments.net/reader033/viewer/2022042221/5ec76408b075612ca66dd882/html5/thumbnails/18.jpg)
Mulanje Cedar (Widdringtonia whytei)
Matching Seed to Origin
Chambe
Chinzama Lichenya
Madzeka
Sombani
Thuchila
unknown
Co
ord
. 2 (
39
.1%
)
Coord. 1 (52.2%)
Principal Coordinates Analysis (PCA): 10 SSR loci; by basin; 190 trees
![Page 19: Use of genotyping technologies to identify tree variabilityPINUS GENOMES Nuclear: • All diploids (n=12) • Massive genomes: 28 pg (Arabidopsis 0.2 pg) • High levels of intra-specific](https://reader033.fdocuments.net/reader033/viewer/2022042221/5ec76408b075612ca66dd882/html5/thumbnails/19.jpg)
Mulanje Cedar (Widdringtonia whytei)
Matching Seed to Origin
ChambeChicangawa
Chinzama
Khungubwe
Lichenya
Limbe Transect
Lukhubla valley
Madzeka Valley
Malosa
Minunu
Mzimba
Nkandanyalungwe
Ntayamoyo-1
Plot 2
Sombani
Sombani Research Plot
TanzaniaZomba
Zomba 1
Zomba 2
Co
ord
. 2 (
18
.0%
)
Coord. 1 (60.9%)
Principal Coordinates Analysis (PCA): 10 SSR loci; by stand; 190 trees
![Page 20: Use of genotyping technologies to identify tree variabilityPINUS GENOMES Nuclear: • All diploids (n=12) • Massive genomes: 28 pg (Arabidopsis 0.2 pg) • High levels of intra-specific](https://reader033.fdocuments.net/reader033/viewer/2022042221/5ec76408b075612ca66dd882/html5/thumbnails/20.jpg)
DEFINE FOCUS AND NEED
(species, scope, application)
REFERENCE DATABASES
(new technologies)
COOPERATION
To identify tree variability……..