Planar cell polarity gene expression correlates with tumor cell ...
Analysis of Gene Expression at the Single-Cell Level
description
Transcript of Analysis of Gene Expression at the Single-Cell Level
Analysis of Gene Expressionat the Single-Cell Level
Guo-Cheng Yuan
Department of Biostatistics and Computational Biology
Dana-Farber Cancer Institute
Harvard School of Public Health Bioconductor, July 31st, 2014
bioconductor
Methods to sequence the DNA and RNA of single cells are poised to transform many areas of biology and medicine.
--- Nature Methods
• “Recent technical advances have enabled RNA sequencing (RNA-seq) in single cells. Exploratory studies have already led to insights into the dynamics of differentiation, cellular responses to stimulation and the stochastic nature of transcription. We are entering an era of single-cell transcriptomics that holds promise to substantially impact biology and medicine.”– R. Sandberg, 2014. Nature Methods
Cell-type A
Cell-type B
Cell-type C
Cell-type DCell-type E
Cell-type F
Cell D
ivis
ion
R. Sandberg, 2014. Nature Methods
Challenges in single-cell data analysis
• Characterize and distinguish technical/biological variability
• Identify new and meaningful cell clusters.
• Identify the lineage relationship between different cell clusters.
• Characterize the dynamic process during cell-state transitions.
• Elucidate the transition of regulatory networks.
• Distinguish stochastic vs real variation
CMP GMP
MEP CLPMEP
Guoji Guo, Eugenio Marco
SPADE: a density-normalized, spanning tree model
Qiu et al. 2011Nat Biotech, p886
Down-sample
Clustering,Spanning-tree
Visualization
Log2(CMP1/CMP2)CD55 7.87ICAM4 3.98CD274 3.32MPL 3.19TEK 2.83
Cancer Stem Cells
• Each cancer contains a highly heterogeneous cell population.
• Clonal evolution contributes to cancer heterogeneity
• Cancer cells are hierarchically organized and maintained by cancer stem cells
• How are the leukemia stem cells related to normal blood cell lineage? How do they differ?
Single cell analysis of the mouse MLL-AF9 acute myeloid leukemia cells
Compilation of mouse cell surface antigens (Lai et al., 1998; eBioscience website)
Primer design for 300 multiplexed PCR (collaboration with Helen Skaletsky)
Micro-fluidic high-throughput realtime PCR (96.96 Array)
Guoji Guo, Assieh Saadatpour
t-SNE analysis identifies similarities between cell-types
• t-SNE is a nonlinear dimension reduction method, and can identify patterns undetectable by PCA
• t-SNE minimizes the divergence between distributions over pairs of points.
• Leukemia cells are more similar to GMPs than to HSCs
• Leukemia cells are highly heterogeneous.
Mapping leukemia cells to normal hematopoietic cell hierarchy
• Use 33 common genes to map cell hierarchy.
• Mapping identifies two subtypes of leukemia cells.
• These cells are similar but not identical to their corresponding normal lineages.
All Leukemia
Leukemia 1 Leukemia 2
GMP
Coexpression networks are different among subtypes
Surani and Tischler, Nature 2012
Guo et al. Dev Cell 2010
Dynamic clusteringT = 1 T = 2 T = 3 T = 4
Eugenio Marco, Bobby Karp, Lorenzo Trippa, Guoji Guo
Maximizing the penalized log-likelihood.2
)()|(log c
cacP x
Identifying bifurcation points and directions
>80% variance increase during bifurcation is attributed to a single (bifurcation) direction.
ICM
TE
EPI
PE
Modeling dynamics by bifurcation analysis
U(x)
U(x)
0274 23 ba
0274 23 ba
bxaxx
xU 24
)(24
I)
II)
dtxUdx )(
)()( tdWdtxUdx
0274 23 ba
0274 23 ba
U(x)
U(x)
I)
II)
Modeling dynamics by bifurcation analysis
Noise level s has large impact on lineage biases
s = 1s = 0.5s = 2
Control Perturbation
Lineage bias due to perturbation of TF activity
Predicted lineage bias due to 2 fold decrease of TF level
U(x) U(x)
Nanog
PE
EPI
Experimental validation using Nanog mutant
How do we infer dynamics without temporal information?
Characterization of early bipotential progeny ofLgr5+ intestinal stem cells
Tae-Hee Kim, Assieh SaadatpourCrosnier 2006. Nature Review
Principal Curve Analysis Reconstruct Temporal Information
t-SNE plot indicates two distinct clusters, linked a small number of transitional cells
Principal Curve Analysis Reconstruct Temporal Information
t-SNE plot indicates two distinct clusters, linked a small number of transitional cells
Principal curve analysis captures the overall trend of cell-state transition
Inferred dynamic gene expression profileO
lfm4
Rn
f43
Gap
dhC
dca
7A
ctb
Pcn
aT
af1d
Sm
oc2-
2C
d24
Itgav
Vil1
Pro
m1
CD
44G
cnt3
His
t1h1
eA
xin2
Sm
oc2-
1S
ox9-
1C
nn3
Cd
x2C
cl9
Acs
m3
Kitl
Gpl
d1 Vdr
Ho
pxH
es1
-2F
gfr4
Cd
kn1b
Nr4
a2V
egfc
Tex
9Z
nrf3
Lrp1
Cd
9D
pp4
Far
p1C
cnl1
Asc
l2S
ox9-
2S
lc14
a1Lr
ig1
No
p2C
cnd1
Lcp1
Hp
rtLy
75Il6
stR
hob
tb3
Cd
x1T
yms
Bm
i1F
osN
fat5
Atf3
Rn
ase1 Dll1
Cd
14Z
fp36
Cd
82G
emin
4S
ycn
Hig
2 Il6R
fc4
Cd
k1D
efa
5C
cnb1
Rrm
2P
lk1
Ang
4C
asp
12 Tlr1 Lc
tK
itA
poa1 Lifr
Tdg
f1D
ctM
uc2
Spd
efN
upr
1D
ll4T
ert-
1N
eur
od1
Ter
t-2
Clc
a3Li
peA
toh1
-1T
bx3
Du
sp1
JunB
Psr
c1C
dkn
2bB
mp4
He
s3T
reh
Re
p15
Ch
ga Alp
iP
ax4
Ato
h1-2
Cck
Cd
kn1a
Ne
urog
3C
cna2
Mki
67D
efa
3C
ach
d1E
phb3
Fst
l1C
enp
eS
hisa
2N
fatc
3Ju
nP
relp
No
tch1
He
s1-1
Fgf
r2C
es1
dA
pex1
Abc
g2Ig
f1r
Lgr5
Agr
3A
rg2
Na
p1l1
Clc
a1H
eph
Sem
a4d
No
tch2
Wdr
90 Ppi
fT
ac1
Egr
2C
cne1
Sel
mG
as6
Sox
4H
beg
fC
d38
Fos
bD
bd2
Lyz1
Clc
a2Ig
fbp4
Ra
sa3
Msi
Cd
55A
spa
Ifnar
1T
cof1
Du
sp4
Mm
p7K
cnq1 Syp
Dcl
k1C
hgb
Ra
d51l
1G
FP
He
s5E
2F1
Egr
1H
ey1
Insm
1G
m61
4H
ey2
Sp5
Btla
Lyz2
He
ylC
cl6
Re
g4M
ycH
es7
Cre
b3l4
Cd
83F
cgbp
Cd
kn1c
Zic
2H
epa
cam
2G
fi1N
kx2.
2G
ram
d1a
Gfi1
bC
dkn
2aT
ff3D
iap3
Cd
kn3
1
2
3
4
5
6
7
8
9
10
Use the principal curve coordinate as a proxy for temporal evolution.
Conclusions
• Single-cell genomics is a powerful technology for understanding cellular heterogeneity and hierarchy.
• Single-cell gene expression data analysis present many new methodological challenges.
• It is a great time to develop algorithms and software for single cell data analysis.
Acknowledgement
Eugenio MarcoAssieh Saadatpour
Bobby Karp
Lorenzo Trippa
Paul Robson
Stuart OrkinGuoji Guo
Ramesh ShivdasaniTae-Hee Kim
Funding fromNIH, HSCI