Comparative genomic hybridization of precipitated DNA · Comparative genomic hybridization of...
Transcript of Comparative genomic hybridization of precipitated DNA · Comparative genomic hybridization of...
1
Supplementary methods and information
Comparative genomic hybridization of precipitated DNA
The karyotypes of two normal leukocytes and two MCF-7 breast cancer cells, the DNA of
which had been precipitated are compiled in Supplementary Figure 1. While the profiles of
the normal cells showed no significant deviation from the midline, the profiles of the MCF-7
cells exhibit multiple yet almost identical genomic aberrations (e.g., on chromosome 8) that
are congruent with previously published data. Therefore, the experiments with single cells of
a defined karyotype – i.e. single leukocytes and MCF-7 breast cancer cells show that cellular
DNA of a single cell can be completely precipitated.
Supplementary Figure 1. CGH profiles of two normal leukocytes (red) and two MCF-7 breast
cancer cells (blue). Genomic DNA was isolated from the supernatant after mRNA isolation.
2
The ratio profiles of normal cells lie within the dashed lines that indicate the threshold of
significance. The profiles of both cancer cells show multiple and similar chromosomal
deletions and amplifications.
3
Generation and global amplification of single cell cDNA.
The following conditions were essential to obtain sufficient sensitivity and reliability. Firstly,
tRNA and rRNA had to be removed to improve PCR amplification. Secondly, we found that
random priming requires 2000-8000-times higher concentrations of primers for cDNA
synthesis than required with previously described techniques using oligo-dT primers at 10 nM
concentrations1-3. Supplementary Fig. 2A shows that higher concentrations of random primers
for cDNA synthesis lead to increased detection rates of specific transcripts (e.g. KRAS).
Surplus primer, being an effective competitor of the subsequent tailing and amplification
reaction, had to be removed before both steps. Equally, high dNTP concentrations improved
cDNA synthesis but interfered with the subsequent tailing reaction and also needed to be
removed. Capturing of mRNA on oligo-dT coated magnetic beads provided for simple
handling during mRNA isolation and buffer exchange steps. The second primer binding site
was added by homopolymer tailing. Here, poly-dC primers binding to poly-dG tails were
found to give at least a 100-fold higher sensitivity than poly-dT primers on poly-dA tails
(Supplementary Fig. 2B, compare lanes 3, 4 to 5, 6). Tailing efficiency (in Supplementary
Fig. 2B) as well as the sensitivity of the subsequent PCR of poly-dA- and poly-dG-tailed
sequences was assessed by adding a homopolymer tail of either poly-dA or poly-dG to a
defined PCR fragment. The poly-dA- and poly-dG-tailed fragments were diluted and then
amplified by PCR with poly-dT and poly-dC primers, respectively.
The most dramatic improvement was obtained when only one primer (Supplementary
Fig. 2D) was used for global PCR instead of two (Supplementary Fig. 2C). To prove this
point we isolated, diluted, reverse-transcribed and dG-tailed mRNA of HT29 cell line cells.
The cDNA synthesis primer consisted of a 3’ random hexamer (N6) and a 5’ flanking region
4
of either a poly-dC stretch (CFL5c-N6) or a sequence of all four bases (FL4-N6), introducing
either the FL4 or poly-dC sequence at the 5’ end of the generated cDNAs. Following dG-
tailing,the cDNAs were amplified with the respective primers. Use of the FL4 primer in
combination with the poly-dC binding primers (CP2, CP3) prevented amplification,
regardless of the annealing temperature chosen (Supplementary Fig. 2C, lanes 1, 2 and 4, 5).
PCR with the CP2 primer alone resulted in amplification of a wide size range of cDNA
molecules (0.2 – 3kb; Supplementary Fig. 2D, lanes 3, 4). Even highly diluted cDNA (1:200)
was still sufficient for global amplification (Supplementary Fig. 2D, lanes 1, 2). Random
hexamers (N6), octamers (N8), alone or combined with oligo-dT (dT)15 , sharing the same
poly-dC flanking region were compared. All worked well and reliably. The best results with
single cells were obtained with a combination of poly-dC-N8 (CFL5c8) and poly-dC-(dT)15
(CFL5cT).
5
Supplementary Figure 2. Experimental conditions determining amplification success. (A)
Twenty HT29 colon carcinoma cells (lanes 1-20) were individually isolated and processed.
After cell lysis in cDNA synthesis buffer containing the detergent Igepal, groups of five single
M
80 µM 8 µM 0.8 µM 0.08 µM A
A-tail
G-tail
10-5 10-3 - B
1 2 3 4 5 6 1 2 3 4 5 CP3-FL4 CP2-FL4 1:200 pure
C D
- + 1 2 3 4 5 11 6 7 8 9 10 12 13 14 15 16 17 19 18 20
3 4 5 621
6
cells were formed and reverse transcribed with four different concentrations of random cDNA
synthesis primers. Efficacy of cDNA synthesis was tested with KRAS specific PCR for each
concentration. (lanes 1-5, 80 µM; lanes 6-10, 8µM; lanes 11-15, 0.8 µM; lanes 16-20, 0.08
µM). (B) Effect of homopolymer tails on sensitivity. A 350 bp TGFA cDNA fragment was
isolated, diluted and either dA or dG tailed. Serial dilutions were tested by PCR using poly-dT
or poly-dC containing primers, respectively, and an additional primer from within the TGFA
sequence. The informative dilutions are shown in duplicates. (lanes 1+2, negative control;
lanes 3+4, 10-3 dilution; lanes 5+6, 10-5 dilution). (C) FL4-N6 primed and reverse transcribed
mRNA was dG-tailed and amplified using the CP3 + FL4 primers (lanes 1-3) or CP2 + FL4
primers at different annealing temperatures (lane 1+4, 68°C, lane 2+5 65°C, lane 3+6,
negative control). (D) An identical amount of mRNA as in C) was reverse transcribed using
the CFL5cN6 primer, and amplified with the CP2 primer (lane 3+4), which resulted in
amplification of a wide range of cDNA fragments. Even a mRNA dilution of 1:200 (lane 1+2)
resulted in a similar smear regardless of annealing temperatures (lane 1+3, 68°C; lane 2+4,
65°C; lane 5, negative control).
Primers used and mentioned in Supplementary Fig.2C and D were:
Cfl5cN6 5’-(CCC)5 GTC TAG ANN (N)6-3’
FL4N6 5’-TTT CTC CTT AAT GTC ACA GAT CTC GAG GAT TTC (N)6-3’)
CP3 5’- GCT GAA GTG GCG AAT TCC GAT GCC (C)12-3’
FL4 5’-CTC CTT AAT GTC ACA GAT CTC GAG GAT TTC-3’.
7
Comparison of single cells of different histogenetic origin
We tested the PCR amplificates from single cells for suitability of cDNA array
analysis. The specificity of the hybridisation of digoxigenin-labelled probes is depicted in
Supplementary Table 1, where the expression patterns of genes from single cells of different
histogenetic origin are shown. Only the MCF-7 and A431 cell expressed the cytokeratin
genes, markers for their epithelial origin, whereas the erythroleukemia K562 cell and EBV-
transformed B cell JY expressed genes of a hematopoetic origin, including CD33, CD37,
CD38, and kappa light chain (IGKC) in the B cell. In addition, the testis- and tumour-specific
MAGE genes were highly expressed in all cancer cells but not the virally transformed B cell.
These data show that the amplificates obtained from single cell PCR are suitable for cDNA
array analysis and produce cell type-specific gene expression patterns of single cells.
Supplementary Table 1. Expression of histogenetically informative genes by single cells
derived from different tissues.
MCF-7 A431 K562 JY ACTB + + + + EEF1A1 + + + + KRT7 + + - - KRT10 - + - - KRT13 - + - - KRT18 + + - - KRT19 + + - - EpCAM + + - - CD33 - - + + CD37 - - + + CD38 - - + - IGKC - - - + VIM - + + -
8
ITGA6 + - - - ITGB1 + - - - ITGB2 - - - + ITGB4 - - + - ITGB7 - - - + PTK2 + - - - MageA1 + - + - MageA2 + + + - MageA3 + - + - MageA6 + - + - MageA12 + + + -
Array analysis of MCF-7 breast cancer cells
Expression patterns of four MCF-7 cells were compared using a cDNA array with 109
different genes (Supplementary Table 2). 46 genes (42%) were expressed in at least one cell,
and 63 (58%) were negative for all four cells. Eighteen of the 46 (39%) expressed genes were
detected in all four cells whereas the remaining 29 (61%) were found to be heterogeneously
expressed. All housekeeping genes (EEF1A1, GHPDH, ACTB) belonged to the genes
detected in all four cells but also structural genes like the cytokeratins (KRT) 7, 8, 18, 20 and
adhesion molecules like Integrin α6 and β1 (ITGA6, ITGB1). Among the genes that were
found only in one of the cells many are regulated within the cell cycle such as CDKN2A,
CDKN1A, CDKN1B, ING1, MKI67, TK1, but also cytokeratin family members that are
usually not expressed in breast tissue (KRT10, KRT13).
Supplementary Table 2. Commonly and differentially detected transcripts of four single
MCF-7 cells.
4/4* 3/4 2/4 1/4 EEF1A1 GHPDH ACTB
KRT19 TIMP1 CTSB
ITGB4 ITGB5 TP53
KRT10 KRT13 ADAM9
9
KRT7 KRT8 KRT18 KRT20 ITGA6 ITGB1 PTK2 BSG PLAUR MMP7 CCND1 EPHA2 M4S1 (EpCAM) ABCC1 PHLDH1
CTSD CTSL ADAM10 MYC
CKM
ADAM15 ADAM17 CDKN2A CDKN1A CDKN1B ING1 MKI67 TK1 CDH1 IGF1R IGF2R TGFB VEGF DSP
* Listed are the transcripts that were detected in all four single cells (4/4), three of four (3/4),
two of four (2/4), and one of four (1/4).
10
Primer information of the gene specific PCRs
Supplementary Table 3: Sequences of oligonucleotide primers for the RT-PCR amplification
of the different genes.
MAGE 2 sense antisense
CAT TGA AGG AGA AGA TCT GCC T CAG GCT TGC AGT GCT GAC TC
MAGE 3/6 sense antisense
GGC TCG GTG AGG AGG CAA G GAT GAC TCT GGT CAG GGC AA
MAGE 4 sense antisense
CAC CAA GGA GAA GAT CTG CCT CAG GCT TGC AGT GCT GAC TCT
β-Actin sense antisense
GTG GGG CGC CCC AGG CAC CA CTC CTT AAT GTC ACG CAC GAT TTC
EF-1α sense antisense
TGC CCC AGG ACA CAG AGA CT CTG TGT CGG GGT TGT AGC CA
TNF-α sense antisense
CAG AGG GAA GAG TTC CCC AG CCT TGG TCT GGT AGG AGA CG
EMMPRIN sense antisense
CAT GCT GGT CTG CAA GTC A GTT CTC AAT GTG TAG CTC TG
EpCAM sense antisense
GCG AGT GAG AA CTA CTG CAC ATC AGC TAT GTC CAC ATC
Ki-ras sense antisense
TAT AAA CTT GTG GTA GTT GGA GC CTC TTG ACC TGC TGT GTC GA
11
Sequence-independent amplification
In order to assess the sensitivity, the potential systematic failure to amplify “difficult”
sequences and the influence of other technical factors on transcript detection we tested the
differences between cDNA preparations of single A431 cells that were split into two halves
before global amplification. In the first step, we performed gene-specific PCRs with the
amplified cDNAs of the halved cells (Supplementary Fig. 3). For comparison, we diluted
unamplified cDNA isolated from a pool of 500,000 A431 cells to such an extent that the
intensity of the β-actin band was similar to that obtained with 50% of cDNA prepared from a
single cell. After 32 cycles and with a cDNA amount corresponding to about 10,000 cells,
both β-actin signals that of the pooled control DNA as well as that of 50% of the single cell
cDNA reached the plateau phase of amplification. As shown in Supplementary Figure 3, the
variation between two cDNA halves of the same cell was very low. In two independent
experiments, each half (a+b) from six single A431 cells yielded β-actin bands of similar
intensity.
Because PCR amplification is known to be sequence-dependent, we also tested the
amplification of MAGE transcripts, very demanding with regard to primer design. Kufer et al.
had to test almost 800 primer combinations to achieve successful amplification of MAGE
genes from less than five cells demonstrating the magnitude of the problem5. MAGE genes
belong to a family of genes expressed in spermatogonal cells and exclusively in various
cancer cells, hence the designation cancer-testis genes. MAGE transcripts could be detected in
all single cells. After the same number of cycles in the specific PCR as for β-actin, the level
of MAGE expression was consistently found to be lower than that of β-actin, and the
concentration of MAGE transcripts in split single cell samples after global PCR amplification
(Supplementary Fig. 3, lanes 2-4 and 6-8) compared well to that of the control sample from
12
pooled cells (Supplementary Fig. 3, +). Although the MAGE transcripts were only weakly
expressed in the pooled cDNA from 10,000 cells (Supplementary Fig.3, +), the data obtained
from single cells were identical in 4 out of 6 paired halves of the cDNA. The failure to detect
any MAGE transcript in cell half 7a and 8b (Supplementary Fig.3) is most likely due to an
unequal distribution of the beads prior to amplification between the two halves indicating thus
the detection limit of the method. The semi-quantitative character of our amplification method
is strongly suggested by the reproducibly similar signal intensity of all β-actin and MAGE
transcripts: No artificial increase in intensity is observed in any divided cell and the intensities
of the β-actin and MAGE band closely resemble the positive control. As the yield (y) of a
PCR amplification follows the equation y = ((1+R)n -2n)x, with n being the number of cycles,
x the copy number of the original template and (1+R) the amplification efficiency of a any
sequence (R ≤ 1), Supplementary Figure 3 suggests that R is the same for all sequences under
the applied conditions. Consequently, although an exponential amplification takes place, the
ratios of the different sequences remain very similar.
Supplementary Figure 3. Gene specific PCR for β-actin and various MAGE transcripts using
unamplified pooled cDNA of A431 cells as positive control (+) and amplificates of single
76 5321 4 8 M
Actin
Mage
Mage
Mage
M - + a b a b a b a b a b a b a b a b
13
A431 cells (lane 2-4 and 6-8) that were divided into two halves (a+b) before global PCR. Two
independent experiments were performed (lane 1-4 and 5-8) with lane 1 and lane 5 being the
negative controls for the global PCR.
14
Evaluation of array hybridization of split single cells
The high sensitivity of the procedure and the diversity of the isolated and amplified
cDNA sequences was also seen with the cDNA of four halves labeled and hybridized to a
cDNA array representing 193 different gene sequences. Most transcripts could be detected in
both halves of the single cell PCR products (Supplementary Fig.4). Of the total of 148
observed signals, 95 (64%) were found in the corresponding halves, whereas 53 (36%) were
found in only one half. Figure 4 shows the distribution of the signals of the two halves from
one experiment. Hybridization intensity of the two halves for the spotted cDNAs was
classified from 0 (no signal), 1 (weak) over 2 (middle) to 3 (strong). It can be seen that for
strong signals both halves correlated well (correlation coefficient r= 0.81). Also the signals
with intermediate strength were in most cases found in both halves. Weak signals gave the
least concordant results between the two halves of a cell. This can be explained by the
stochastic nature of the cDNA priming events, an uneven distribution of low copy number
transcripts before the PCR is performed and the undefined but real detection-limit of the
method. As many transcripts are estimated to be present in a cell in copy numbers of about 5-
15 (ref. 4), it is very likely that such transcripts are particularly subjected to effects leading to
discordant results when cDNA of a single cell is divided.
15
Supplementary Figure 4. Hybridization results of two independently amplified cDNA halves
of a single cell. Each spot on the array was classified as strong (3), middle (2), weak (1) and
no signal (0). The signal intensities for each spot were compared between the two halves. X-
and y-axis give the signal intensities for each half of the cDNA of a single cell and the z-axis
the number of cDNAs. The column at position 3/3 for example means that 10 transcripts were
expressed equally strong in both halves. Ideally, the columns would form a diagonal from 3/3
over 2/2 and 1/1 to 0. However, it can be seen that the light blue columns are of similar height
at position 0/1, 1/1, 1/0, indicating that no bipartition was achieved for weak signals as was
for the strong signals. (Red columns, at least one half produced a strong signal; yellow, at
least one half produced an intermediate signal; light blue, at least one half resulted in a weak
signal; dark blue, no signal in both halves).
16
Hybridization of exemplary micrometastatic cells to Clontech Cancer1.2 and own array
From the three micrometastatic cells C, B, L mRNA was isolated and prepared as
described. As control, the procedure was performed without the addition of a cell. Seven
genes on the Clontech arrays had to be excluded from analysis because a signal was obtained
in at least one of the negative controls. These genes were coding for VBP1, CASP10, TGFB1,
HBA1, ROS1, TDGF1, and TNFR2 . The number of positive signals ranged from 5.3 %
(70/1313), 7.0 % (92/1313) to 11.8 % (155/1313) for cells from patients B, C, and L
respectively. Although, these numbers are lower than those obtained by conventional
transcriptome analysis with millions of cells, one should not forget that the number of
attributes of an individual cell is necessarily lower than that of a population. The hybridization
results are shown in Supplementary Figure 5.
Hugo B C L Hugo B C L Hugo B C L
ABCC1 HBA1 PDGFAP
ACTB hBAP PDPK1
ACTB HDAC3 PGF
ACTB HDGF PHB
ADAM15 HINT PHKB
ADAM17 HLA PHLDA1
ADAM8 HLAC PIM1
ADPRT HLADPB1 PIN1
ADSL HLADQ1B PKUA
ADSS HLADRB1 PLA2
AHR HMG2 PLAU
AK1 HMGIY PLAUR
ANT2 HNRPK PM5
17
API4 HSPA5 PPM1G
ARAF1 IFI27 PRIM1
ARHA IFIT1 PRKACB
ARHB IFNAR1 PRKDC
ARHGDIB IFNB1 PRKM3
BAD IGF1 D123
BARK1 IGF2R PSCD2
BCGF1 IL10 PSME1
BCL7B IL12B PTB
BENE IL1B PTK2
BLMH IL1RN PTP4A1
BPGF1 IMPDH1 RAC1
BSG IMPDH2 RACK7
C5 ING1 RAD23A
CALM1 IRF1 RALB
CASP6 ITGA3 RAN
CCNA ITGAV RAP1A
CD27 ITGB2 RBBP4
CD36 ITGB3 RBL2
CD59 ITGB7 RFC2
CD74 JAG1 RFC3
CD9 JUN ROCK1
CDC25A KAI1 RPL10
CDC37 KIAA0324 RPL13A
CDC42 KRAS2 RPL32
CDC7L1 KRT10 RPL5
CDH1 KRT11 RPS16
CDK10 KRT13 RPS21
CDK5 KRT18 RPS5
CDKN1A KRT19 RPS9
18
CDKN2A KRT2E RXRB
CENPF KRT6A S100A4
CHD3 KRT7 SFRS7
CSBP1 KRT8 SFRS8
CSTB LAMC1 SGK
CTNNB LDHB SHH
CTSB LIG1 SP100
CTSD LY84L SPTB
CTSL M4S1 STI3
DDIT1 MAD STK12
DES MAGEA12 STK3
DSG2 MGMT TAPBP
DYRK4 MIC2 TDGF1
E2F1 MKI67 TGFA
EB1 MLF2 TGFB1
EEF1A1 MLH1 THBS1
EEF2 MMP7 TIEG
EGF MT1-MMP TK1
EGFR MT2-MMP TNFRSF1B
EIF2A MYBL2 TOP2A
EIF3S9 MYC TP53
EPHA2 NCK1 TRAM
ERCC1 NEDD5 TRAP1
ETV4 NFKBIA TRIP
FAU NME1 TUBA1
FBP3 NME2 TYMS
FGF6 NME3 UB
FRA1 NME4 UBL1
G1P3 NOTCH1 UMK
GHPDH NTHL1 UMPS
19
GLG1 OSMR VEGF
GMFB PA2G4 VIM
GRP58 PAK2 VRK2
GST PCTK1
GUK1 PCTK2
HAI2 PCTK3
PDGFA
Supplementary Figure 5. Hybridization results of cell B, C, and L. Three categories of signal intensity were formed: Strong (black), middle (middle gray) and weak (light gray).
20
References
1. Brady, G. & Iscove, N.N. Construction of cDNA libraries from single cells. Methods
Enzymol 225, 611-623 (1993).
2. Brail, L.H. et al. Gene expression in individual cells: analysis using global single cell
reverse transcription polymerase chain reaction (GSC RT-PCR). Mutat-Res 406, 45-54
(1999).
3. Trumper, L.H. et al. Single-cell analysis of Hodgkin and Reed-Sternberg cells:
molecular heterogeneity of gene expression and p53 mutations. Blood 81, 3097-3115
(1993).
4. Alberts, B. Molecular biology of the cell. (Garland Publishing, New York & London;
1994).
5. Kufer, P. et al. Heterogeneous Expression of MAGE-A Genes in Occult Disseminated
Tumor Cells: A Novel Multimarker Reverse Transcription-Polymerase Chain Reaction
for Diagnosis of Micrometastatic Disease. Cancer Res 62, 251-261. (2002).