Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard...

37
Article Nudt21 Controls Cell Fate by Connecting Alternative Polyadenylation to Chromatin Signaling Graphical Abstract Highlights d shRNA screen identifies mRNA processing factor Nudt21 as a regulator of cell fate d Nudt21 knockdown enhances reprogramming but disrupts ESC/myeloid differentiation d Nudt21 suppression induces alternative polyadenylation for hundreds of transcripts d Chromatin regulators are enriched among Nudt21 targets important for reprogramming Authors Justin Brumbaugh, Bruno Di Stefano, Xiuye Wang, ..., Guang Hu, Yongsheng Shi, Konrad Hochedlinger Correspondence [email protected] (Y.S.), [email protected] (K.H.) In Brief Alternative polyadenylation exerts post- transcriptional control over cell fate decisions and pluripotency. Control KD Brumbaugh et al., 2018, Cell 172, 106–120 January 11, 2018 ª 2017 Elsevier Inc. https://doi.org/10.1016/j.cell.2017.11.023

Transcript of Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard...

Page 1: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

Article

Nudt21 Controls Cell Fate by Connecting Alternative

Polyadenylation to Chromatin Signaling

Graphical Abstract

Control KD

Highlights

d shRNA screen identifies mRNA processing factor Nudt21 as

a regulator of cell fate

d Nudt21 knockdown enhances reprogramming but disrupts

ESC/myeloid differentiation

d Nudt21 suppression induces alternative polyadenylation for

hundreds of transcripts

d Chromatin regulators are enriched among Nudt21 targets

important for reprogramming

Brumbaugh et al., 2018, Cell 172, 106–120January 11, 2018 ª 2017 Elsevier Inc.https://doi.org/10.1016/j.cell.2017.11.023

Authors

Justin Brumbaugh, Bruno Di Stefano,

Xiuye Wang, ..., Guang Hu,

Yongsheng Shi, Konrad Hochedlinger

[email protected] (Y.S.),[email protected] (K.H.)

In Brief

Alternative polyadenylation exerts post-

transcriptional control over cell fate

decisions and pluripotency.

Page 2: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

Article

Nudt21 Controls Cell Fateby Connecting Alternative Polyadenylationto Chromatin SignalingJustin Brumbaugh,1,2,3,4,5,11 Bruno Di Stefano,1,2,3,4,5,11 Xiuye Wang,6,12 Marti Borkent,1,2,3,4,5,12 Elmira Forouzmand,6

Katie J. Clowers,7 Fei Ji,1 Benjamin A. Schwarz,1,2,3,4,5 Marian Kalocsay,7 Stephen J. Elledge,8 Yue Chen,9

Ruslan I. Sadreyev,1,3 Steven P. Gygi,7 Guang Hu,10 Yongsheng Shi,6,* and Konrad Hochedlinger1,2,3,4,5,13,*1Department of Molecular Biology, Massachusetts General Hospital, Boston, MA 02114, USA2Center for Regenerative Medicine, Massachusetts General Hospital, Boston, MA 02114, USA3Cancer Center, Massachusetts General Hospital, Boston, MA 02114, USA4Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA 02138, USA5Harvard Stem Cell Institute, Cambridge, MA 02138, USA6Department of Microbiology and Molecular Genetics, School of Medicine, University of California, Irvine, Irvine, CA 92697, USA7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA8Howard Hughes Medical Institute, Brigham and Women’s Hospital and Department of Genetics, Harvard Medical School, Boston,

MA 02115, USA9Department of Biochemistry, Molecular Biology, and Biophysics, College of Biological Sciences, University of Minnesota, Saint Paul,

MN 55018, USA10Epigenetics andStemCell Biology Laboratory, National Institute of Environmental Health Sciences, Research Triangle Park, NC 27709, USA11These authors contributed equally12These authors contributed equally13Lead Contact

*Correspondence: [email protected] (Y.S.), [email protected] (K.H.)https://doi.org/10.1016/j.cell.2017.11.023

SUMMARY

Cell fate transitions involve rapid gene expressionchanges and global chromatin remodeling, yet theunderlying regulatory pathways remain incompletelyunderstood. Here, we identified the RNA-processingfactor Nudt21 as a novel regulator of cell fate changeusing transcription-factor-induced reprogramming asa screening assay. Suppression of Nudt21 enhancedthe generation of induced pluripotent stemcells, facil-itated transdifferentiation into trophoblast stem cells,and impaired differentiation of myeloid precursorsand embryonic stem cells, suggesting a broader rolefor Nudt21 in cell fate change. We show that Nudt21directs differential polyadenylation of over 1,500 tran-scripts in cells acquiring pluripotency, although only afraction changed protein levels. Remarkably, theseproteins were strongly enriched for chromatin regula-tors, and their suppression neutralized the effect ofNudt21 during reprogramming. Collectively, our datauncover Nudt21 as a novel post-transcriptional regu-lator of cell fate and establish a direct, previously un-appreciated link between alternative polyadenylationand chromatin signaling.

INTRODUCTION

A key property of stem and progenitor cells is the capacity to

differentiate into committed cell types. These transitions prompt

106 Cell 172, 106–120, January 11, 2018 ª 2017 Elsevier Inc.

widespread changes in gene expression programs that require

multiple levels of regulation to specify and ultimately restrict

cell fate. Cell identity can be experimentally modulated through

the ectopic expression of key transcription factors. For example,

forced expression of Oct4, Klf4, Sox2, and c-Myc (OKSM) in

somatic cells gives rise to induced pluripotent stem cells (iPSCs)

within 1–3 weeks and at an efficiency of 1%–3% (Takahashi and

Yamanaka, 2006). The slow rate and low efficiency of this

process have been attributed to somatic barriers that are

established during development (Apostolou and Hochedlinger,

2013). Thus, iPSC generation represents a tractable system to

identify these barriers and resolve general mechanisms that

control cell fate. While several studies have focused on the

role of direct chromatin and transcriptional regulators during

induced or physiological cell fate transitions, post-transcriptional

mechanisms of cell fate control remain relatively unexplored

(Ye and Blelloch, 2014).

Recently, alternative polyadenylation (APA) has emerged as a

fundamental mediator of gene expression. In mammals, �70%

of genes harbor multiple polyA sites, which yield distinct

mRNA isoforms that differ in the length of their 30 untranslatedregions (UTRs) (Mayr, 2017; Shi, 2012; Tian and Manley, 2017).

Transcripts with proximal polyA sites and therefore shorter

30 UTRs are generally thought to produce increased protein

levels due to the exclusion of regulatory sequences that mediate

degradation, export to the cytoplasm, or translational effi-

ciency of mRNAs (Mayr and Bartel, 2009; Sandberg et al.,

2008). However, several studies have reported that the effect

of APA on mRNA stability and ribosome loading is marginal

and may depend on cell-type-specific expression of microRNAs

and RNA-binding proteins (Gruber et al., 2014; Gupta et al.,

Page 3: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

100

200

300

400

500

200

400

600

500

1000

1500

0

0

0

50

100

150

0

50

100

150

200

0

500

1000

1500

0

shRNA

Dox Sort GFP+ cells/purify gDNA

Amplify and cloneshRNA

Enriched pool of shRNAs

60,642 hairpins18,464 genes

A B

10

20

30

40

50

Uni

que

shR

NA

s (X

103 )

Start 1 2Round

30

E F

Con

trol

siR

NA

OKSM Removal

Nud

t21

siR

NA

Day3 Day4 Day5 Day6 Day7 Day8

SSEA1

THY

1

Control siRNA

9.26

Nudt21 siRNA

42.6

EPCAM

THY

1

Control siRNA

8.81

Nudt21 siRNA

48.9

OCT4-GFP

THY

1

Nudt21 siRNA

38.7

Control siRNA

0.95

0 3 6 9 12 iPS

0 3 6 9 12 iPS

0 3 6 9 12 iPS

Epcam

Esrrb

Nanog

Rel

ativ

e E

xpre

ssio

nR

elat

ive

Exp

ress

ion

Rel

ativ

e E

xpre

ssio

n

Time (Days)

H I J

0 3 6 9 12 iPS

Pou5f1

0 3 6 9 12 iPS

Sall4

0 3 6 9 12 iPS

Cdh1

Time (Days)

0

10

20

30

40

% S

SE

A1+

Day3 Day6 Day9 Day12MEFs

Nudt21 siRNAControl siRNA

****

**** **

***

0

10

20

30

40

Day3 Day6 Day9 Day12

% O

CT4

-GFP

+

MEFs

***

**

**

0

10

30

50

% E

PC

AM

+

Day3 Day6 Day9 Day12MEFs

20

40

****

**

***

***

**

***

*

***

***

****

**

****

****

**

**

*****

DC

Nudt21siRNA

ControlsiRNA

0

50

100

150

200

250

300

350

Nudt21siRNA

Control siRNA

AP

+ co

loni

es

OKSM****

MEFs iPScells

THY1

SSEA1

EPCAM

OCT4-GFP

ReprogrammingIntermediates

G

β-ACTIN50

Nudt21

siRNA

NUDT2125

50c-MYC

Contro

l siR

NA

KDa

Figure 1. A Serial shRNA Screen Identifies Nudt21 as a Potent Barrier to Reprogramming

(A) A schematic of the serial enrichment shRNA screen.

(B) shRNA library complexity during hairpin enrichment.

(C) AP staining of transgene-independent iPSC colonies. Cells were induced with dox for 12 days, followed by 4 days of dox withdrawal.

(D) Quantification of AP staining.

(E) Western blot analysis showing Nudt21 knockdown with siRNA at day 3 of reprogramming.

(F) Dox withdrawal assay. Cells were induced for the indicated time period after which dox was replaced with ESC media until analysis at day 15.

(G) A schematic of the markers used to determine the progression of reprogramming.

(H) Flow cytometry analysis of intermediate reprogramming markers, SSEA1, EPCAM, and OCT4-GFP at day 12 of induction.

(I) Time course quantification of flow cytometry analysis.

(legend continued on next page)

Cell 172, 106–120, January 11, 2018 107

Page 4: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

2014; Spies et al., 2013). Thus, the role of APA in different

cell contexts and under different conditions remains to be

determined.

The polyadenylation of mRNAs requires four distinct protein

complexes, comprising the cleavage and polyadenylation spec-

ificity factor (CPSF), cleavage stimulation factor (CstF), cleavage

factor I (CFIm), and cleavage factor II (CFIIm) (Tian and Manley,

2017). Perturbation of complex subunits leads to variable shifts

of polyA site usage, implying a direct role for these factors in

APA (Lackford et al., 2014; Li et al., 2015; Martin et al., 2012;

Masamha et al., 2014). Interestingly, components of CPSF and

CstF display increased expression during reprogramming to plu-

ripotency, which is suggested to modulate polyadenylation site

usage to match patterns observed in pluripotent cells (Ji et al.,

2009). In general, undifferentiated and proliferative cells are

enriched for mRNAs using proximal polyA sites, which may

play a role in cell fate control (Ji et al., 2009; Lackford et al.,

2014). For example, the use of proximal polyA sites within the

Pax3 30UTR renders satellite cells insensitive to miR-206-depen-

dent degradation, impacting muscle progenitor function and

regenerative capabilities (Boutet et al., 2012). These studies sug-

gest that APA patterns are dynamically regulated during devel-

opment and cellular differentiation. However, the relevance of

polyadenylation factors and their functions during cell fate tran-

sitions remain poorly defined.

In this study, we performed a genome-wide, unbiased short

hairpin RNA (shRNA) screen during the induction of iPSCs from

fibroblasts in order to identify novel regulators of cell fate

change. This effort uncovered Nudt21, a component of the

CFIm complex (Ruegsegger et al., 1998), as a potent barrier to

reprogramming and a regulator of widespread APA patterns in

somatic cells acquiring pluripotency.

RESULTS

A Serial shRNA Screen Identifies Nudt21 as a PotentBarrier to ReprogrammingTo identify barriers to reprogramming, we conducted a non-

saturated shRNA enrichment screen during the conversion of

murine embryonic fibroblasts (MEFs) into iPSCs using a previ-

ously described approach (Borkent et al., 2016). Transgenic

MEFs were derived that carried a doxycycline (dox)-inducible

polycistronic transgene encoding Oct4, Klf4, Sox2, and c-Myc

(OKSM) and an OCT4-EGFP reporter (Stadtfeld et al., 2010) (Fig-

ure S1A). These cells were transduced with a lentiviral library

containing 60,642 hairpins targeting 18,464 genes and treated

with dox to induce reprogramming (Figure 1A). Emerging

OCT4-GFP+ cells were purified by fluorescence-activated cell

sorting and hairpins enriched in reprogrammed cells were iso-

lated from genomic DNA. The purified hairpins were then ampli-

fied and re-cloned into lentiviral backbones in order to perform

additional rounds of infection, reprogramming, and hairpin

enrichment. The complexity of the shRNA library decreased

(J) Time course qRT-PCR quantification of gene expression for select pluripotenc

false discovery rate.

*p < 0.05; **p < 0.01; ***p < 0.001; ****p < 0.0001, unpaired Student’s t test. n = 3

See also Figure S1.

108 Cell 172, 106–120, January 11, 2018

with each round of the screen and several hairpins became pro-

gressively enriched, suggesting that they confer a selective

advantage for the generation of iPSCs (Figure 1B). Among the

top scoring hits, Nudt21 shRNAs emerged as the strongest

and most consistent enhancers of reprogramming across

replicates (Figures S1B and S1C). Nudt21 (also called CFIm25

or Cpsf5) is a component of the pre-mRNA cleavage and polya-

denylation complex and has been implicated in alternative poly-

adenylation (APA) of mRNA (Li et al., 2015; Martin et al., 2012;

Masamha et al., 2014). However, Nudt21 has not previously

been associated with cellular reprogramming or mammalian

cell fate regulation. We therefore focused on this molecule for

the remainder of this study.

To confirm the effect of Nudt21 shRNAs on reprogramming, we

transfected reprogrammable MEFs with pooled small interfering

RNAs (siRNAs) targeting Nudt21 and observed an �30-fold in-

crease in the formation of alkaline phosphatase (AP)-positive,

transgene-independent iPSC colonies (Figures 1C and 1D). We

confirmed that Nudt21 protein levels were downregulated in

Nudt21 siRNA-treated cells, indicating specificity of the pheno-

type and efficiency of knockdown (Figure 1E). Importantly,

suppression of Nudt21 did not impact transgene expression

(Figure 1E), excluding the possibility that Nudt21 knockdown

enhances reprogramming by increasing levels of exogenous

OKSM.Nudt21 also dramatically increased iPSC colony numbers

when exogenous c-Myc was omitted from the reprogramming

cocktail, demonstrating that Nudt21 knockdown greatly en-

hances cell fate change in systems with an inherently low reprog-

ramming efficiency (Figures S1D and S1E). Finally, we ensured

that transient Nudt21 suppression does not compromise pluripo-

tency by generating well-differentiated teratomas from iPSCs

derived from reprogrammable MEFs treated with Nudt21

siRNAs (Figure S1F).

To determine whether Nudt21 suppression accelerates the

rate of reprogramming, we exposed reprogrammable MEFs to

dox for different lengths of time before withdrawing dox to quan-

tify transgene-independent iPSC colonies at day 15 (Figure 1F).

While stable iPSC colonies first emerged after 7–8 days of

OKSM expression in control siRNA-treated cultures, consistent

with a prior study (Stadtfeld et al., 2008), Nudt21 siRNA-treated

cells gave rise to iPSC colonies after as little as 4–5 days of

OKSM expression (Figures 1F and S1G). To further refine the

kinetics of reprogramming, we examined cell surface markers

and a reporter allele, which identify early (SSEA1), mid (EpCAM),

or late (OCT4-GFP) stages of reprogramming (Figure 1G) (Polo

et al., 2012; Stadtfeld et al., 2008). Strikingly, levels of SSEA1

were 4- to 5-fold higher in cultures treated with Nudt21 siRNAs

relative to control siRNAs after just 3 days of dox induction and

remained high throughout the course of reprogramming

(Figures 1H and 1I). Similarly, EPCAM and OCT4-GFP were

expressed at earlier time points and at much higher levels

compared to control. By day 12 of induction, up to 40% of cells

in the Nudt21 knockdown condition had attained OCT4-GFP

y genes. Statistical significance was determined using multiple t test with 1%

independent experiments, mean ± SD.

Page 5: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

F G

BA C D

E

K L M

H I J

TAU-GFP

TAU

-GFP

SOX2-GFP CDX2-GFP SO

X2

LYZ-

GFP LY

Z-G

FP

(legend on next page)

Cell 172, 106–120, January 11, 2018 109

Page 6: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

expression compared to only 1% in the control. Moreover, we

noticed that transcription factors and epithelial proteins associ-

ated with pluripotency such as Nanog, Esrrb, and Cdh1 were

induced more rapidly in reprogramming cultures with Nudt21

siRNAs and reached the expression levels observed in iPSCs in

just 9 days (Figure 1J). We excluded that the observed accelera-

tion of reprogramming and enhanced marker expression are due

to an increase in proliferation with Nudt21 knockdown using both

CFSE assays and cell counting (Figures S1H and S1I). Nudt21

knockdown also did not appreciably change apoptosis rates as

judged by AnnexinV staining (Figure S1J).Moreover, suppression

of Nudt21 in uninduced MEFs did not alter their growth rate or

viability (Figures S1H–S1J) and had no discernable impact on

the expression of transcripts related to MEF and iPSC identity

(Figures 1K and 1L). Together, these results indicate that

Nudt21 suppression aids the establishment of the pluripotency

network and generates bona fide iPSCs with reduced latency

but does not induce cell fate changes on its own.

Nudt21 Suppression Promotes Alternative Cell FateTransitionsWe next investigated whether Nudt21 acts as a barrier to cell fate

transitions that do not involve a pluripotent state.We first focused

on transdifferentiation of pre-B cells to macrophages using

ectopic expression of the myeloid transcription factor C/EBPa

(Bussmann et al., 2009) (Figure 2A). By tracking downregulation

of the B cell surface marker CD19 and reciprocal upregulation

of the macrophage surface marker MAC1 (CD11b), we observed

conversion of pre-B cells into macrophages with the expected

increase in size and granularity (Figure S2A) as well as frequency.

However, we were unable to detect an enhancement of transdif-

ferentiation efficiency after suppression of Nudt21 (Figures 2B

and S2A–S2C). Similarly, transdifferentiation of MEFs into

TAU1-GFP+ induced neurons (iNs) (Vierbuchen et al., 2010) was

unchanged with Nudt21 knockdown upon ectopic expression

of the neuronal transcription factor Ascl1, alone or in combination

with Brn2 and Myt1l (Figures 2C, 2D, S2D, and S2E).

Transdifferentiation into macrophages or iNs produces cells

with fundamentally different cell-cycle characteristics compared

to iPSCs (post-mitotic versus proliferative) and differentiation

potentials (terminally differentiated versus pluripotent). We

therefore hypothesized that Nudt21 suppression may uniquely

Figure 2. Nudt21 Suppression Enhances the Transdifferentiation of ME

(A) A schematic of B cell-to-macrophage transdifferentiation.

(B) Time course analysis for lineage markers during B cell-to-macrophage transd

(C) A schematic of MEF to neuron transdifferentiation.

(D) Quantification of neural transdifferentiation.

(E) A schematic of MEF to trophoblast stem cell transdifferentiation.

(F) Immunofluorescence images showing staining for SOX2-GFP and CDX2. Sca

(G) Quantification of iTSC colonies following transdifferentiation.

(H) Bright field images of EBs after 6 days of differentiation. Scale bar, 200 mm.

(I) Quantification of EB diameters for each condition in technical replicate from th

The center bar and whiskers represent the mean and SD of the mean, respective

(J) qRT-PCR for lineage markers at day 6 of induction.

(K) A schematic of the Hoxb8 differentiation system.

(L) Flow cytometry analysis showing LYZ-GFP levels for cells under differentiatio

(M) Quantification for differentiation from progenitor cells to myeloid lineages.

*p < 0.05; **p < 0.01; ***p < 0.001; ****p < 0.0001, unpaired Student’s t test. n = 3

See also Figure S2.

110 Cell 172, 106–120, January 11, 2018

facilitate cell fate transitions that pass through a self-renewing

stem cell state. To test this hypothesis, we performed Nudt21

knockdown during the conversion of MEFs into induced tropho-

blast stem cells (iTSCs) using a previously reported protocol

(Benchetrit et al., 2015; Kubaczka et al., 2015) (Figure 2E).

Briefly, we infected SOX2-GFP MEFs with dox-inducible lentivi-

ral vectors carrying open reading frames for the transcription

factors Tfap2c, Eomes, and Gata3. After 20 days of transcription

factor expression and 10 days of transgene-independent

growth, we detected SOX2-GFP+ iTSC-like colonies that resem-

bled bona fide trophoblast stem cells at the expected efficiency.

These iTSC colonies exhibited a flat, cuboidal morphology, ex-

pressed CDX2 protein, and were surrounded by differentiated

progeny that appeared similar to trophoblast giant cells (Figures

2F, S2F, and S2G). Notably, suppression of Nudt21 by siRNAs

enhanced the formation of SOX2-GFP+ iTSCs by 5-fold, indi-

cating that Nudt21 resists transdifferentiation into this alterna-

tive, non-pluripotent stem cell type (Figure 2G).

Based on the finding that Nudt21 suppression facilitates the

acquisition of early embryonic stem cell states, we explored

whether Nudt21 also plays a role during ESC differentiation. To

this end, we introduced a lentiviral shRNA targeting Nudt21 in

ESCs and evaluated their differentiation potential by generating

embryoid bodies (EBs). Notably, after 6 days of differentiation,

Nudt21-depleted cells gave rise to significantly smaller EBs

compared to control ESCs despite equal numbers of input cells

(p < 0.001) (Figures 2H and 2I). Moreover, Nudt21 knockdown

EBs showed a significant reduction in the expression of neuronal

marker genes (Pax6 and Sox1) but expressed mesoderm- and

endoderm-specific genes at similar levels as control-infected

EBs (Figure 2J). Thus, Nudt21 suppression impairs the potential

of ESCs to differentiate into the ectodermal lineage.

To test whether Nudt21 suppression also affects adult progen-

itor differentiation, we performed shRNA-mediated knockdown

of Nudt21 in a myeloid progenitor cell line carrying a Lysozyme

(LYZ)-GFP reporter, which is silenced in progenitors and acti-

vated in derivative macrophages and neutrophils (Figure 2K).

This cell line is reversibly locked into a self-renewing, undifferen-

tiated state using expression of an estradiol-dependent Hoxb8

transgene, providing a powerful in vitro assay to study progenitor

cell maintenance and differentiation (Sykes et al., 2016). With-

drawal of estradiol in control shRNA transduced progenitors

Fs to iTSCs Yet Delays ESC and Myeloid Differentiation

ifferentiation.

le bar, 100 mm. Arrowheads indicate trophoblast giant cells.

ree independent experiments (Control shRNA, n = 36; Nudt21 shRNA, n = 58).

ly.

n conditions.

independent experiments, mean ± SD.

Page 7: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

A

F

B C

D E

HG

I J

1,562

Controls

s

Phc1 Mgea5 Atm

Figure 3. Nudt21 Suppression Facilitates APA and Gene Expression Changes

(A) A correlation plot comparing polyA site usage for iPS cells versus MEFs.

(B)Acorrelationplot comparing polyAsite usageatDay3of reprogramming forNudt21 siRNAversuscontrol siRNA.KD=knockdown;P/D=proximal todistal ratio.

(C) Multi-dimensional scaling analysis for APA.

(D) Gene tracks showing alternative polyadenylation for Nudt21 targets. Control and Nudt21 knockdown samples were analyzed at day 3 of reprogramming.

(legend continued on next page)

Cell 172, 106–120, January 11, 2018 111

Page 8: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

led to the activation of LYZ-GFP and differentiation in the major-

ity of cells (91%), as expected (Figures 2L and 2M). By contrast,

myeloid progenitors with Nudt21 knockdown activated the

LYZ-GFP reporter in a significantly smaller fraction of cells

(61%), indicating that their differentiation is delayed (Figures 2L

and 2M). Altogether, these results demonstrate that Nudt21

suppression not only facilitates the acquisition of an embryonic

pluripotent (iPSC) and multipotent (iTSC) stem cell state but

also impairs the differentiation of ESCs and an adult hematopoi-

etic progenitor cell type.

Nudt21 Suppression Triggers Alternative PolyA SiteUsage of Hundreds of TargetsIn order to identify potential targets of Nudt21 and thus gain

insight into the mechanisms by which Nudt21 modulates cell

fate change, we performed polyA site-sequencing (PAS-seq)

(Shepard et al., 2011) of untreated MEFs, MEFs expressing

OKSM in the presence of either Nudt21 or control siRNAs,

SSEA1-enriched reprogramming intermediates, and transgene-

independent iPSCs. We found that in iPSCs, twice as many

transcripts utilized proximal polyA sites (117) compared to

MEFs (55), consistent with the notion that undifferentiated,

proliferative cells preferentially use proximal polyA sites

(Ji et al., 2009; Sandberg et al., 2008; Shepard et al., 2011) (Fig-

ure 3A). Remarkably, however, suppression of Nudt21 in MEFs

expressing OKSM for 3 days led to a massive shift of transcripts

from distal to proximal polyA sites compared to the control

(1,562 versus 59) (Figure 3B). Multi-dimensional scaling

(MDS) analysis of PAS-seq data suggested a progressive

reprogramming trajectory from MEFs, through day 3 and 6

control siRNA-treated cells, to SSEA1+ intermediates and ulti-

mately iPSCs (Figure 3C, gray line; Table S1). Interestingly, cells

depleted for Nudt21 were molecularly closer to iPSCs along the

first dimension but distinct from both iPSCs and MEFs along the

second dimension (Figure 3C). This suggests that Nudt21 knock-

down facilitates the acquisition of an iPSC-like polyadenylation

pattern but often exceeds the iPSC phenotype or induces poly-

adenylation changes on transcripts that usually remain stable

during reprogramming. Indeed, while some transcripts showed

APA patterns that were more similar to SSEA1+ intermediates

and iPSCs (e.g., Phc1), others showed enhanced or ectopic

APA patterns (e.g., Atm,Mgea5) upon Nudt21 suppression (Fig-

ure 3B, 3D and S3A). Further supporting the MDS results, gene

ontology analysis of Nudt21-dependent APA changes revealed

enrichment for several categories that have been implicated in

reprogramming such as integrin and MAPK signaling pathways,

but also highlighted cancer-associated signaling and protein

ubiquitination, pointing to the complexity of biological functions

influenced by Nudt21 (Figure 3E).

(E) Ingenuity pathway analysis of APA at day 3 of reprogramming for Nudt21 siR

(F) UGUA distribution around polyA sites.

(G) APstainingof transgene-independent iPScell colonies following reprogrammin

(H) Quantification of AP staining of transgene-independent iPS cell colonies from

(I) Multi-dimensional scaling analysis for gene expression.

(J) Heatmaps for select MEF and pluripotency genes.

*p < 0.05; ****p < 0.0001, unpaired Student’s t test. n = 3 independent experime

See also Figure S3 and Table S1.

112 Cell 172, 106–120, January 11, 2018

To better understand the mechanisms by which Nudt21

modulates APA, we examined the distribution of the Nudt21

binding motif, UGUA, within the 30 UTR of mRNAs (Brown

and Gilmartin, 2003; Martin et al., 2012). Notably, we found

significant enrichment for UGUA motifs near distal polyA sites

compared to the proximal polyA sites within the 30 UTR of

genes that exhibit APA changes following Nudt21 knockdown

(i.e., ‘‘target genes’’) (Figure 3F, left panel). By contrast, UGUA

distribution was similar between proximal and distal polyA

sites of genes whose APA profiles were not affected, termed

‘‘non-target genes’’ (Figure 3F, right panel). To confirm that

transcripts exhibiting APA changes upon Nudt21 knockdown

are direct targets of the CFIm complex, we compared our

dataset with published cross-linking immunoprecipitation

sequencing (CLIP-seq) data for NUDT21 (CFIm25) and its

cofactor CFIm68 in human cells. Our results indicate that

Nudt21 and CFIm68 show higher and more concentrated

binding at distal polyA sites of target genes whereas CLIP an-

alyses showed similar distribution patterns at the proximal

and distal polyA sites of non-target genes (Figure S3B). This

binding pattern is consistent with the observed distribution

pattern of the UGUA motif (Figure 3F) and supports a model

whereby Nudt21 is directed to distal sites to facilitate

polyadenylation.

Nudt21 functions in complex with other factors that impact

APA (Shi, 2012; Tian and Manley, 2017). To corroborate the

finding that Nudt21 enhances reprogramming through an

APA-dependent function, we performed immunoprecipitation

for Nudt21 followed by mass-spectrometry using HEK293T

cells. Gene ontology analysis based on all detected Nudt21-

associated proteins revealed ‘‘RNA processing,’’ ‘‘30-end pro-

cessing,’’ and ‘‘pre-mRNA cleavage’’ among the top-enriched

categories, supporting the conclusion that Nudt21 predomi-

nantly functions at the level of mRNA processing (Figure S3C).

Of note, our analysis confirmed direct interaction between

Nudt21 and other components of the CFIm complex, including

CFIm68 and CFIm59 (Figure S3D).

To test whether suppression of additional components of the

CFIm complex mirror the effect of Nudt21 deficiency, we

depleted CFIm68 during the generation of iPSCs. Previous

studies showed that suppression of CFIm68 also causes 30

UTR shortening in different cell types, although fewer transcripts

are affected compared to Nudt21 suppression (Li et al., 2015;

Martin et al., 2012). Accordingly, CFIm68 knockdown led to a

>7-fold increase in reprogramming efficiency, as judged

by transgene-independent, AP-positive colonies (Figures 3G,

3H and S3E). By contrast, knockdown of the CFIIm subunit

Pcf11, which has previously been shown to cause 30 UTR length-

ening (Li et al., 2015), produced a reciprocal phenotype,

NA versus control siRNA.

g.Cellswere inducedwithdox for 12days, followedby4daysof doxwithdrawal.

(E).

nts, mean ± SD.

Page 9: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

decreasing reprogramming efficiency >2-fold (Figures 3G, 3H,

and S3E). These data demonstrate that suppression of alterna-

tive factors that positively or negatively impact 30UTR length

have opposing effects on reprogramming, further strengthening

our conclusion that Nudt21 influences cell fate through modula-

tion of APA patterns.

Proteomics Analysis Reveals Enrichment for ChromatinRegulators among Upregulated Nudt21 TargetsWe next examined mRNA expression levels for the abovemen-

tioned PAS-seq samples. MDS analysis again showed a clear

trajectory of reprogramming (Figure 3I, gray line; Table S1).

Interestingly, Nudt21 knockdown samples fell along this

trajectory and were more advanced than control samples, sug-

gesting that Nudt21 knockdown accelerates acquisition of a

transcriptional state resembling iPSCs (Figure 3I). Consistent

with this observation, we found that established pluripotency

transcripts such as Nanog, Utf1, and Zfp296 were upregulated

whereas MEF-associated transcripts such as Snai1, Pdgfrb,

and Fibin were downregulated in Nudt21-depleted cells relative

to controls (Figure 3J). We conclude that the transcriptome of

Nudt21-depleted cultures is more similar to iPSCs even though

the extent of distal to proximal polyA shifts is enhanced

compared to unperturbed reprogramming intermediates and

iPSCs (Figure 3C).

Messenger RNAs that shift polyA site usage from distal to

proximal sites are generally thought to yield increased mRNA

or protein levels due to the inability of miRNA to bind to 30UTRsand destabilize transcripts or block translation (Shi, 2012; Tian

and Manley, 2017). However, it remains unclear how universal

this observation is on a genome-wide scale and in different

cellular contexts. To address these questions in our system,

we first compared APA changes with mRNA levels in Nudt21-

depleted cells undergoing reprogramming. We observed a pos-

itive, albeit modest, correlation between APA and mRNA levels,

indicating that 30UTR shortening increases corresponding tran-

script levels in Nudt21-depleted cells (Figure S3F). To determine

the effect of APA on protein levels, we performed large-scale

quantitative proteomic analysis of MEFs expressing OKSM for

3 days in the presence of either Nudt21 or control siRNAs. In

total, we quantified 8,187 proteins and selected the subset that

demonstrated statistically significant changes of 1.2-fold or

more for further analysis (n = 3 replicates per sample) (Figure 4A;

Table S2). We observed a good overall correlation between

mRNA and protein changes in Nudt21 knockdown cells,

suggesting that altered protein levels are generally due to

increased or decreased mRNA levels (Figure 4B). However, we

noticed only a subtle, positive correlation (correlation coeffi-

cient = 0.11) between APA and protein levels when comparing

Nudt21 versus control siRNA-treated cells (Figure 4C), which is

consistent with recent literature (Gruber et al., 2014; Spies

et al., 2013). Overall, only 208 transcripts undergoing APA led

to increased protein levels (12.8%) whereas 74 transcripts led

to decreased protein levels (4.5%) (Figure 4D). We surmise that

the regulatory effect of APA on the remaining transcripts may

depend on additional, cell context-specific factors that are

absent in our system (miRNAs, RNA binding proteins, etc.).

Nevertheless, these data highlight that only a small fraction of

Nudt21-dependent APA changes results in altered protein levels

(17%) that ultimately enhance reprogramming. We therefore

sought to characterize these proteins further.

Strikingly, gene ontology analysis of Nudt21 targets that

change protein levels (n = 282) revealed that upregulated

proteins were enriched for categories related to chromatin regu-

lationwhile downregulated proteins were enriched for categories

related to somatic cell function, including mesenchymal- and

extracellular matrix-related categories (Figures 4E and S4A).

Among the upregulated chromatin factors, we found multiple

components of the polycomb complex (BCOR, RYBP, SFMBT1,

PHC1), chromatin remodelers (CHD1, SS18, CHD9) and chro-

matin readers (WDR5, NSD1) (Figure 4F). We confirmed

increased protein expression for several of these proteins via

western blot analysis (Figure S4B). These results show that

Nudt21 suppression affects APA of a large set of genes, yet

only a subset of resultant transcripts lead to elevated expression

of proteins, which are strongly associated with chromatin

signaling.

Nudt21 Suppression Relieves miRNA-MediatedDegradationAnalysis of Nudt21 targets that increase protein expression

revealed that the 30 UTRs of corresponding transcripts were

enriched for binding sites of several miRNAs. This finding raises

the intriguing possibility that relief from miRNA-mediated

degradation is a contributing mechanism by which Nudt21

knockdown increases reprogramming (Figure S4C). In agree-

ment with this notion, we found that regions of the 30UTRthat are lost during the shift from distal to proximal polyA sites

show enriched binding for the miRNA effector AGO2 in ESCs

(Figure 5A). Moreover, several miRNAs whose binding sites

are lost within the 30UTR of Nudt21 targets have previously

been implicated in reprogramming and pluripotency. In partic-

ular, miR-29a and miR-34c were shown to lower reprogram-

ming efficiency through interactions with p53 or p21 (Choi

et al., 2011; Fraguas et al., 2017) and their expression gradually

diminishes during iPSC generation (Figure S4D, red line). Closer

examination of Jmjdc1, Rybp, and Wdr5 transcripts revealed

that miR34c and miR-29a target sequences were eliminated

when transcripts shifted from distal to proximal polyA sites

with Nudt21 knockdown (Figures 5B and S4E). Of note, previ-

ous work demonstrated that Rybp is directly regulated by

miR-29 (Zhou et al., 2012).

To determine whether APA of Jmjdc1, Rybp, and Wdr5

mRNAs have a direct effect on protein expression, we cloned

the corresponding short (polyadenylated at the proximal polyA

site) and long (distal polyA site) 30 UTR isoforms for each gene

into a luciferase reporter construct (Figure 5C). Luciferase activ-

ity diminished significantly for Jmjdc1 andRybp distal constructs

compared to the proximal constructs but was not significantly

different for Wdr5 (Figure 5D). Notably, co-transfection with

miRNAmimics led to a pronounced reduction in luciferase activ-

ity for Jmjdc1 and Rybp distal constructs (�40% and 30%,

respectively) and a highly significant decrease for theWdr5 distal

construct (46%) relative to proximal constructs (Figure 5D).

These findings confirm that APA eliminates miRNA seed

sequences that directly regulate key targets of Nudt21.

Cell 172, 106–120, January 11, 2018 113

Page 10: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

A Control siRNA Nudt21 siRNA

D

Proteins Up(208 genes)

Nudt21-regulated APA(1623 genes)

12.8%

4.5%

Proteins down(74 genes)

FE

p-va

lue

(-lo

g10)

Reg

. of h

isto

ne a

cety

latio

n

His

tone

ubi

quiti

natio

n

His

tone

mod

ifica

tion

Reg

. of h

isto

ne m

odifi

catio

n

Reg

. of h

isto

ne m

ethy

latio

n

Chr

omat

in m

odifi

catio

n

6

4

2

0

Enrichment AnalysisProteins Up

−1.5

−1

−0.5

0

0.5

1

1.5

ANP32EMECP2CTCFCASC5SETSPIN1BRD3BCORRYBPSFMBT1PHC1POLE3POLE4CHD1SS18CHD9RTF1WDR5NSD1JMJD1C

Control siRNA Day 3

Nudt21 siRNA Day 3

Chromatin readers

Polycombcomponents

Chromatin remodelers

Effectors of active chromatin

NUDT21−1.5

0

1.5

B

C

Pro

tein

(log

10(N

udt2

1 si

RN

A/C

ontro

l))

0.4

0.2

0

-0.2

-0.4

APA (log10(Nudt21 siRNA[P/D]/Control[P/D]))-2 -1 0 1 2

Corr. Coefficient=0.11Chromatin factors

Figure 4. Large-Scale Quantitative Prote-

omics Analysis Reveals a Subset of Upregu-

lated Nudt21 Targets that Are Enriched for

Chromatin Modifiers

(A) A heatmap for differentially expressed proteins

exhibiting a 1.2-fold or greater difference between

Nudt21 and control siRNA at day 3 of reprogram-

ming.

(B) A correlation plot (p value = 0) comparing mRNA

to protein expression at day 3 of reprogramming.

DEGs = differentially expressed genes (>2-fold).

(C) A correlation plot (p value = 8.84 3 10�5)

comparing APA to protein expression at day 3 of

reprogramming. P/D = proximal to distal ratio.

(D) A pie chart showing the percent of Nudt21 tar-

gets that change protein levels 1.2-fold or greater

by day 3 of reprogramming.

(E) Gene Ontology analysis for Nudt21 target pro-

teins that increase expression 1.2-fold or greater by

day 3 of reprogramming.

(F) A heatmap of chromatin modifiers that change

both protein levels and polyadenylation site usage.

See also Figure S4 and Table S2.

To assess whether miR-34c and miR-29a have a functional

effect in our system, we transfected miRNA inhibitors into

MEFs and induced reprogramming. These inhibitors reduced

the expression of each miRNA to <20% of the levels observed

in control samples (Figure S4F). Consistent with previous

reports, we observed a 2- to 3-fold increase in reprogramming

efficiency with miRNA inhibition (Choi et al., 2011; Fraguas

et al., 2017) (Figures 5E and 5F). Collectively, these observations

provide a direct, mechanistic link between APA, the elimination

of miRNA-mediated regulation, and cell fate changes that occur

with Nudt21 knockdown.

114 Cell 172, 106–120, January 11, 2018

Chromatin Regulators Targeted byNudt21 Are Key Mediators ofReprogrammingTo determine whether candidate chro-

matin regulators contribute to enhanced

reprogramming in Nudt21-depleted cells,

we performed co-suppression experi-

ments using siRNAs targeting Nudt21

and individual chromatin factors. We

initially focused on the H3K4me3 reader

WDR5 and the PRC1 component RYBP,

as they have previously been implicated

in pluripotency and reprogramming (Ang

et al., 2011; Li et al., 2017b). Western

blot analysis or qRT-PCR confirmed that

WDR5 and Rybp were upregulated in

Nudt21-depleted cells (Figures S5A and

S5E). Notably, while suppression of

Nudt21 alone led to the expected in-

crease in AP+ colonies, co-suppression

of Nudt21 and either Wdr5 or Rybp

neutralized this effect, suggesting that

upregulation of these factors is involved

in Nudt21’s effect on reprogramming

(Figures S5B–S5F). In further support of the molecular connec-

tion between Nudt21 and Rybp, we found that genes directly

suppressed by RYBP (Li et al., 2017a) including Bmp4, Tril,

and Jun were transcriptionally downregulated upon Nudt21

suppression (Figure S5G). Importantly, co-suppression of

Nudt21 with Rybp or Wdr5 did not increase cell death or alter

proliferation (Figures S5H and S5I), thus excluding the possibil-

ity that knockdown of these factors decreased reprogramming

efficiency by impairing cell viability or growth.

We next examined Nudt21-dependent chromatin factors with

no previous association to induced pluripotency. We chose two

Page 11: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

Rybp

[0 - 325]

500 bp

MEFs

ControlsiRNA

Nudt21siRNA

iPS cells

miR-29a

U U UC G G G C GGA A AC C CG C

-5’3’-

B [0 - 476]

Jmjd1c

500 bp

miR-34cU U CA C G C A5’- -3’

AG A GG C GU..

D

MEFs

ControlsiRNA

Nudt21siRNA

iPS cells

Control miRinhibitor

miR-34cinhibitor

**80

60

40

20

0

Contro

l miR

inhib

itor

miR-34

c inh

ibitor

AP

+ co

loni

es

Reprogrammingefficiency

**60

40

20

0

Contro

l miR

inhib

itor

miR-29

a inh

ibitor

AP

+ co

loni

es

Reprogrammingefficiency

Control miRinhibitor

miR-29ainhibitor

E F

*******

Rybp

**

****

Jmjd1c1.5

1.0

0.5

0Fold

-cha

nge

norm

aliz

ed

luci

fera

se a

ctiv

ity

Wdr5

***

N.S.

Proxim

al+miR

-29a

Proxim

al

Distal+

miR-34

cDist

al

Distal+

miR-29

aDist

al

Proxim

al

1.5

1.0

0.5

0

1.5

1.0

0.5

0

1.5

1.0

0.5

0

1.5

1.0

0.5

0

1.5

1.0

0.5

0

Proxim

al+miR

-34c

Distal

Distal+

miR-29

a

Proxim

al

Proxim

al+miR

-29a

C

Distal polyAconstruct

Luc

Proximal polyAconstruct

Luc

A

AG

O2

CLI

P-s

eq s

igna

l on

prox

imal

-di

stal

regi

on (l

og2(R

PK

M)) -2

-4

-6

-8

-10

p-value: 1.11e-8

AGO2 CLIP-seqMEFs

iPS cells

[0-935]

Wdr5

500 bp

miR-29aU G GA U G U C U

5’- -3’GC C CA C AU A

ControlsiRNA

Nudt21siRNA

APA genesDOWN

APA genesUP

Figure 5. Nudt21 Targets Are Regulated by miRNAs

(A) AGO2 CLIP-seq enrichment on transcripts that change polyadenylation following Nudt21 knockdown. The center bar, boxes, and whiskers represent the

median, first to third quartile, and minimum/maximum values, respectively.

(B) Gene tracks showing APA for Jmjd1c, Rybp, and Wdr5. The inset shows microRNA seed sequences in bold for each transcript. Control and Nudt21

knockdown samples were analyzed at day 3 of reprogramming.

(C) A schematic of the luciferase assay to assess the effect of 30 UTR length on protein expression. Arrows represent polyA sites and the red box represents a

predicted miRNA seed sequence.

(D) Normalized luciferase assay for the indicated genes with or without miRNA mimics.

(E) AP staining of transgene-independent iPS cell colonies generated with miR-34c inhibitor or miR inhibitor control. Cells were induced with dox for 12 days,

followed by 4 days of dox withdrawal.

(F) AP staining of transgene-independent iPS cell colonies generated with miR-29a inhibitor or miR inhibitor control. Cells were induced with dox for 12 days,

followed by 4 days of dox withdrawal.

**p < 0.01; ***p < 0.001; ****p < 0.0001, unpaired Student’s t test. n = 3 independent experiments, mean ± SD, N.S., not significant.

See also Figures S4 and S5.

Cell 172, 106–120, January 11, 2018 115

Page 12: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

A B

C D E

THY

1

MEFsMEFs

MEFsMEFs

log 2(C

hIP

/Inpu

t)lo

g 2(ChI

P/In

put)

10

10

Figure 6. Chromatin Modifiers Targeted by Nudt21 Mediate Cell Fate Change(A) Flow cytometry analysis showingOCT4-GFPpercentages at day 9 of reprogramming, following double knockdownof Nudt21 and the indicated chromatin factors.

(B) Quantification of transgene-independent iPS cell colonies, following double knockdown of Nudt21 and the indicated chromatin factors.

(C) Heatmaps of ATAC-seq read density for peaks at MEF-specific enhancers (left) and ESC-specific enhancers (right), colored by the ratio to the highest

coverage for each enhancer.

(D) H3K27me3 and H3K4me3 coverage at TSSs of genes proximal to the indicated enhancers. The center bar, boxes, and whiskers represent the median, first to

third quartile, and minimum/maximum values, respectively.

(E) Pathway enrichment analysis for genes associated with the MEF- and ESC-specific enhancer.

*p < 0.05; **p < 0.01; ***p < 0.001; ****p < 0.0001, unpaired Student’s t test. n = 3 independent experiments, mean ± SD.

See also Figure S6.

chromatin remodelers, CHD9 and ANP32E, one component of

the polycomb repressive complex 1 (PRC1), PHC1, and the

chromatin readers, RTF1 and SPIN1. To determine their contri-

bution to the enhanced reprogramming observed with Nudt21

knockdown, we performed double transfection with each of

the respective siRNAs (Figures S6A and S6B). Remarkably, co-

suppression of Nudt21 and any of these 4 chromatin factors

led to a significant decrease in the fraction of OCT4-GFP+ cells,

abrogating the positive effect of Nudt21 suppression on reprog-

ramming (Figures 6A and S6C). We also observed a significant

116 Cell 172, 106–120, January 11, 2018

decrease in the number of transgene-independent iPS cell

colonies as judged by AP staining, and decreased expression

of pluripotency-related genes (Nanog, Sall4, Esrrb, and Pou5f1)

after 9 days of reprogramming (Figures 6B, S6D, and S6E).

Cell viability and proliferation were once again unchanged by

co-transfection with siRNAs targeting Nudt21 and the chromatin

modifiers (Figures S6F and S6G). We therefore conclude that a

select set of chromatin readers, writers, and remodelers are

important for reprogramming and likely contribute to enhanced

iPS cell generation in cells with Nudt21 knockdown.

Page 13: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

Nudt21 KDUb

H2BH2A

H3 H4

Me

Me

Me

3’ UTRshortening

Chromatin writer

Chromatin reader

Chromatin remodeler

Control KD

Increased levelsof key proteins

A B

Stem/progenitor cell

Differentiatedcell

Stem/progenitor cell

Differentiatedcell

CHD9

Transcript Protein Cell fate

RYBP

WDR5

RTF1

SPIN1

ANP32E

PHC1

Figure 7. Nudt21 Alters Polyadenylation Site Usage and Modulates Cell Fate Transitions(A) A model showing the effect of Nudt21 knockdown.

(B) A model showing the molecular mechanism of Nudt21 knockdown.

The observation that APA regulates multiple chromatin factors

raises the question of whether Nudt21 depletion alters the chro-

matin landscape during reprogramming. To explore this possibil-

ity, we performed assay for transposase-accessible chromatin

combined with deep sequencing (ATAC-seq) (Buenrostro et al.,

2013) at various time points during iPSC induction. In total, we

identified 16,661 accessible regions in MEFs, which progres-

sively closed during the reprogramming time course and

13,672 loci that were open specifically in iPSCs when compared

to MEFs. The majority of regions that changed chromatin acces-

sibility mapped to enhancer regions (79%), suggesting that chro-

matin remodeling preferentially occurs at enhancers during the

initial phases of reprogramming. Strikingly, Nudt21 knockdown

prompted rapid closure of chromatin at MEF enhancers after

as little as 3 days of reprogramming, while the same regions

remained accessible in control cells (Figures 6C and S6H).

Concomitantly, pluripotency-associated enhancers exhibited

increased chromatin accessibility in cells depleted for Nudt21

compared to control cells at days 3 and 6 of reprogramming

(Figures 6C and S6H).

Many of the chromatin modifiers targeted by APA are compo-

nents of the polycomb and trithorax complexes. We therefore

hypothesized that these complexes are involved in the observed

changes in chromatin accessibility. To examine this prospect,

we identified closest proximal transcription start sites (TSSs)

for each ATAC-seq peak and overlapped them with published

chromatin immunoprecipitation sequencing (ChIP-seq) data for

the trithorax mark, H3K4me3, and the polycomb repressive

mark, H3K27me3 (Kundu et al., 2017). Indeed, we observed

high levels of H3K27me3 and H3K4me3 at TSSs associated

with MEF- and ESC-associated enhancer regions, respectively

(Figure 6D). Pathway enrichment analysis for genes adjacent to

ATAC-seq regions that close during reprogramming revealed

enrichment for ‘‘TGF-b pathway’’ and ‘‘senescence,’’ while

genes associated with loci that progressively open during

reprogramming were enriched for ‘‘histone modification,’’

‘‘cytosine methylation,’’ and ‘‘pluripotency’’ (Figure 6E).

Together, these results reinforce the link between APA-mediated

regulation of chromatin factors and the increased reprogram-

ming efficiency observed with Nudt21 knockdown.

DISCUSSION

Proper establishment and maintenance of cell fate relies on key

regulatory factors. To date, our understanding of these pro-

cesses has focused largely on the direct transcriptional regula-

tors; yet, post-transcriptional and translational mechanisms of

cell fate control remain underappreciated. Our results suggest

that post-transcriptional regulation plays a key role in reprog-

ramming, transdifferentiation, and stem/progenitor cell differen-

tiation. Specifically, we have identified the RNA-binding protein

Nudt21 as a novel regulator of cell fate using induced pluripo-

tency as a discovery tool. Expanding upon this finding, we

dissected mechanism of action for Nudt21 and extended its

role to alternative cell fate transitions, including the generation

of iTSCs and the differentiation of ESCs and myeloid progeni-

tors. Thus, our results reveal that Nudt21 suppression facilitates

both the acquisition andmaintenance of distinct stem or progen-

itor cell states (Figure 7A).

Mechanistically, Nudt21 suppression exerts its effect on cell

fate by inducing a widespread switch of APA patterns in over

1,500 transcripts. This, in turn, relieves miRNA-mediated degra-

dation, revealing a simple yet effective post-transcriptional

mechanism to influence gene expression. Our finding that only

a fraction of Nudt21-dependent APA changes lead to altered

protein levels underscores the importance of measuring protein

levels in parallel with APA patterns to identify meaningful targets.

This result is in good accordance with several recent studies that

reported only modest differences in transcript stability, transla-

tional efficiency, and corresponding protein abundance for

mRNAs with proximal polyadenylation and shorter 30 UTRs

(Gruber et al., 2014; Spies et al., 2013). Despite the low correla-

tion between proximal polyadenylation and protein abundance,

it has been suggested that tissue- or cell-type-specific APA

Cell 172, 106–120, January 11, 2018 117

Page 14: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

patterns are critical for establishing requisite gene expression

programs (Neve et al., 2016).

Here, we demonstrate for the first time that APA indeed regu-

lates a pivotal subset of genes that potently influence cell fate

transitions. Interestingly, this subset was strongly enriched for

chromatin-related genes (Figure 7B). Some of these genes

(Wdr5, Jmjd1c, Chd1, and Rybp) have previously been impli-

cated in iPSC reprogramming (Ang et al., 2011; Gaspar-Maia

et al., 2009; Li et al., 2017b; Shakya et al., 2015), hematopoietic

progenitor maintenance, leukemia (Grebien et al., 2015; Zhu

et al., 2016), and TSC formation (Pirity et al., 2005) while other

factors such as Anp32e, Chd9, Phc1, Rtf1, and Spin1 are novel

regulators of cell fate. Importantly, simultaneous knockdown of

each of these genes with Nudt21 abrogated its effect on reprog-

ramming, suggesting that these are critical downstream targets

of Nudt21 responsible for safeguarding cell identity. The exten-

sive closing of MEF-enhancers and the accelerated opening of

ESC-specific regions in Nudt21-depleted cells undergoing

reprogramming support a model in which APA regulates the

expression of chromatin factors that in turn alter the epigenomic

landscape favoring cell fate transition. However, it remains to be

tested whether the same or different sets of Nudt21 targets are

involved in alternative cell fate transitions such as ESC and

myeloid differentiation and iTSC generation. We also recognize

that some of the �1,500 Nudt21 targets that do not change at

the protein level may contribute to cell fate change by alternative

mechanisms that remain to be elucidated (Mayr, 2017).

Based on the finding that Nudt21 suppression delays progen-

itor cell differentiation, it will also be interesting to test whether

its perturbation contributes to tumorigenesis by modulating

cell proliferation and/or rewiring cell identity. Indeed, Nudt21

suppression was previously shown to promote the growth of a

glioblastoma cell line, although possible changes to cell fate

were not examined (Masamha et al., 2014). Interestingly, reanal-

ysis of this study reveals that Anp32e, Wdr5, and Rybp are also

targets of Nudt21 in the glioblastoma model, suggesting that

reprogramming to pluripotency and malignant transformation

may utilize common pathways (Apostolou and Hochedlinger,

2013) (Figure S6I). Our observation that cancer-associated

pathways are generally enriched among the Nudt21-repsonsive

transcripts further supports a possible role of APA in driving

tumorigenesis. Finally, our findingsmay have relevance in poten-

tial therapeutic settings as themodulation of Nudt21 levels could

be exploited to expand desired progenitor cell populations or

drive cancer cells into differentiation.

STAR+METHODS

Detailed methods are provided in the online version of this paper

and include the following:

d KEY RESOURCES TABLE

d CONTACT FOR REAGENT AND RESOURCE SHARING

d EXPERIMENTAL MODEL AND SUBJECT DETAILS

118 C

B Derivation of mouse embryonic fibroblasts

B Induction of pluripotency

B Transdifferentiation assays

B Teratoma assays

ell 172, 106–120, January 11, 2018

B Embryoid body generation

d METHOD DETAILS

B Serial enrichment shRNA screen

B Cell cycle analysis

B RNA preparation

B qRT-PCR analyses

B Vectors and virus production and infection

B siRNA and miRNA transfection

B Immunofluorescence assays

B Western blot

B Poly(A) site mapping

B ATAC-seq

B Flow cytometry

B Cell lysis and protein digestion

B Protein digestion and peptide labeling

B Offline basic pH reversed-phase (BPRP) fractionation

B Liquid chromatography and tandem mass

spectrometry

B IP mass spectrometry analysis

B Luciferase assays

d QUANTIFICATION AND STATISTICAL ANALYSIS

B Statistical Analysis

B Bioninformatics analysis

B Large-scale Proteomic Data Analysis

d DATA AND SOFTWARE AVAILABILITY

B Deposition of sequencing data

SUPPLEMENTAL INFORMATION

Supplemental Information includes six figures and three tables and can be

found with this article online at https://doi.org/10.1016/j.cell.2017.11.023.

AUTHOR CONTRIBUTIONS

J.B., B.D.S., Y.S., and K.H. conceived the study and wrote the manuscript.

J.B., B.D.S., E.F., M.B., X.W., and B.A.S. performed the experiments and

analyzed the data. G.H. and S.J.E. constructed the shRNA library. G.H. and

M.B. conducted the shRNA screen. K.J.C., M.K., and S.P.G. performed prote-

omic analysis. Y.C. performed mass spectrometry to identify Nudt21 binding

partners. F.J. and R.I.S. contributed bioinformatics analysis.

ACKNOWLEDGMENTS

We thankMaris Handley, AmyGalvin, MarianneGesner, and Eric Surette of the

Harvard Stem Cell Institute flow cytometry core. We are grateful to David

Sykes for sharing HOXB8-ER/LYZ-GFP cells. We thank Aaron Goldstrohm

and members of the Hochedlinger lab for discussions. K.H. was supported

by funds from the MGH, NIH (R01 HD058013-06, P01GM099134-06), and

the Gerald and Darlene Jordan Chair in Regenerative Medicine. J.B. is grateful

for support from the NIH (1F32HD078029-01A1). B.D.S. was supported by an

EMBO long-term fellowship (ALTF 1143-2015). Y.S. was supported by NIH

grants GM090056 and CA17488. G.H. was supported by the NIH Intramural

Research Program (Z01ES102745). S.J.E. was supported by the NIH

(AG11085).

Received: June 14, 2017

Revised: October 8, 2017

Accepted: November 10, 2017

Published: December 14, 2017

Page 15: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

REFERENCES

Ang, Y.S., Tsai, S.Y., Lee, D.F., Monk, J., Su, J., Ratnakumar, K., Ding, J., Ge,

Y., Darr, H., Chang, B., et al. (2011). Wdr5 mediates self-renewal and reprog-

ramming via the embryonic stem cell core transcriptional network. Cell 145,

183–197.

Apostolou, E., and Hochedlinger, K. (2013). Chromatin dynamics during

cellular reprogramming. Nature 502, 462–471.

Arnold, K., Sarkar, A., Yram, M.A., Polo, J.M., Bronson, R., Sengupta, S.,

Seandel, M., Geijsen, N., and Hochedlinger, K. (2011). Sox2(+) adult stem

and progenitor cells are important for tissue regeneration and survival of

mice. Cell Stem Cell 9, 317–329.

Bar-Nur, O., Brumbaugh, J., Verheul, C., Apostolou, E., Pruteanu-Malinici, I.,

Walsh, R.M., Ramaswamy, S., and Hochedlinger, K. (2014). Small molecules

facilitate rapid and synchronous iPSC generation. Nat. Methods 11,

1170–1176.

Beausoleil, S.A., Villen, J., Gerber, S.A., Rush, J., and Gygi, S.P. (2006). A

probability-based approach for high-throughput protein phosphorylation anal-

ysis and site localization. Nat. Biotechnol. 24, 1285–1292.

Benchetrit, H., Herman, S., van Wietmarschen, N., Wu, T., Makedonski, K.,

Maoz, N., Yom Tov, N., Stave, D., Lasry, R., Zayat, V., et al. (2015). Extensive

nuclear reprogramming underlies lineage conversion into functional tropho-

blast stem-like cells. Cell Stem Cell 17, 543–556.

Borkent, M., Bennett, B.D., Lackford, B., Bar-Nur, O., Brumbaugh, J., Wang,

L., Du, Y., Fargo, D.C., Apostolou, E., Cheloufi, S., et al. (2016). A serial shRNA

screen for roadblocks to reprogramming identifies the protein modifier

SUMO2. Stem Cell Reports 6, 704–716.

Boutet, S.C., Cheung, T.H., Quach, N.L., Liu, L., Prescott, S.L., Edalati, A., Iori,

K., and Rando, T.A. (2012). Alternative polyadenylation mediates microRNA

regulation of muscle stem cell function. Cell Stem Cell 10, 327–336.

Brown, K.M., and Gilmartin, G.M. (2003). A mechanism for the regulation of

pre-mRNA 30 processing by human cleavage factor Im. Mol. Cell 12,

1467–1476.

Buenrostro, J.D., Giresi, P.G., Zaba, L.C., Chang, H.Y., and Greenleaf, W.J.

(2013). Transposition of native chromatin for fast and sensitive epigenomic

profiling of open chromatin, DNA-binding proteins and nucleosome position.

Nat. Methods 10, 1213–1218.

Bussmann, L.H., Schubert, A., VuManh, T.P., De Andres, L., Desbordes, S.C.,

Parra, M., Zimmermann, T., Rapino, F., Rodriguez-Ubreva, J., Ballestar, E.,

and Graf, T. (2009). A robust and highly efficient immune cell reprogramming

system. Cell Stem Cell 5, 554–566.

Chen, E.Y., Tan, C.M., Kou, Y., Duan, Q.,Wang, Z., Meirelles, G.V., Clark, N.R.,

and Ma’ayan, A. (2013). Enrichr: interactive and collaborative HTML5 gene list

enrichment analysis tool. BMC Bioinformatics 14, 128.

Choi, Y.J., Lin, C.P., Ho, J.J., He, X., Okada, N., Bu, P., Zhong, Y., Kim, S.Y.,

Bennett, M.J., Chen, C., et al. (2011). miR-34 miRNAs provide a barrier for

somatic cell reprogramming. Nat. Cell Biol. 13, 1353–1360.

Elias, J.E., and Gygi, S.P. (2010). Target-decoy search strategy for mass

spectrometry-based proteomics. Methods Mol. Biol. 604, 55–71.

Fraguas, M.S., Eggenschwiler, R., Hoepfner, J., Schiavinato, J.L., Haddad, R.,

Oliveira, L.H., Araujo, A.G., Zago, M.A., Panepucci, R.A., and Cantz, T. (2017).

MicroRNA-29 impairs the early phase of reprogramming process by targeting

active DNA demethylation enzymes and Wnt signaling. Stem Cell Res. (Amst.)

19, 21–30.

Gaspar-Maia, A., Alajem, A., Polesso, F., Sridharan, R., Mason, M.J.,

Heidersbach, A., Ramalho-Santos, J., McManus, M.T., Plath, K., Meshorer,

E., and Ramalho-Santos, M. (2009). Chd1 regulates open chromatin and

pluripotency of embryonic stem cells. Nature 460, 863–868.

Grebien, F., Vedadi, M., Getlik, M., Giambruno, R., Grover, A., Avellino, R.,

Skucha, A., Vittori, S., Kuznetsova, E., Smil, D., et al. (2015). Pharmacological

targeting of the Wdr5-MLL interaction in C/EBPa N-terminal leukemia. Nat.

Chem. Biol. 11, 571–578.

Gruber, A.R., Martin, G., Muller, P., Schmidt, A., Gruber, A.J., Gumienny, R.,

Mittal, N., Jayachandran, R., Pieters, J., Keller, W., et al. (2014). Global 30

UTR shortening has a limited effect on protein abundance in proliferating

T cells. Nat. Commun. 5, 5465.

Gupta, I., Clauder-Munster, S., Klaus, B., Jarvelin, A.I., Aiyar, R.S., Benes, V.,

Wilkening, S., Huber, W., Pelechano, V., and Steinmetz, L.M. (2014). Alterna-

tive polyadenylation diversifies post-transcriptional regulation by selective

RNA-protein interactions. Mol. Syst. Biol. 10, 719.

Ji, Z., Lee, J.Y., Pan, Z., Jiang, B., and Tian, B. (2009). Progressive lengthening

of 30 untranslated regions of mRNAs by alternative polyadenylation during

mouse embryonic development. Proc. Natl. Acad. Sci. USA 106, 7028–7033.

John, S., Sabo, P.J., Thurman, R.E., Sung, M.H., Biddie, S.C., Johnson, T.A.,

Hager, G.L., and Stamatoyannopoulos, J.A. (2011). Chromatin accessibility

pre-determines glucocorticoid receptor binding patterns. Nat. Genet. 43,

264–268.

Kim, D., Pertea, G., Trapnell, C., Pimentel, H., Kelley, R., and Salzberg, S.L.

(2013). TopHat2: accurate alignment of transcriptomes in the presence of

insertions, deletions and gene fusions. Genome Biol. 14, R36.

Kubaczka, C., Senner, C.E., Cierlitza, M., Arauzo-Bravo, M.J., Kuckenberg, P.,

Peitz, M., Hemberger, M., and Schorle, H. (2015). Direct induction of

trophoblast stem cells from murine fibroblasts. Cell Stem Cell 17, 557–568.

Kundu, S., Ji, F., Sunwoo, H., Jain, G., Lee, J.T., Sadreyev, R.I., Dekker, J., and

Kingston, R.E. (2017). Polycomb repressive complex 1 generates discrete

compacted domains that change during differentiation. Mol. Cell 65, 432–446.

Lackford, B., Yao, C., Charles, G.M., Weng, L., Zheng, X., Choi, E.A., Xie, X.,

Wan, J., Xing, Y., Freudenberg, J.M., et al. (2014). Fip1 regulates mRNA

alternative polyadenylation to promote stem cell self-renewal. EMBO J. 33,

878–889.

Lengner, C.J., Camargo, F.D., Hochedlinger, K., Welstead, G.G., Zaidi, S.,

Gokhale, S., Scholer, H.R., Tomilin, A., and Jaenisch, R. (2007). Oct4

expression is not required for mouse somatic stem cell self-renewal. Cell

Stem Cell 1, 403–415.

Leung, A.K., Young, A.G., Bhutkar, A., Zheng, G.X., Bosson, A.D., Nielsen,

C.B., and Sharp, P.A. (2011). Genome-wide identification of Ago2 binding sites

from mouse embryonic stem cells with and without mature microRNAs. Nat.

Struct. Mol. Biol. 18, 237–244.

Li, H. (2012). Exploring single-sample SNP and INDEL calling with whole-

genome de novo assembly. Bioinformatics 28, 1838–1844.

Li, H., Handsaker, B., Wysoker, A., Fennell, T., Ruan, J., Homer, N., Marth, G.,

Abecasis, G., and Durbin, R.; 1000 Genome Project Data Processing

Subgroup (2009). The sequence alignment/map format and SAMtools.

Bioinformatics 25, 2078–2079.

Li, W., You, B., Hoque, M., Zheng, D., Luo, W., Ji, Z., Park, J.Y., Gunderson,

S.I., Kalsotra, A., Manley, J.L., and Tian, B. (2015). Systematic profiling of

poly(A)+ transcripts modulated by core 30 end processing and splicing factors

reveals regulatory rules of alternative cleavage and polyadenylation. PLoS

Genet. 11, e1005166.

Li, H., Lai, P., Jia, J., Song, Y., Xia, Q., Huang, K., He, N., Ping, W., Chen, J.,

Yang, Z., et al. (2017a). RNA Helicase DDX5 inhibits reprogramming to

pluripotency by miRNA-based repression of RYBP and its PRC1-dependent

and -independent functions. Cell Stem Cell 20, 462–477.

Li, H., Lai, P., Jia, J., Song, Y., Xia, Q., Huang, K., He, N., Ping, W., Chen, J.,

Yang, Z., et al. (2017b). RNA helicase DDX5 inhibits reprogramming to

pluripotency by miRNA-based repression of RYBP and its PRC1-dependent

and -independent functions. Cell Stem Cell 20, 571.

Martin, G., Gruber, A.R., Keller, W., and Zavolan, M. (2012). Genome-wide

analysis of pre-mRNA 30 end processing reveals a decisive role of human

cleavage factor I in the regulation of 30 UTR length. Cell Rep. 1, 753–763.

Masamha, C.P., Xia, Z., Yang, J., Albrecht, T.R., Li, M., Shyu, A.B., Li, W., and

Wagner, E.J. (2014). CFIm25 links alternative polyadenylation to glioblastoma

tumour suppression. Nature 510, 412–416.

Cell 172, 106–120, January 11, 2018 119

Page 16: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

Mayr, C. (2017). Regulation by 3’-untranslated regions. Annu. Rev. Genet.

Published online August 30, 2017. https://doi.org/10.1146/annurev-genet-

120116-024704.

Mayr, C., and Bartel, D.P. (2009). Widespread shortening of 3’UTRs by alterna-

tive cleavage and polyadenylation activates oncogenes in cancer cells. Cell

138, 673–684.

McAlister, G.C., Nusinow, D.P., Jedrychowski, M.P., Wuhr, M., Huttlin, E.L.,

Erickson, B.K., Rad, R., Haas, W., and Gygi, S.P. (2014). MultiNotch MS3

enables accurate, sensitive, and multiplexed detection of differential

expression across cancer cell line proteomes. Anal. Chem. 86, 7150–7158.

Neve, J., Burger, K., Li, W., Hoque, M., Patel, R., Tian, B., Gullerova, M., and

Furger, A. (2016). Subcellular RNA profiling links splicing and nuclear DICER1

to alternative cleavage and polyadenylation. Genome Res. 26, 24–35.

Paulo, J.A., O’Connell, J.D., Everley, R.A., O’Brien, J., Gygi, M.A., and Gygi,

S.P. (2016). Quantitative mass spectrometry-based multiplexing compares

the abundance of 5000 S. cerevisiae proteins across 10 carbon sources.

J. Proteomics 148, 85–93.

Pirity, M.K., Locker, J., and Schreiber-Agus, N. (2005). Rybp/DEDAF is

required for early postimplantation and for central nervous system develop-

ment. Mol. Cell. Biol. 25, 7193–7202.

Polo, J.M., Anderssen, E., Walsh, R.M., Schwarz, B.A., Nefzger, C.M., Lim,

S.M., Borkent, M., Apostolou, E., Alaei, S., Cloutier, J., et al. (2012). A molec-

ular roadmap of reprogramming somatic cells into iPS cells. Cell 151,

1617–1632.

Quinlan, A.R., and Hall, I.M. (2010). BEDTools: a flexible suite of utilities for

comparing genomic features. Bioinformatics 26, 841–842.

Ramırez, F., Ryan, D.P., Gruning, B., Bhardwaj, V., Kilpert, F., Richter, A.S.,

Heyne, S., Dundar, F., and Manke, T. (2016). deepTools2: a next generation

web server for deep-sequencing data analysis. Nucleic Acids Res. 44 (W1),

W160-5.

Robinson, M.D., McCarthy, D.J., and Smyth, G.K. (2010). edgeR: a

Bioconductor package for differential expression analysis of digital gene

expression data. Bioinformatics 26, 139–140.

Ruegsegger, U., Blank, D., and Keller, W. (1998). Human pre-mRNA cleavage

factor Im is related to spliceosomal SR proteins and can be reconstituted

in vitro from recombinant subunits. Mol. Cell 1, 243–253.

Sandberg, R., Neilson, J.R., Sarma, A., Sharp, P.A., and Burge, C.B. (2008).

Proliferating cells express mRNAs with shortened 30 untranslated regions

and fewer microRNA target sites. Science 320, 1643–1647.

Shakya, A., Callister, C., Goren, A., Yosef, N., Garg, N., Khoddami, V., Nix, D.,

Regev, A., and Tantin, D. (2015). Pluripotency transcription factor Oct4 medi-

ates stepwise nucleosome demethylation and depletion. Mol. Cell. Biol. 35,

1014–1025.

120 Cell 172, 106–120, January 11, 2018

Shen, Y., Yue, F., McCleary, D.F., Ye, Z., Edsall, L., Kuan, S., Wagner, U.,

Dixon, J., Lee, L., Lobanenkov, V.V., and Ren, B. (2012). A map of the

cis-regulatory sequences in the mouse genome. Nature 488, 116–120.

Shepard, P.J., Choi, E.A., Lu, J., Flanagan, L.A., Hertel, K.J., and Shi, Y. (2011).

Complex and dynamic landscape of RNA polyadenylation revealed by PAS-

Seq. RNA 17, 761–772.

Shi, Y. (2012). Alternative polyadenylation: new insights from global analyses.

RNA 18, 2105–2117.

Spies, N., Burge, C.B., and Bartel, D.P. (2013). 30 UTR-isoform choice has

limited influence on the stability and translational efficiency of most mRNAs

in mouse fibroblasts. Genome Res. 23, 2078–2090.

Stadtfeld, M., Maherali, N., Breault, D.T., and Hochedlinger, K. (2008). Defining

molecular cornerstones during fibroblast to iPS cell reprogramming in mouse.

Cell Stem Cell 2, 230–240.

Stadtfeld, M., Maherali, N., Borkent, M., and Hochedlinger, K. (2010). A

reprogrammable mouse strain from gene-targeted embryonic stem cells.

Nat. Methods 7, 53–55.

Sykes, D.B., Kfoury, Y.S., Mercier, F.E., Wawer, M.J., Law, J.M., Haynes,

M.K., Lewis, T.A., Schajnovitz, A., Jain, E., Lee, D., et al. (2016). Inhibition of

dihydroorotate dehydrogenase overcomes differentiation blockade in acute

myeloid leukemia. Cell 167, 171–186.

Takahashi, K., and Yamanaka, S. (2006). Induction of pluripotent stem cells

from mouse embryonic and adult fibroblast cultures by defined factors. Cell

126, 663–676.

Tian, B., and Manley, J.L. (2017). Alternative polyadenylation of mRNA precur-

sors. Nat. Rev. Mol. Cell Biol. 18, 18–30.

Tucker, K.L., Meyer, M., and Barde, Y.A. (2001). Neurotrophins are required for

nerve growth during development. Nat. Neurosci. 4, 29–37.

Vierbuchen, T., Ostermeier, A., Pang, Z.P., Kokubu, Y., Sudhof, T.C., and

Wernig, M. (2010). Direct conversion of fibroblasts to functional neurons by

defined factors. Nature 463, 1035–1041.

Wang, G.G., Calvo, K.R., Pasillas, M.P., Sykes, D.B., Hacker, H., and Kamps,

M.P. (2006). Quantitative production of macrophages or neutrophils ex vivo

using conditional Hoxb8. Nat. Methods 3, 287–293.

Ye, J., and Blelloch, R. (2014). Regulation of pluripotency by RNA binding

proteins. Cell Stem Cell 15, 271–280.

Zhou, L., Wang, L., Lu, L., Jiang, P., Sun, H., and Wang, H. (2012). A novel

target of microRNA-29, Ring1 and YY1-binding protein (Rybp), negatively

regulates skeletal myogenesis. J. Biol. Chem. 287, 25255–25265.

Zhu, N., Chen, M., Eng, R., DeJong, J., Sinha, A.U., Rahnamay, N.F., Koche,

R., Al-Shahrour, F., Minehart, J.C., Chen, C.W., et al. (2016). MLL-AF9- and

HOXA9-mediated acute myeloid leukemia stem cell self-renewal requires

JMJD1C. J. Clin. Invest. 126, 997–1011.

Page 17: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

STAR+METHODS

KEY RESOURCES TABLE

REAGENT or RESOURCE SOURCE IDENTIFIER

Antibodies

Mouse monoclonal anti-Nudt21 Santa Cruz Biotechnology Cat#2203C3

Rabbit monoclonal anti-beta-Actin Cell Signaling Technology Cat#4970 (13E5)

Rabbit polyclonal anti-Wdr5 Bethyl Labs Cat#A302-430A

Rabbit monoclonal anti-Rtf1 Cell Signaling Technology Cat#14737 (D7V3W)

Mouse monoclonal anti-Phc1 Cell Signaling Technology Cat#13768 (1F3F3)

Rabbit monoclonal anti-c-Myc Cell Signaling Technology Cat#5605 (D84C12)

Mouse monoclonal anti-BrdU Agilent Cat#M074401-8

Mouse monoclonal anti-CD19 BD Biosciences Cat#561738 (1D3)

Rat monoclonal anti-CD11b BD Biosciences Cat#561098 (M1/17)

Mouse monoclonal anti-SSEA1 eBiosciences Cat#eBioMC-480

Rat monoclonal anti-Thy1 eBiosciences Cat#48-0902-82 (53-2.1)

Rat monoclonal anti-EpCAM eBiosciences Cat#25-5791-80 (G8.8)

Goat polyclonal anti-Cdx2 Santa Cruz Biotechnology Cat#SC-19478 (c-20)

Rabbit polyclonal anti-GFP Novus Biologicals Cat#NB-600-308

Rabbit polyclonal anti-Mouse IgG Thermo Fisher Scientific Cat#31450

Goat polyclonal anti-Rabbit IgG Thermo Fisher Scientific Cat#31460

Goat polyclonal anti-Rabbit IgG Thermo Fisher Scientific Cat#R37116

Goat polyclonal anti-Mouse IgG Thermo Fisher Scientific Cat#R37121

Anti-SSEA-1 (CD15) MicroBeads, human and mouse Miltenyi Biotec Cat#130-094-530

Bacterial and Virus Strains

ElectroMAX DH10B Cells Thermo Fisher Scientific Cat#18290015

Chemicals, Peptides, and Recombinant Proteins

Doxycycline hyclate Sigma Aldrich Cat#D9891-100G

N-2 Thermo Fisher Scientific Cat#17502-048

Fetal Bovine Serum GE Healthsciences Cat#SH30396.03

Beta-estradiol Sigma Aldrich Cat#E2758

IL3 Peprotech Cat#213-13

FGF4 Peprotech Cat#100-31

CSF Peprotech Cat#300-03

Heparin Stem Cell Technologies Cat#07980

L-Ascorbic Acid Sigma Aldrich Cat#A92902

Carbenicillin, Disodium Salt Sigma Aldrich Cat#10177012

2,2,2-tribromoethanol Sigma Aldrich Cat#T48402-5G

Tert-amyl alcohol Sigma Aldrich Cat#19954

5-Bromo-20-deoxyuridine (BrdU) Sigma Aldrich Cat#B5002-100MG

Sodium tetraborate decahydrate (Na2B4O7$10H2O) Sigma Aldrich Cat#221333

Lipofectamine 2000 Thermo Fisher Scientific Cat#11668027

holo-Transferrin bovine Sigma Aldrich Cat#T1283-50MG

Pierce Lys-C Protease, MS Grade Thermo Fisher Scientific Cat#90051

Pierce Trypsin Protease, MS Grade Thermo Fisher Scientific Cat#90057

EPPS Sigma Aldrich Cat#E9502-10G

cOmplete, Mini Protease Inhibitor Cocktail Sigma Aldrich Cat#11836153001

TCEP Sigma Aldrich Cat#C4706

(Continued on next page)

Cell 172, 106–120.e1–e10, January 11, 2018 e1

Page 18: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

Continued

REAGENT or RESOURCE SOURCE IDENTIFIER

Iodoacetamide Sigma Aldrich Cat#I6125

Dithiotreitol Sigma Aldrich Cat#233155

Hydroxylamine Sigma Aldrich Cat#159417

Critical Commercial Assays

Nextera DNA library preparation kit Illumina Cat#IP-202-1012

QIAGEN MinElute reaction clean up kit QIAGEN Cat#28204

QIAquick Gel Extraction Kit QIAGEN Cat#28704

miRNeasy Mini Kit QIAGEN Cat#217004

CircLigase II ssDNA Ligase Epicenter Cat#CL9021K

VECTOR Red Alkaline Phosphatase (Red AP) Substrate Kit Vector Laboratories Cat#SK-5100

TMT10plex Isobaric Label Reagent Set Thermo Fisher Scientific Cat#90111

Q5� High-Fidelity DNA Polymerase New England Biolabs Cat#M0491S

GenElute HP Plasmid Maxiprep Kit Sigma Aldrich Cat#NA0310-1KT

High Capacity RNA-to-cDNA kit Applied Biosystems Cat#4387406

TaqMan MicroRNA Reverse Transcription Kit Applied Biosystems Cat#4366596

Brilliant III SYBR Master Mix Agilent Cat#600882

TaqMan Universal PCR Master Mix (no AmpErase UNG) Applied Biosystems Cat#4324018

RNA Fragmentation Reagent Ambion Cat#AM8740

Buffer E (from 4-CORE� Buffer Pack) Promega Cat#R9921

Ampure XP beads Beckman Coulter Cat#A63880

Pierce BCA Protein Assay Kit Thermo Fisher Scientific Cat#23225

Pierce Quantitative Colorimetric Peptide Assay Thermo Fisher Scientific Cat#23275

50 mg Sep-Pak cartridges Waters Cat#WAT054960

Deposited Data

ATAC-seq This study GSE104529

PAS-seq This study GSE99922

Large-scale proteomics This study PXD008078

Nudt21 IP-MS This study PXD008108

Experimental Models: Cell Lines

Mouse embryonic fibroblasts from

B6;129S4- Gt(ROSA)26Sortm1(rtTA*M2)Jae

Col1a1tm1(tetO-Pou5f1,-Klf4,-Sox2,-Myc)Hoch Pou5f1tm2Jae/J

This paper N/A

Mouse embryonic fibroblasts from

B6;129S- Gt(ROSA)26Sortm1(rtTA*M2)Jae Sox2tm2Hoch/J

This paper N/A

Mouse embryonic fibroblasts from

B6.129S4- Gt(ROSA)26Sortm1(rtTA*M2)Jae (Cg)-Mapttm1(EGFP)Klt/J

This paper N/A

C10 pre B cell line Bussmann et al., 2009 N/A

Induced pluripotent stem cell line 2D-iPS This paper N/A

HEK293 cells ATCC Cat#CRL3216

LYZ-GFP/HOXB8-ER cell line Wang et al., 2006 N/A

Experimental Models: Organisms/Strains

Mouse: B6;129S4-

Gt(ROSA)26Sortm1(rtTA*M2)Jae

Col1a1tm1(tetO-Pou5f1,-Klf4,-Sox2,-Myc)Hoch Pou5f1tm2Jae/J

This paper N/A

Mouse: B6;129S-

Gt(ROSA)26Sortm1(rtTA*M2)Jae Sox2tm2Hoch/J

This paper N/A

Mouse: B6.129S4-

Gt(ROSA)26Sortm1(rtTA*M2)Jae (Cg)-Mapttm1(EGFP)Klt/J

This paper N/A

(Continued on next page)

e2 Cell 172, 106–120.e1–e10, January 11, 2018

Page 19: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

Continued

REAGENT or RESOURCE SOURCE IDENTIFIER

Recombinant DNA

FUW-TetO-Ascl1 Vierbuchen et al., 2010 Addgene #27150

FUW-TetO-Myt1l Vierbuchen et al., 2010 Addgene #27151

FUW-TetO-Brn2 Vierbuchen et al., 2010 Addgene #27151

pLV-TetO-Gata3 Kubaczka et al., 2015 Addgene #70270

pLV-TetO-Eomes Kubaczka et al., 2015 Addgene #70271

pLV-TetO-Tfap2c Kubaczka et al., 2015 Addgene #70269

FLAG-CFIm25-pCDNA3.1 This paper N/A

pGL3-Rybp-distal This paper N/A

pGL3-Rybp-proximal This paper N/A

pGL3-Jmjd1c-distal This paper N/A

pGL3-Jmjd1c-proximal This paper N/A

pMIR-REPORT-Wdr5-distal This paper N/A

pMIR-REPORT-Wdr5-proximal This paper N/A

pHAGE-Mir Pooled shRNA library This paper N/A

pHAGE-Mir-shNudt21 This paper N/A

pHAGE-Mir-shRenilla This paper N/A

Oligonucleotides

pHAGE-Mir-PCR: 50- gcaaactggggcacagatgatgcgg This paper N/A

BC1R-L: 50- CGCCTCCCCTACCCGGTAGA This paper N/A

mir30-EcoRI: 50-TAGCCCCTTGAATTCC

GAGGCAGTAGGCA

This paper N/A

p5-miSeq: 50-ATGATACGGCGACCACCGAG

ATCTACACCTAAAGTAGCCCCTTGAATTC

This paper N/A

p7-miSeq-1: 50-CAAGCAGAAGACGGCATA

CGAGACGATAGTGAAGCCACAGATGTA

This paper N/A

p7-miSeq-2: 50-CAAGCAGAAGACGGCATAC

GAGACACTAGTGAAGCCACAGATGTA

This paper N/A

p7-miSeq-3: 50-CAAGCAGAAGACGGCATAC

GAGACTATAGTGAAGCCACAGATGTA

This paper N/A

p7-miSeq-4: 50-CAAGCAGAAGACGGCATACG

AGACCTTAGTGAAGCCACAGATGTA

This paper N/A

PASSEQ7-2 RT oligo: [phos]NNNNAGATCGGAAGAGC

GTCGTGTTCGGATCCATTAGGATCCGAGACGTGTGCT

CTTCCGATCTTTTTTTTTTTTTTTTTTTT[V-Q]

This paper N/A

See also Table S3 This paper N/A

Software and Algorithms

FlowJo V10.2 N/A https://www.flowjo.com/

imageJ N/A https://imagej.nih.gov/ij/download.html

GraphPad Prism N/A https://www.graphpad.com/

DESeq2 1.6.3 N/A https://bioconductor.org/packages/release/

bioc/html/DESeq2.html

RStudio 1.0.14 N/A https://www.rstudio.com/

EnrichR Chen et al., 2013 http://amp.pharm.mssm.edu/Enrichr/

Tophat Kim et al., 2013 https://ccb.jhu.edu/software/tophat/index.shtml

edgeR package Robinson et al., 2010 http://bioconductor.org/packages/release/bioc/

html/edgeR.html

BWA-MEM Li, 2012 http://bio-bwa.sourceforge.net/

HOTSPOT2 John et al., 2011 https://github.com/Altius/hotspot2

(Continued on next page)

Cell 172, 106–120.e1–e10, January 11, 2018 e3

Page 20: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

Continued

REAGENT or RESOURCE SOURCE IDENTIFIER

BEDTools Quinlan and Hall, 2010 http://bedtools.readthedocs.io/en/latest/

SciPy N/A https://www.scipy.org/

DeepTools Ramırez et al., 2016 https://deeptools.readthedocs.io/en/latest/

Python 2.7 N/A https://www.python.org/download/releases/2.7/

SAMtools v1.1 N/A http://samtools.sourceforge.net/

Diva v6.1.2 N/A http://www.bdbiosciences.com/us/instruments/

research/software/flow-cytometry-acquisition/

bd-facsdiva-software/m/111112/overview

ReAdW.exe N/A http://www.ionsource.com/functional_reviews/

readw/t2x_update_readw.htm

SEQUEST-based in-house software pipeline McAlister et al., 2014 N/A

Other

CFIM68 PAR-CLIP in HEK293T Martin et al., 2012 GEO: GSE37401

NUDT21 PAR-CLIP in HEK293T Martin et al., 2012 GEO: GSE37401

AGO2 CLIP-seq in ESCs Leung et al., 2011 GEO: GSE25310

H3K4me3 in ESCs Kundu et al., 2017 GEO: GSE89949

H3K27me3 in ESCs Kundu et al., 2017 GEO: GSE89949

CONTACT FOR REAGENT AND RESOURCE SHARING

Further information and requests for resources and reagents should be directed to and will be fulfilled by the Lead Contact, Konrad

Hochedlinger ([email protected]).

EXPERIMENTAL MODEL AND SUBJECT DETAILS

Derivation of mouse embryonic fibroblastsMouse embryonic fibroblasts (MEFs) were generated essentially as described (Bar-Nur et al., 2014). Following timed mating,

embryos were dissected from pregnant females at E13.5. The head, limbs, spinal cord, gonads, and internal organs were removed

and the remaining tissue was minced in 200 mL of Trypsin-EDTA (Life Technologies). Following a 5 minutes incubation at 37�C, theTrypsin-EDTA was quenched in 10 mL of MEF growth media (500 mL DMEM (Life Technologies) 10% fetal bovine serum (Hyclone),

1X non-essential amino acids (Life Technologies), 1X Glutamax (Life Technologies), 55 mM beta-mercaptoethanol (Sigma)) and

cultured at 37�C at 4% oxygen. For reprogramming experiments, MEFs were derived from mice carrying an inducible, polycistronic

OKSM cassette in the 30 UTR of Col1a1, the M2-rtTA transactivator at the Rosa26 locus, and an EGFP reporter construct under

control of Pou5f1 regulatory elements (Lengner et al., 2007). For transdifferentiation experiments to neurons, MEFs derived from

mice carrying a TAU-GFP reporter were used (Tucker et al., 2001) and MEFs established from mice harboring a SOX2-GFP reporter

were used for transdifferentiation to induced trophoblast stem cells (Arnold et al., 2011). All procedures involvingmice adhered to the

guidelines of the approved Massachusetts General Hospital Institutional Animal Care and Use Committee (IACUC) protocol no.

2006N000104.

Induction of pluripotencyTo induce expression of the pluripotency factors, reprogrammableMEFs were seeded at a density of 15,000 cells per well of a 12well

plate coated with 0.2% gelatin (Sigma). Induction media contained ES culture media (500 mL KO-DMEM (Life Technologies) 15%

fetal bovine serum (Hyclone), 1X non-essential amino acids (Life Technologies), 1XGlutamax (Life Technologies), 1000U/ml leukemia

inhibitory factor, 55 mM beta-mercaptoethanol (Sigma)) supplemented with 50 mg/ml ascorbic acid (Sigma) and 2 mg/ml doxycycline

hyclate (Sigma). Unless otherwise indicated, reprogrammable MEFs were induced for 12 days, followed by 4 days of doxycycline

withdrawal to ensure transgene independence. Alkaline phosphatase staining was performed using a Vector Red kit (Vector Labs)

according to the manufacturer’s recommendation.

Transdifferentiation assaysFor direct conversion of fibroblasts into neurons, Nudt21 and control siRNAs were transfected into TAU-GFP/rtTA MEFs using

Lipofectamine-2000 per the manufacturer’s recommendation. The cells were infected with doxycycline-inducible lentiviral vectors

(FUW-TetO-Ascl1, FUW-TetO-Myt1l and FUW-TetO-Brn2) in MEF growthmedia. Two days after infection, themediumwas changed

e4 Cell 172, 106–120.e1–e10, January 11, 2018

Page 21: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

to neuron induction medium (DMEM-F12, 1X N2 supplement, 1X Glutamax, 25 mg/ml insulin (Sigma)) supplemented with 2 mg/ml of

doxycycline (Sigma). Media was changed every two days. Cultures were analyzed for TAU-GFP expression by flow cytometry

after 7 days of conversion.

For direct conversion of B cells intomacrophages, pre-B cells (C10 line) (Bussmann et al., 2009) were infectedwith lentiviral vectors

carrying shRNA for Nudt21 or shRNA Renilla as a control. The cells were cultured in RPMI Medium, 10% charcoal stripped FBS, 1X

glutamax, 1X penicillin/streptomycin, 55 mM beta-mercaptoethanol. To induce macrophage transdifferentiation, C10 cells were

plated at 50,000 cells/cm2 in media supplemented with b-estradiol (Sigma, E2758) and macrophage cytokines (10 ng/ml IL3 and

10 ng/ml CSF, both from Peprotech). The indicated time points were analyzed for CD19 and MAC-1 expression by flow cytometry.

For direct conversion of MEFs into trophoblast stem cells, Sox2-GFP/rtTAMEFswere infectedwith doxycycline-inducible lentiviral

vectors (pLV-TetO-Gata3, pLV-TetO-Eomes and pLV-TetO-Tfap2c) in MEF media. Eighteen hours later, media was changed to TSC

reprogramming medium (RPMI supplemented with 20% FBS, 0.1 mM b-mercaptoethanol, 1X glutamax, 25 ng/ml human recombi-

nant FGF4 (PeproTech), 1 mg/ml heparin (StemCell Technologies), and 2 mg/ml doxycycline (Sigma)). TSC reprogramming medium

was changed every other day for 20 days, followed by 10 days in TSC culture medium. TSC culture medium contained 30% RPMI

supplemented with 20% FBS, 1% non-essential amino acids, 2 mM L-glutamine, 25 ng/ml human recombinant FGF4 (PeproTech)

and 1 mg/ml heparin (Sigma-Aldrich), and 70% MEF conditioned media with the same supplements. Ten days after doxycycline

removal, plates were screened for Sox2-GFP+/Cdx2+ primary iTSC colonies.

LYZ-GFP/HOXB8-ER cells (Wang et al., 2006) were maintained in RPMI medium supplemented with 10% FBS, 1X glutamax, 1X

penicillin/streptomycin and SCF. The source of SCF was conditioned media generated from a Chinese hamster ovary (CHO) cell line

that stably secretes SCF. Conditioned medium was added at a final concentration of 2%. b-estradiol (Sigma, E2758) was added to a

final concentration of 0.5 mM from a 10-mM stock dissolved in 100% ethanol. To induce differentiation, LYZ-GFP/HOXB8-ER cells

were plated at 50,000 cells/cm2 in RPMI medium supplemented with 10% FBS, 1X glutamax, 1X penicillin/streptomycin and SCF

without b-estradiol. LYZ-GFP expression was analyzed by flow cytometry after 3 days of differentiation.

All the products for cells culture were obtained from Thermo Fisher Scientific unless otherwise specified.

Teratoma assaysFor teratoma generation, iPS cell lines were harvested and resuspended in 600 mL of ES culture media per confluent 6-well dish.

Recipient mice were anesthetized with Avertin (1.61 g/ml 2,2,2-tribromoethanol (Sigma Aldrich) in tert-amyl alcohol (Sigma Aldrich))

and injected with 150 mL of cell suspension subcutaneously. Tumors were harvested 3 to 4 weeks after injection and analyzed

histologically.

Embryoid body generationEmbryoid bodies were derived using the hanging drop method. Briefly, iPS cells were trypsinized and pre-plated directly onto plastic

tissue culture dishes for one hour in ES growthmedia to remove fibroblasts. Following centrifugation, the cells were resuspended at a

density of 13,000 cells/ml in EB media (15% fetal bovine serum (Hyclone), 1X non-essential amino acids (Life Technologies), 1X Glu-

tamax (Life Technologies), 0.4% 1-thioglycerol (Sigma-Aldrich), 1 mM Sodium Pyruvate (Life Technologies), 10 mg/ml Iron-saturated

transferrin (Sigma-Aldrich), 50 mg/ml ascorbic acid (Sigma-Aldrich)) supplemented with 2 mg/ml doxycycline (Sigma-Aldrich). The

cells were then cultured in 30 mL hanging drops for three days. At that time, EBs were collected and seeded into non-adherent plates

and incubated with constant agitation for an additional 3 days.

METHOD DETAILS

Serial enrichment shRNA screenReprogrammable MEFs were infected with a pooled shRNA library (621,000 shRNAs) in MEF growth media supplemented with

10 mg/ml polybrene (EMD-Millipore). Following a 48-hour recovery period, the infected cells were passaged onto gelatinized

15 cm2 cell culture dishes in induction media for 10 days, followed by 4 days of doxycycline withdrawal. The cells were then

harvested, pooled, and purified using SSEA1-linked magnetic beads and an AutoMACS sorter according to the manufacturer’s

recommendations (Miltenyi). SSEA1-enriched cells were FACS-sorted for OCT4-GFP and lysed in 10 mM Tris-HCl pH 8.0, 10 mM

EDTA, 10 mM NaCl, 0.5% SDS. To obtain genomic DNA, the lysates were treated with 0.1 mg/ml RnaseA at 37�C for 30 minutes,

0.5 mg/ml Proteinase K at 55�C for 2 hours, and then subjected to phenol-chloroform extraction with ethanol precipitation. The

resulting genomic DNA was used as template to amplify the resident shRNA vectors via PCR. Each 50 mL PCR reaction contained

2.5 mg genomic DNA template, 200 mM dNTPs, 400 nM of each PCR primer (pHAGE-Mir-PCR: 50- GCAAACTGGGGCACAGAT

GATGCGG; BC1R-L: 50- CGCCTCCCCTACCCGGTAGA), 1X Q5 high GC buffer, and 0.5 mL Q5 polymerase (New England Biolabs).

PCR was performed with the following program: 94�C for 4 minutes (1 cycle), 94�C for 30 s, 60�C for 30 s, 72�C for 45 s (35 cycles),

72�C for 10 minutes (1 cycle). PCR products for each sample were pooled, ethanol precipitated, resuspended, and gel-purified using

theQIAquick Gel Extraction Kit (QIAGEN). The purified shRNAPCRproducts were used to generate sub-libraries for the next round of

shRNA library screens and also to generate sequencing libraries for Solexa sequencing.

For sub-library generation, the purified PCR product was digested with NotI and MluI, and then gel-purified using the QIAquick

Gel Extraction Kit (QIAGEN). Approximately 50 ng of the purified shRNA fragment and 125-250 ng of NotI/MluI pHAGE-Mir vector

Cell 172, 106–120.e1–e10, January 11, 2018 e5

Page 22: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

backbone were ligated in 5 ul ligation reaction using T4 ligase (New England Biolabs). Then, 1 mL ligation reaction was used to trans-

form 20 mL Electromax competent cells DH10b (Life Technology) via electroporation. Tomaintain the representation of the library, we

required at least 100X coverage (i.e., the number of transformed colonies must be at least 100-fold greater than the number of

shRNAs in the library). Transformed colonies were pooled and cultured in 300 mL LB-Carbenicillin (100 mg/ml) and grown at 30�Cfor 2-3 hr. The bacteria were collected and the DNA library was extracted using a Genelute Maxiprep kit per the manufacturer’s

recommendations (Sigma).

For Solexa sequencing, purified shRNA PCR product was used as template for further PCR that included 500 ng of purified

shRNA PCR product, 200 mM dNTPs, 2 mM of PCR primer (p5 and p7-see below), 1x Q5 high GC buffer, and 1 mL Q5 polymerase

(New England Biolabs). PCR was performed with the following program: 94�C for 4 minutes (1 cycle), 94�C for 30 s, 50�C for

20 s, 72�C for 30 s (2 cycles), 94�C for 30 s, 60�C for 20 s, 72�C for 30 s (20 cycles), 72�C 10 minutes (1 cycle). PCR products

were gel-purified using the QIAquick Gel Extraction Kit (QIAGEN) and submitted for Solexa sequencing using a custom sequencing

primer, mir30-EcoRI.

mir30-EcoRI: 50-TAGCCCCTTGAATTCCGAGGCAGTAGGCA

p5-miSeq:

50-ATGATACGGCGACCACCGAGATCTACACCTAAAGTAGCCCCTTGAATTC p7-miSeq-1:

50-CAAGCAGAAGACGGCATACGAGACGATAGTGAAGCCACAGATGTA

p7-miSeq-2:

50-CAAGCAGAAGACGGCATACGAGACACTAGTGAAGCCACAGATGTA

p7-miSeq-3:

50-CAAGCAGAAGACGGCATACGAGACTATAGTGAAGCCACAGATGTA

p7-miSeq-4:

50-CAAGCAGAAGACGGCATACGAGACCTTAGTGAAGCCACAGATGTA

Single-end 51 bp reads were obtained using the Illumina HiSeq or MiSeq instrument. Successful sequencing yielded 22 nucle-

otides that identify the shRNA, followed by a constant region and a 2 nucleotide barcode to identify the sample. Successful reads

were extracted from the sequencing data and matched to the shRNA library annotation file and 2-nucleotide sample bar code. The

total number of reads that were identified for each shRNA, sample, and round, were counted. The counts were normalized against

the total number of counts for that sample and round and then multiplied by the total number of shRNAs in the initial library. A

pseudocount of 1 was added to each normalized count to downweight enrichment derived from low read counts and to avoid

division by zero in calculating fold-changes. The enrichment for individual shRNAs in each round was calculated as the log2fold change of the Oct4-GFP+ normalized counts over the maximum of the normalized counts of the controls (T0, No-Dox,

and Oct4-GFP-). The cumulative enrichment for each shRNA in each round was calculated as the sum of the log2 fold changes

for that round and all previous rounds. The overall enrichment of each shRNA was defined as the maximum of the cumulative

enrichment scores among all rounds.

Cell cycle analysisTo determine cell cycle dynamics, reprogrammable MEFs were treated with doxycycline for 48h and subsequently exposed to

20uM BrdU (Sigma) for 30 minutes in ES culture media. The cells were then trypsinized and kept on ice in 100 mL of PBS (Life

Technologies). Next, two milliliters of cold ethanol were added dropwise to cells, which were then incubated for 30 minutes

on ice. Two milliliters 4N HCl were added for an additional 30 minutes. Cells were then centrifuged at 500 RPM for 5 minutes

at 4�C and resuspended in 1ml of 0.1N Sodium tetraborate decahydrate (Sigma), pH 8.5 and washed with staining buffer (2%

FBS and 0.5% Tween-20 in PBS). Anti-BrdU antibody (mouse, Agilent M074401-8) was added to cells for 30 minutes at 23�Cat a concentration of 5 mg/ml. The cells were then washed three times in PBS incubated with anti-mouse FITC secondary anti-

body (BD Biosciences, 55434) for 30 minutes at 23�C at a dilution of 1:100. After 3 additional washes with PBS, the cells were

resuspended for analysis in PBS containing 2% FBS and 5ug/ml propidium iodide. The samples were then analyzed immediately

on a MACSQuant cytometer.

Cell Trace experiments were conducted according to the manufacturer’s recommendation (Thermo Fisher Scientific).

RNA preparationTotal RNA isolation was performed using themiRNeasy mini kit (QIAGEN). RNAwas eluted from the columns using RNase-free water

and quantified using a Nanodrop ND-1000. cDNA was produced with the High Capacity RNA-to-cDNA kit (Applied Biosystems). For

detection of maturemiRNAs, RNAwas retrotranscribed using the TaqManMicroRNAReverse Transcription Kit (Applied Biosystems)

following the manufacturer instructions.

qRT-PCR analysesFor gene expression analysis, qRT-PCR reactions were set up in triplicate with the Brilliant III SYBR Master Mix (Agilent Genomics)

with primers listed in Table S3. Reactions were run on LightCycler 480 (Roche) PCR machine with 40 cycles of 30 s at 95�C, 30 s at

e6 Cell 172, 106–120.e1–e10, January 11, 2018

Page 23: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

60�C and 30 s at 72�C. For mature miRNA detection, qRT-PCR reactions were set up in triplicate with the TaqMan Universal PCR

Master Mix (no AmpErase UNG, Applied Biosystems) and the miRNA specific TaqMan MicroRNA assays (hsa-miR-34c assay ID

000428 (4427975), mmu-miR-34c* assay ID 002584 (4427975), hsa-miR-29a assay ID: 002112 (4427975) hsa-miR-29a* assay

ID: 002447 (4427975)). Reactions were run on LightCycler 480 (Roche) PCR machine with 40 cycles of 15 s at 95�C, and 1 minute

at 60�.

Vectors and virus production and infectionFUW-TetO-Ascl1, FUW-TetO-Myt1l, FUW-TetO-Brn2, pLV-TetO-Gata3, pLV-TetO-Eomes, and pLV-TetO-Tfap2c lentiviral vectors

have been described previously (Vierbuchen et al., 2010; Kubaczka et al., 2015). For virus production, HEK293T cells were co-trans-

fected with vector plasmid and packaging plasmids (pVSVG and pD8.9) using calcium phosphate transfection. Viral supernatants

were harvested 48 hours later and concentrated by ultracentrifugation at 20,000 X g for 2 hours at 20�C. Viral concentrates were

re-suspended in PBS and stored at �80�C.

siRNA and miRNA transfectionFor knockdown experiments, cells were transfected with pooled siRNA at a final concentration of 15-20 nM, using Lipofectamine-

2000 per the manufacturer’s recommendation (Thermo Fisher Scientific). Briefly, Lipofectamine-2000 and siRNA were added to

separate aliquots of OptiMEM and incubated for 5 minutes at 23�C. The siRNA and transfection reagent were then combined and

incubated for an additional 15 minutes at 23�C. Trypsinized cells were seeded in 750 mL of media at a density of 20,000 cells per

one well of a 12-well dish and the transfection mixture was added immediately. Cell culture media was replaced 6 hours after

transfection. Nudt21 and Wdr5 pooled esiRNAs were purchased from Sigma-Aldrich. All other pooled siRNAs were purchased

from GE-Dharmacon.

FormiRNA knockdown experiments, cells were transfected as described above except thatmiR-29a andmiR-34cwere added to a

final concentration of 4 nM and 2 nM, respectively. Control miRNA inhibitors were added to the same final concentration for each

corresponding experiment. All miRNA inhibitors were purchased from GE-Dharmacon.

Immunofluorescence assaysCells were fixed with 4% paraformaldehyde, blocked in PBS containing 10% goat serum and 0.1% triton and incubated with primary

antibodies overnight at 4�C. On the next day, the cells were exposed to secondary antibodies (all Alexa Fluor from Thermo Fisher

Scientific) at 23�C for one hour. The primary antibodies used were CDX2 (clone C-20, Santa Cruz biotechnology) and GFP

(NB600-308). Nuclear staining was performed using DAPI solution (564907, BD Biosciences).

Western blotThe following antibodies were used for western blot: Nudt21 (Santa Cruz Biotechnology, 2203C3); ACTIN (Cell Signaling Technology,

13E5); WDR5 (Bethyl Labs, A302-430A); RTF1 (Cell Signaling Technology, D7V3W); PHC1 (Cell Signaling Technology, 1F3F3); and

c-MYC (Cell Signaling Technology, D84C12).

Poly(A) site mappingPoly(A) site mapping was carried out as previously described (Spies et al., 2013). Total RNA was extracted with Trizol as per the

manufacturer’s recommendations (Life Technologies). Then, 10 mg of total RNA was fragmented with fragmentation reagent

(Ambion) at 70�C for 10 minutes followed by precipitation with ethanol. After centrifugation, RNA was dissolved and reverse

transcription was performed using Superscript III (Thermo Fisher Scientific) using a custom primer, PASSEQ7-2 RT oligo:

[phos]NNNNAGATCGGAAGAGCGTCGTGTTCGGATCCATTAGGATCCGAGACGTGTGCTCTTCCGATCTTTTTTTTTTTTTTTTTTTT

[V-Q]. cDNA was recovered by ethanol precipitation and centrifugation. cDNA ranging from 120-200 basepairs was gel-purified

and eluted from 8% Urea-PAGE. Recovered cDNA was circularized with Circligase II (Epicenter) at 60�C overnight. Buffer E

(Promega) was added to cDNA and heated at 95�C for 2 minutes, and then slowly cooled to 37�C. Circularized cDNA was

linearized by adding BamHI (Promega). cDNA was centrifuged after ethanol precipitation. PCR was carried out with primers

PE1.0 and PE2.0 containing index. Around 200 basepair of PCR product was gel-purified and submitted for sequencing (single

end, 100 nucleotides).

ATAC-seqATAC-seq was performed as previously described (Buenrostro et al., 2013). 60,000 cells were washed once with 100ml PBS and

resuspended in 50ml lysis buffer (10mM Tris-HCl pH 7.4, 10mM NaCl, 3mMMgCl2, 0.2% IGEPAL CA-630). The suspension of nuclei

was then centrifuged for 10min at 500 g at 4�C, followed by the addition of 50ml transposition reaction mix (25ml TD buffer, 2.5ml Tn5

Transposase and 22.5ml Nuclease Free H2O) and incubation at 37�C for 30min. DNA was isolated using MinElute Kit (QIAGEN).

Libraries were amplified by PCR (13 cycles). After the PCR reaction, the library was selected for fragments between 100 and

1000bp with AmpureXP beads (Beckman Coulter). Libraries were purified with Qiaquick PCR (QIAGEN) and integrity checked on

a Bioanalyzer before sequencing.

Cell 172, 106–120.e1–e10, January 11, 2018 e7

Page 24: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

Flow cytometryGFP and cell surface marker expression was analyzed with an LSR II FACS (BD Biosciences) using Diva v6.1.2 (BD Biosciences)

and FlowJo software v10.2 (TreeStar). Primary antibodies used were CD19 (clone 1D3, BD Biosciences), MAC-1 (clone M1/70,

BD Biosciences), SSEA1 (clone eBioMC-480, eBiosciences), THY1 (clone 53-2.1, eBiosciences), and EPCAM (clone G8.8,

eBiosciences).

Cell lysis and protein digestionCells were suspended in buffer containing 8M urea, 200 mM EPPS pH 8.5 (Sigma Aldrich), and protease inhibitors (Sigma Aldrich)

and syringe lysed 10 times. After centrifugation, clarified lysates were transferred to new tubes. Bicinchoninic acid (BCA) protein

assay (Thermo Fischer Scientific) was performed to determine protein concentration. Proteins were then subjected to disulfide

reduction with 5mM tris (2 carboxyethyl) phosphine (TCEP; Sigma Aldrich) at 23�C for 30 minutes, followed by alkylation with

10 mM iodoacetamide (Sigma Aldrich) at 23�C for 30 minutes in the dark, and 15 mM dithiotreitol (Sigma Aldrich) was used to

quench excess iodoacetamide 23�C for 15 minutes in the dark. Proteins (200 mg) were then chloroform/methanol precipitated

and washed with methanol prior to air drying. Samples were resuspended in 8 M urea (Sigma Aldrich), 50 mM EPPS, pH 8.5,

and then diluted to < 1M urea with 50mM EPPS, pH 8.5.

Protein digestion and peptide labelingProteins were digested for 16 hours with LysC (1:100 enzyme:protein ratio) at 23�C, followed by trypsin (1:100 enzyme:protein

ratio) for 6 hours at 37�C. Peptides were quantified using Pierce Quantitative Colorimetric Peptide Assay. TMT reagents

(0.8 mg; Thermo Fisher Scientific) were dissolved in 40 mL anhydrous acetonitrile (Sigma Aldrich), and 7 mL was used to label

70 mg peptides in 30% (v/v) acetonitrile. Labeling continued for 1 hour at 23�C, until reaction was quenched using 7 mL 5%

hydroxylamine (Sigma Aldrich). TMT-labeled peptides were pooled, vacuum centrifuged, and cleaned using 50 mg Sep-Pak

cartridges (Waters).

Offline basic pH reversed-phase (BPRP) fractionationThe pooled TMT-labeled peptide sample was fractionated using BPRP HPLC. We used an Agilent 1260 Infinity pump equipped

with a degasser and a single wavelength detector (set at 220 nm). Peptides were subjected to a 50 minute linear gradient from

8% to 40% acetonitrile in 10 mM ammonium bicarbonate pH 8 at a flow rate of 0.6 mL/min over an Agilent 300Extend C18

column (3.5 mm particles, 4.6 mm ID and 250 mm in length). We fractionated into a total of 96 fractions, then consolidated

samples into 24 fractions and vacuum centrifuged to near dryness. Twelve fractions were acidified to 1% formic acid

(Sigma Aldrich), desalted via StageTip, dried via vacuum centrifugation, and reconstituted in 5% acetonitrile, 5% formic acid

for LC-MS/MS processing.

Liquid chromatography and tandem mass spectrometryMass spectrometry data were collected using an Orbitrap Fusion mass spectrometer (Thermo Fischer Scientific) equipped with

a Proxeon EASY-nLC 1000 liquid chromatography (LC) system (Thermo Fisher Scientific). Peptides were separated on a 100 mm

inner diameter microcapillary column packed with �35 cm of Accucore C18 resin (2.6 mm, 150 A, Thermo Fisher Scientific).

We loaded �2 mg sample onto the column. Peptides were separated using a 3 hour gradient of acidic acetonitrile. We used

the multinotch MS3-based TMT method (McAlister et al., 2014). The scan sequence began with a MS1 spectrum (Orbitrap

analysis; resolution 120,000; mass range 400�1400 Th). MS2 analysis followed collision-induced dissociation (CID, CE = 35)

with a maximum ion injection time of 150 ms and an isolation window of 0.7 Da. The 10 most abundant MS1 ions of charge

states 2-6 were selected for MS2/MS3 analysis. To obtain quantitative information, MS3 precursors were fragmented by

high-energy collision-induced dissociation (HCD, CE = 65) and analyzed in the Orbitrap (resolution was 60,000 at 200 Th)

with a maximum ion injection time of 2 ms and a charge state-dependent variable isolation window of 0.7 to 1.2 Da

(Paulo et al., 2016).

IP mass spectrometry analysisThe Flag-CFIm25-pCDNA3.1 plasmid was transfected into HEK293 cells with Lipofectamine 2000 per manufacturer’s instruction

(Invitrogen) and single colonies of stable transfectants were selected with G418. Nuclear extracts from control HEK293 cells or a

Flag-CFIm25 cell line were used for immunoprecipitation with anti-Flag antibodies (M2, Sigma). Immunoprecipitated proteins

were eluted from the beads with 3xFlag peptides and eluted proteins were analyzed by mass spectrometry.

Luciferase assaysHEK293 cells were transfected at 50% confluency using Lipofectamine-2000 per the manufacturer’s recommendation. Briefly,

0.25 mg of luciferase constructs and 0.025 mg of renilla control plasmid were added to OptiMEM and incubated for 5 minutes at

23�C. The plasmids and transfection reagent were then combined and incubated for an additional 15 minutes at 23�C, after which

the solution was added dropwise to the cells. Assays including miRNA mimics were transfected similarly, except that mimics were

mixed with the luciferase constructs at a final concentration of 10 nM prior to the addition of Lipofectamine-2000. ThemiRNAmimics

e8 Cell 172, 106–120.e1–e10, January 11, 2018

Page 25: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

for miR-29a and miR34c were purchased from GE-Dharmacon. Luciferase activity was detected using a Dual-Glo Luciferase Assay

Kit according to the manufactuer’s protocol (Promega Corporation).

Oligos for cloning 30 UTRs for reporter assays (Table S3):

Rybp-F ACATCTAGAGATTGCACATGGAATTGTGAAAC

Rybp-dist-rev ACACTCGAGAAATTTTACTATTTTATTTGTGAAAAAACTAC

Rybp-prox-rev ACACTCGAGATGTACATGGAAAATTGTGCAC

Jmjd1c-F ACATCTAGATGCGGTTGGAACTGGGATGC

Jmjd1c-dist-rev ACACTCGAGTCCAGCTTCTTGATAAAGTCTTTTAATG

Jmjd1c-prox-rev ACACTCGAGAGAATTTCTTGGCACTGATGG

Wdr5-F gagctcGTCCTGGCTCCATGGGAGAC

Wdr5-dist-rev aagcttTACAACTTACAACCTTTCTG

Wdr5-prox-rev aagcttTTACAAGGCATGAAAATCTT

QUANTIFICATION AND STATISTICAL ANALYSIS

Statistical AnalysisStatistical analyses were performed using Prism software (GraphPad). Details for statistical analyses, including replicate numbers,

are included in figure legends.

Bioninformatics analysisFor the proteomic, scaling and statistical analysis were performed using the Rpackage DESeq2 (1.6.3). Differential expressed pro-

teins between siCtrl and siNudt21 samples were selected based on the nbinomWaldTest (FDR < 0.01) and 1.2-fold change cut-off.

Heatmaps were generated using RStudio (Version 1.0.14). Gene ontology, miRNA enrichment analysis and pathway enrichment

analysis were performed using EnrichR (Chen et al., 2013).

For PAS-seq analyses, raw reads (single end, 100 nucleotides) were scanned for consecutive adenine nucleotides (at least 15) and

then the polyA and downstream sequences were trimmed. The remaining sequences weremapped to themouse genome (construct

mm9), using TopHat (v2.1.0) with -g 1 parameter (Kim et al., 2013). Alignment with the possibility of internal priming was removed if a

read had 6 or more consecutive adenine nucleotides or more than 7 adenine nucleotides in the 10 nucleotide downstream of the

polyA site. From the remaining reads, those with the 30 end in the interval of �40nt to +40nt of potential polyA sites were used to

generate the count table. To find the polyA sites which were differentially used, we adapted diffSpliceDGE and topSpliceDGE

from edgeR package(v3.8.5) (Robinson et al., 2010). These functions, primarily used to find differential exon usage, generated a

list of sites with significant difference between samples after modeling the polyA sites read counts and comparing the log fold change

of each polyA site to the log fold change of the entire gene. This set was then filtered based on FDR (< 0.05) and the observed shift in

the ratio of polyA site read counts to gene read counts (> 15%). Finally, from the remaining sites, the top two were chosen based on p

value and marked distal or proximal based on their relative location within the gene.

For the analysis of ATAC-seq data, sequencing reads were first mapped to mm10 reference genome using BWA-MEM (Li, 2012)

with default parameters, followed by calling peaks using HOTSPOT2 (John et al., 2011). As a result, at each time point we identified

30,000-50,000 peaks, which showed high consistency between biological duplicates. The union of these peak sets at all time points

(82309 peaks total) was used to calculate the ATAC-seq coverage over each peak region across all samples. We identified 16661

MEF-specific peaks with the strongest coverage in MEFs and > 5-fold lower coverage in iPSC, and 13672 iPSC-specific peaks

with the strongest coverage in iPSCs and > 5-fold lower coverage in MEFs. Only a minority (2.5%) of MEF-specific peaks overlapped

with TSS, whereas the majority (79%) of these peaks overlapped with annotated enhancer regions (Shen et al., 2012), defined as the

1Kb-vicinity of the reported center coordinate. The ATAC-seq read densities over enhancer regions were compared across all sam-

ples to identify MEF- and iPSC-specific enhancers.

For UGUA distributions analysis, the sequence around distal and proximal poly(A) sites were extracted using BEDTools (v2.25.0)

(Quinlan andHall, 2010) for alternatively polyadenylated sites and the same number of sites with no significant changes between con-

trol and experiment chosen randomly. UGUA distributions were extracted from these sequences in the format of a histogram with

20bps bin size. The smooth underlying function of the normalized histogram was then generated using interp1d class in SciPy library

(https://www.scipy.org/) and then visualized.

For CLIP-seq data analysis, PAR-CLIP signals fromGSE37401 (Martin et al., 2012) were normalized at proximal and distal PASs for

target and non-target genes to count the binding frequency per transcript. For proximal PAS, CLIP read counts were divided by the

PAS-seq read counts of that PAS plus all downstream PASs, and for distal PAS the CLIP read counts were divided by the PAS-seq

read counts of the distal PAS. Wig files were converted to bigwigs, and the CLIP signals on �100nt to 100nt region around poly(A)

Cell 172, 106–120.e1–e10, January 11, 2018 e9

Page 26: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

sites were extracted by deepTools (v2.4) (Ramırez et al., 2016) using those bigwig files, separately for each strand. Signals were com-

bined, normalized as described above, and averaged in Python. The generated curves for each set of 200nt intervals (proximal and

distal sites in ‘‘target’’ and ‘‘nontarget’’ genes) were then scaled by their own total coverage, to make the comparison of distributions

easier.

AGO2CLIP-Seq was downloaded fromGSE25310 (Leung et al., 2011). Mapping was done after 30s linker was removed. The reads

aligned to the same exact region were counted as one. The number of readsmapped on the region between proximal and distal polyA

sites of alternatively polyadenylated genes were normalized by the number of mapped reads in each sample. To have a per transcript

coverage, this value was divided by PAS-Seq read counts at distal site. The distribution of this value for alternatively polyadenylated

and differentially expressed genes was illustrated as separate boxplots for genes with repressed and enhanced expressions in

Figure 5A.

For Multidimensional Scaling (MDS) analyses were done in R. Correlation between observed pairwise distances and

pairwise distances after dimension reduction was calculated and the two dimensions that lead to the highest correlation

were chosen.

The computational analyses and visualization if not specified otherwise, were done in Python 2.7. Where necessary, conversion

between BAM and BED files were done using BEDTools (v2.25.0) (Quinlan and Hall, 2010) and BAM files were sorted or indexed

via SAMtools (v1.1) (Li et al., 2009). Kolmogorov–Smirnov test, implemented in Scipy library, was used in multiple cases to

determine if two samples are from the same distribution. The generated p value quantifies the significance of the observations

coming from different distributions. To calculate the correlation coefficient for scatterplots, the stats.pearsonr method from Scipy

library was used.

Large-scale Proteomic Data AnalysisMass spectra were processed using a SEQUEST-based in-house software pipeline (McAlister et al., 2014). Spectra were converted

to mzXML using a modified version of ReAdW.exe. Database searching used the mouse proteome downloaded from Uniprot (http://

www.uniprot.org/downloads) in both forward and reverse directions. Common contaminating protein sequences were included as

well. Searches were performed using peptide mass tolerance of 20 ppm, and a fragment ion tolerance of 0.9 Da. These wide mass

tolerance windows were chosen to maximize sensitivity in conjunction with SEQUEST searches and linear discriminant analysis

(Beausoleil et al., 2006). TMT tags on lysine residues and peptide N termini (+229.163 Da) and carbamidomethylation of cysteine res-

idues (+57.021 Da) were set as static modifications, while oxidation of methionine residues (+15.995 Da) was set as a variable

modification.

Peptide-spectrum matches (PSMs) were adjusted to a 1% false discovery rate (FDR) (Elias and Gygi, 2010). Linear discriminant

analysis was used to filter PSMs, as described previously (Elias and Gygi, 2010), while considering the following parameters: XCorr,

DCn, missed cleavages, adjusted PPM, peptide length, fraction of ions matched, charge state, and precursor mass accuracy. PSMs

were identified, quantified, and collapsed to a 1%peptide false discovery rate (FDR) and then collapsed further to a final protein-level

FDR of 1%. PSMs were quantified from MS3 scans; those with poor quality, MS3 spectra with total TMT reporter signal-to-noise

ratio that is < 200, or no MS3 spectra were excluded from quantitation. Protein quantitation was performed by summing the

signal-to-noise for all peptides for a given protein. Each TMT channel was summed across all quantified proteins and normalized

assuming equal protein loading of all 10 channels.

DATA AND SOFTWARE AVAILABILITY

Deposition of sequencing dataThe accession numbers for the genome-wide sequencing data reported in this paper are GEO: GSE104529 (ATAC-seq) and

GSE99922 (PAS-seq). The accession numbers for the mass spectrometry proteomics data reported in this paper are

ProteomeXchange Consortium via the PRIDE partner repository: PXD008078 and PXD008108.

e10 Cell 172, 106–120.e1–e10, January 11, 2018

Page 27: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

Supplemental Figures

Nud

t21

siR

NA

Nud

t21

siR

NA

MEF iPS

DOX

rtTA

Col1A1

GFP

Rosa26

tetO

OKSM

Pou5f1

A

shN

udt2

1 iP

S c

ells

Mesoderm EndodermEctodermF

Time (days)

AP

+ co

loni

es

GD

Nudt21siRNA

Luc. siRNA

0

40

80

120

160

Nudt21siRNA

AP

+ co

loni

es

Luc.siRNA

EOKS

****

0

50

100

150

200

250

300

350

3 4 5 6 7 8

Zmat5

Klhl2

Csrnp3

Ptprg

AK018240

Hint1

Mbd4

XM_288760

Gm12057

AK004256

Adamts9

Eif2a

Izumo4

Nudt21

0

1

2

3

4

Fold

cha

nge

AP

+ co

loni

es

B

Cell Trace Violet (CFP channel)

Dox

Nudt21 siRNAMock transfect.

No Dox

Cel

l cou

nt (N

orm

aliz

ed T

o M

ode)

Day

1D

ay 2

Day

3D

ay 4

Day

0

No dyeDye

H

0

1

2

3

4 %

DA

PI/A

nnex

inV

+ cel

ls

+Dox No Dox

JNudt21 shRNAControl shRNA

Nudt21 shRNAControl shRNA

0

4

8

12

16

1 2 3 4

Cel

l cou

nt (N

orm

aliz

ed to

Day

1)

+Dox

No Dox

Time (Days)

INudt21 shRNAControl shRNA

0

10

20

30

40

50

60

Scr

ambl

e

1 2 3 4

shNudt21

AP

+ co

loni

es

C

Con

trol s

iRN

AC

ontro

l siR

NA

K L

0

100

200

300

400

500

NanogNanog

Contro

l siR

NA

Nudt21

siRNA

iPSCs

Rel

ativ

e ex

pres

sion

0

50

100

150

200

250

Pou5f1

Contro

l siR

NA

Nudt21

siRNA

iPSCs

Snai2

0.0

0.5

1.0

1.5

Contro

l siR

NA

Nudt21

siRNA

iPSCs

Fbn1 Twist1

0.0

0.5

1.0

1.5

Contro

l siR

NA

Nudt21

siRNA

iPSCs

Contro

l siR

NA

Nudt21

siRNA

iPSCs0.0

0.5

1.0

1.5

(legend on next page)

Page 28: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

Figure S1. Nudt21 Knockdown Dramatically Increases Reprogramming Efficiency to Bona Fide Induced Pluripotent Stem Cells, Related to

Figure 1

(A) A schematic representation of the secondary system used for reprogramming.

(B) Reprogramming efficiency validation for individual candidate genes. Error bars represent standard deviation of the mean for three independent experiments.

(C) Reprogramming efficiency for MEFs treated with independent shRNAs targeting Nudt21. Cells were induced with dox for 12 days, followed by 4 days of dox

withdrawal.

(D) Alkaline phosphatase staining of transgene-independent iPS colonies following reprogramming with OKS. Cells were induced with dox for 14 days, followed

by 4 days of dox withdrawal.

(E) Quantification of alkaline phosphatase staining of transgene-independent iPS colonies following reprogramming with OKS. Error bars represent standard

deviation of the mean for three independent experiments. Statistical significance was determined using a two-tailed unpaired Student’s t test (****p < 0.0001).

(F) Teratomas derived from iPS cells generated with Nudt21 knockdown.

(G) Quantification of the dox withdrawal assay with Nudt21 knockdown.

(H) A cell trace time course for proliferation with Nudt21 knockdown at the indicated time points.

(I) Cell proliferation analysis with Nudt21 knockdown.

(J) AnnexinV staining for cells treated with Nudt21 or control siRNAs.

(K) Bright field images showing uninduced MEF with Nudt21 siRNA or control siRNA. Scale bar: 10 mm.

(L) qRT-PCR quantification for MEF and pluripotency related genes in uninduced MEFs with Nudt21 siRNA or control siRNA.

Page 29: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

(legend on next page)

Page 30: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

Figure S2. Nudt21 Knockdown Mediates Cell Fate Transitions, Related to Figure 2

(A) Flow cytometry analysis showing size and granularity differences for pre-B cell to macrophage transdifferentiation.

(B) qRT-PCR quantification of Nudt21 knockdown in pre-B cells.

(C) Flow cytometry analysis showing lineage marker transitions for pre-B cell to macrophage transdifferentiation.

(D) A western blot at day 3 of transdifferentiation showing Nudt21 knockdown in MEFs undergoing transifferentiation to iNs.

(E) Flow cytometry analysis of TAU-GFP for iN transdifferentiation.

(F) A western blot showing Nudt21 knockdown at day 3 during iTSC transdifferentiation.

(G) Immunofluorescence for iTSC markers. Scale bar: 100 mm.

Page 31: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

A

Phc1

ControlsiRNA-D3

MEF

iPS cells

SSEA1+ D6

SSEA1+ D9

Nudt21siRNA-D3

[0-800]

500 bp Mgea5

[0-550]

500 bp Atm

[0-350]

500 bp

ControlsiRNA-D3

MEF

iPS cells

SSEA1+ D6

SSEA1+ D9

Nudt21siRNA-D3

ControlsiRNA

D3

MEF

iPScells

SSEA1+

D6

SSEA1+

D9

Nudt21siRNA

D3

Corr Coef:0.165APA Genes

APA:log2((Prox/Dist)siNudt21/(Prox/Dist)siCtrl)

mR

NA

:log 2(s

iNud

t21/

siC

trl)

1 2 3 4 5 6 70-3

-2

-1

0

1

2

3Correlation APA vs mRNAs Day3

B

E F

0.05

0.10 DistalProximal

Nudt21 targets

Non-targetsDistalProximal

0.05

0.10

CFIm68 CLIP-seq

-100 1000

Non-targets

NUDT21 CLIP-seq

Nor

mal

ized

Cov

erag

e D

istri

butio

n

0.05

0.10

Nor

mal

ized

Cov

erag

e D

istri

butio

n

0.05

0.10

-100 1000

Nudt21 targets

Contro

l siR

NA

CFIm68

siRNA

0.0

0.5

1.0

Rel

ativ

e ex

pres

sion

CFIm68

0.0

0.5

1.0

Contro

l siR

NA

Pcf11 s

iRNA

Pcf11

Nor

mal

ized

Cov

erag

e D

istri

butio

nN

orm

aliz

ed C

over

age

Dis

tribu

tion

mR

NA

:log 2(s

iNud

t21/

siC

trl)

-3

-2

-1

0

1

2

3

APA:log2((Prox/Dist)siNudt21/(Prox/Dist)siCtrl)-4 -2 0 2 4 6-6

Corr Coef:0.333APA Genes

Correlation APA vs mRNAs Day6

-30-20-100

RNA processing

RNA secon.struct. unwinding

RNA splicing

mRNA 3'-end processing

mRNA processing

p-value (-log10)

GO Nudt21 IP

Log2 (Fold Change)

Fold change >2FDR= 0.05Nudt21CFIm68CFIm59

-10 -5 0 5 10

-Log

2 (P

-Val

ue)

80

60

40

20

C D

Significant proteins

(legend on next page)

Page 32: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

Figure S3. Nudt21 Knockdown Elicits Alternative Polyadenylation on Key Genes, Resembling Profiles of Progressing Reprogramming In-

termediates, Related to Figure 3

(A) Gene tracks showing PAS-seq for Nudt21 targets.

(B) CLIP-seq signal around polyA sites for Nudt21 (left panels) and CFIm68 (right panels).

(C) Gene ontology analysis for Nudt21 interacting proteins.

(D) Volcano plot representation of Nudt21 immunoprecipitation mass-spectrometry data. Grey dots: non-significantly enriched proteins (FDR > 0.05). Black dots:

significantly enriched proteins (FDR < 0.05). Nudt21, CFIm68 and CFIm58 are highlighted in color.

(E) qRT-PCR analysis for CFIm68 and Pcf11 knockdown at day 3 of reprogramming.

(F) Correlation plot for APA versus mRNA at day3 and day6 of reprogramming.

Page 33: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

C

D E

Jmjd1c

[0 - 884]

500 bpWdr5

[0-935]

500 bp

ControlsiRNA-D3

MEF

iPS cells

SSEA1+ D6

SSEA1+ D9

Nudt21siRNA-D3

Rybp

[0-325]

500 bp

ControlsiRNA

D3

MEF

iPScells

SSEA1+

D6

SSEA1+

D9

Nudt21siRNA

D3

ControlsiRNA-D3

MEF

iPS cells

SSEA1+ D6

SSEA1+ D9

Nudt21siRNA-D3

F

miRNA target sites enriched in “Protein Up”

miR-29a/b/c

miR-34b

miR-493

miR-181a/b/c/d

4 60 2p-value (-log10)

miR-520b

PHC1150

Contro

l siR

NANud

t21 si

RNA

RTF1100

β-ACTIN50

NUDT2125

A

Positive regulation of EMT

Chondrocyte development

Collagen fibril organization

Extracellular structure organization

Reg. of cell morph. involved in diff.

Extracellular matrix organization

Enrichment AnalysisProteins Down

p-value (-log10)6420

B

miR-34

c Inh

ibitor

0.0

0.5

1.0

Rel

ativ

e ex

pres

sion

miR34c expression

Contro

l miR

inhib

itor

miR-29

a Inh

ibitor

0.0

0.5

1.0

mir29a expression

Contro

l miR

inhib

itor

0

1

-20 3 6 9 12

-1

miR-34c

Time (days)

SSEA1+SSEA1-

0

1

-1

-2

-3

miR-29a

Exp

ress

ion

SSEA1+SSEA1-

Exp

ress

ion

Figure S4. Knockdown of Nudt21 Eliminates miRNA Seed Sequences via APA, Related to Figures 4 and 5

(A) Gene Ontology analysis for Nudt21 target proteins that decrease expression 1.2-fold or greater by day 3 of reprogramming.

(B) Western blot analysis for RTF1, PHC1, and NUDT21.

(C) ‘‘TargetScan microRNA’’ enrichment analysis for miRNA binding within Nudt21 targets that change protein level.

(D) miR-29a and miR-34c expression during reprogramming (Polo et al., 2012).

(E) Gene tracks showing PAS-seq for chromatin factors targeted by Nudt21.

(F) qRT-PCR for miR-34c and miR-29a in MEFs transfected with miR-34c, miR-29a inhibitor, or miR inhibitor control at day 3 of reprogramming.

Page 34: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

100 101 102 103 104 105

Cell Trace

Nudt21+Control siRNA

Control

Nudt21+Rybp siRNA

Day0-No Dye

Nudt21+Wdr5 siRNA

Nudt21+Control siRNA

Control

Nudt21+Rybp siRNA

Nudt21+Wdr5 siRNA

Nudt21+Control siRNA

Control

Nudt21+Rybp siRNA

Nudt21+Wdr5 siRNA

Nudt21+Control siRNA

Control

Nudt21+Rybp siRNA

Day0

Nudt21+Wdr5 siRNA

Day

2D

ay 3

Day

4D

ay 1

N.S.

Contro

l siR

NA

Nudt21

+con

trol s

iRNA

Cell death

Nudt21

+Wdr5

siRNA

Nudt21

+Ryb

p siR

NA

8

6

4

2

0

%D

API/A

nnex

inV+ c

ells

10

D

H

D

Contro

l siR

NA

Wdr5

siRNA

0

20

40

60

80

100

120

160

AP

+ co

loni

es

Nudt21

siRNA

Nudt21

+Wdr5

siRNA

140

Reprogramming efficiency

****

Bmp450

40

30

20

0

10mR

NA

exp

ress

ion

Tril

0

5

10

15

20 Jun

100

200

300

400

0

G

Control siRNA

Nudt21siRNA

Control siRNA

Nudt21siRNA

Control siRNA

Nudt21siRNA

E

0

0.5

1

1.5

2

2.5

3

%O

CT4

-GFP

+/E

PC

AM

+

Reprogramming efficiencyF

Rel

ativ

e E

xpre

ssio

n

0

0.2

0.4

0.6

0.8

1

1.2

1.4

1.6

1.8

2

2.2

Rybp

Contro

l siR

NA

Rybp s

iRNA

Nudt21

siRNA

Nudt21

+Ryb

p

siRNA

Contro

l siR

NA

Rybp s

iRNA

Nudt21

siRNA

Nudt21

+Ryb

p

siRNA

**

I

37 WDR5

Nud

t21

siR

NA

Wdr

5 si

RN

A

Nud

t21+

Wdr

5 si

RN

A

β-ACTIN37

CA B

37 WDR5

Nud

t21

siR

NA

Con

trol s

iRN

A

25 NUDT21

β-ACTIN50

Control siRNA

Wdr5siRNA

Nudt21 siRNA +Wdr5 siRNANudt21 siRNA

(legend on next page)

Page 35: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

Figure S5. Wdr5 and Rybp Are Regulated by Nudt21 and Impact Reprogramming Efficiency, Related to Figure 5

(A) A western blot showing WDR5 levels with Nudt21 knockdown at day 3 of reprogramming.

(B) A western blot showing WDR5 knockdown with and without Nudt21 knockdown at day 3 of reprogramming.

(C) Alkaline phosphatase staining for transgene independent iPS colonies with simultaneous knockdown of Wdr5 and Nudt21. Cells were induced with dox for

12 days, followed by 4 days of dox withdrawal.

(D) Quantification of alkaline phosphatase staining for transgene independent iPS colonies with simultaneous knockdown of Wdr5 and Nudt21. Error bars

represent standard deviation of the mean for three independent experiments. Statistical significance was determined using a two-tailed unpaired Student’s t test

(****p < 0.0001).

(E) qRT-PCR analysis with the indicated knockdown conditions.

(F) Reprogramming efficiency based on OCT4-GFP for double knockdown of Nudt21 and RYBP at day 6 of reprogramming. Error bars represent standard

deviation of the mean for three independent experiments. Statistical significance was determined using a two-tailed unpaired Student’s t test (**p < 0.01).

(G) mRNA levels for three targets of RYBP with Nudt21 knockdown at day 3 of reprogramming.

(H) AnnexinV staining for cells co-transfected with the indicated siRNAs.

(I) A cell trace time course for proliferation with co-knockdown of Nudt21 with Rybp or Wdr5 at the indicated time points.

Page 36: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

Zfp42 Thy1

Figure S6. Key Chromatin Modifiers Targeted by Nudt21 Mediate Reprogramming, Related to Figure 6

(A) qRT-PCR analysis at day 3 of reprogramming showing the efficiency of knockdown for chromatin factors with simultaneous knockdown of Nudt21.

(B) qRT-PCR analysis at day 3 of reprogramming showing the efficiency of knockdown for Nudt21 with simultaneous knockdown of chromatin factors.

(legend continued on next page)

Page 37: Nudt21 Controls Cell Fate by Connecting Alternative ......7Department of Cell Biology, Harvard Medical School, Boston, MA 02115, USA 8Howard Hughes Medical Institute, Brigham and Women’s

(C) Quantification of OCT4-GFP+ cells at nine days of reprogramming, following double knockdown of Nudt21 and the indicated chromatin factors. Error bars

represent standard deviation of the mean for three independent experiments. Statistical significance was determined using two-tailed unpaired Student’s t test

(***p < 0.001; ****p < 0.0001).

(D) Representative images of alkaline phosphatase staining following double knockdown of Nudt21 and the indicated chromatin factors. Cells were induced with

dox for 12 days, followed by 4 days of dox withdrawal.

(E) Gene expression analysis by qRT-PCR of pluripotency factors at day 9 of reprogramming, following double knockdown of Nudt21 and the indicated chromatin

factors.

(F) A cell trace time course for proliferation with co-knockdown of Nudt21 with chromatin factors at the indicated time points.

(G) AnnexinV staining for cells co-transfected with the indicated siRNAs.

(H) Genome browser screenshots showing ATAC-seq data for the Thy-1 and Zfp42 loci.

(I) Gene ontology analysis of APA changes identified in a previous study (Masamha et al., 2014).