1234567890():,;

ARTICLE

DOI: 10.1038/s41467-017-02643-5

OPEN

CUG initiation and frameshifting enable production

of dipeptide repeat proteins from ALS/FTD

C9ORF72 transcripts

Ricardos Tabet1,2, Laure Schaeffer3, Fernande Freyermuth1,2, Melanie Jambeau1,2, Michael Workman1, Chao-Zong Lee1, Chun-Chia Lin1, Jie Jiang4, Karen Jansen-West5, Hussein Abou-Hamdan6, Laurent Désaubry6, Tania Gendron5, Leonard Petrucelli5, Franck Martin 3 & Clotilde Lagier-Tourenne 1,2

Expansion of G4C2 repeats in the C9ORF72 gene is the most prevalent inherited form of amyotrophic lateral sclerosis and frontotemporal dementia. Expanded transcripts undergo repeat-associated non-AUG (RAN) translation producing dipeptide repeat proteins from all reading frames. We determined cis-factors and trans-factors inﬂuencing translation of the human C9ORF72 transcripts. G4C2 translation operates through a 5′–3′ cap-dependent scanning mechanism, requiring a CUG codon located upstream of the repeats and an initiator Met-tRNAMeti. Production of poly-GA, poly-GP, and poly-GR proteins from the three frames is inﬂuenced by mutation of the same CUG start codon supporting a frameshifting mechanism. RAN translation is also regulated by an upstream open reading frame (uORF) present in mis-spliced C9ORF72 transcripts. Inhibitors of the pre-initiation ribosomal complex and RNA antisense oligonucleotides selectively targeting the 5′-ﬂanking G4C2 sequence block ribosomal scanning and prevent translation. Finally, we identiﬁed an unexpected afﬁnity of expanded transcripts for the ribosomal subunits independently from translation.

1 Department of Neurology, MassGeneral Institute for Neurodegenerative Disease (MIND), Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA. 2 Broad Institute of Harvard University and MIT, Cambridge, MA 02142, USA. 3 Architecture et Réactivité de l’ARN, UPR 9002, Université de Strasbourg, CNRS, F-67000 Strasbourg, France. 4 Ludwig Institute for Cancer Research, University of California San Diego, La Jolla, CA 92093, USA. 5 Department of Neuroscience, Mayo Clinic, Jacksonville, FL 32224, USA. 6 Therapeutic Innovation Laboratory (UMR 7200), Faculty of Pharmacy,
CNRS/University of Strasbourg, 67401 Illkirch, Cedex, France. Correspondence and requests for materials should be addressed to
F.M. (email: f.martin@ibmc-cnrs.unistra.fr) or to C.L.-T. (email: clagier-tourenne@mgh.harvard.edu)

NATURE COMMUNICATIONS | (2018)9:152

| DOI: 10.1038/s41467-017-02643-5 | www.nature.com/naturecommunications

1

ARTICLE

NATURE COMMUNICATIONS | DOI: 10.1038/s41467-017-02643-5

Amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (FTD) are devastating neurodegenerative disorders with a considerable clinical and pathological

overlap, which is further substantiated by the discovery of

C9ORF72 repeat expansions as the most frequent genetic cause for both diseases1,2. Indeed, expansion of a G4C2 hexanucleotide repeat in the ﬁrst intron of the C9ORF72 gene is identiﬁed in

~40% and ~25% of familial ALS and FTD, respectively, as well as 5% of sporadic patients3. The number of G4C2 repeats is normally lower than 30 and can extend to several hundred repeats in

patients. As in other microsatellite diseases, C9ORF72 expansions

are transcribed from both sense and antisense strands (reviewed in ref. 4). Bidirectional transcription of the C9ORF72 locus results

in the production of transcripts containing either repeats that accumulate into RNA foci1,5–10. The

G4C2 or C4G2 G4C2-contain-

ing RNAs were proposed to form G-quadruplex secondary

structures and sequester several RNA-binding proteins (RBPs)

including hnRNP H1/F, ALYREF, SRSF2, hnRNPA1, hnRNPA3, ADARB2, Pur-α, and Nucleolin (reviewed in ref. 4). In addition,

C9ORF72 expanded transcripts are translated into dipeptide

repeat (DPR) proteins through unconventional translation, known as repeat-associated non-AUG (RAN) translation11. RAN

translation occurs in absence of an AUG start codon, in multiple

reading frames of the same repeat-containing transcript, and within coding as well as non-coding regions12. This mechanism

has now been described in several microsatellite expansion diseases, including spinocerebellar ataxia type 8 (SCA8)11, myotonic dystrophy (DM1 and DM2)11,13, Huntington’s disease (HD)14, fragile X-associated tremor/ataxia syndrome (FXTAS)15, spinocerebellar ataxia type 3116, and C9ORF72 ALS/FTD10,17–20. Both

G4C2 sense and C4G2 antisense transcripts are translated from the three coding frames into ﬁve DPR proteins, which aggregate in C9ORF72 ALS/FTD patients10,13,18–21. Poly-Glycine-Alanine

(poly-GA) and poly-Glycine-Arginine (poly-GR) are translated

from the sense strand G4C2 transcripts, while poly-ProlineAlanine (Poly-PA) and poly-Proline-Arginine (poly-PR) are

produced from the antisense strand C4G2 RNA. Poly-GlycineProline (poly-GP) may be produced from both RNA strands.

These DPR proteins are the main components of cytoplasmic

p62-positive, TDP-43-negative aggregates that represent a unique pathological hallmark in C9ORF72 ALS/FTD patients22,23. Evi-

dence supports that DPR proteins, in particular arginine-rich

poly-GR and poly-PR proteins, are toxic and play a central role in

neurodegeneration due to C9ORF72 expansions (reviewed in ref. 24).

However, how RAN translation of C9ORF72 expanded tran-

scripts occurs and which factors are required is unknown.

Translation initiation of canonical mRNAs is a complex process

that requires numerous eukaryotic initiation factors (eIFs) and is

crucial for regulation of gene expression. The 40S ribosomal

subunit binds to the 5′ cap and then scans along the mRNA until

encountering an initiation codon. Most of the regulation is

exerted at the ﬁrst stage, where the AUG start codon is identiﬁed

and decoded by the methionyl-tRNA specialized for initiation (Met-tRNAMeti)25. The efﬁciency of start codon selection is strongly inﬂuenced by surrounding sequences and the recruit-

ment of eIFs. Certain viral and cellular messenger RNAs escape

the canonical translation pathway to attract the ribosomes in a

cap-independent scanning mechanism. These RNAs contain

highly structured sequence, called internal ribosome entry site

(IRES), mimicking initiation factors to directly recruit the ribosome at the start codon26,27. Repeat-containing RNAs may also

adopt stable structures, such as stem loops or G-quadruplexes and

an IRES-like mechanism could be at the origin of RAN translation in microsatellite expansion diseases12,28–32. Against this

hypothesis, RAN translation of CGG repeats associated with

FXTAS was recently shown to involve a canonical cap-dependent scanning mechanism33. The cis-factors and trans-factors inﬂuencing the translation of the human C9ORF72 expansion transcripts are not yet identiﬁed. Determining whether
hexanucleotide G4C2 transcripts recruit the ribosome following the canonical translation initiation or using an IRES mechanism
is a crucial step for the development of therapeutic approaches targeting RAN translation in C9ORF72 ALS/FTD patients.
Herein, we provide mechanistic insights delineating the different steps needed to recruit the ribosome and initiate RAN
translation from G4C2 expansions to produce poly-GA, GP, and GR proteins. Similar to a canonical mechanism of translation34,
the production of DPR proteins from expanded transcripts requires a 5′cap insertion, involves the initiator methionine and
strongly relies on sequences upstream of the repeat. G4C2 RAN translation proceeds by a 5′–3′ canonical scanning mechanism to
start translation at a near-cognate CUG codon and produce DPR proteins by frameshifting. Consistent with this mechanism, we
also demonstrate that G4C2 RAN translation is downregulated by an upstream open reading frame (uORF) present in abnormally spliced C9ORF72 transcripts35. Inhibitors of the pre-initiation ribosomal complex and RNA antisense oligonucleotides (ASOs)
targeting the sequence upstream of the repeats inhibit G4C2 RAN translation, conﬁrming a scanning-dependent mechanism that
may be targeted for therapeutic intervention. Finally, G4C2-containing RNAs are found to be associated with ribosomal subunits
in a translation independent manner supporting a new RNA gain of function mechanism in C9ORF72 disease.

Results

Translation efﬁciency of G4C2 RAN translation. To identify cisfactors and trans-factors inﬂuencing the translation of G4C2
repeats in the context of the C9ORF72 gene, we used a construct

containing 66 repeats that was shown to undergo RAN transla-

tion in all three frames when expressed in cultured cells and in the mouse central nervous system20,36. This construct was

modiﬁed to generate a series of vectors with different sequences

ﬂanking the repeat at the human C9ORF72 locus (Supplementary

Fig. 1 and Table 1). Sequences encoding for a speciﬁc tag in each

of the three reading frames were inserted downstream of the

repeat to monitor the production of poly-GA (HA in the +1

frame), poly-GP (His in the +2 frame), and poly-GR (FLAG in

the +3 frame). RAN translation from all three reading frames is

therefore examined from the same G4C2 construct. A cell-free translation assay based on rabbit reticulocyte lysates

(RRL) was developed to monitor RAN translation efﬁciency from

C9ORF72 transcripts. In vitro RAN translation was observed in

all three frames from capped RNAs with 66 repeats (Fig. 1a–c,

Supplementary Fig. 2a). To accurately compare the translation

efﬁciency of the repeat in each frame, we used as reference Renilla

luciferases with either HA, His, or FLAG tags under the control of

the intergenic region (IGR) IRESs from cricket paralysis virus

(CrPV). IRES are structural RNA elements that allow ribosome

hi-jacking and trigger translation in a cap-independent manner26,27. Among them, IGR promotes highly efﬁcient translation

without any AUG start codon, does not need eIF or the initiator

tRRRNLA39M. eItni3d7e,3e8d, ,

and was canonical

shown to be efﬁciently translated translation under the 5’UTR of the

in β-

globin is only one fold more efﬁcient than translation under the

control of the IGR (Supplementary Fig. 2b–d). We compared the

efﬁciency of C9ORF72 RAN translation in the three reading

frames with the translation of tagged-luciferase reporter mRNAs

that are controlled by the CrPV IGR. A striking difference in

translation efﬁciency was observed between the three frames.

Indeed, translation of the capped (G4C2)66 mRNA in the +1 poly-

2

NATURE COMMUNICATIONS | (2018)9:152

| DOI: 10.1038/s41467-017-02643-5 | www.nature.com/naturecommunications

NATURE COMMUNICATIONS | DOI: 10.1038/s41467-017-02643-5

ARTICLE

a
Repeats

Uncapped 66 30

Capped 66 30

Uncapped
IGR-luciferase M HA His FLAG

d
18 16

(G4C2)66 translation efficiency (relative to IGR)

GA (20.5 kDa)
GA (14.5 kDa)
Anti- HA

b
Repeats

Uncapped 66 30

kDa 35 25
15

Capped

Uncapped

66 30 IGR-luciferase HA His FLAG

e

GP (20.5 kDa)

kDa 35 25
15 f

Anti- His
c Uncapped
Repeats 66 30

Capped

Uncapped

66 30 IGR-luciferase HA His M FLAG

g

GR (20.5 kDa) GR (14.5 kDa)
Anti- FLAG

kDa 35 25
15

GR translation efficiency (relative to capped 66 repeats)

GP translation efficiency (relative to capped 66 repeats)

GA translation efficiency (relative to capped 66 repeats)

2 1

0 GA GP GR IGR

1.2 *
1 0.8 0.6 0.4 0.2
0

**

1.2 ** ***
1 0.8 0.6 0.4 0.2
0

2*
1.6
1.2
0.8 0.4
0 Unc Capped Unc Capped 30 Repeats 66 Repeats

Fig. 1 G4C2 RAN translation is length dependent and displays different efﬁciencies across reading frames. RNA transcripts with (G4C2)30 or (G4C2)66 repeats were transcribed in vitro with T7 RNA polymerase, capped or not capped and subjected to translation in rabbit reticulocyte lysate (RRL) system. Increasing RNA concentrations (100 and 200 nM) were used for translation in RRL. RAN translation was probed on immunoblot with antibodies to (a) HA tag in the +1 poly-GA frame, (b) His tag in the +2 poly-GP frame, and (c) FLAG tag in the + 3 poly-GR frame. Schematics of constructs with 30 repeats (#3) and 66 repeats (#4) are shown in Figure S1. (d) Efﬁciencies of RAN translation in the different frames were measured relatively to Renilla Luciferase with the corresponding tags driven by the intergenic region (IGR) IRES from the cricket paralysis virus. The efﬁciencies of RAN translation from capped RNAs were compared to uncapped RNAs at 100 nM for (e) poly-GA, (f) poly-GP, (g) poly-GR with 30 or 66 G4C2 repeats, relatively to the capped 66 repeats. Graphs represent mean ± SEM, n = 3. Student’s t-test, *p ≤ 0.05; **p ≤ 0.01; ***p ≤ 0.001

GA frame was 17 times more efﬁcient than the IGR-luciferase reporter (Fig. 1a, d). In contrast, translation efﬁciency from polyGP in the +2 frame and poly-GR in the +3 frame was equivalent to the translation of IGR-luciferase (Fig. 1b–d). Notably, poly-GA aggregates are the most prevalent DPR proteins accumulated in post-mortem brain samples from C9ORF72 ALS/FTD patients (Supplementary Fig. 3)17,40 supporting that translation of the C9ORF72 repeat is most efﬁcient in the +1 frame both in vitro and in vivo.
We also uncovered that the size of the expansion does not equally inﬂuence translation of the different frames. Production of poly-GP in the +2 frame was strongly inﬂuenced by the size of the repeat when comparing 30 and 66 repeats (Fig. 1b, f, Supplementary Fig. 1; constructs #3 vs. #4). In contrast, no

signiﬁcant difference was observed for poly-GA or poly-GR, which were equally expressed from both 30 and 66 G4C2 repeats (Fig. 1a, c, e, g).
Cap-dependent G4C2 translation initiates with methionine. Our in vitro assay provided the opportunity to determine whether RAN translation of the C9ORF72 repeat depends on the presence of a 5′m7G cap. Levels of poly-GA produced from 66 repeats increased more than ﬁve times when transcripts were capped (Fig. 1a, e) and poly-GP/GR syntheses were strongly repressed in absence of the cap, supporting a canonical cap-dependent mechanism of translation for all three DPR proteins (Fig. 1a, b, c, e, f, g).

NATURE COMMUNICATIONS | (2018)9:152

| DOI: 10.1038/s41467-017-02643-5 | www.nature.com/naturecommunications

3

ARTICLE

NATURE COMMUNICATIONS | DOI: 10.1038/s41467-017-02643-5

a
Constructs Cap 113 nt
#3

HA His FLAG (GGGGCC)
30

d

Canonical cap-dependent initiation Met

#4 113 nt (GGGGCC)

4G 4A

66 +1 +2 +3
(GA) (GP) (GR)

UAC 4E

(GGGGCC)30 or 66

b
Capped

Capped

4E 4E

40S

Competition assay

No 66 RNA

66

30 Repeats [RNA]

4E

4E Cap analog

(m7GpppG)

Inactive cap (ApppG)

35S-Met Lipoxygenase

*

35S-Met DPRs
35S-Met globin

*

e
kDa
35 Competition 25 assay
15 35S-Met DPRs

Capped 66 Repeats

Control (ApppG)

Cap analog (m7GpppG)

–

35S-Met autoradiography
c
Capped 66 Repeats (GA)HA

IP – +
35S-Met DPRs

kDa 25

Fold enrichment

7.15

35S-Met autoradiography

f
Competition assay
GA

35S-Met autoradiography

Capped 66 Repeats

Control (ApppG)

Cap analog (m7GpppG)

–

Immunoblot with Anti-HA

kDa 25
kDa 25

Fig. 2 G4C2 RAN translation is cap-dependent and initiates with a methionine. (a) Schemes of the RNA with (G4C2)30 (#3) or (G4C2)66 (#4) repeats that were transcribed in vitro with T7 RNA polymerase, capped and subjected to translation in RRL. (b) Translation was performed in the presence of [35S]-
methionine and capped RNA #3 or #4 at 100 and 200 nM. RAN translation products were detected by autoradiography. Asterisk indicates bands in the stacking gel. (c) Translation was performed in presence of [35S]-methionine and capped RNA #4 followed by immunoprecipitation with antibody against HA-tag and detection of immunoprecipitated [35S]-methionine proteins by autoradiography. (d) Scheme of the canonical translation involving the cap-
binding protein eukaryotic initiation factor 4E (eIF4E), the protein platform (eIF4G) and the helicase (eIF4A) that recruit the 40S ribosomal subunit. This
pre-initiation complex scans the 5′ of the transcript for an appropriate start codon. Compounds used for the competition assay in (e) and (f) are represented by dark circles and squares for the cap analog (m7GpppG) and the inactive form (ApppG), respectively. (e–f) Translation was performed in presence of [35S]-methionine, capped (G4C2)66 RNA #4 and an increased concentration of inactive cap (control, ApppG) or cap analog (competitor of the cap, m7GpppG). [35S]-methionine RAN translation products and poly-GA were detected by (e) autoradiography and (f) immunoblot with an antibody
against HA-tag, respectively

Canonical translational initiation consists of base-pairing between the initiator Met-tRNAMeti anticodon and the AUG start codon. The incorporation of [35S]-methionine during the translation of transcripts expressing 30 repeats (#3) or 66 repeats (#4) was measured to determine whether RAN translation requires Met-tRNAMeti for the production of DPR proteins (Fig. 2a, b). Notably, the sequence of the transcripts #3 and #4 do not contain any AUG codon and the presence of [35S]methionine in RAN products cannot derive from the incorporation of an internal methionine (Supplementary Fig. 1 and Table 1). A speciﬁc [35S]-methionine band was detected at the expected 14.5 and 20.5 kDa molecular weight from constructs expressing 30 and 66 repeats, respectively (Fig. 2b). The level of [35S]-methionine labeled polypeptide(s) was proportional to RNA concentration indicating that RAN translation is observed in subsaturating conditions. Immunoprecipitation of poly-GA products with a HA-speciﬁc antibody conﬁrmed that RAN translation initiates with the incorporation of a methionine residue (Fig. 2c). To further demonstrate that G4C2 RAN translation starts with a

methionine, we inhibited the activity of the methionylated initiator tRNAMet carrier eIF2 by inducing the phosphorylation of its α subunit with poly(I:C)/salubrinal treatment as previously described41 (Supplementary Fig. 4a). While this treatment did not have any impact on the non-canonical translation of IGR-renilla luciferase, it inhibited the translation of a capped-dependent renilla luciferase reporter and the incorporation of [35S]methionine in DPR products (Supplementary Fig. 4b).
To gain further insights in the mechanism of RAN translation, we investigated the requirement of eukaryotic initiation factor eIF4E. In canonical translation the cap binding protein eIF4E is part of a larger complex called eIF4F, which also contains the platform protein eIF4G and the RNA helicase eIF4A34 (Fig. 2d). To test whether eIF4E is involved in the RAN translation of G4C2 repeats we monitored the translation efﬁciency in the presence of an excess of cap analog (m7GpppG) or its non-functional ApppG counterpart. The competition assay was performed in RRL (Fig. 2e, f) and wheat germ extract (WGE) (Supplementary Fig. 4c, d), a highly cap-dependent system42. Increasing

4

NATURE COMMUNICATIONS | (2018)9:152

| DOI: 10.1038/s41467-017-02643-5 | www.nature.com/naturecommunications

NATURE COMMUNICATIONS | DOI: 10.1038/s41467-017-02643-5

ARTICLE

a Identification of translation initiation codon
Met

4G 4A
UAC 4E
40S

? START codon

b
Constructs
Cap #4
86 nt

Near-cognate start codon in perfect Kozak sequence
–3 +4

#5

#6

#7

#8 6 nt
#9
#10

#11

ce

– 24 nt – 13 nt
Constructs 4 5 6 7 8 9 kDa

Kozak –3G/+4G –3U/+4C –3G/+4G – 24 nt AUG CUG CUG Constructs 10 11 4 kDa

66 35S-Met DPRs

25 35S-Met DPRs

25

35S-Met autoradiography

35S-Met autoradiography

dHA His FLAG

– 24 nt

66 – 13 nt

Constructs 4 5 6 7 8

66 Anti-HA (GA)

66

f

9 kDa 25

Kozak –3G/+4G –3U/+4C –3G/+4G – 24 nt AUG CUG CUG
Constructs 10 11 4

kDa

Anti-HA (GA)

25

66
Anti-His (GP)
66

* 25
Anti-His (GP)

* 25

66 Anti-FLAG (GR)
66
66 +1 +2 +3
(GA)(GP)(GR)

Immunoblot

* 25
Anti-FLAG (GR)

* 25
Immunoblot

Fig. 3 G4C2 RAN translation of all reading frames initiates at the same near-cognate CUG start codon in RRL. (a) Scheme of the pre-initiation complex loaded at the 5′cap and the 40S ribosomal subunit ready to scan toward the start codon. RAN translation occurs in absence of an AUG codon. (b) Schemes of the transcripts #4 to #11 showing mutations in the 5′ ﬂanking sequence of (G4C2)66 used in (c-f). Construct #4 contains the native sequence of 113 nucleotides upstream of the G4C2 repeat. Construct #5 has a CUG > CCG mutation (blue nucleotide) in a near-cognate start codon located in a perfect Kozak sequence 24 nucleotides upstream of the repeat. Construct #6 has CUG > CCG and GAG > GGG (blue nucleotides) mutations in two potential start codons located 24 and 13 nucleotides upstream of the repeat, respectively. Construct #7 contains a GAG > GGG mutation in a potential near-cognate start codon located 13 nucleotides upstream of the repeat. Constructs #8 and #9 harbor a deletion leaving 33 nucleotides (including CUG and GAG codons) and
eight nucleotides (deleting both CUG and GAG codons) upstream of the repeat, respectively. Construct #10 has a CUG > AUG mutation in the nearcognate start codon. Construct #11 has GCUCUGG > UCUCUGC mutations in the Kozak sequence. (c–f) Translation was performed in presence of [35S]methionine using each RNA variant separately (#4 to #11). (c and e) [35S]-methionine RAN translation products were detected by autoradiography.
(d and f) Poly-GA, poly-GP, and poly-GR were detected by immunoblot using antibodies against HA-tag, His-tag, and FLAG-tag, respectively. Asterisk
indicates unspeciﬁc proteins translated in the RRL system

concentrations of cap analog, but not ApppG, lead to eIF4E titration thereby affecting the efﬁciency of eIF4F-dependent translation. The levels of [35S]-methionine-DPRs (Fig. 2e, Supplementary Fig. 4c) and poly-GA accumulation (Fig. 2f, Supplementary Fig. 4d) were reduced by increased concentrations of cap analog, demonstrating the role of the canonical initiation factor eIF4E in C9ORF72 RAN translation.
G4C2 translation initiates at a near-cognate CUG start codon. We next sought to identify the codon(s) used to initiate translation of C9ORF72 transcripts (Fig. 3a). The presence of a single band on SDS-PAGE for the different DPR products (Fig. 1), corroborated by [35S]-methionine labeling (Fig. 2), suggests that the translation of G4C2 starts at a speciﬁc position. In addition, in vitro RAN translation products obtained from 66 repeats had the same estimated molecular weight of 20.5 kDa in all three frames (Fig. 1a–c) suggesting that translation in the different frames is initiated from a single or neighboring start codons.
A candidate start site is a near-cognate CUG codon located 24 nucleotides upstream of the repeats in the +1 frame and embedded in a perfect Kozak sequence43 (G/A in −3 and G in +4) (Supplementary Fig. 1, Fig. 3b, and Supplementary Table 1). Site-directed mutagenesis of this codon from CUG to CCG was sufﬁcient to abolish the production of [35S]-methionine labeled DPR proteins in RRL, demonstrating that this CUG is used as

start codon to translate C9ORF72 G4C2 repeats (Fig. 3b, c; construct #4 vs. #5). In contrast, a point mutation from GAG to GGG in another putative start site located 13 nucleotides upstream of the repeats in the +2 frame only slightly reduced the level of [35S]-methionine DPR proteins (Fig. 3b, c; construct #4 vs. #7). Transcripts containing mutations at both putative start codons conﬁrmed the necessity of the CUG codon to initiate RAN translation of the C9ORF72 repeat (Fig. 3b, c; construct #4 vs. #6). This was further corroborated by using constructs with 5′ truncations either preserving the near cognate CUG codon (#8) or deleting the entire region (#9) (Fig. 3b, c; construct #8 vs. #9). Importantly, immunoblot analyses revealed that syntheses of all three DPRs, poly-GA, poly-GP, and poly-GR, are equally disabled by the CUG mutation located 24 nucleotides upstream of the repeats (Fig. 3d; construct #4 vs. #5). In contrast, mutation of the GAG codon located 13 nucleotides upstream of the repeats reduced the levels of the three DPRs without abolishing their production (Fig. 3d; construct #4 vs. #7). Site-directed mutagenesis of the near cognate CUG codon to a canonical start codon AUG increases the incorporation of [35S]-methionine in DPR products (Fig. 3b, e; construct #10 vs. #4) and concomitantly the level of DPRs from all three frames, poly-GA, poly-GP, and polyGR (Fig. 3b, f; construct #10 vs. #4). Interestingly, mutating the Kozak sequence inhibits the production of DPR proteins detected by [35S]-methionine autoradiography (Fig. 3b, e; construct #11 vs.

NATURE COMMUNICATIONS | (2018)9:152

| DOI: 10.1038/s41467-017-02643-5 | www.nature.com/naturecommunications

5

ARTICLE

NATURE COMMUNICATIONS | DOI: 10.1038/s41467-017-02643-5

#4), as well as immunoblots for poly-GA, poly-GP, and poly-GR

(Fig. 3b, f; construct #11 vs. #4). This striking result demonstrates

that RAN translation producing DPR proteins from the three

frames starts at the same CUG codon, and implies that

production of poly-GP and poly-GR requires frameshifting

events, +1 and −1, respectively. An additional smaller poly-GA

product was translated from construct #7 suggesting that

mutation of GAG to GGG induces another translation initiation

event further downstream in frame +1 that is less efﬁcient than

initiation at CUG. The frameshifting necessary to produce +2

(poly-GP) and +3 (poly-GR) DPR proteins might explain the

yield of DPR productions observed in Fig. 1 and patient tissues

(Supplementary Fig. 3). Indeed, poly-GA translated from the +1

frame is the predominant DPR protein, poly-GP and poly-GR

require one frameshifting event (−1 or + 1) and are therefore

signiﬁcantly less produced.

The crucial role of the CUG translation initiation codon

located 24 nucleotides upstream of the C9ORF72 repeat was

further conﬁrmed in vivo by expressing 66 repeats with either a

CUG codon (construct #4) or its mutated CCG version (construct #5) in human neural progenitor cells (ReNcell VM)44, mouse

motor neuron-like cells (NSC-34), and human embryonic kidney

293T cells (HEK293T) (Fig. 4). Immunoblots using antibodies

that recognize each DPR protein identiﬁed products at a

comparable molecular weight in the three cell types and RRL

demonstrating similar RAN translation of the wild-type construct

(#4) in all systems (Figs. 3 and 4). RAN translation of poly-GA

and poly-GP was abolished by mutation of the CUG codon in

human neural progenitors (Fig. 4b, c, Supplementary Fig. 5a, b;

construct #4 vs. #5) and motor neuron-like cells (Fig. 4d, e;

construct #4 vs. #5), conﬁrming results observed in RRL (Fig. 3;

construct #4 vs. #5). RAN translation of poly-GR could not be

detected with any of the constructs in these cell lines. As shown in

RRL experiments (Fig. 3), G4C2 RAN translation in the poly-GA +1 frame and the poly-GR +3 frame was also abolished by

mutation of the CUG codon in HEK293T cells (Fig. 4f, g,

Supplementary Fig. 5d; construct #4 vs. #5). However, in contrast

to RRL and the two neuronal models, mutating the CUG codon

did not inhibit production of poly-GP in HEK293T cells but

instead induced a 20% increase detected by antibodies recogniz-

ing either poly-GP (Fig. 4f, g; Supplementary Fig. 5e; construct #4

vs. #5) or the HIS tag (Supplementary Fig. 5c). This observation

supports that poly-GP translation from an alternative start site

may be inﬂuenced in HEK293T by additional trans-acting factor

(s) that are absent in RRL, motor neuron-like NSC-34 and neural

progenitor cells. Overall, these results identify a mechanism

wrehqeurierescaMp-edt-etpReNndAeMnettittroaninsliatitaiotentroafnsthlaetioCn9OinRaFll7r2eaGd4iCng2

repeat frames

at a near-cognate CUG codon located upstream of the expansion.

An uORF represses G4C2 translation. Recently, Niblock et al. identiﬁed poly-adenylated C9ORF72 RNA species that retain the
repeat-containing intron 1 and in which downstream exons are correctly spliced35. This ﬁnding opens the possibility that G4C2 RAN translation occurs from a C9ORF72 mRNA variant with an enlarged 5′-untranslated region containing the G4C2 repeats. Notably, retention of intron 1 creates a potential uORF with 55 codons ﬂanked by an AUG start codon and two consecutive stop
codons (UGA and UAA) (Supplementary Fig. 1 and Table 1).
Emerging evidence suggests that the presence of uORF may regulate the expression of downstream ORF25,45,46. Indeed, translation of uORFs located in the 5′UTRs of transcripts often
inhibits translation of the downstream ORF likely by reducing its accessibility to the preinitiation complex47,48. Hence, we tested
whether the uORF created by the retention of intron 1 in

C9ORF72 transcripts may inﬂuence RAN translation of DPR proteins (Fig. 5a). We generated a construct with 66 repeats and the entire 5′ end sequence of C9ORF72 starting with exon 1A (Fig. 5b, Supplementary Fig. 1; construct #1). The uORF was found to strongly repress RAN translation in all frames of C9ORF72 repeat. Indeed, [35S]-methionine labeled DPR proteins were not detected in presence of the uORF (Fig. 5c, construct #1 vs. #4). Immunoblot analysis conﬁrmed the inﬂuence of the uORF with a severe reduction of poly-GA (+1 frame) levels and non-detectable poly-GP (+2) and poly-GR (+3) products (Fig. 5d; construct #1 vs. #4). Mutation of the uORF AUG start codon into CGG (Fig. 5b, construct #2) restored G4C2 RAN translation from all reading frames conﬁrming its role in repressing RAN translation (Fig. 5c, d; construct #2 vs. #1). Overall, these ﬁndings strongly support that RAN translation operates through a 5′–3′ scanning mechanism and is regulated by an uORF in C9ORF72 transcripts that retain intron 1.

5′–3′ scanning-dependent mechanism for G4C2 translation. To further demonstrate that RAN translation uses a canonical 5′–3′

scanning mechanism we investigated whether the eIF4A, an RNA

helicase required for efﬁcient scanning during translation initia-

teiIoFn4,Ai-sspinecvioﬁlvceidnhinibiGto4rC492,

RAN translation (Fig. 6a–e). FL3, an was found to reduce RAN translation

in RRL as demonstrated by the levels of [35S]-methionine labeled

DPR proteins (Supplementary Fig. 6a,b; construct #4) and poly-

GA (Fig. 6b, Supplementary Fig. 6c; construct #4) generated from

two different concentrations of expanded RNAs. Consistently,

FL3 treatment signiﬁcantly reduced the levels of poly-GA, poly-

GP, and poly-GR in HEK293T (Fig. 6c, d) without affecting the

level of the repeat-containing transcripts (Fig. 6e). To conﬁrm the

role of eIFs and a 5′–3′scanning mechanism in RAN translation,

we used a longer transcript that includes exon 1a and the entire

intronic region upstream of the C9ORF72 repeat with a AUG >

CGG mutation in the uORF start codon (Supplementary Fig. 6a;

construct #2). Consistent with our previous results, production of

DPR proteins was partially restored in presence of the mutated

uORF, but was strongly inhibited after treatment with FL3

(Supplementary Fig. 6a–c; construct #2). Another important

component for scanning is the platform eIF4G, which links the

cap binding factor eIF4E with the small ribosomal subunit

(Fig. 6a). To investigate whether eIF4G is required for G4C2 RAN translation we used 4EIRCat, an inhibitor that prevents the direct interaction between eIF4E and eIF4G50. Consistently, synthesis of

poly-GA from two different RNA concentrations was also

reduced by 4EIRCat (Fig. 6b). Finally, we found that both edeine

and cycloheximide completely inhibited the RAN translation

from all three reading frames (Fig. 6b, Supplementary Fig. 6b–d).

Edeine is a translation inhibitor that prevents the interaction of Met-tRNAMeti anticodon with the start codon in the P site of the ribosome (Fig. 6a). Cycloheximide binds between the E site and

P site of the ribosome and thereby blocks translocation to the next codon (Fig. 6a)51. The profound effect of these inhibitors on

RAN translation is consistent with our previous results showing

that G4C2 mechanism

RAN translation uses a canonical translation and initiates at a CUG codon with Met-tRNAMeti

anticodon interaction in the P site of the ribosome (Figs. 2–4).

Overall, the effect of speciﬁc translation inhibitors on the pro-

duction of DPR proteins demonstrate that G4C2 RAN translation requires eIF4F components (4E, 4G and 4A) to promote efﬁcient

cap-dependent 5′–3′ scanning.

Inhibition of G4C2 translation by RNA ASOs. We previously showed that DNA ASOs targeting sense strand G4C2-containing transcripts mediate their cleavage through action of the primarily

6

NATURE COMMUNICATIONS | (2018)9:152

| DOI: 10.1038/s41467-017-02643-5 | www.nature.com/naturecommunications

NATURE COMMUNICATIONS | DOI: 10.1038/s41467-017-02643-5

ARTICLE

a
Constructs
#4 86 nt GCU CUG G –3 +4

HA His FLAG
(GGGGCC)
66
+1 +2 +3 (GA) (GP) (GR)

#5

86 nt GCU CCG G –3 +4

bc

Neural progenitor cells

Neural progenitor cells

CUG 4
GA

CCG 5

22

1.5 1.3 1.1

*** ***

HA His FLAG (GGGGCC)
66
+1 +2 +3 (GA) (GP) (GR)
GA GP

DPR level (normalized to β-Actin)

GP β-Actin

Immunoblot

22 46

0.9
0.6
0.3
0 CUG

CCG

de

Motor neuron-like cell (NSC-34)

Motor neuron-like cell (NSC-34)

CUG

CCG

45

GA 22

1.5 1.3 1.1

*** ***

GA GP

DPR level (normalized to HSP90)

GP HSP90

Immunoblot

22 90

0.9 0.6 0.3

f
HEK293T CUG 4
GA
GP
GR

0 CUG

CCG

CCG 5

–

g
kDa

HEK293T

22 1.5

22 1.3
1.1 22
0.9

***
** ***

GA GP GR

DPR level (normalized to GFP)

GFP HSP90

Immunoblot

32 0.6 100 0.3
0

CUG

CCG

Fig. 4 Poly-GA, poly-GP, and poly-GR RAN translation initiate at the near-cognate CUG start codon in cells. (a) Schematic representations of constructs #4
and #5 containing the near cognate start codon CUG or mutant CCG upstream of (G4C2)66 repeats. These constructs are driven by a CMV early enhancer/chicken β actin (CAG) promoter. Human neural progenitor cells (b–c), mouse motor neuron like cells (NSC-34) (d–e) and human HEK293T cells (f–g) were co-transfected with the constructs #4 or #5 along with a GFP plasmid reporter. GFP, Hsp90 or β-Actin proteins were analyzed by immunoblot to control for the transfection efﬁciency and protein loading. Poly-GA, poly-GP, and poly-GR proteins were identiﬁed by immunoblot using antibodies raised against poly-GA, poly-GP, and poly-GR. Levels of the different DPR proteins were quantiﬁed and normalized to GFP, HSP90 or β-Actin. Error bars indicate SEM of three independent transfections. Student’s t- test, ** and *** indicate p < 0.01 and p < 0.001, respectively

nuclear enzyme RNase H, reducing the level of RNA foci and
DPR proteins in a C9ORF72 transgenic mouse model and patient ﬁbroblasts7,52. To determine whether RNA ASOs targeting the 5′ ﬂanking G4C2 sequence can block the scanning of ribosomes and inhibit RAN translation without inducing RNAse-H-dependent
degradation, we generated ASOs selectively targeting the region
upstream of the repeats and tested their potency in inhibiting G4C2 RAN translation in RRL system (Fig. 6f–h). One RNA

C9ORF72 ASO (RNA-ASO1) was complementary to a sequence that overlaps the near-cognate CUG codon, and two ASOs (RNAASO2, RNA-ASO3) were chosen to cover sequences located at 41 and 82 nucleotides distal from the repeats, respectively (Fig. 6f). Corresponding RNA sense oligonucleotides (RNA-SOs) were used as controls (RNA-SO1, RNA-SO2, and RNA-SO3, Fig. 6f). All three RNA-ASOs induced a dose-dependent reduction of DPR proteins produced from the capped G4C2 66 repeats RNAs

NATURE COMMUNICATIONS | (2018)9:152

| DOI: 10.1038/s41467-017-02643-5 | www.nature.com/naturecommunications

7

ARTICLE

NATURE COMMUNICATIONS | DOI: 10.1038/s41467-017-02643-5

a
Regulation by a short upstream open reading frame
Enhancing or inhibiting RAN translation

c
35S-Met DPRs

AUG CGG ΔAUG 1 2 4 kDa
25

Cap

AUG uORF UGA 80S

b
Constructs

Cap #1

1a AUG uORF UGA
171 nt 55 nt

? CUG

(GGGGCC)66

d

35S-Met autoradiography

124

kDa

1b HA His FLAG

Anti-HA (GA)

25

66

#2 CGG uORF UGA

66

Anti-His (GP)

25

#4

113 nt

66

+1 +2 +3 (GA) (GP) (GR)

Anti-FLAG (GR)

25

Immunoblot
Fig. 5 RAN translation of G4C2 repeats is down-regulated by a short upstream open reading frame (uORF). (a) Retention of intron 1 in C9ORF72 repeatcontaining transcripts creates an uORF located 226 nucleotides upstream of the start CUG codon. This uORF may inhibit or enhance G4C2 RAN translation. (b) To interrogate the regulation of RAN translation by this uORF, RNAs harboring the 5′ full-length sequence including C9ORF72 exon 1A (#1) and a AUG > CCG mutation in the uORF start codon (#2) were compared to RAN translation from RNA without the uORF (#4). Black boxes represent exons 1a and 1b and the gray box represents the uORF overlapping exon 1a and intron 1. (c) Translation in RRL system was performed in presence of [35S]methionine and capped RNA (#1, #2, or #4) followed by detection of [35S]-methionine proteins by autoradiography. (d) Poly-GA, poly-GP, and poly-GR were detected by immunoblots using antibodies against HA-tag, His-tag, and FLAG-tag, respectively

as measured by [35S]-methionine-labeling (Fig. 6g) and immu-
noblot (Fig. 6h). In contrast, SO controls did not affect the levels
of DPR proteins. These results demonstrate that RNA ASOs targeting the 5′ ﬂanking G4C2 sequence are sufﬁcient to block RAN translation independently of C9ORF72 RNAs degradation and identify the 5′–3′ scanning of ribosomes as a potential ther-
apeutic target in C9ORF72 ALS/FTD.

G4C2 RNAs bind ribosomes independently from translation. To assess ribosome loading onto (G4C2)exp RNAs, we performed sucrose gradient analysis with radiolabeled capped RNAs con-

taining either 30 or 66 repeats. As a control for canonical

translation we used radiolabelled capped human β-globin mRNA.

Radiolabeled capped RNAs with 66 antisense C4G2 repeats were also used as control for RAN translation (Fig. 7a). Sucrose gra-

dient analysis with 30 and 66 G4C2 repeat transcripts showed that RNA-containing repeats are mainly associated with polysomes

(Fig. 7a, b green graph, Supplementary Fig. 6e orange graph;

heavy fractions 0–20). Only a small proportion of RNAs was free

(RNP; ribonucleoproteins), associated with the ribosomal small

subunit in complex with initiation factors (48S) or with mono-

somes (80S), which is consistent with active RAN translation

(Fig. 1). Since transcripts containing expanded repeats, including

G4C2 RNAs, were recently transition and form gel-like

shown to structures

undergo abnormal phase in vitro53, we determined

whether the presence of radiolabeled G4C2 RNAs in the heavy fractions could be due to self-aggregation rather than association

with polyribosomes. Against this possibility, G4C2-free RNAs remained in the light fractions of sucrose gradients strongly

supporting that expanded RNAs associate with polyribosomes in

RRL. Contrary to the sense (G4C2)66 RNAs, transcripts

containing the antisense (C4G2)66 repeat sedimented mainly in the light fractions or were associated to monosomes, consistent with a low translation efﬁciency of the antisense transcripts (Fig. 7a, b; blue graph)40. Unexpectedly, treatment with edeine, that blocks the translation (Fig. 6b) and lead to the accumulation of β-globin mRNA in the light fractions (Fig. 7c, Supplementary Fig. 6f; light fractions 20–40, red graphs), did not prevent loading
of polysomes on transcripts with 66 or 30 G4C2 repeats (Fig. 7c, Supplementary Fig. 6f; heavy fractions 0–20, green and orange
graphs). The same abnormal sedimentation of G4C2 transcripts in heavy fractions was observed after treatment with GMP-PNP, a
non-hydrolysable GTP analog that normally leads to the accumulation of the transcripts in the fraction corresponding to the
48S particles, showing that G4C2 RNAs can recruit ribosomes in a translation-independent manner (Supplementary Fig. 6g). As
expected, blocking ribosomal translocation with cycloheximide induced the accumulation of the control β-globin mRNAs in the
fraction corresponding to monosomes 80S that are prevented from translocating after assembly (Fig. 7d, Supplementary Fig. 6h;
red graphs). In contrast, inhibiting RAN translation with cycloheximide (Fig. 6b, Supplementary Fig. 6b–d) did not prevent
ribosomal loading on expanded transcripts with 30 or 66 repeats (Fig. 7d, Supplementary Fig. 6h; heavy fractions 0–20, green and
orange graphs). As expected the 80S peak was slightly increased consistent with a small proportion of expanded G4C2 RNAs being associated with monosomes after cycloheximide treatment, but most transcripts remained present in the heavy fractions despite
cycloheximide blockage of translation. Notably, radiolabeled (G4C2)66 transcripts were more abundant in heavy fractions when they were folded in presence of K+ ions that stabilize Gquadruplex structures, comparatively to Na+ and Li+ ions (Supplementary Fig. 6i). Finally, to conﬁrm that G4C2 RNAs recruit

8

NATURE COMMUNICATIONS | (2018)9:152

| DOI: 10.1038/s41467-017-02643-5 | www.nature.com/naturecommunications

NATURE COMMUNICATIONS | DOI: 10.1038/s41467-017-02643-5

ARTICLE

a
Canonical 5′-3′ scanning-dependent mechanism

Met FL3

Met

4EIRCat 4E

4G UAC

4A

40S

Edeine
U AC C UG 80S

CHX (GGGGCC)66

b
RRL
Control [RNA] Anti-HA (GA)

Capped 66 repeats

CHX

FL3 4EIRCat Edeine
kDa 25

Immunoblot

c
GA GP GR HSP90

HEK293T

d

66 repeats

DMSO FL3 kDa 25

25

25

90 Immunoblot

HEK293T

e

1.2 ***

***

***

1

0.8

0.6

0.4

0.2

0 DMSO FL3 DMSO FL3 DMSO FL3

GA GP GR

HEK293T
1.2 ns 1
0.8 0.6 0.4 0.2
0

DPR level (normalized to HSP90)
66 repeats RNA (normalized to Rplp0) DMSO FL3

f RNA ASOs targeting the 5′ flanking sequence of G4C2

Construct Cap
#4

RNA-SO1 RNA-ASO1

RNA-SO2 RNA-ASO2

RNA-SO3

80S

RNA-ASO3

HA His FLAG
66
+1 +2 +3 (GA) (GP) (GR)

g
[Oligo] – ASO1 SO1 ASO2 SO2 ASO3 SO3

h
– ASO1 SO1 ASO2 SO2 ASO3 SO3 [Oligo]

35S-Met DPRs

35S-Met autoradiography

kDa Anti-HA 25 (GA)
Anti-FLAG (GR)

Immunoblot

kDa 25
25

Fig. 6 Inhibition of RAN translation by eIFs inhibitors and RNA ASOs support a 5′–3′ scanning-dependent mechanism. (a) Illustration of translation inhibitors used to delineate the recruitment of the ribosome at the CUG start codon: 4EIRCat prevents the interaction between eIF4E (4E) and eIF4G (4G). FL3 inhibits RNA helicase eIF4A (4A). Edeine blocks the codon–anticodon interaction. Cycloheximide (CHX) blocks the translational elongation. (b) Translation was performed in presence of CHX, FL3, 4EIRCat, or Edeine in RRL followed by immunoblot detection of anti-HA (poly-GA) antibody. (c–e) HEK293T cells were transfected with the construct #4 expressing 66 G4C2 repeats and treated with FL3 or DMSO control. (c) Immunoblots using antibodies against poly-GA, poly-GP, poly-GR, and HSP-90 proteins. (d) Levels of poly-GA, poly-GP, and poly-GR after FL3 treatment were quantiﬁed and normalized to HSP90 and DMSO-treated cells. Graphs represent mean ± SEM, n = 5. Student’s t-test, *** indicate p < 0.001. (e) Levels of repeatcontaining transcripts determined by quantitative RT-PCR and normalized to the Rplp0 transcripts and DMSO treated cells. (f) Schematic representations of construct #4 with sequences of sense (RNA-SO) and antisense (RNA-ASO) RNA oligonucleotides used to inhibit RAN translation. (g–h) Translation of capped (G4C2)66 RNAs (construct #4) was performed in RRL in presence of two concentrations of sense or antisense RNA oligonucleotides. (g) [35S]methionine RAN translation products were detected by autoradiography. (h) Poly-GA and poly-GR were detected by immunoblot using anti-HA (Poly-GA)
and -FLAG (Poly-GR) antibodies, respectively

the ribosome independently from DPR translation, we performed sucrose gradient analysis with puriﬁed ribosomal 40S and 60S.
Expanded transcripts with 30 repeats were able to recruit and load several 40S and 60S ribosomal subunits without the need of 5′-cap and any other initiation factors (Fig. 7e). Overall, we demonstrate here that G4C2 repeat-containing transcripts associate with ribosomal subunits independently of translational factors.

Discussion
G4C2 hexanucleotide expansions in the C9ORF72 gene were recently discovered as the major genetic cause of ALS and FTD, two fatal neurodegenerative disorders1,2. Emerging evidence supports pathogenic RNA gain-of-function mechanisms, where expanded G4C2 transcripts form RNA foci sequestering RNAbinding proteins in the nuclei or undergo RAN translation to produce toxic DPR proteins4. We developed robust assays to

NATURE COMMUNICATIONS | (2018)9:152

| DOI: 10.1038/s41467-017-02643-5 | www.nature.com/naturecommunications

9

ARTICLE

NATURE COMMUNICATIONS | DOI: 10.1038/s41467-017-02643-5

a
Radioactive RNA profiles on sucrose gradients
Capped β-globin mRNA + RRL (G4C2)30 RNA + RRL (G4C2)66 Free RNA (G4C2)66 RNA + RRL (C4G2)66 antisense RNA + RRL

Uncapped (G4C2)30 RNA

HA His FLAG
#3 30
+1 +2 +3 (GA) (GP) (GR)

Capped (G4C2)66 RNA

Cap #4

HA His FLAG
66 +1 +2 +3 (GA) (GP) (GR)

Capped β-globin mRNA

Cap β-globin

Capped antisense (C4G2)66 RNA

Cap

FLAG HA

66
+1 +2 +3 (PR) (GP) (PA)

d
Cycloheximide (CHX)
Capped β−globin mRNA

b
No inhibitor
Capped β−globin mRNA canonical translation

Capped (G4C2)66 RNA RAN translation

Total RNA (%)

20 18 Polysomes 80S 48S RNP

16

14

12

10

8

6

4

2

0

0 10 20 30 40

Bottom

Fractions

Top

c

20 18 Polysomes 80S 48S RNP 16 14 12 10
8 6 4 2
0 0 10 20 30 40

Total RNA (%)

Edeine Capped β−globin mRNA

20 18 16 14 12 10
8 6 4 2 0
0 10
Bottom

43S RNP

20

18

16

14

12

10

8

6

4

2

0

20 30 40

0

Fractions

Top

Capped (G4C2)66 RNA 43S RNP
10 20 30 40

Capped (G4C2)66 RNA

e
Purified 40S
Uncapped (G4C2)30 RNA

Total RNA (%)

Total RNA (%)

Capped antisense (C4G2)66 RNA RAN translation RNP
20 18 Polysomes 80S 48S 16 14 12 10
8 6 4 2 0
0 10 20 30 40
Capped antisense (C4G2)66 RNA
RNP 20 18 43S 16 14 12 10
8 6 4 2 0
0 10 20 30 40
Purified 60S
Uncapped (G4C2)30 RNA

Total RNA (%)

20 18 16 Polysomes 14 12 10
8 6 4 2 0
0 10 20

80S 48S RNP
30 40

20

18 80S

16 Polysomes

48S RNP

14

12

10

8

6

4

2

0 0 10 20 30 40

Bottom Fractions

Top

Total RNA (%)

20 18 16 14

(40S)3 (40S)2 40S Free RNA

20 18

(60S)4(60S)3(60S)2 60S

16

14

Free RNA

12 12

10 10

88

66

44

22

00

0 10 20 30 40

0 10 20 30 40

Bottom Fractions

Top

Fig. 7 G4C2 containing transcripts have intrinsic ribosome binding capacity independently of their translation. (a) Scheme of the capped (G4C2)66 RNA (#4) and uncapped (G4C2)30 transcripts (#3) used for translation in RRL and polyribosome fractionation on sucrose gradients. As controls, capped βglobin and capped (C2G4)66 antisense repeat RNAs were used in the same system. (b–d) Radiolabeled capped (G4C2)66 RNA proﬁle by polyribosome fractionation in RRL comparatively to capped β-globin mRNA and (C2G4)66 antisense RNAs. Fractionation on sucrose gradients was performed without inhibitor (b), in presence of Edeine (c) or CHX (d). (e) Sucrose gradient fractionation of radiolabeled uncapped (G4C2)30 transcripts (#3) was performed in presence of puriﬁed 40S or 60S ribosomal subunits

study RAN translation and determine speciﬁc cis-requirements
and trans-requirements for expanded G4C2 translation. G4C2 RAN translation was found to share many aspects with canonical translation initiation, including the requirement of a 5′ cap structure, methionylated initiator tRNAMet, and the recruitment
of the 40S subunit by the eIF4F complex (eIF4A, E, and G) to begin scanning toward the start codon (Fig. 8a, b). These ﬁndings
are consistent with mechanisms involved in RAN translation of CGG triplet repeats in the fragile X FMR1 gene which also depends on a cap-dependent scanning mechanism15,33,54. Since eIF4F’s functions were shown to be critical in dysregulation
of the translational machinery in cancers, major efforts have been undertaken to develop speciﬁc compounds directed against its components for therapeutic purposes55. Our work highlights the importance of eIF4F in ALS/FTD pathogenesis, thereby opening

the potential for new therapeutic strategies using existing eIF4F inhibitors to mitigate the effects of this neurodegenerative disease.
Ribosome proﬁling on higher eukaryotes showed that translation occurs on numerous ORFs without an AUG-initiator but operates with near-cognate start codons (CUG > GUG > UUG > ACG > others)56,57. We discovered that the CUG codon located 24 nucleotides upstream of the G4C2 repeat, in the +1 (GA) frame and in an optimal Kozak sequence, is utilized as a start codon to produce DPR proteins. Mutations of this CUG codon or the Kozak sequence abolish production of all three DPR proteins in RRL supporting a frameshifting model where the ribosome starts at the CUG and slips to translate GP (+2) and GR (+3) (Fig. 8b). As in RRL, RAN translation in all three frames was affected by mutation of the CUG codon in human neural progenitor, mouse motor neuronal cells and HEK293T cells. However, while poly-

10

NATURE COMMUNICATIONS | (2018)9:152

| DOI: 10.1038/s41467-017-02643-5 | www.nature.com/naturecommunications

NATURE COMMUNICATIONS | DOI: 10.1038/s41467-017-02643-5

ARTICLE

a
Canonical translation initiation
Met

Met

Scanning 4G 4A
UAC 4E
40S

U AC A UG
80S

b
RAN translation starting at the near-cognate CUG codon in +1 frame

Met Met

Scanning

4G 4A UAC
4E
40S

UAC CUG
80S

c
Translation of an uORF in C9ORF72 retained intron 1 inhibits RAN translation

Met

ORF

3′

RAN translation products

+1 M-

exp

Frameshifting to produce GP & GR

+2 M

----

exp

80S +3 M

----

exp

exp 3′

Scanning

4G 4A
UAC 4E Exon1a
40S

AUG

uORF UGA 80S

CUG 80S

exp 3′

d
G4C2-containing transcripts associate with ribosomal subunits independently of translation

60S 60S

60S

60S

40S 5′

40S 40S

exp 3′

Fig. 8 Model of translation mechanisms associated with G4C2 expansions in C9ORF72 ALS/FTD. (a) Pre-Initiation ribosomal complex (PIC) assembles on the 5′ cap of mRNA by interacting with eIF4F complex formed by the cap binding factor eIF4E, the platform eIF4G and the RNA helicase eIF4A. The PIC complex scans the 5′ end for an appropriate AUG start codon, where the 60S ribosomal subunit joins the 40S to form a functional 80S ribosome ready to translate the coding sequence. (b) G4C2 RAN translation initiation shares the same pathway as the canonical one to translate poly-GA dipeptides, including the need of 5′ cap, eIF4E, eIF4G, eIF4A, initiator methionyl-tRNA, and the scanning mechanism. However, it initiates on a near-cognate CUG codon embedded in a perfect Kozak sequence, in frame with poly-GA, instead of a canonical AUG start codon. The ability of G4C2 expansions to form stable Gquaduplex structures forces the ribosome to occasionally undergo frameshifting to translate poly-GP and poly-GR in the +2 and +3 frames, respectively. (c) When G4C2 repeats are expanded, a subset of C9ORF72 mRNA is mis-spliced retaining intron 1 with the repeats35. RAN translation from these RNAs is inhibited by a uORF that is translated canonically. (d) G4C2 expanded transcripts associate with ribosomal subunits independently from their translation

GP translation was prevented by mutation of the CUG repeat in RRL and the two neuronal models, poly-GP levels were slightly increased in HEK293T cells supporting a context-dependent regulation that differs between the three frames when the CUG is absent. The presence of speciﬁc RNA helicases might explain the differences on poly-GP translation between the different cell types, such as DDX21 recently shown to unfold RNA Gquadruplex structures in HEK293T58. Notably, an UAG stop codon in phase with the poly-GP frame is present at the beginning of the G4C2 repeats (UAG GGG CC sequence, Supplementary Fig. 1), indicating that the ribosome must initiate in another reading frame and then frameshift to produce poly-GP or directly initiate within the repeat. As we observed a single band on SDS-PAGE with comparable molecular weight between all reading frames and systems used (Figs. 3 and 4), initiation further

downstream inside the repeats is less likely to occur from (G4C2)66 transcripts. When comparing translation efﬁciencies for the three reading frames, poly-GA (+1) is predominant, followed by poly-GP (+2) and poly-GR (+3), which is in agreement with a
frameshifting model. This is also consistent with staining and immunoassay from human post-mortem tissues, where poly-GA
accumulates at higher levels than poly-GP and poly-GR (Supplementary Fig. 3)17,40,59,60.
G4C2 RAN translation initiation is inﬂuenced by repeat length, with different sensitivity among the three reading frames. While RAN translation efﬁciency is only reduced in the +1 poly-GA and +3 poly-GR frames with shorter repeat length, it is completely
abolished for poly-GP at 30 comparatively to 66 repeats (Fig. 1). This repeat length dependence could reﬂect secondary structures,
which differentially affect ribosomal scanning, translation

NATURE COMMUNICATIONS | (2018)9:152

| DOI: 10.1038/s41467-017-02643-5 | www.nature.com/naturecommunications

11

ARTICLE

NATURE COMMUNICATIONS | DOI: 10.1038/s41467-017-02643-5

elongation or force the ribosome to undergo a frameshifting. Indeed, G4C2 expansions can adopt RNA G-quadruplexes28–32, a structure that was recently demonstrated to induce frameshifting during translation61,62. These RNA secondary structures are stable in presence of monovalent cations, in the order of K+ > Na+ > Li+ 63. Thus, variations of ion concentration in the cell or speciﬁc RNA binding proteins58 may modulate the presence of G-
quadruplex structure in G4C2-containing transcripts and could inﬂuence frameshifting or initiation at non-AUG start codon.
Another major ﬁnding is the down-regulation of G4C2 RAN translation by a short uORF. Indeed, in mis-spliced C9ORF72
transcripts that retain intron 1, an uORF is present with an AUG
and two in-frame stop codons located 76 nucleotides upstream of
the G4C2 repeats. Notably, the AUG codon in exon 1A is the only AUG identiﬁed in the 5′end of the mis-spliced RNA. Upstream
ORF are cis-acting elements that regulate the expression of downstream protein coding sequences25,45,46. We demonstrated that mutating the AUG start codon of the uORF is sufﬁcient to
increase G4C2 RAN translation in all three reading frames, conﬁrming that this uORF is efﬁciently used by the ribosome during 5′–3′ scanning and is therefore inhibiting the translation of the
downstream G4C2 repeat (Figs. 5 and 8c). It is noteworthy that translation of synaptic mRNA(s) was shown to be downregulated by uORF(s) located in their 5′UTR, but upregulated upon metabotropic glutamate receptor activation64–66. Thus, it will be
important to determine whether the uORF in mis-spliced C9ORF72 transcripts inﬂuences G4C2 RAN translation level upon synaptic activation or external stimuli in neurons.
Notably, ASOs directing RNase-H-dependent degradation of C9ORF72 transcripts are under therapeutic development5–7,52. The identiﬁcation of sequences upstream of the repeat that inﬂuence RAN translation (CUG near-cognate start codon and
uORF) opens the possibility of using alternative strategies based
on ASOs that modulate translation without reducing transcript levels67,68. In agreement, we demonstrated that several RNA ASOs speciﬁcally targeting the region immediately upstream of the repeats block ribosomal scanning and efﬁciently reduce the level of RAN translation products (Fig. 6f–h).
Finally, we show that G4C2 repeat transcripts unexpectedly associate with ribosomal subunits in a translation independent
manner (Fig. 8d). Indeed, blocking cap initiation factors, codon–anticodon interaction, 80S ribosome assembly and ribo-
somal elongation did not avert the sedimentation of radiolabeled
G4C2 RNAs in the heavy fractions of sucrose gradients (Fig. 7). In addition, removing the 5′cap, shortening the repeat size, or using puriﬁed ribosomal subunits did not prevent the assembly of the
transcript to multiple ribosomal subunits. On the contrary,
antisense transcripts with C4G2 repeat did not associate with the ribosome. This striking ﬁnding supports a RNA gain-of-function
mechanism, independent from RAN translation and DPR pro-
teins accumulation. Ribosomal subunits are assembled in the
nucleolus and exported to the cytoplasm by multiple export receptors69. It will be important to determine whether seques-
tration of ribosomal subunits by expanded repeats and disruption of nucleocytoplasmic transport recently identiﬁed in C9ORF72 disease4 negatively impact overall translation in cells with
C9ORF72 expansions.
Overall, we provide new insights into RAN translation of
C9ORF72 G4C2 repeat which uses a cap-dependent mechanism initiating at a near-cognate CUG codon. A novel mechanism of
toxicity associated to C9ORF72 expansion is supported by the
association of G4C2 transcripts with ribosomal subunits independently of their translation. Importantly, this work identiﬁes sequences upstream of the G4C2 repeats and speciﬁc initiation factors as possible therapeutic targets to inhibit RAN translation
in C9ORF72 ALS/FTD patients.

Methods
Generation of C9ORF72 constructs with G4C2 repeats. To generate the different constructs listed in Supplementary Fig. 1 and Table 1, a plasmid pAG3 containing 66 repeats20,36 was ﬁrst digested with restriction sites BssHII and SacI to isolate the intronic region of human C9ORF72 with (G4C2)66, including 8 bp of 5′, 99 bp of 3′ ﬂanking sequences and three tags in frame with DPR proteins. BssHII is a restriction site naturally present in the human C9ORF72 gene located two nucleotides upstream of the repeats. Second, pUC18 (ThermoFisher, # SD0051) was modiﬁed to contain the three HindIII, BssHII, and SacI restriction sites, enabling the insertion of the digested BssHII/SacI C9ORF72 insert and the addition of any 5′end sequence between the HindIII and BssHII sites. After cloning the C9ORF72 insert in modiﬁed pUC18 with BssHII and SacI, primers listed in Supplementary Table 2 were used to generate the different constructs listed in Supplementary Fig. 1. Primers were designed to add the T7 Promoter for in vitro transcription (construct #9), followed immediately by 113 bp of 5′ ﬂanking G4C2 sequence with CUG > CCG mutation (construct #5), GAG > GGG mutation (construct #6) and double mutations CUG > CCG + GAG > GGG (construct #7). Also, primers were designed to add T7 promoter followed by 320 bp of 5′ sequence (construct #1), 320 bp with AUG > CGG mutations (construct #2) and to generate a short 5′end by adding T7 promoter with 33 bp (construct #8). All primers were designed to harbor the HindIII restriction site at the 5′ end and BssHII site at the 3′ end. After phosphorylation with T4 Polynucleotide Kinase (ThermoFisher, #EK0031) of the primers at the 5′end and hybridization of corresponding forward and reverse primers, the generated inserts were cloned in HindIII-BssHII pUC18 with (G4C2)66 repeats. The original plasmid was modiﬁed to contain T7 promoter by cloning using the HindIII restriction site (construct #4). Construct #3 with 30 G4C2 repeats was generated by expansion retraction during ampliﬁcation of the construct #4 with 66 repeats. Finally, construct #5 containing CUG > CCG mutation was digested with HindIII and NotI to be cloned in pAG3 downstream of the CMV early enhancer/chicken β-actin (CAG) promoter for human cell transfection.
The C4G2 antisense construct used as control in Fig. 7 was cloned by digesting pAG3 containing 66 repeats20,36 with restriction sites BssHII and NotI to isolate the intronic region of human C9ORF72 with (G4C2)66 and cloning it into puc18 harboring T7 promoter in antisense direction. This construct was designed to harbor Flag tag in poly-PR +1 frame and HA tag in +3 poly-PA frame.
In vitro transcription. The different variants of C9ORF72 (G4C2)exp constructs were cloned downstream of T7 promoter in pUC18 as detailed in Supplementary Fig. 1 and Table 1. Vectors were digested by XhoI and used for run-off in vitro transcription with T7 RNA polymerase. Uncapped RNAs were separated on denaturing PAGE (4%) and RNA were recovered from the gel slices by electroelution. The resulting pure RNA transcripts were capped at their 5′ end with the ScriptCap m7G Capping System (Epicenter Biotechnologies).
In vitro translation in RRL. Translation reactions were performed in self-made rabbit reticulocyte lysate system (RRL) as previously described42, without RNase treatment (used in commercially available extracts) that was shown to be detrimental to the translation efﬁciency from extracts, especially for cap-dependent translation70. Brieﬂy, reactions were incubated at 30 °C for 60 min and included 100 and 200 nM of each transcript and 10.8 µCi [35S]Met. Aliquots of translation reactions were analyzed by 15% SDS-PAGE and Western Blots. The cap dependency was analyzed by preincubation of increasing m7GpppG concentrations ranging from 0.5 to 1.5 mM for 5 min at room temperature. The experiments were performed in the presence of MgCl2 at a constant [MgCl2]/[cap analog] ratio of 0.8. For translation in presence of RNA sense (RNA-SO) and antisense (RNA-ASO) oligonucleotides (Supplementary Table 3) were annealed to 100 nM capped 66 repeat RNA (construct #4) in 20 mM Hepes-K (pH 7.6) and 100 mM KC1 for 5 min at 65 °C and 20 min at room temperature with a 10 or 50 fold molar excess of oligonucleotides over construct #4. This annealing mixture was kept on ice before addition to the translation reaction. RRL were incubated 5 min at 30 °C in presence of the different translational inhibitors at the following concentrations: 150 ng mL −1 for the polyI:C, 15 μM for salubrinal, 4.5 mg mL−1 cycloheximide, 10 μM edeine, 15 μM FL3, and 5 μM 4E1RCat.
Sucrose-gradient analysis. For sucrose-gradient analysis, 5′-32P-labeled or 3′-32Plabeled mRNA were incubated in RRL or with puriﬁed 40S and 60S ribosomal subunits, in the presence of speciﬁc inhibitors (Edeine leads to 43S accumulation, GMP-PNP leads to 48S formation, cycloheximide blocks translocation and leads to 80S accumulation) or without inhibitor to assemble polysomes. Translational inhibitors were incubated with RRL 5 min prior to addition of radiolabeled mRNAs. The translation initiation complexes were separated on a 7–47% linear sucrose gradient in buffer (25 mM Tris–HCl [pH 7.4], 50 mM KCl, 5 mM MgCl2, 1 mM DTT). The reactions were loaded on the gradients and spun (23,411×g for 2.5 h at 4 °C) in a SW41 rotor. mRNA sedimentation on sucrose gradients was monitored by Cerenkov counting after fractionation. In Supplementary Fig. 6i, capped (G4C2)66 transcripts were folded in presence of KCl, NaCl or LiCl at 195 mM, by denaturating 1 min at 95 °C, followed by 5 min at 20 and 4 °C until adding the RRL (75 mM ﬁnal ion concentrations).

12

NATURE COMMUNICATIONS | (2018)9:152

| DOI: 10.1038/s41467-017-02643-5 | www.nature.com/naturecommunications

NATURE COMMUNICATIONS | DOI: 10.1038/s41467-017-02643-5

ARTICLE

Cell culture and plasmid transfection. The HEK293T cells were cultured in DMEM 10% (v/v) FBS and penicillin/streptomycin. ReNcell VM human neural progenitors (Millipore; Catalog number SCC008) were maintained in high-glucose DMEM/F12 (ThermoFisher Scientiﬁc) media supplemented with 2 μg mL−1 heparin (StemCell Technologies, #07980), 2% (v/v) B27 neural supplement (ThermoFisher Scientiﬁc, #175004044), 20 μg mL−1 hEGF (Sigma-Aldrich, #E9644), 20 μg mL−1 bFGF (Stemgent, #03-0002) and 1% penicillin/streptomycin (ThermoFisher Scientiﬁc) and were plated onto BD Matrigel (BD Biosciences)coated cell culture ﬂasks with B27, EGF, FGF, and heparin on precoated Matrigel dishes. The NSC-34 cells (CELLutions Biosystems Inc; Catalog number—CLU140) were grown in DMEM supplemented with 10% FBS, 100 U mL−1 penicillin, and 100 μg mL−1 streptomycin at 37 °C in a humidiﬁed atmosphere of 5% CO2. HEK293T were plated 24 h prior transfection with different C9ORF72 (G4C2)66 expansion constructs (Supplementary Fig.1 and Table 1) and a reporter pGFPmax (Lonza) expressing GFP using a contruct:pGFPmax ratio of 5:1. The lipofectamine 2000 reagent was used according to manufacturer instruction (Invitrogen) for HEK293T and NSC-34 transfections. Nucleofection using Nucleofector kit (Lonza, #VPG 1005) was used for neural progenitor cell to achieve high efﬁciency of transfection of plasmids. Twenty-four hours after transfection, the cells were washed with PBS 1X and collected for RNA and protein extractions.
FL3 treatment in cells. HEK293T were cultured 24 h prior treatment into 10 cm dish, following by transfection with lipofectamine 2000 of construct #4 as described previously. After 4 h of incubation in the transfection reagents, cells were treated with 10 μM FL3 for 24 h and collected for immunoblot analysis
Immunoblotting. The cell pellets were re-suspended in 400 μl of 2X Laemmli sample buffer (Biorad #1610737). The proteins were homogenized with pestle, then denatured at 95 °C for 10 min. The total protein extract was separated on gradient 4–20% SDS-PAGE gels and 18% SDS-PAGE gels, transferred onto PVDF membranes, blocked with 5% (v/v) non fat dry milk (NFM) in Tris–buffered saline (TBS) pH 7.5. The membranes were incubated with primary antibodies (Supplementary Table 4) overnight at 4 °C in TBS and 5% (v/v) NFM, washed with TBSTween 20 0.1%, incubated with horseradish peroxidase (HRP)-conjugated secondary antibodies (donkey anti-rabbit GE Healthcare Life Sciences #NA934, sheep anti-mouse GE Healthcare Life Sciences #NA931, goat anti-rat abcam #97057), washed with TBS-Tween 20 0.1% and signal was revealed with autoradiographic ﬁlms.
Immunoﬂuorescence. HEK293T cells were cultured on 24-well plates prior transfection with lipofectamine 2000, following the recommendations of supplier. Twenty-four hours after transfection, the cells were ﬁxed in 4% paraformaldehyde and washed twice with PBS. Cells were permeabilized with 0.1% Triton X-100 for 10 min at room temperature. They were washed twice again with PBS and blocked with 1% bovine albumin in PBS for 1 h at room temperature. Cells were incubated at 4 °C for 24 h with primary antibodies anti-GA or anti-GP (Supplementary Table 4) at 1:500 dilution in the blocking solution supplemented with 0.02% Tween-20. Rabbit ﬂuorescently tagged secondary antibody conjugated to Alexa 595 (ThermoFisher Scientiﬁc) was incubated for 1 h at room temperature in the blocking buffer. The nuclei were stained with ProLong™ Gold Antifade Mountant with DAPI (ThermoFisher, # P36935) and mounted on slides for confocal microscopy.
Immunohistochemistry of human brain sections. Parafﬁn sections (8 μm) from the cerebellum were deparafﬁnized with CitriSolv (Thermo Fisher Scientiﬁc, #04355-121) and incubated in 100% EtOH, 90% EtOH, 70% EtOH, 50%, and Milli-Q® water. Sections were incubated in 0.6% H2O2 in methanol at room temperature for 15 min, treated with antigen unmasking solution (Vector Laboratories, #H-3300) in the steam chamber for 45 min, and blocked at room temperature with 1% FBS/ 0.1% Triton X-100/PBS for 25 min. Sections were then incubated at 4 °C overnight with anti-GA rabbit antibody (Rb4334) (1:1000), anti-GR rabbit antibody (Rb4995) (1:1000), or anti-GP rabbit antibody (Rb7633) (1:1000)52 diluted in 1% FBS/PBS. Next, sections were stained with secondary antibody ImmPRESSTM HRP (peroxidase) anti-Rabbit IgG Reagent (Vector Laboratories, #MP-7401) at room temperature for 1 h, developed with VECTOR NovaRED Peroxidase (HRP) Substrate Kit (Vector Laboratories, #SK-4800), treated with hematoxylin stain solution (RICCA, #3530-32) and bluing reagent Scott’s tap water substitute (Leica Biosystems, #3802901), and mounted with Richard-Allan ScientiﬁcTM Mounting Medium (Thermo Fisher Scientiﬁc, #4112).
Data availability. The data that support the ﬁndings of this study are available from the corresponding author upon request. All constructs and reagents generated in this study will be shared upon request.
Received: 28 April 2017 Accepted: 14 December 2017

References
1. DeJesus-Hernandez, M. et al. Expanded GGGGCC hexanucleotide repeat in non-coding region of C9ORF72 causes chromosome 9p-linked frontotemporal dementia and amyotrophic lateral sclerosis. Neuron 72, 245–256 (2011).
2. Renton, A. E. et al. A hexanucleotide repeat expansion in C9ORF72 is the cause of chromosome 9p21-linked ALS-FTD. Neuron 72, 257–268 (2011).
3. Majounie, E. et al. Frequency of the C9orf72 hexanucleotide repeat expansion in patients with amyotrophic lateral sclerosis and frontotemporal dementia: a cross-sectional study. Lancet Neurol. 11, 323–330 (2012).
4. Gitler, A. D. & Tsuiji, H. There has been an awakening: emerging mechanisms of C9orf72 mutations in FTD/ALS. Brain Res. 1647, 19–29 (2016).
5. Donnelly, C. J. et al. RNA toxicity from the ALS/FTD C9ORF72 expansion is mitigated by antisense intervention. Neuron 80, 415–428 (2013).
6. Sareen, D. et al. Targeting RNA foci in iPSC-derived motor neurons from ALS patients with a C9ORF72 repeat expansion. Sci. Transl. Med 5, 208ra149 (2013).
7. Lagier-Tourenne, C. et al. Targeted degradation of sense and antisense C9orf72 RNA foci as therapy for ALS and frontotemporal degeneration. Proc. Natl Acad. Sci. USA 110, E4530–E4539 (2013).
8. Mizielinska, S. et al. C9orf72 frontotemporal lobar degeneration is characterised by frequent neuronal sense and antisense RNA foci. Acta Neuropathol. 126, 845–857 (2013).
9. Mori, K. et al. hnRNP A3 binds to GGGGCC repeats and is a constituent of p62-positive/TDP43-negative inclusions in the hippocampus of patients with C9orf72 mutations. Acta Neuropathol. 125, 413–423 (2013).
10. Zu, T. et al. RAN proteins and RNA foci from antisense transcripts in C9ORF72 ALS and frontotemporal dementia. Proc. Natl Acad. Sci. USA 110, E4968–E4977 (2013).
11. Zu, T. et al. Non-ATG-initiated translation directed by microsatellite expansions. Proc. Natl Acad. Sci. USA 108, 260–265 (2011).
12. Cleary, J. D. & Ranum, L. P. New developments in RAN translation: insights from multiple diseases. Curr. Opin. Genet. Dev. 44, 125–134 (2017).
13. Zu, T. et al. RAN translation regulated by muscleblind proteins in myotonic dystrophy type 2. Neuron 95, 1292–1305.e1295 (2017).
14. Banez-Coronel, M. et al. RAN translation in Huntington disease. Neuron 88, 667–677 (2015).
15. Todd, P. K. et al. CGG repeat-associated translation mediates neurodegeneration in fragile X tremor ataxia syndrome. Neuron 78, 440–455 (2013).
16. Ishiguro, T. et al. Regulatory role of RNA chaperone TDP-43 for RNA misfolding and repeat-associated translation in SCA31. Neuron 94, 108–124.e107 (2017).
17. Mori, K. et al. The C9orf72 GGGGCC repeat is translated into aggregating dipeptide-repeat proteins in FTLD/ALS. Science 339, 1335–1338 (2013).
18. Mori, K. et al. Bidirectional transcripts of the expanded C9orf72hexanucleotide repeat are translated into aggregating dipeptide repeat proteins. Acta Neuropathol. 126, 881–893 (2013).
19. Ash, P. E. A. et al. Unconventional translation of C9ORF72 GGGGCC expansion generates insoluble polypeptides speciﬁc to c9FTD/ALS. Neuron 77, 639–646 (2013).
20. Gendron, T. F. et al. Antisense transcripts of the expanded C9ORF72 hexanucleotide repeat form nuclear RNA foci and undergo repeat-associated non-ATG translation in c9FTD/ALS. Acta Neuropathol. 126, 829–844 (2013).
21. Kearse, M. G. & Todd, P. K. Repeat-associated non-AUG translation and its impact in neurodegenerative disease. Neurotherapeutics 11, 721–731 (2014).
22. Mackenzie, I. R. et al. Dipeptide repeat protein pathology in C9ORF72 mutation cases: clinico-pathological correlations. Acta Neuropathol. 126, 859–879 (2013).
23. Mahoney, C. J. et al. Frontotemporal dementia with the C9ORF72 hexanucleotide repeat expansion: clinical, neuroanatomical and neuropathological features. Brain 135, 736–750 (2012).
24. Freibaum, B. D. & Taylor, J. P. The role of dipeptide repeats in C9ORF72related ALS-FTD. Front. Mol. Neurosci. 10, 35 (2017).
25. Hinnebusch, A. G., Ivanov, I. P. & Sonenberg, N. Translational control by 5′untranslated regions of eukaryotic mRNAs. Science 352, 1413–1416 (2016).
26. Filbin, M. E. & Kieft, J. S. Toward a structural understanding of IRES RNA function. Curr. Opin. Struct. Biol. 19, 267–276 (2009).
27. Jackson, R. J., Hellen, C. U. & Pestova, T. V. The mechanism of eukaryotic translation initiation and principles of its regulation. Nat. Rev. Mol. Cell Biol. 11, 113–127 (2010).
28. Grigg, J. C., Shumayrikh, N. & Sen, D. G-quadruplex structures formed by expanded hexanucleotide repeat RNA and DNA from the neurodegenerative disease-linked C9orf72 gene efﬁciently sequester and activate heme. PLoS One 9, e106449 (2014).
29. Fratta, P. et al. C9orf72 hexanucleotide repeat associated with amyotrophic lateral sclerosis and frontotemporal dementia forms RNA G-quadruplexes. Sci. Rep. 2, 1016 (2012).
30. Reddy, K., Zamiri, B., Stanley, S. Y., Macgregor, R. B. Jr & Pearson, C. E. The disease-associated r(GGGGCC)n repeat from the C9orf72 gene forms tract length-dependent uni- and multimolecular RNA G-quadruplex structures. J. Biol. Chem. 288, 9860–9866 (2013).

NATURE COMMUNICATIONS | (2018)9:152

| DOI: 10.1038/s41467-017-02643-5 | www.nature.com/naturecommunications

13

ARTICLE

NATURE COMMUNICATIONS | DOI: 10.1038/s41467-017-02643-5

31. Haeusler, A. R. et al. C9orf72 nucleotide repeat structures initiate molecular cascades of disease. Nature 507, 195–200 (2014).
32. Conlon E.G. et al. The C9ORF72 GGGGCC expansion forms RNA Gquadruplex inclusions and sequesters hnRNP H to disrupt splicing in ALS brains. Elife 5, e17820 (2016).
33. Kearse, M. G. et al. CGG repeat-associated non-AUG translation utilizes a capdependent scanning mechanism of initiation to produce toxic proteins. Mol. Cell 62, 314–322 (2016).
34. Hinnebusch, A. G. The scanning mechanism of eukaryotic translation initiation. Annu. Rev. Biochem. 83, 779–812 (2014).
35. Niblock, M. et al. Retention of hexanucleotide repeat-containing intron in C9orf72 mRNA: implications for the pathogenesis of ALS/FTD. Acta Neuropathol. Commun. 4, 18 (2016).
36. Chew, J. et al. Neurodegeneration. C9ORF72 repeat expansions in mice cause TDP-43 pathology, neuronal loss, and behavioral deﬁcits. Science 348, 1151–1154 (2015).
37. Murray J. et al. Structural characterization of ribosome recruitment and translocation by type IV IRES. Elife 5, e13567 (2016).
38. Fernandez, I. S., Bai, X. C., Murshudov, G., Scheres, S. H. & Ramakrishnan, V. Initiation of translation by cricket paralysis virus IRES requires its translocation in the ribosome. Cell 157, 823–831 (2014).
39. Pestova, T. V. & Hellen, C. U. Translation elongation after assembly of ribosomes on the Cricket paralysis virus internal ribosomal entry site without initiation factors or initiator tRNA. Genes Dev. 17, 181–186 (2003).
40. Mackenzie, I. R. et al. Quantitative analysis and clinico-pathological correlations of different dipeptide repeat protein pathologies in C9ORF72 mutation carriers. Acta Neuropathol. 130, 845–861 (2015).
41. Namer, L. S. et al. An ancient pseudoknot in TNF-α Pre-mRNA activates PKR, inducing eIF2α phosphorylation that potently enhances splicing. Cell Rep. 20, 188–200 (2017).
42. Martin, F. et al. Cap-assisted internal initiation of translation of histone H4. Mol. Cell 41, 197–209 (2011).
43. Kozak, M. Point mutations deﬁne a sequence ﬂanking the AUG initiator codon that modulates translation by eukaryotic ribosomes. Cell 44, 283–292 (1986).
44. Choi, S. H. et al. A three-dimensional human neural cell culture model of Alzheimer’s disease. Nature 515, 274–278 (2014).
45. Chew, G. L., Pauli, A. & Schier, A. F. Conservation of uORF repressiveness and sequence features in mouse, human and zebraﬁsh. Nat. Commun. 7, 11663 (2016).
46. Liang, X. H. et al. Translation efﬁciency of mRNAs is increased by antisense oligonucleotides targeting upstream open reading frames. Nat. Biotechnol. 34, 875–880 (2016).
47. Barbosa, C., Peixeiro, I. & Romao, L. Gene expression regulation by upstream open reading frames and human disease. PLoS Genet. 9, e1003529 (2013).
48. Calvo, S. E., Pagliarini, D. J. & Mootha, V. K. Upstream open reading frames cause widespread reduction of protein expression and are polymorphic among humans. Proc. Natl Acad. Sci. USA 106, 7507–7512 (2009).
49. Boussemart, L. et al. eIF4F is a nexus of resistance to anti-BRAF and anti-MEK cancer therapies. Nature 513, 105–109 (2014).
50. Robert, F. et al. Translation initiation factor eIF4F modiﬁes the dexamethasone response in multiple myeloma. Proc. Natl Acad. Sci. USA 111, 13421–13426 (2014).
51. Garreau de Loubresse, N. et al. Structural basis for the inhibition of the eukaryotic ribosome. Nature 513, 517–522 (2014).
52. Jiang, J. et al. Gain of toxicity from ALS/FTD-linked repeat expansions in C9ORF72 is alleviated by antisense oligonucleotides targeting GGGGCCcontaining RNAs. Neuron 90, 535–550 (2016).
53. Jain, A. & Vale, R. D. RNA phase transitions in repeat expansion disorders. Nature 546, 243–247 (2017).
54. Sellier, C. et al. Translation of expanded CGG repeats into FMRpolyG is pathogenic and may contribute to fragile X tremor ataxia syndrome. Neuron 93, 331–347 (2017).
55. Bhat, M. et al. Targeting the translation machinery in cancer. Nat. Rev. Drug Discov. 14, 261–278 (2015).
56. Ingolia, N. T., Lareau, L. F. & Weissman, J. S. Ribosome proﬁling of mouse embryonic stem cells reveals the complexity and dynamics of mammalian proteomes. Cell 147, 789–802 (2011).
57. Jackson, R. & Standart, N. The awesome power of ribosome proﬁling. RNA 21, 652–654 (2015).
58. McRae, E. K. S. et al. Human DDX21 binds and unwinds RNA guanine quadruplexes. Nucleic Acids Res. 45, 6656–6668 (2017).
59. Gendron, T. F. et al. Cerebellar c9RAN proteins associate with clinical and neuropathological characteristics of C9ORF72 repeat expansion carriers. Acta Neuropathol. 130, 559–573 (2015).
60. Davidson, Y. et al. Neurodegeneration in frontotemporal lobar degeneration and motor neurone disease associated with expansions in C9orf72 is linked to

TDP-43 pathology and not associated with aggregated forms of dipeptide repeat proteins. Neuropathol. Appl. Neurobiol. 42, 242–254 (2016). 61. Yu, C. H., Teulade-Fichou, M. P. & Olsthoorn, R. C. Stimulation of ribosomal frameshifting by RNA G-quadruplex structures. Nucleic Acids Res. 42, 1887–1892 (2014). 62. Kapur, M., Monaghan, C. E. & Ackerman, S. L. Regulation of mRNA translation in neurons—a matter of life and death. Neuron 96, 616–637 (2017). 63. Millevoi, S., Moine, H. & Vagner, S. G-quadruplexes in RNA biology. Wiley Interdiscip. Rev. RNA 3, 495–507 (2012). 64. Di Prisco, G. V. et al. Translational control of mGluR-dependent long-term depression and object-place learning by eIF2alpha. Nat. Neurosci. 17, 1073–1082 (2014). 65. Bal, N. V. et al. Upstream open reading frames located in the leader of protein kinase Mzeta mRNA regulate its translation. Front. Mol. Neurosci. 9, 103 (2016). 66. Lee, J. et al. An upstream open reading frame impedes translation of the huntingtin gene. Nucleic Acids Res. 30, 5110–5119 (2002). 67. Kole, R., Krainer, A. R. & Altman, S. RNA therapeutics: beyond RNA interference and antisense oligonucleotides. Nat. Rev. Drug Discov. 11, 125–140 (2012). 68. Johansson, H. E., Belsham, G. J., Sproat, B. S. & Hentze, M. W. Target-speciﬁc arrest of mRNA translation by antisense 2’-O-alkyloligoribonucleotides. Nucleic Acids Res. 22, 4591–4598 (1994). 69. Kohler, A. & Hurt, E. Exporting RNA from the nucleus to the cytoplasm. Nat. Rev. Mol. Cell Biol. 8, 761–773 (2007). 70. Soto Rifo, R., Ricci, E. P., Decimo, D., Moncorge, O. & Ohlmann, T. Back to basics: the untreated rabbit reticulocyte lysate as a competitive system to recapitulate cap/poly(A) synergy and the selective advantage of IRES-driven translation. Nucleic Acids Res. 35, e121 (2007).
Acknowledgements
We thank Dr Doo Yeon Kim for his support in culturing ReNcell VM, Amélie Laugel, Michael Baughn, Dr Anna-Claire Devlin, and Dr Ying Sun for technical assistance, Dr Gilbert Eriani, members of Dr Brian J. Wainger and Dr Mark W. Albers laboratories, Dr Merit Cudkowicz, Dr Shuying Sun, Dr Raymond Kaempfer, Dr Frank Rigo, and Dr Don W. Cleveland for helpful discussions and continuous support. R.T. was supported by a grant from the Philippe Foundation. This work was supported by CNRS, Université de Strasbourg, a grant from the ANR to F.M. (ANR-11-SVSE802501). C.L.-T. was supported by the Department of Neurology at the Massachusetts General Hospital and grants from Target ALS (13-04827) and from NINDS/NIH (R01NS087227).
Author contributions
R.T., F.M. and C.L-T. designed research; R.T., F.F., F.M. and C.L.-T. analyzed the data; R. T., L.S., F.M., F.F., M.J., M.W., C.-Z.L., C.-C.L. and T.G. performed research; J.J, L.D., H. A.-H., K.J.-W., T.G. and L.P. contributed key reagents and methodology. R.T., F.F., F.M. and C.L.-T. wrote the manuscript.
Additional information
Supplementary Information accompanies this paper at https://doi.org/10.1038/s41467017-02643-5.
Competing interests: The authors declare no competing ﬁnancial interests.
Reprints and permission information is available online at http://npg.nature.com/ reprintsandpermissions/
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional afﬁliations.
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/ licenses/by/4.0/.
© The Author(s) 2018

14

NATURE COMMUNICATIONS | (2018)9:152

| DOI: 10.1038/s41467-017-02643-5 | www.nature.com/naturecommunications