- Journal List
- HHS Author Manuscripts
- PMC4268322
Programmable RNA recognition and cleavage by CRISPR/Cas9
Mitchell R. O’Connell
1Department of Molecular and Cell Biology, University of California, Berkeley, California 94720, USA
Benjamin L. Oakes
1Department of Molecular and Cell Biology, University of California, Berkeley, California 94720, USA
Samuel H. Sternberg
2Department of Chemistry, University of California, Berkeley, California 94720, USA
Alexandra East-Seletsky
1Department of Molecular and Cell Biology, University of California, Berkeley, California 94720, USA
Matias Kaplan
3Howard Hughes Medical Institute, University of California, Berkeley, California 94720, USA
Jennifer A. Doudna
1Department of Molecular and Cell Biology, University of California, Berkeley, California 94720, USA
2Department of Chemistry, University of California, Berkeley, California 94720, USA
3Howard Hughes Medical Institute, University of California, Berkeley, California 94720, USA
4Physical Biosciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
Associated Data
- Supplementary Materials
-
NIHMS621290-supplement-1.jpg (1.4M)GUID: CEDD3DEE-35A4-45DB-8B53-9E7B6C3D456FGUID: EC53C40E-5750-482A-882F-1F7B85CED029NIHMS621290-supplement-2.jpg (760K)GUID: 5C04C4EA-38B7-491E-A5BA-BAFF3AEE97E0NIHMS621290-supplement-3.jpg (138K)GUID: 7D43731A-F72E-4DAB-B6CA-DC7CC4330D92NIHMS621290-supplement-4.jpg (408K)GUID: D5F80904-8E12-448A-824C-D58C20532043NIHMS621290-supplement-5.jpg (308K)GUID: 5E8ACDDD-16F5-4D48-A78E-9BA0F20C0C13NIHMS621290-supplement-6.jpg (771K)GUID: 529F7B2A-7D62-4AF3-9E7E-810A14004EF9NIHMS621290-supplement-7.jpg (612K)GUID: 1F5FD0BC-1C6D-4996-9132-C11387C4ECCANIHMS621290-supplement-8.jpg (103K)GUID: 12257A66-EEEA-4D85-B69E-A55B890ECE87GUID: 96EE017C-4A22-4D5A-86EA-2DC3D10A596A
Abstract
The CRISPR-associated protein Cas9 is an RNA-guided DNA endonuclease that uses RNA:DNA complementarity to identify target sites for sequence-specific doublestranded DNA (dsDNA) cleavage1-5. In its native context, Cas9 acts on DNA substrates exclusively because both binding and catalysis require recognition of a short DNA sequence, the protospacer adjacent motif (PAM), next to and on the strand opposite the 20-nucleotide target site in dsDNA4-7. Cas9 has proven to be a versatile tool for genome engineering and gene regulation in many cell types and organisms8, but it has been thought to be incapable of targeting RNA5. Here we show that Cas9 binds with high affinity to single-stranded RNA (ssRNA) targets matching the Cas9-associated guide RNA sequence when the PAM is presented in trans as a separate DNA oligonucleotide. Furthermore, PAM-presenting oligonucleotides (PAMmers) stimulate site-specific endonucleolytic cleavage of ssRNA targets, similar to PAM-mediated stimulation of Cas9-catalyzed DNA cleavage7. Using specially designed PAMmers, Cas9 can be specifically directed to bind or cut RNA targets while avoiding corresponding DNA sequences, and we demonstrate that this strategy enables the isolation of a specific endogenous mRNA from cells. These results reveal a fundamental connection between PAM binding and substrate selection by Cas9, and highlight the utility of Cas9 for programmable and tagless transcript recognition.
CRISPR–Cas immune systems must discriminate between self and non-self to avoid an autoimmune response9. In type I and II systems, foreign DNA targets which contain adjacent PAM sequences are targeted for degradation, whereas potential targets in CRISPR loci of the host do not contain PAMs and are avoided by RNA-guided interference complexes3,5,6,10. Single-molecule and bulk biochemical experiments showed that PAMs act both to recruit Cas9–guide RNA complexes (Cas9–gRNA) to potential target sites and to trigger nuclease domain activation7. Cas9 from Streptococcus pyogenes recognizes a 5’-NGG-3’ PAM on the non-target (displaced) DNA strand4,6, suggesting that PAM recognition may stimulate catalysis through allosteric regulation. Moreover, the HNH nuclease domain of Cas9, which mediates target strand cleavage4,5, is homologous to other HNH domains that cleave RNA substrates11,12. Based on the observation that single-stranded DNA (ssDNA) targets can be activated for cleavage by a separate PAMmer oligonucleotide7, and that similar HNH domains can cleave RNA, we wondered whether a similar strategy would enable Cas9 to cleave ssRNA targets in a programmable fashion (Fig. 1a).
Using S. pyogenes Cas9 and dual-guide RNAs (Methods), we performed in vitro cleavage experiments using a panel of RNA and DNA targets (Fig. 1b and Extended Table 1). Deoxyribonucleotide-comprised PAMmers specifically activated Cas9 to cleave ssRNA (Fig. 1c), an effect that required a 5’-NGG-3’ or 5’-GG-3’ PAM. RNA cleavage was not observed using ribonucleotide-based PAMmers, suggesting that Cas9 may recognize the local helical geometry and/or deoxyribose moieties within the PAM. Consistent with this idea, dsRNA targets were not cleavable, and RNA–DNA heteroduplexes could only be cleaved when the non-target strand was composed of deoxyribonucleotides. Interestingly, we found that Cas9 cleaved the ssRNA target strand between positions 4 and 5 of the base-paired guide RNA-target RNA hybrid (Fig. 1d), in contrast to the cleavage between positions 3 and 4 observed for dsDNA3-5 likely due to subtle differences in substrate positioning. However, we did observe a significant reduction in the pseudo-first order cleavage rate constant of PAMmeractivated ssRNA as compared to ssDNA7 (Extended Data Fig. 1).
We hypothesized that PAMmer nuclease activation would depend on the stability of the hybridized PAMmer–ssRNA duplex and tested this by varying PAMmer length. As expected, ssRNA cleavage was lost when the predicted melting temperature for the duplex decreased below the temperature used in our experiments (Fig. 1e). In addition, large molar excesses of di- or tri-deoxyribonucleotides in solution were poor activators of Cas9 cleavage (Extended Data Fig. 2). Collectively, these data demonstrate that hybrid substrate structures composed of ssRNA and deoxyribonucleotide-based PAMmers that anneal upstream of the RNA target sequence can be cleaved efficiently by RNA-guided Cas9.
We investigated the binding affinity of catalytically inactive (dCas9; D10A/H840A) dCas9–gRNA for ssRNA targets with and without PAMmers using a gel mobility shift assay. Intriguingly, while our previous results showed that ssDNA and PAMmer-activated ssDNA targets are bound with indistinguishable affinity7, PAMmer-activated ssRNA targets were bound >500-fold tighter than ssRNA alone (Fig. 2a,b). A recent crystal structure of Cas9 bound to a ssDNA target revealed deoxyribose-specific van der Waals interactions between the protein and the DNA backbone13, suggesting that energetic penalties associated with ssRNA binding must be attenuated by favorable compensatory binding interactions with the provided PAM. The equilibrium dissociation constant measured for a PAMmer–ssRNA substrate was within 5-fold of that for dsDNA (Fig. 2b), and this high-affinity interaction again required a cognate deoxyribonucleotide-comprised 5’-GG-3’ PAM (Fig. 2a). Tight binding also scaled with PAMmer length (Fig. 2c), consistent with the cleavage data presented above.
It was known that Cas9 possesses an intrinsic affinity for RNA, but sequence specificity of the interaction was not explored5. Thus, to verify the programmable nature of PAMme-rmediated ssRNA cleavage by Cas9–gRNA, we prepared three distinct guide RNAs (λ2, λ3, and λ4) and showed that their corresponding ssRNA targets could be efficiently cleaved using complementary PAMmers without any detectable cross-reactivity (Fig. 3a). This result indicates that complementary RNA–RNA base-pairing is critical in these reactions. Surprisingly, though, dCas9 programmed with the λ2 guide RNA bound all three PAMmer–ssRNA substrates with similar affinity (Fig. 3b). This observation suggests that high-affinity binding in this case may not require correct base-pairing between the guide RNA and the ssRNA target, particularly given the compensatory role of the PAMmer.
During dsDNA targeting by Cas9–gRNA, duplex melting proceeds directionally from the PAM and strictly requires formation of complementary RNA–DNA base-pairs to offset the energetic costs associated with dsDNA unwinding7. We therefore wondered whether binding specificity for ssRNA substrates would be recovered using PAMmers containing 5’-extensions that create a partially double-stranded target region requiring unwinding (Fig. 3c). Indeed, we found that use of a 5’-extended PAMmer enabled dCas9 bearing the λ2 guide sequence to bind sequence-selectively to the λ2 PAMmer–ssRNA target. The λ3 and λ4 PAMmer–ssRNA targets were not recognized (Fig. 3d and Extended Data Fig. 3), although we did observe a 10-fold reduction in overall ssRNA substrate binding affinity. By systematically varying the length of the 5’ extension, we found that PAMmers containing 2–8 additional nucleotides upstream of the 5’-NGG-3’ offer an optimal compromise between gains in binding specificity and concomitant losses in binding affinity and cleavage efficiency (Extended Data Fig. 4).
Next we investigated whether nuclease activation by PAMmers requires base-pairing between the 5’-NGG-3’ and corresponding nucleotides on the ssRNA. Prior studies showed that DNA substrates containing a cognate PAM that is mismatched with the corresponding nucleotides on the target strand are cleaved as efficiently as a fully base-paired PAM4. Importantly, this could enable targeting of RNA while precluding binding or cleavage of corresponding genomic DNA sites lacking PAMs (Fig. 4a). To test this possibility, we first demonstrated that Cas9–gRNA cleaves PAMmer–ssRNA substrates regardless of whether or not the PAM is base-paired (Fig. 4b, c). When Cas9–RNA was incubated with both a PAMmer–ssRNA substrate and the corresponding dsDNA template containing a cognate PAM, both targets were cleaved. In contrast, when a dsDNA target lacking a PAM was incubated together with a PAMmer-ssRNA substrate bearing a mismatched 5’-NGG-3’ PAM, Cas9–gRNA selectively targeted the ssRNA for cleavage (Fig. 4c). The same result was obtained using a mismatched PAMmer with a 5’ extension (Fig. 4c), demonstrating that this general strategy enables the specific targeting of RNA transcripts while effectively eliminating any targeting of their corresponding dsDNA template loci.
We next explored whether Cas9-mediated RNA targeting could be applied for tagless transcript isolation from HeLa cells (Fig. 4d). The immobilization of Cas9 on a solid-phase resin is described in Methods (see also Extended Data Fig. 5). As a proof of concept, we first isolated GAPDH mRNA from HeLa total RNA using biotinylated dCas9, gRNAs and PAMmers (Extended Table 2) that target four non-PAM-adjacent sequences within exons 5–7 (Fig. 4e). We observed a substantial enrichment of GAPDH mRNA relative to a control β-actin mRNA by Northern blot analysis, but saw no enrichment using a non-targeting gRNA or dCas9 alone (Fig. 4f).
We then used this approach to isolate endogenous GAPDH transcripts from HeLa cell lysate under physiological conditions. In initial experiments, we found that Cas9–gRNA captured two GAPDH-specific RNA fragments rather than the full-length mRNA (Fig. 4g). Based on the sizes of these bands, we hypothesized that RNA:DNA heteroduplexes formed between the mRNA and PAMmer were cleaved by cellular RNase H. Previous studies have shown that modified DNA oligonucleotides can abrogate RNase H activity14, and therefore we investigated whether Cas9 would tolerate chemical modifications to the PAMmer. We found that a wide range of modifications (locked nucleic acids, 2’-OMe and 2’-F ribose moieties) still enabled PAMmer-mediated nuclease activation (Extended Data Fig. 6). Importantly, by varying the pattern of 2’-OMe modifications in the PAMmer, we could completely eliminate RNase H-mediated cleavage during the pull-down and successfully isolate intact GAPDH mRNA (Fig. 4g,h). Interestingly, we consistently observed specific isolation of GAPDH mRNA in the absence of any PAMmer, albeit with lower efficiency, suggesting that Cas9–gRNA can bind to GAPDH mRNA through direct RNA:RNA hybridization (Fig. 4f,g and Extended Data Fig. 7). These experiments demonstrate that RNA-guided Cas9 can be used to purify endogenous untagged RNA transcripts. In contrast to current oligonucleotide-mediated RNA-capture methods, this approach works well under physiological salt conditions and does not require crosslinking or large sets of biotinylated probes15-17.
Here we have demonstrated the ability to re-direct the dsDNA targeting capability of CRISPR/Cas9 for RNA-guided ssRNA binding and/or cleavage (RCas9). Programmable RNA recognition and cleavage has the potential to transform the study of RNA function much as site-specific DNA targeting is changing the landscape of genetic and genomic research8 (Extended Data Fig. 8). Although certain engineered proteins such as PPR proteins and Pumilio/FBF (PUF) repeats show promise as platforms for sequence-specific RNA targeting18- 22, these strategies require re-designing the protein for every new RNA sequence of interest. While RNA interference has proven useful for manipulating gene regulation in certain organisms23, there has been a strong motivation to develop orthogonal nucleic acid-based RNA recognition systems, such as the CRISPR/Cas Type III-B Cmr complex24-28 and the atypical Cas9 from Francisella novicida29,30. In contrast to these systems, the molecular basis for RNA recognition by RCas9 is now clear and requires only the design and synthesis of a matching gRNA and complementary PAMmer. The ability to recognize endogenous RNAs within complex mixtures with high affinity and in a programmable manner paves the way for direct transcript detection, analysis and manipulation without the need for genetically encoded affinity tags.
METHODS
Cas9 and nucleic acid preparation
Wild-type Cas9 and catalytically inactive dCas9 (D10A/H840A) from S. pyogenes were purified as previously described4. crRNAs (42 nt) were either ordered synthetically (Integrated DNA Technologies) or transcribed in vitro with T7 polymerase using single-stranded DNA templates, as described31. tracrRNA was transcribed in vitro and contained nucleotides 15–87 following the numbering scheme used previously4. λ-targeting sgRNAs were in vitro transcribed from linearized plasmids and contain full-length crRNA and tracrRNA connected via a GAAA tetraloop insertion. GAPDH mRNA-targeting sgRNAs were in vitro transcribed from dsDNA PCR products based on an optimized sgRNA design32. Target ssRNAs (55–56 nt) were in vitro transcribed using single-stranded DNA templates. Sequences of all nucleic acid substrates used in this study can be found in Extended Data Tables 1 & 2.
All RNAs were purified using 10–15% denaturing polyacrylamide gel electrophoresis (PAGE). crRNA–tracrRNA duplexes were prepared by mixing equimolar concentrations of each RNA in hybridization buffer (20 mM Tris-HCl, pH 7.5, 100 mM KCl, 5 mM MgCl2), heating to 95 °C for 30 s and slow cooling. Fully double-stranded DNA/RNA substrates (substrates 1, 8–10 in Fig. 1 and substrates 1–2 in Fig. 4) were prepared by mixing equimolar concentrations of each nucleic acid strand in hybridization buffer, heating to 95 °C for 30 s, and slow cooling. RNA, DNA, and chemically modified PAMmers were synthesized commercially (Intergrated DNA Technologies). DNA and RNA substrates were 5’-radiolabeled using [γ-32P]-ATP (PerkinElmer) and T4 polynucleotide kinase (New England Biolabs). dsDNA and dsRNA substrates (Fig. 1c, ,4c)4c) were 5’-radiolabeled on both strands, whereas only the target ssRNA was 5’-radiolabeled in other experiments.
Cleavage assays
Cas9–gRNA complexes were reconstituted before cleavage experiments by incubating Cas9 and the crRNA–tracrRNA duplex for 10 min at 37 °C in reaction buffer (20 mM Tris-HCl, pH 7.5, 75 mM KCl, 5 mM MgCl2, 1 mM dithiothreitol (DTT), 5% glycerol). Cleavage reactions were conducted at 37 °C and contained ~1 nM 5′-radiolabeled target substrate, 100 nM Cas9–RNA, and 100 nM PAMmer, where indicated. Aliquots were removed at each time point and quenched by the addition of RNA gel loading buffer (95% deionized formamide, 0.025% (w/v) bromophenol blue, 0.025% (w/v) xylene cyanol, 50 mM EDTA (pH 8.0), 0.025% (w/v) SDS). Samples were boiled for 10 min at 95 °C prior to being resolved by 12% denaturing PAGE. Reaction products were visualized by phosphorimaging and quantified with ImageQuant (GE Healthcare).
RNA cleavage site mapping
A hydrolysis ladder (OH-) was obtained by incubating ~25 nM 5′-radiolabeled λ2 target ssRNA in hydrolysis buffer (25 mM CAPS (N-cyclohexyl-3- aminopropanesulfonic acid), pH 10.0, 0.25 mM EDTA) at 95 °C for 10 min, before quenching on ice. An RNase T1 ladder was obtained by incubating ~25 nM 5′-radiolabeled λ2 target ssRNA with 1 Unit RNase T1 (NEB) for 5 min at 37 °C in RNase T1 buffer (20 mM sodium citrate, pH 5.0, 1 mM EDTA, 2 M urea, 0.1 mg mL-1 yeast tRNA). The reaction was quenched by phenol/chloroform extraction before adding RNA gel loading buffer. All products were resolved by 15% denaturing PAGE.
Electrophoretic mobility shift assays
In order to avoid dissociation of the Cas9–gRNA complex at low concentrations during target ssRNA binding experiments, binding reactions contained a constant excess of dCas9 (300 nM), increasing concentrations of sgRNA, and 0.1–1 nM of target ssRNA. The reaction buffer was supplemented with 10 μg ml-1 heparin in order to avoid non-specific association of apo-dCas9 with target substrates7. Reactions were incubated at 37 °C for 45 min before being resolved by 8% native PAGE at 4 °C (0.5× TBE buffer with 5 mM MgCl2). RNA and DNA were visualized by phosphorimaging, quantified with ImageQuant (GE Healthcare), and analyzed with Kaleidagraph (Synergy Software).
Cas9 Biotin Labeling
To ensure specific labeling at a single residue on Cas9, two naturally occurring cysteine residues were mutated to serine (C80S and C574S) and a cysteine point mutant was introduced at residue M1. To attach the biotin moiety, 10 μM WT Cas9 or dCas9 was reacted with a 50-fold molar excess of EZ-Link® Maleimide-PEG2-Biotin (Thermo Scientific) at 25 °C for 2 h. The reaction was quenched by the addition of 10 mM DTT, and unreacted Maleimide-PEG2-Biotin was removed using a Bio-Gel® P-6 column (Bio-Rad). Labeling was verified using a streptavidin bead binding assay, where 8.5 pmol of biotinylated Cas9 or non-biotinylated Cas9 was mixed with either 25 μL streptavidin-agarose (Pierce Avidin Agarose; Thermo Scientific) or 25 μL streptavidin magnetic beads (Dynabeads MyOne Streptavidin C1; Life Technologies). Samples were incubated in Cas9 reaction buffer at RT for 30 min, followed by three washes with Cas9 reaction buffer and elution in boiling SDS-PAGE loading buffer. Elutions were analysed using SDS-PAGE. Cas9 M1C biotinylation was also confirmed using mass spectroscopy performed in the QB3/Chemistry Mass Spectrometry Facility at UC Berkeley. Samples of intact Cas9 proteins were analyzed using an Agilent 1200 liquid chromatograph equipped with a Viva C8 (100 mm × 1.0 mm, 5 μm particles, Restek) analytical column and connected in-line with an LTQ Orbitrap XL mass spectrometer (Thermo Fisher Scientific). Mass spectra were recorded in the positive ion mode. Mass spectral deconvolution was performed using ProMass software (Novatia).
GAPDH mRNA pull-down
HeLa-S3 cell lysates were prepared as previously described33. Total RNA was isolated from HeLa-S3 cells using Trizol reagent according to the manufacturer’s instructions (Life Technologies). Cas9–sgRNA complexes were reconstituted before pull-down experiments by incubating a two-fold molar excess of Cas9 with sgRNA for 10 min at 37 °C in reaction buffer. HeLa total RNA (40 μg) or HeLa lysate (~5×106 cells) was added to reaction buffer with 40 U RNasin (Promega), PAMmer (5 μM) and the biotin-dCas9 (50 nM):sgRNA (25 nM) in a total volume of 100 μL and incubated at 37 °C for 1 h. This mixture was then added to 25 μL magnetic streptavidin beads (Dynabeads MyOne Streptavidin C1; Life Technologies) pre-equilibrated in reaction buffer and agitated at 4 °C for 2 h. Beads were then washed six times with 300 μL wash buffer (20 mM Tris-HCl, pH 7.5, 150 mM NaCl, 5mM MgCl2, 0.1% Triton X-100, 5% glycerol, 1mM DTT, 10 μg ml-1 heparin). Immobilized RNA was eluted by heating beads at 70 °C in the presence of DEPC-treated water and a phenol/chloroform mixture. Eluates were then treated with an equal volume of glyoxal loading dye (Life Technologies) and heated at 50 °C for 1 h before separation via 1% BPTE agarose gel (30 mM Bis-Tris, 10 mM PIPES, 10 mM EDTA, pH 6.5). Northern blot transfers were carried out according to Chomczynski et al.34. Following transfer, membranes were crosslinked using UV radiation and incubated in pre-hybridization buffer (UltraHYB® Ultrasensitive Hybridization Buffer; Life Technologies) for 1 h at 46 °C prior to hybridization. Radioactive Northern probes were synthesized using random priming of GAPDH and β-actin partial cDNAs (for cDNA primers, see Extended Data Table 2) in the presence of [α-32P]-dATP (PerkinElmer), using a Prime-It II Random Primer Labeling kit (Agilent Technologies). Hybridization was carried out for 3 h in pre-hybridization buffer at 46 °C followed by two washes with 2×SSC (300 mM NaCl, 30 mM trisodium citrate, pH 7, 0.5% (w/v) SDS) for 15 min at 46 °C. Membranes were imaged using a phosphorscreen.
Extended Data
Extended Data Figure 1
Extended Data Figure 2
Extended Data Figure 3
Extended Data Figure 4
Extended Data Figure 5
Extended Data Figure 6
Extended Data Figure 7
Extended Data Figure 8
Extended Data Table 1
NA, not applicable.
PS: phosphorothioate bond
LNA: locked nucleic acid
Extended Data Table 2
NA, not applicable.
Supplementary Material
1
10
2
3
4
5
6
7
8
9
Acknowledgments
We thank B. Staahl and K. Zhou for technical assistance, A. Iavarone for assistance with mass spectrometry measurements, Integrated DNA Technologies for the synthesis of DNA and RNA oligonucleotides, and members of the Doudna laboratory and J. Cate for helpful discussions and critical reading of the manuscript. S.H.S. acknowledges support from the National Science Foundation and National Defense Science & Engineering Graduate Research Fellowship programs. A.E.S and B.L.O acknowledge support from NIH NRSA trainee grants. Funding was provided NIH-funded Center for RNA Systems Biology (5P50GM102706-03). J.A.D. is an Investigator of the Howard Hughes Medical Institute.
Footnotes
M.R.O. and S.H.S. conceived the project. M.R.O., B.L.O., S.H.S., A.E.S and M.K. conducted experiments. All authors discussed the data, and M.R.O, S.H.S, B.L.O and J.A.D wrote the manuscript.
J.A.D., M.R.O, B.L.O, and S.H.S are inventors on a related patent.