AU2004298527A1 - RNAi potency prediction method - Google Patents

RNAi potency prediction method Download PDF

Info

Publication number
AU2004298527A1
AU2004298527A1 AU2004298527A AU2004298527A AU2004298527A1 AU 2004298527 A1 AU2004298527 A1 AU 2004298527A1 AU 2004298527 A AU2004298527 A AU 2004298527A AU 2004298527 A AU2004298527 A AU 2004298527A AU 2004298527 A1 AU2004298527 A1 AU 2004298527A1
Authority
AU
Australia
Prior art keywords
rnai
potency
reagent
previous
target gene
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
AU2004298527A
Inventor
Jonathan Hall
Dieter Huesken
Jorg Bernd Lange
Francois Jean-Charles Natt
Mischa Werner Henri Marie Reinhardt
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Novartis AG
Original Assignee
Novartis AG
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Novartis AG filed Critical Novartis AG
Publication of AU2004298527A1 publication Critical patent/AU2004298527A1/en
Abandoned legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/111General methods applicable to biologically active non-coding nucleic acids
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P25/00Drugs for disorders of the nervous system
    • A61P25/28Drugs for disorders of the nervous system for treating neurodegenerative disorders of the central nervous system, e.g. nootropic agents, cognition enhancers, drugs for treating Alzheimer's disease or other forms of dementia
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • G16B40/20Supervised data analysis
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2310/00Structure or type of the nucleic acid
    • C12N2310/10Type of nucleic acid
    • C12N2310/14Type of nucleic acid interfering N.A.
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2320/00Applications; Uses
    • C12N2320/10Applications; Uses in screening processes
    • C12N2320/11Applications; Uses in screening processes for the determination of target sites, i.e. of active nucleic acids

Description

WO 2005/059132 PCT/EP2004/014037 RNAI POTENCY PREDICTION METHOD Background of the Invention The present invention relates generally to the field of RNA interference (RNAi) and particularly to methods for obtaining RNAi reagents of high RNAi potency using an algorithm trained by an artificial neuronal network. Field of the Invention Since Elbashir et al. have demonstrated the ability of synthetic small interfering RNA (siRNA) to mediate, through RNA interference (RNAi) mechanism, specific mRNA down regulation in mammalian cells [see, for instance, Elbashir et al., Nature, Vol. 411, pp. 494-498 (2001); and Caplen et al., PNAS, Vol. 98, No. 17, pp. 9742-9747 (2001)], this technique has been increasingly used as a research tool to investigate gene function by exploring the phenotypes induced by the specific down-regulation of a given gene. In particular, as gene silencing experiment by mean of an siRNA or RNA-interference type of reagent requires only the knowledge of a part of the nucleotide sequence of the targeted gene, it is envisageable, once the genome of a particular organism is known, to design for every gene a RNAi experiment and screen for phenotypes relevant for instance for therapeutic targets discovery. This genome-wide approach dictates the specifications for the design of the RNAi reagents. Indeed, specificity parameters avoiding off-target silencing become of particular importance. Also, the characterization of individual siRNAs for potency is no longer envisageable. Consequently, potency prediction algorithms are required for the application of RNA gene silencing experiment to genome-wide phenotypic screens. It has been suggested that similarly to antisense, target accessibility plays an important role in the potency of siRNA. See Kretschmer-Kazemi et al., Nucleic Acids Res., Vol. 31, No. 15, pp. 4417-4424 (2003). Recently another study [see Anastasia Khvorova, Cell, Vol. 115, pp. 209-216 (2003)] revealed some sequence requirements triggering siRNA and miRNA potency such as notably a low internally stability of the dsRNA at 5' of the antisense strand as well as in the region 9-14 of the antisense strand. These findings were obtained by a statistical analysis of the relationship between siRNA potency and duplex internal WO 2005/059132 PCT/EP2004/014037 -2 thermodynamic stabilities. The study was based on siRNA potencies of 375 randomly selected siRNA against three different targets. Given the fact that the RNAi mechanism is not fully characterized and that many additional parameters may have an impact on siRNA potency, it would be beneficial to acquire larger functional data sets in order to have a better understanding of the sequence-activity relationship. Summary of the Invention In one aspect, the present invention pertains to a method of making an algorithm for the prediction of the RNAi potency of a RNAi reagent comprising: a) determining experimentally the potency of a plurality of RNAi reagent to down regulate a reporter protein readout; and b) using said potency data set to train a artificial neuronal network. In another aspect, the present invention pertains to an algorithm obtained by a method according to the present invention. In another aspect, the present invention pertains to a method for predicting the RNAi potency of a RNAi reagent comprising: a) providing a plurality of RNAi reagent sequences comprising a region complementary to a given target gene; b) applying a trained artificial neuronal network according to the present invention to said RNAi reagent sequences; and c) selecting the RNAi reagent sequence(s) which are predicted to be potent. In another aspect, the present invention pertains to a method for inhibiting the expression of a given target gene, comprising: a) providing a plurality of RNAi reagent sequences comprising a region complementary to a given target gene; b) applying a trained artificial neuronal network according to the present invention to a said RNAi reagent sequences; c) selecting the RNAi reagent sequence(s) which are predicted to be potent; WO 2005/059132 PCT/EP2004/014037 -3 d) synthesizing the RNAi reagent(s) selected in c); e) exposing cells expressing the target gene with the RNAi reagent(s) of d); and f) measuring the activity of the RNAi reagent or other phenotypes which may be induced by the down regulation of the target gene. Description of the Fiqures Figure 1: An example of a normalized data set. 79 siRNA targeting a 3'-UTR insert of a YFP mRNA were co-transfected with the reporter fusion mRNA (H1299, 50 nM, readout at 50h). Grey bars are 3'-UTR targeting siRNA, black bars are positive and negative controls. Negative control was arbitrary set at 10% potency and positive control at 90% potency. Each siRNA potencies were normalized according to these controls. Figure 2: Illustrates the filtering of screening data. Figure 3: Comparison of prediction versus screening measures for the training set. Figure 4: Comparison of prediction versus screening measures for the testing set. Figure 5: Dependence of the prediction - measure correlation (on test set) from the size of the training set. Detailed Description of the Invention All patent applications, patents and literature references cited herein are hereby incorporated by reference in their entirety. As used herein, the terms "RNAi reagent" and "oligoribonucleotide" are used interchangeably and mean an oligomer or polymer of ribonucleic acid (RNA) or deoxyribonucleic acid (DNA) or a mimetic thereof. The RNAi reagent may also comprise modified ribonucleotide residues. Suitable modifications are known in the art. See, for instance, Uhlmann, Current Opin. Drug Discovery Dev., Vol. 3, No. 2, pp. 203-213 (2000); and Uhlmann and Peyman, Chem. Rev., Washington, DC, Vol. 90, No. 4, pp. 543-584 (1990). The term RNAi reagent encompasses both single-stranded and double-stranded nucleic acid molecules. Double-stranded nucleic acid molecules may be composed of two WO 2005/059132 PCT/EP2004/014037 -4 separate strands or of one strand comprising two regions which can form a double-stranded structure and a spacer region between the two regions forming a hairpin loop. In the context of RNAi, the RNAi reagents are preferably of double-stranded structure and comprise a sequence complementary to a target gene. However, the present invention is not limited to double stranded structures and includes also single-stranded RNAi reagents capable of inducing RNAi. See Schwarz et al., Mol. Cell, Vol. 10, No. 3, pp. 537-548 (2002). In a first aspect, the present invention provides a method of making an algorithm for the prediction of the RNAi potency of a RNAi reagent comprising: a) determining experimentally the potency of a plurality of RNAi reagent to down regulate a reporter protein readout; and b) using said potency data set to train a artificial neuronal network. In one embodiment, the method of making an algorithm for the prediction of the RNAi potency of a RNAi reagent comprises the steps of: a) determining experimentally the RNAi potency of a plurality of RNAi reagents comprising a sequence complementary to at least one target gene; b) generating a data set of the potency of said RNAi reagents with the experimentally determined RNAi potency of a), wherein said data set though obtained and different targets (reporter fusion-mRNA) has a common protein read-out which is normalizable by mean reporter specific positive and negative controls; and c) training an artificial neuronal net using said read-out. In the context of the present invention, the term algorithm means a set of equations and a set of rules which can be applied to the data automatically and can be implemented as a code which executes on a computer. "RNAi potency" or "potency" is a term of the art and means the relative ability of a given siRNA to down-regulate a given protein or mRNA upon its transfection in a cellular assay. Typically, potency of a siRNA is determined by measuring the level of expression of the target mRNA or protein and is usually expressed as a percentage of the negative control. Thus, a high potency means that the RNAi reagent is capable of efficiently inhibiting, i.e., decreasing, the expression of a target gene, whereas a low potency means that the expression of the target gene is not or only little inhibited. A potent RNAi reagent inhibits the WO 2005/059132 PCT/EP2004/014037 -5 expression of a target gene more than 50% as compared to negative control, preferably more than 60%, more than 70%, more than 80%, most preferably more than 90%. The RNAi reagent in accordance with the present invention is a RNAi reagent suitable for RNAi experimentation. Various types of RNAi reagents suitable for RNAi are known in the art. See Dykxhoorn et al., Nature Rev., Vol. 4, pp. 457-467 (2003). Such a RNAi reagent comprises a sequence that is complementary to a target gene. Complementary to a target gene within the context of the present invention means that the sequence is complementary to an RNA transcribed from the DNA sequence of the target gene, including pre-mRNA, mRNA, cDNA. The term "target gene" is meant to include any DNA sequence which is expressed, i.e., transcribed to RNA, in a cell, tissue or organism. The expressed sequence needs not necessarily be translated into a protein and includes for instance also pre-mRNAs, regulatory RNAs, rRNAs, etc. The sequence complementary to the target gene typically is from about 19-23 nucleotides long, but may also be longer. Preferably, the complementary sequence is less than 50 nucleotides long, more preferably from 15-35 nucleotides long or from 18-25 nucleotides long. The complementary sequence is preferably 100% identical to the corresponding sequence of the target gene, i.e., there is no mismatch between the complementary sequence and the corresponding sequence of the target gene. In some embodiments, the complementary sequence may comprise one, two, three, four, five or more mismatches, if these mismatches do not abolish the RNAi activity of the RNAi reagent. The RNAi reagent used for RNAi is preferably double-stranded and may be composed of two separate strands, but may also be composed of one strand forming a hairpin loop. The double-strand RNA region of the RNAi reagent may contain mismatches and function in a miRNA like mechanism. Examples for types of RNAi reagents that mediate RNAi are for instance siRNAs or miRNA (microRNA) or small hairpin RNAs (shRNAs). In a one step, the RNA potency of a plurality of RNAi reagents is experimentally determined in accordance with the methods of the present invention. In general, it is advantageous to provide a large number of RNAi reagents, as there is a positive correlation between the number of RNAi reagents that are provided and the quality of the algorithm. However, for practical reasons as RNAi reagent synthesis and the experimental determination of RNAi potency is costly and time-consuming, it may be desirable to keep the number of RNAi reagents for experimental determination of RNA potency as low as possible. However, the number of RNAi reagents should not be below a minimum number, below which the WO 2005/059132 PCT/EP2004/014037 -6 algorithm is not properly trainable. In a preferred embodiment, at least 10 RNAi reagents, at least 50 RNAi reagents or at least 100 RNAi reagents are provided, in a more preferred embodiment at least 200 RNAi reagents, at least 500 RNAi reagents, at least 1000 RNAi reagents or at least 2000 RNAi reagents are provided. In another preferred embodiment, less than 10000 RNAi reagents, preferably less than 5000 RNAi reagents or more preferably less than 3000 RNAi reagents are provided. In another preferred embodiment, the RNAi reagents are randomly selected. The RNAi reagents may be overlapping or not overlapping, in a preferred embodiment, the RNAi reagents are not overlapping. The RNAi reagents comprise a region complementary to a target gene. Several RNAi reagents may comprise a region complementary to the same target gene, overlapping or not overlapping. In one particular embodiment, all RNAi reagents comprise a region complementary to the same target gene, overlapping or not overlapping. In another embodiment, the RNAi reagents comprise a region complementary to more than one target gene, preferably to at least 2 target genes, at least 5 target genes or at least 10 target genes. In another preferred embodiment of the present invention, the RNAi reagent sequences provided in step a) were pre-screened for RNAi specificity. "RNAi specificity" or "specificity" is a term of the art and refers in the context of the present invention to the selectivity of a RNAi reagent, i.e., to the capability of a RNAi reagent to selectively inhibit or decrease the expression of a given target gene without inhibiting or decreasing the expression of other genes expressed in a cell, tissue or organism. Ideally, specific RNAi reagents only inhibit the expression of the target gene and leave the expression of all other genes expressed in a cell, tissue or organism unaffected. For this purpose, the specific RNAi reagents advantageously comprise a fully complementary sequence to the target gene, i.e., a complementary sequence with no mismatch to the target sequence, but no fully complementary sequence to all other genes expressed in a cell or an organism, i.e., said complementary sequences have at least one mismatch, preferably at least two mismatches, more preferably at least three mismatches to the all sequences expressed in a cell, tissue or organism other than the target gene. Pre-selection of the specificity of the RNAi reagents can for instance be done by computationally comparing the sequence of the RNAi reagents of interest to all known expressed sequences available in the databases for a given cell, tissue or organism using a suitable software for sequence comparison.
WO 2005/059132 PCT/EP2004/014037 -7 Numerous methods suitable for experimentally determining RNAi potency of a RNAi reagent are known in the art. Generally, a double-stranded RNAi reagent comprising a complementary region to a given target gene is transfected into a cell expressing the target gene. For transfection, different methods, such as for instance electroporation, use of cationic lipids or cationic polymers as helpers for transfection may be used. The cells are then incubated under suitable conditions allowing the expression of the target gene. The expression of the target gene is subsequently measured using a suitable technique, such as, for instance, RT-PCR or measuring the amount of a reporter protein. In a preferred embodiment, a fusion-mRNA coding for a reporter is co-transfected with the siRNA. Preferably, the target nucleotide sequence is inserted in 3'-UTR of the reporter coding sequence. As such, the target will not be translated and its down regulation will have no biological impact. In a preferred embodiment the reporter protein is a Yellow Fluorescent protein (YFP). In another embodiment, negative and reporter specific controls, i.e., a siRNA targeting the coding region of the reporter protein and thereby silencing the reporter protein with a similar potency independently of the 3'-UTR insert, are used and allow the comparison of the expression level of the reporter protein obtained with each siRNA with the expression levels of the negative control and the reporter specific positive control. As such expression levels measured for all of the siRNAs can be compared and pooled in one single homogeneous data set. Basically, any kind of cell may be used for transfection, however, in a preferred embodiment, the cell is a eukaryotic cell, preferably an animal cell, more preferably a mammalian cell and most preferably a human cell. Upon comparison of the inhibition observed for each siRNA with the positive and negative controls (also mentioned as normalization), an experimental potency score correlating to the experimentally determined RNAi potency will be created for each RNAi reagent leading to a read-out which can be compiled in a data set of the experiments. In a preferred embodiment, the experimental scores are obtained by measurement of the reporter protein. Preferably all data of the experimental read-out come from a single type of experimental setting under homogenous conditions. The data is preferably based on a measurement on protein level and not mRNA level, i.e., the amount of protein expressed by the target gene is measured upon exposure with the RNAi reagent and not the amount of mRNA. A typical experimental setup for experimentally determining the RNAi potency in accordance with the present invention is described hereinbelow in the Examples.
WO 2005/059132 PCT/EP2004/014037 -8 In a preferred embodiment of the present invention, a reporter assay is used for the experimental determination of RNAi activity. The use of a reporter assay in accordance with the present invention allows to screen a large number of siRNA against a broad panel of targets with a common experimental read-out. Such an assay is described in Hjsken et al., Nucleic Acids Res., Vol. 31, No. 17, p. e102 (2003). Briefly, an mRNA fusion transcript construct comprising of a full-length reporter gene with a target region of interest inserted into the 3'-untranslated region is provided. For instance, Luciferase and Fluorescent reporter genes can be used in the constructs. The RNAi reagents to be tested for RNAi potency comprise a sequence complementary of interest inserted into the 3'-untranslated region. The RNAi reagent is then transfected with a suitable transfection method into cells expressing transiently or constitutively the reporter gene construct and cultured under suitable conditions allowing the expression of the reporter gene. The level of protein expressed by the reporter gene is subsequently measured. Such an assay allows measuring the RNAi potency of large numbers of RNAi reagents at the protein level against a broad panel of targets with a common read-out. As such, the generated data set is homogeneous and all potency data are comparable each together. The above described data set is used for training of an artificial neuronal network. Artificial neuronal networks are known in the art, see, for instance, Zell, Simulation neuronaler Netzwerke, Addison Wesley (1994); and Rumelhart and McLelland, Parallel Distributed Processing, Vol.1, MIT Press, Cambridge, MA (1986), and available, for instance, on http://www-ra.informatik.uni-tuebingen.de/SNNS. Statistical information will be extracted from the siRNA sequence antisense strand by artificial neuronal networks and correlated with the screening measures. Eventually, a trained network can be applied to an arbitrary input sequence to give an estimate of the 'would-be' screening measure. For instance, for the purpose of illustration, a 3 layer feed-forward network with back propagation training 7-8 can be used. The input layer consists of 4 lanes of ordered nodes, see Figure 1. There is one lane per nucleic base type and always 4 nodes of different base type referring to the same input sequence position. The number of positions is the length of the input sequence. At any given time, during training and/or application, exactly one node at any given position is activated. The activities along the ordered lanes then represent the input sequence. The signals of activated nodes are propagated from the input layer into the second layer, also called layer of hidden units. During this propagation the signals of the input layer, either 0 or WO 2005/059132 PCT/EP2004/014037 -9 1, are weighted differently, summed into the signals of the hidden units. Similarly the signals of the hidden units progress into a single output node of the third and last layer. The weights are the memory elements to represent the statistical knowledge. Initially the weights are randomly set, and result in output signals for siRNA antisense sequence which deviate from the true screening signals. The difference between current network output signal and the experimental is used to change all weights to reduce the differences. Back propagation comes into place by the network for having a 'true' target signal for hidden units in the second layer. Another aspect of the present invention provides a computer system comprising computer hardware and the algorithm of the present invention. A further aspect of the present invention provides a computer readable medium comprising the algorithm of the present invention. Another aspect of the present invention provides methods for obtaining RNAi reagents with an enhanced likelihood of RNAi potency against a given target gene, i.e., an enhanced likelihood of inhibiting the expression of a pre-selected target gene. Thus, for a given number of RNAi reagents designed, a higher percentage will be potent using this method to design RNAi reagents than if the RNAi reagents were designed randomly. Conversely, fewer RNAi reagents have to be designed and screened to find a given number of RNAi reagents of high RNAi potency useful for specifically inhibiting the expression of a target gene. In one embodiment, the method according to the present invention comprises the steps of: a) providing a plurality of RNAi reagent sequences comprising a region complementary to a given target gene; b) applying the trained algorithm according to the present invention to said RNAi reagent sequences using a neuronal network; and c) selecting the RNAi reagent sequence(s) which are predicted to be potent. In a first step, a plurality of candidate RNAi reagents sequences comprising a sequence complementary to the gene to be targeted, i.e. to the gene to be inhibited by RNAi, is chosen. The RNAi reagents may be prescreened for specificity or for the presence or absence of certain sequence motives. They may be overlapping or non-overlapping. In one embodiment, the candidate RNAi reagents are randomly chosen. In a second step, the WO 2005/059132 PCT/EP2004/014037 - 10 potency of the RNAi reagents provided in the first step is predicted using the neuronal network which is trained with an algorithm in accordance with the present invention. In a next step, the RNAi reagent sequences which are predicted to be potent are selected. For instance, the three or five or 10 most potent RNAi reagents can be chosen. Alternatively, all RNAi reagent sequences with a prediction score above a certain threshold score are chosen. In a preferred embodiment, the threshold score is at least 0.7, at least 0.75, at least 0.8 or at least 0.85. The selected RNAi reagents can now be experimentally assayed for their RNAi potency. Thus, in a next step, RNAi reagents suitable for RNAi comprising the sequences which were predicted to be active are synthesized. In a preferred embodiment, the RNAi reagents are chemically synthesized. The skilled person is familiar with chemical methods for the synthesis of such oligonucleotides, for instance, through the well-known technique of solid phase synthesis. However, the RNAi reagents may also be synthesized using biochemical methods such as in vitro transcription or vector based systems. Suitable cells expressing the target gene can now be exposed to the synthesized RNAi reagents (or to the vector comprising the sequence of interest in case of a vector based system), incubated under suitable conditions and the expression level of the target gene can be measured with suitable methods. As a control, the expression level of the target gene of cells which were not exposed to the sequence of interest can be compared. The following examples are included to demonstrate preferred embodiments of the invention and are not intended to limit the invention. EXAMPLES In the approach illustrating this invention more than 3,000 siRNA targeting 34 different mRNA have been screened for potency in a cellular assay. One feature of this study has been to generate a homogeneous data set for subsequent analysis of potency-sequence relationship. This has been possible by the use of a fusion-mRNA reporter assay. See HOsken et al. (2003), supra. In this assay, a plasmid encoding a reporter fusion-mRNA for a reporter protein where the target sequence has been inserted in 3'-UTR of the reporter mRNA is transfected followed by the transfection of the RNAi reagent. As such: a) the result of down regulation of the target sequence has no biological consequences; WO 2005/059132 PCT/EP2004/014037 -11 b) in all assays, the potency readout is at protein level and as the same reporter protein is the whole study, the potency data are not biased by different read-outs; and c) the use of common positive and negative controls in all assays allows the normalization of all potency data. A artificial neuronal network has been used to investigate the sequence potency relationship of this homogeneous potency data se. As a result, the trained artificial neuronal network is able to predict the potency of any siRNA solely based on its nucleotide sequence. As the sequence requirements for potency of gene silencing reagents acting through RNA interference pathway, this approach will be applicable to other RNAi reagents like, for instance, shRNAs or miRNAs. RNAi Reagents The RNAi reagents used in that study were 21-mer double-stranded RNAi reagents with a 19 base-pairing RNA region and dideoxynucleotide overhangs on 3' of each strand. The overhang of the sense strand was systematically a dithymidine whereas the overhangs of the antisense was a dideoxynucleotide designed to be complementary to the target. Screening Methods: eYFP mRNA-Fusion Reporter Assay Construction of reporter expression clones The enhanced cyan and yellow fluorescent protein (eCFP, eYFP) dual reporter based vector pNAS-092 (described in HOsken et al. (2003), supra), was constructed to contain a multiple cloning site after the stop codon of the eYFP for inserting the appropriate cDNAs or ESTs of interest. The eCFP reporter serves for normalization measurements driven under the elongation factor 1 alpha (EF-1 ca promoter and the eYFP reporter was used to monitor activity of siRNA driven under the CMV promoter. The origin of the vector was a plasmid pBudCE4 (Invitrogen) which contains a hCMV and a EF-lca promoter. pNAS-092 was generated by inserting the eCFP gene derived from peCFP-N1 (Clontech) and by transferring the eYFP gene derived from peFP-N1 (Clontech) together with a synthetic DNA fragment bearing the cloning site (EcoRV, Notl, Hindlll, Kpnl, Xbal). The synthetic DNA that was used in the pNAS-092 was confirmed by sequencing. For alternative cloning strategies pNAS-092 was converted to a GatewayTM destination vector pNAS-097 by inserting the WO 2005/059132 PCT/EP2004/014037 -12 attR1 and attR2 cloning sites according to the manufacture protocol (Invitrogen) after the eYFP stop codon. All plasmids used for the final reporter assay were constructed by inserting into the cloning site the c-DNA via ligation (pNAS-092) or recombination (pNAS-097). Cell Lines and Cell Culture The human non-small cell lung carcinoma cell line H-1299 (CRL-5803) was purchased from ATCC (Rockville, MD). H-1299 cells was maintained in 5% humidified CO 2 atmosphere at 37 0 C in RPMI 1640 medium (Life Technologies) containing 10% fetal bovine serum plus 1% L-Glutamine. H1299 cells were split 48 hours prior to transfection reaching a 80% subconfluent stage. Cells were trypsinized, washed and equally dispensed (50 pL) in black 96-well assay plates (Costar, clear bottom) a day before the transfection. Fluorescent protein assays with dual reporter construct carrying a reference gene (intracellular normalization). Plasmid Transfection Lipofectamine-PLUS reagent was incubated with the plasmid diluted in OptiMEM-I (22 ng/pL plasmid, 4.4 mL/mL Lipofectamine-PLUS) and then this mix was diluted 11-fold with OptiMEM-1. Lipofectamine was 1.3-fold pre-diluted with HEPES (20 mM, pH 7.2) and diluted further 28.6-fold with OptiMEM-1 (26.6 pL/mL Lipofectamine) and kept for 15 minutes. Both mixtures were combined 1:1 and incubated for 15 minutes, further diluted 10-fold with OptiMEM-I. The medium was aspirated and 100 pL was added to the cells (0.2 pL/mL Lipofectamine-PLUS, 13.3 pL/mL Lipofectamine, 1 ng/pL plasmid). After 2 hours, 50 pL siRNA transfection mixture was added to the cells which was then further incubated for 2 hours. siRNA Transfection Oligofectamin was diluted with OptiMEM-1 (60 pL/mL) were mixed and incubated for 30 minutes at room temperature. The siRNA was diluted to 600 pM from the hybridized stock solution with hybridization buffer (30 mM HEPES, 100 mM Potassium Acetate, 2 mM Magnesium Acetate: pH 7.63 at room temperature. Annealing 2 minutes at 900C followed by 1 hour at 37°C). Diluted Oligofectamin and siRNA were combined 2 volumes plus WO 2005/059132 PCT/EP2004/014037 -13 1 volume, and incubated for 15 minutes. The siRNA-Oligofectamin mix was further diluted 1:1 with OptiMEM-I and transferred onto the cells (50 pL) (final conc. 0.7 ng/pL plasmid, 10 pL/mL Oligofectamin, 50 nM siRNA). The medium was removed and replaced with 100 pL standard RPMI medium without phenol red containing 10% fetal bovine serum plus 1% L-Glutamine and incubated for 3 days at 37°C. Fluorescence was measured in 24-hour intervals. The fluorescence of eCFP and eYFP was measured with the excitation filter of 436/20 nm and the emission filter of 480/30 nm and the excitation filter of 500/25 nm and the emission filter of 535/30 nm, respectively. The quotient of eYFP/eCFP fluorescence counts expresses the eYFP activity per cell number equivalent. For this data collection with the eYFP reporter assay all siRNA treatments of the positive standard (YFP-specific siRNA NAS-12842/58) and negative standard (luciferase siRNA NAS-8548/9) were done in triplicate. The standard deviation from the mean of the standard siRNAs NAS-12842/58 treatments was calculated and found on average to be 9.1%. Target Selection Firstly, reporter plasmids with 34 different inserts were selected. Size of the inserts varied between 344 nucleotides and 3784 nucleotides. siRNA Sequence Design With 79 siRNA per plate we have 3160 in total. The sequences were designed to walk the inserts randomly, allowing for overlap of varying size (0-20 bases). With an insert size of 27k bases even regularly chosen positions of the 3160 siRNA would result in significant overlap (13 bases) in the siRNA sequences. Sequences with long polynucleotide stretch (five or more consecutive nucleotides) were not considered. In cases of long inserts, two sets of 79 siRNA were designed. The total set of 3160 siRNA sequences was checked for its nucleic content. It was found that all possible motifs up to tetra nucleotides were present in the siRNA sequence set. Screening Format Each siRNA plate contains 79 siRNA, a negative control (anti-luciferase siRNA NAS-8549), two reporter specific siRNA (anti-YFP siRNA NAS-12842 and NAS-12847). Control siRNA WO 2005/059132 PCT/EP2004/014037 -14 were pipetted in from one single batch in triplicates. Eight wells were left empty and will be used for "plasmid only" negative controls. Filtering and Normalization of siRNA Activity Data (see Figure 21 Each plate was assayed in duplicates and YFP level was measured by fluorimetry at two time points. For each plate, linear correlation within the duplicate was checked, as well as the level of inhibition of the positive and negative controls. Data were accepted when a correlation within the duplicate was superior to 0.7 and when the positive control was down regulating YFP to at least 60% as compared to negative control (see Figure 3). Data set from five assays were rejected according to that filter. Remaining data set (2717 sequences) was divided in a training set and a testing set and further filtered by checking each siRNA individually and excluding siRNA displaying a variation higher than 30% within the duplicate. As such, approximately 15% of data were further removed. The remaining duplicate data points were averaged to result in a data set with reduced noise characteristics. The final data set contained 2109 sequences in training set and 234 sequences in testing set. All data points S(i), i indexing the data points, were normalized by the affine system A T(i) = A(S(i)) = (T_high-Tlow) / (S_high-Slow) * (S(i)-S_Iow) + Tlow, in which the original signal of the negative control which is S_low and shall be normalized to give TIow (we set T_low = 0.1 = 10%). Similarly defined is the positive control signal Shigh and its transformed T_high (set to 0.9 = 90%). Artificial Neuronal Network Training (see Figure 3) The siRNA sequence data is presented to the input layer and the screening reporter signals are used to adjust the weights between the network nodes. Each siRNA sequence and its screening measure is presented a total of 10 times. After a one time presentation of all data points the weights of the network are updated synchronously with a learning rate of 0.1 and a momentum of 0.1. See Zell, Simulation Neuronaler Netzwerke, Addison Wesley (1994). Based on five different initializations of weights, the resulting five trained networks weights differed but all five networks showed consistently only slightly varying prediction output. A final output was achieved by averaging the signals of the respective output node of all five WO 2005/059132 PCT/EP2004/014037 -15 networks. We will refer to the average output as 'the' output of 'the' network instead of looking at properties of any of the individual networks for simplicity. Evaluation of Performances of Predictor (see Figure 4) The trained network predicted the experimental inhibition activity with a correlation of 0.63 when applied to the testing set, while it showed moderately higher correlation of 0.665 when applied to the training set. Figure 3 graphs the concordance of the two. In addition to the correlation between prediction and experimental values, the performance of the algorithm can be evaluated by setting a threshold for experimentally active siRNA and for predicted active siRNA. The threshold for experimentally active siRNA was set at 75% normalized potency (0.75). The threshold for predicted active siRNA was set at a score above 0.8. These thresholds form four quadrants containing true negatives (predicted inactive and inactive), false negative (predicted inactive but active), false positive (predicted active but inactive) and true positive (predicted active and active) sequences. The predictor performance can be measured by its sensitivity and selectivity. Predictor sensitivity = true positive / (true positive + false negative) = 0.26 Predictor selectivity = true positive / (true positive + false positive) = 0.71 These values indicate that the predicted sequences have a 71% probability to be active (as defined above). This value has to be compared to the hit rate observed on the whole testing set where 35% of the sequences were active. Also, the predictor will identify 26% of the active sequences. Influence of the Size of the Training Set on the Predictor's Performance (see Figure 5) The training data does not need to be taken completely for training, which allows investigating the degradation of the BIOpred prediction performance with fading set size. The size of the testing set was constant. The correlation, see Figure 5, consistently degrades slowly with reduced training sets. The correlation for a training data set as small as 265 data points still gives correlations of about 0.53.

Claims (18)

1. A method of making an algorithm for the prediction of the RNAi potency of a RNAi reagent comprising: a) determining experimentally the RNAi potency of a plurality of RNAi reagents comprising a sequence complementary to at least one target gene; and b) training an artificial neuronal net using said data set.
2. A method according to claim 1, wherein the RNAi potency is determined by measuring the amount of protein encoded by the target gene.
3. A method according to any of the previous claims, wherein the RNAi potency is determined by a reporter gene assay.
4. A method according to any of the previous claims, wherein the RNAi reagent is targeting the 3'-UTR of the fusion-mRNA encoding for the reporter protein.
5. A method according to any of the previous claims, wherein the data from different fusion-mRNA can be normalized.
6. A method according to any of the previous claims, wherein the potency of at least 100 RNAi reagents, preferably at least 1000 RNAi reagents is determined.
7. A method according to any of the previous claims, wherein the sequences of said RNAi reagents are randomly chosen.
8. A method according to any of the previous claims wherein the sequences of said RNAi reagents have a region between 15 and 30 nucleotides long with sufficient complementary to bind to target mRNA.
9. A method according to any of the previous claims wherein the complementary region of the RNAi reagent contains one or several mismatches to the corresponding region of the target gene.
10. A method according to any of the previous claims wherein the RNAi reagent is a siRNA. WO 2005/059132 PCT/EP2004/014037 -17
11. A method according to any of the previous claims wherein the RNAi reagent is a shRNA.
12. A method according to any of the previous claims wherein the RNAi reagent is a miRNA.
13. An algorithm obtained by a method according to any of the previous claims.
14. A computer readable storage medium comprising an algorithm according to Claim 13.
15. A computer system comprising an algorithm according to Claim 13 and computer hardware.
16. A method for predicting the RNAi potency of a RNAi reagent comprising: a) providing a plurality of RNAi reagent sequences comprising a region complementary to a given target gene; b) running the trained algorithm according to Claim 13 to said RNAi reagent sequences using a neuronal network; and c) selecting the RNAi reagent sequence(s) which are predicted to be potent.
17. A method for inhibiting the expression of a given target gene, comprising: a) providing a plurality of RNAi reagent sequences comprising a region complementary to a given target gene; b) running the trained algorithm according to Claim 13 to said RNAi reagent sequences using a neuronal network; c) selecting the RNAi reagent sequence(s) which are predicted to be potent; d) synthesizing the RNAi reagent(s) selected in c); and e) exposing cells expressing the target gene with the RNAi reagent(s) of d).
18. A method according to Claim 16 or 17, wherein the RNAi reagents that are selected in c) are above a given threshold value.
AU2004298527A 2003-12-10 2004-12-09 RNAi potency prediction method Abandoned AU2004298527A1 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US52843003P 2003-12-10 2003-12-10
US60/528,430 2003-12-10
PCT/EP2004/014037 WO2005059132A1 (en) 2003-12-10 2004-12-09 Rnai potency prediction method

Publications (1)

Publication Number Publication Date
AU2004298527A1 true AU2004298527A1 (en) 2005-06-30

Family

ID=34699869

Family Applications (1)

Application Number Title Priority Date Filing Date
AU2004298527A Abandoned AU2004298527A1 (en) 2003-12-10 2004-12-09 RNAi potency prediction method

Country Status (9)

Country Link
EP (1) EP1694843A1 (en)
JP (1) JP2007523632A (en)
KR (1) KR20060126499A (en)
CN (1) CN1890370A (en)
AU (1) AU2004298527A1 (en)
BR (1) BRPI0417489A (en)
CA (1) CA2545675A1 (en)
RU (1) RU2006124515A (en)
WO (1) WO2005059132A1 (en)

Families Citing this family (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE102006016365B4 (en) * 2006-04-05 2015-08-13 Qiagen Gmbh RNA interference tags
WO2008071771A2 (en) * 2006-12-15 2008-06-19 Novartis Ag Inhibition of gpr4
WO2008122314A1 (en) * 2007-04-10 2008-10-16 Qiagen Gmbh Rna interference tags
EP2260102A1 (en) 2008-03-25 2010-12-15 Novartis Forschungsstiftung, Zweigniederlassung Friedrich Miescher Institute For Biomedical Research Treating cancer by down-regulating frizzled-4 and/or frizzled-1
WO2010100247A1 (en) 2009-03-06 2010-09-10 Novartis Forschungsstiftung, Zweigniederlassung, Friedrich Miescher Institute For Biomedical Research Novel therapy for anxiety
EP2241323A1 (en) 2009-04-14 2010-10-20 Novartis Forschungsstiftung, Zweigniederlassung Friedrich Miescher Institute For Biomedical Research Tenascin-W and brain cancers
CN101625732B (en) * 2009-08-03 2011-11-30 杭州电子科技大学 Forecasting method of water level of potamic tidewater
EP2292266A1 (en) 2009-08-27 2011-03-09 Novartis Forschungsstiftung, Zweigniederlassung Treating cancer by modulating copine III
EP2480573A1 (en) 2009-09-22 2012-08-01 Novartis Forschungsstiftung, Zweigniederlassung Friedrich Miescher Institute For Biomedical Research Treating cancer by modulating mex-3
WO2011045352A2 (en) 2009-10-15 2011-04-21 Novartis Forschungsstiftung Spleen tyrosine kinase and brain cancers
EP2542578A1 (en) 2010-03-05 2013-01-09 Novartis Forschungsstiftung, Zweigniederlassung Friedrich Miescher Institute For Biomedical Research Smoc1, tenascin-c and brain cancers
WO2011131611A1 (en) 2010-04-19 2011-10-27 Novartis Forschungsstiftung, Zweigniederlassung Friedrich Miescher Institute For Biomedical Research Modulating xrn1
US20130089538A1 (en) 2010-06-10 2013-04-11 Novartis Forschungsstiftung, Zweigniederlassung Friedrich Miescher Institute forBiomedical Researh Treating cancer by modulating mammalian sterile 20-like kinase 3
WO2012168259A1 (en) 2011-06-06 2012-12-13 Novartis Forschungsstiftung, Zweigniederlassung Protein tyrosine phosphatase, non-receptor type 11 (ptpn11) and triple-negative breast cancer
EP2776022A1 (en) 2011-11-08 2014-09-17 Novartis Forschungsstiftung, Zweigniederlassung Friedrich Miescher Institute For Biomedical Research New treatment for neurodegenerative diseases
KR20140091695A (en) 2011-11-15 2014-07-22 노파르티스 아게 Combination of a phosphoinositide 3-kinase inhibitor and a modulator of the janus kinase 2-signal transducer and activator of transcription 5 pathway
EP2800583A1 (en) 2012-01-02 2014-11-12 Novartis AG Cdcp1 and breast cancer
WO2013144240A1 (en) 2012-03-29 2013-10-03 Friedrich Miescher Institute For Biomedical Research Inhibition of interleukin- 8 and/or its receptor cxcrl in the treatment her2/her3 -overexpressing breast cancer
WO2014001482A1 (en) 2012-06-29 2014-01-03 Novartis Forschungsstiftung, Zweigniererlassung, Friedrich Miescher Institute For Biomedical Research Treating diseases by modulating a specific isoform of mkl1
EP2870242A1 (en) 2012-07-05 2015-05-13 Novartis Forschungsstiftung, Zweigniederlassung Friedrich Miescher Institute For Biomedical Research New treatment for neurodegenerative diseases
WO2014006115A1 (en) 2012-07-06 2014-01-09 Novartis Ag Combination of a phosphoinositide 3-kinase inhibitor and an inhibitor of the il-8/cxcr interaction
US20160178610A1 (en) 2013-08-07 2016-06-23 Friedrich Miescher Institute For Biomedical Research New screening method for the treatment Friedreich's ataxia
WO2015189816A1 (en) 2014-06-13 2015-12-17 Friedrich Miescher Institute For Biomedical Research New treatment against influenza virus
US20170165261A1 (en) 2014-07-01 2017-06-15 Brian Arthur Hemmings Combination of a brafv600e inhibitor and mertk inhibitor to treat melanoma
WO2016046768A1 (en) 2014-09-24 2016-03-31 Friedrich Miescher Institute For Biomedical Research Lats and breast cancer
US20180348224A1 (en) 2015-10-28 2018-12-06 Friedrich Miescher Institute For Biomedical Resear Ch Tenascin-w and biliary tract cancers
CN109360604B (en) * 2018-11-21 2021-09-24 南昌大学 Ovarian cancer molecular typing prediction system

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB0212302D0 (en) * 2002-05-28 2002-07-10 Isis Innovation Method of selecting targets for gene silencing by RNA interference

Also Published As

Publication number Publication date
CN1890370A (en) 2007-01-03
JP2007523632A (en) 2007-08-23
BRPI0417489A (en) 2007-05-22
KR20060126499A (en) 2006-12-07
CA2545675A1 (en) 2005-06-30
WO2005059132A1 (en) 2005-06-30
EP1694843A1 (en) 2006-08-30
RU2006124515A (en) 2008-01-20

Similar Documents

Publication Publication Date Title
AU2004298527A1 (en) RNAi potency prediction method
Elbashir et al. Analysis of gene function in somatic mammalian cells using small interfering RNAs
Paddison et al. Short hairpin activated gene silencing in mammalian cells
Chang et al. Lessons from Nature: microRNA-based shRNA libraries
Suzuki et al. MCPIP1 ribonuclease antagonizes dicer and terminates microRNA biogenesis through precursor microRNA degradation
Shirane et al. Enzymatic production of RNAi libraries from cDNAs
Salmanidis et al. Direct transcriptional regulation by nuclear microRNAs
Dai et al. RNAi-induced targeted silencing of developmental control genes during chicken embryogenesis
Falschlehner et al. High‐throughput RNAi screening to dissect cellular pathways: A how‐to guide
DE69933856D1 (en) USE OF RIBOZYMES FOR DETERMINING THE FUNCTION OF GENES
Taylor et al. MBNL splicing activity depends on RNA binding site structural context
EP1693450A1 (en) METHOD OF EVALUATING RNAi ACTIVITY AND miRNA ACTIVITY
Akashi et al. Gene discovery by ribozyme and siRNA libraries
Miyagishi et al. Generation of an shRNAi expression library against the whole human transcripts
JP2011004763A (en) Method and composition for rna interference
Scherer et al. Optimization and characterization of tRNA-shRNA expression constructs
WO2007147067A2 (en) Methods and compositions for regulating cell cycle progression
EP2235180B1 (en) Apoptosis inducing positive control for expression modulating experiments
WO2005038054A1 (en) METHOD OF MEASURING THE EFFICACY OF siRNA MOLECULES
CA2587719A1 (en) Method of screening compound regulating the translation of specific mrna
Muller et al. An Efficient method for electroporation of small interfering RNAs into ENCODE project tier 1 GM12878 and K562 cell lines
JP2007082436A (en) METHOD FOR PREDICTING OR IDENTIFYING TARGET mRNA CONTROLLED BY FUNCTIONAL RNA, AND APPLICATION THEREOF
Yau et al. A cellular high-throughput screening approach for therapeutic trans-cleaving ribozymes and RNAi against arbitrary mRNA disease targets
Liang et al. A gene expression restriction network mediated by sense and antisense Alu sequences located on protein-coding messenger RNAs
Strat et al. Specific and nontoxic silencing in mammalian cells with expressed long dsRNAs

Legal Events

Date Code Title Description
MK5 Application lapsed section 142(2)(e) - patent request and compl. specification not accepted