WO2024124041A1 - Multiplexed detection of nucleic acid targets - Google Patents
Multiplexed detection of nucleic acid targets Download PDFInfo
- Publication number
- WO2024124041A1 WO2024124041A1 PCT/US2023/082958 US2023082958W WO2024124041A1 WO 2024124041 A1 WO2024124041 A1 WO 2024124041A1 US 2023082958 W US2023082958 W US 2023082958W WO 2024124041 A1 WO2024124041 A1 WO 2024124041A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- target
- amplifier
- nucleic acid
- sgcs
- detectable labels
- Prior art date
Links
- 150000007523 nucleic acids Chemical class 0.000 title claims abstract description 286
- 102000039446 nucleic acids Human genes 0.000 title claims abstract description 280
- 108020004707 nucleic acids Proteins 0.000 title claims abstract description 280
- 238000001514 detection method Methods 0.000 title abstract description 84
- 239000000523 sample Substances 0.000 claims abstract description 711
- 238000000034 method Methods 0.000 claims abstract description 219
- 238000009396 hybridization Methods 0.000 claims abstract description 35
- 238000009739 binding Methods 0.000 claims description 217
- 230000027455 binding Effects 0.000 claims description 213
- 230000000295 complement effect Effects 0.000 claims description 60
- 239000012445 acidic reagent Substances 0.000 claims description 59
- 108090000623 proteins and genes Proteins 0.000 claims description 46
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 claims description 44
- 230000003321 amplification Effects 0.000 claims description 42
- 238000003199 nucleic acid amplification method Methods 0.000 claims description 42
- 239000003153 chemical reaction reagent Substances 0.000 claims description 20
- 102000004169 proteins and genes Human genes 0.000 claims description 17
- 235000011054 acetic acid Nutrition 0.000 claims description 14
- 238000003776 cleavage reaction Methods 0.000 claims description 13
- 238000011282 treatment Methods 0.000 claims description 13
- MUBZPKHOEPUJKR-UHFFFAOYSA-N Oxalic acid Chemical compound OC(=O)C(O)=O MUBZPKHOEPUJKR-UHFFFAOYSA-N 0.000 claims description 12
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 claims description 12
- 230000007017 scission Effects 0.000 claims description 12
- FERIUCNNQQJTOY-UHFFFAOYSA-N Butyric acid Chemical compound CCCC(O)=O FERIUCNNQQJTOY-UHFFFAOYSA-N 0.000 claims description 8
- OFOBLEOULBTSOW-UHFFFAOYSA-N Malonic acid Chemical compound OC(=O)CC(O)=O OFOBLEOULBTSOW-UHFFFAOYSA-N 0.000 claims description 8
- XBDQKXXYIPTUBI-UHFFFAOYSA-N dimethylselenoniopropionate Natural products CCC(O)=O XBDQKXXYIPTUBI-UHFFFAOYSA-N 0.000 claims description 8
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 claims description 8
- NQPDZGIKBAWPEJ-UHFFFAOYSA-N valeric acid Chemical compound CCCCC(O)=O NQPDZGIKBAWPEJ-UHFFFAOYSA-N 0.000 claims description 8
- BJEPYKJPYRNKOW-REOHCLBHSA-N (S)-malic acid Chemical compound OC(=O)[C@@H](O)CC(O)=O BJEPYKJPYRNKOW-REOHCLBHSA-N 0.000 claims description 4
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 claims description 4
- FEWJPZIEWOKRBE-JCYAYHJZSA-N Dextrotartaric acid Chemical compound OC(=O)[C@H](O)[C@@H](O)C(O)=O FEWJPZIEWOKRBE-JCYAYHJZSA-N 0.000 claims description 4
- KDYFGRWQOYBRFD-UHFFFAOYSA-N Succinic acid Natural products OC(=O)CCC(O)=O KDYFGRWQOYBRFD-UHFFFAOYSA-N 0.000 claims description 4
- FEWJPZIEWOKRBE-UHFFFAOYSA-N Tartaric acid Natural products [H+].[H+].[O-]C(=O)C(O)C(O)C([O-])=O FEWJPZIEWOKRBE-UHFFFAOYSA-N 0.000 claims description 4
- BJEPYKJPYRNKOW-UHFFFAOYSA-N alpha-hydroxysuccinic acid Natural products OC(=O)C(O)CC(O)=O BJEPYKJPYRNKOW-UHFFFAOYSA-N 0.000 claims description 4
- KDYFGRWQOYBRFD-NUQCWPJISA-N butanedioic acid Chemical compound O[14C](=O)CC[14C](O)=O KDYFGRWQOYBRFD-NUQCWPJISA-N 0.000 claims description 4
- 235000015165 citric acid Nutrition 0.000 claims description 4
- 235000019253 formic acid Nutrition 0.000 claims description 4
- 239000001630 malic acid Substances 0.000 claims description 4
- 235000011090 malic acid Nutrition 0.000 claims description 4
- 235000006408 oxalic acid Nutrition 0.000 claims description 4
- 235000019260 propionic acid Nutrition 0.000 claims description 4
- IUVKMZGDUIUOCP-BTNSXGMBSA-N quinbolone Chemical compound O([C@H]1CC[C@H]2[C@H]3[C@@H]([C@]4(C=CC(=O)C=C4CC3)C)CC[C@@]21C)C1=CCCC1 IUVKMZGDUIUOCP-BTNSXGMBSA-N 0.000 claims description 4
- 239000011975 tartaric acid Substances 0.000 claims description 4
- 235000002906 tartaric acid Nutrition 0.000 claims description 4
- 229940005605 valeric acid Drugs 0.000 claims description 4
- 239000000203 mixture Substances 0.000 abstract description 26
- 239000012472 biological sample Substances 0.000 abstract description 11
- 238000010195 expression analysis Methods 0.000 abstract description 2
- 210000004027 cell Anatomy 0.000 description 74
- 210000001519 tissue Anatomy 0.000 description 54
- 238000003556 assay Methods 0.000 description 50
- 238000002372 labelling Methods 0.000 description 47
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 42
- 230000000875 corresponding effect Effects 0.000 description 31
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 26
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 26
- 210000004556 brain Anatomy 0.000 description 26
- -1 i.e. Substances 0.000 description 26
- 239000002253 acid Substances 0.000 description 25
- 102000040430 polynucleotide Human genes 0.000 description 25
- 108091033319 polynucleotide Proteins 0.000 description 25
- 239000002157 polynucleotide Substances 0.000 description 25
- 125000003729 nucleotide group Chemical group 0.000 description 20
- 239000000758 substrate Substances 0.000 description 20
- 102000004190 Enzymes Human genes 0.000 description 19
- 108090000790 Enzymes Proteins 0.000 description 19
- 229940088598 enzyme Drugs 0.000 description 19
- 238000002474 experimental method Methods 0.000 description 19
- 238000007901 in situ hybridization Methods 0.000 description 18
- 108020004414 DNA Proteins 0.000 description 17
- 210000004369 blood Anatomy 0.000 description 17
- 239000008280 blood Substances 0.000 description 17
- 239000002773 nucleotide Substances 0.000 description 16
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 15
- 239000007850 fluorescent dye Substances 0.000 description 15
- 230000014509 gene expression Effects 0.000 description 15
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 14
- 229960000583 acetic acid Drugs 0.000 description 14
- 239000012491 analyte Substances 0.000 description 14
- 238000003384 imaging method Methods 0.000 description 13
- 108020004999 messenger RNA Proteins 0.000 description 13
- 238000013459 approach Methods 0.000 description 12
- 108091005804 Peptidases Proteins 0.000 description 11
- 102000035195 Peptidases Human genes 0.000 description 11
- 108010001336 Horseradish Peroxidase Proteins 0.000 description 10
- 230000005284 excitation Effects 0.000 description 10
- 125000005647 linker group Chemical group 0.000 description 10
- 239000000463 material Substances 0.000 description 10
- FWBHETKCLVMNFS-UHFFFAOYSA-N 4',6-Diamino-2-phenylindol Chemical compound C1=CC(C(=N)N)=CC=C1C1=CC2=CC=C(C(N)=N)C=C2N1 FWBHETKCLVMNFS-UHFFFAOYSA-N 0.000 description 9
- 239000004365 Protease Substances 0.000 description 9
- 239000003795 chemical substances by application Substances 0.000 description 9
- 238000011534 incubation Methods 0.000 description 9
- 238000005406 washing Methods 0.000 description 9
- 230000002380 cytological effect Effects 0.000 description 8
- 125000001153 fluoro group Chemical group F* 0.000 description 8
- 238000011065 in-situ storage Methods 0.000 description 8
- 239000011780 sodium chloride Substances 0.000 description 8
- 101100371340 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) TYE7 gene Proteins 0.000 description 7
- 230000008901 benefit Effects 0.000 description 7
- 230000000903 blocking effect Effects 0.000 description 7
- 238000012545 processing Methods 0.000 description 7
- 108091032955 Bacterial small RNA Proteins 0.000 description 6
- WSFSSNUMVMOOMR-UHFFFAOYSA-N Formaldehyde Chemical compound O=C WSFSSNUMVMOOMR-UHFFFAOYSA-N 0.000 description 6
- MHAJPDPJQMAIIY-UHFFFAOYSA-N Hydrogen peroxide Chemical compound OO MHAJPDPJQMAIIY-UHFFFAOYSA-N 0.000 description 6
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 6
- 108091034117 Oligonucleotide Proteins 0.000 description 6
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 6
- 238000001574 biopsy Methods 0.000 description 6
- 108091070501 miRNA Proteins 0.000 description 6
- 239000002679 microRNA Substances 0.000 description 6
- 235000019419 proteases Nutrition 0.000 description 6
- 101150047761 sdhA gene Proteins 0.000 description 6
- 239000000126 substance Substances 0.000 description 6
- PZBFGYYEXUXCOF-UHFFFAOYSA-N TCEP Chemical compound OC(=O)CCP(CCC(O)=O)CCC(O)=O PZBFGYYEXUXCOF-UHFFFAOYSA-N 0.000 description 5
- 150000007513 acids Chemical class 0.000 description 5
- 238000004458 analytical method Methods 0.000 description 5
- 239000000090 biomarker Substances 0.000 description 5
- 239000000872 buffer Substances 0.000 description 5
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 5
- 230000000694 effects Effects 0.000 description 5
- 230000009467 reduction Effects 0.000 description 5
- 108020004418 ribosomal RNA Proteins 0.000 description 5
- ZAINTDRBUHCDPZ-UHFFFAOYSA-M Alexa Fluor 546 Chemical compound [H+].[Na+].CC1CC(C)(C)NC(C(=C2OC3=C(C4=NC(C)(C)CC(C)C4=CC3=3)S([O-])(=O)=O)S([O-])(=O)=O)=C1C=C2C=3C(C(=C(Cl)C=1Cl)C(O)=O)=C(Cl)C=1SCC(=O)NCCCCCC(=O)ON1C(=O)CCC1=O ZAINTDRBUHCDPZ-UHFFFAOYSA-M 0.000 description 4
- 238000011740 C57BL/6 mouse Methods 0.000 description 4
- 102000053602 DNA Human genes 0.000 description 4
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 4
- 101000611202 Homo sapiens Peptidyl-prolyl cis-trans isomerase B Proteins 0.000 description 4
- 206010028980 Neoplasm Diseases 0.000 description 4
- 108091028043 Nucleic acid sequence Proteins 0.000 description 4
- 238000002509 fluorescent in situ hybridization Methods 0.000 description 4
- 238000002844 melting Methods 0.000 description 4
- 230000008018 melting Effects 0.000 description 4
- 229910052751 metal Inorganic materials 0.000 description 4
- 239000002184 metal Substances 0.000 description 4
- 239000003068 molecular probe Substances 0.000 description 4
- 210000000056 organ Anatomy 0.000 description 4
- 244000052769 pathogen Species 0.000 description 4
- 239000001488 sodium phosphate Substances 0.000 description 4
- 229910000162 sodium phosphate Inorganic materials 0.000 description 4
- 235000011008 sodium phosphates Nutrition 0.000 description 4
- 238000002560 therapeutic procedure Methods 0.000 description 4
- RYFMWSXOAZQYPI-UHFFFAOYSA-K trisodium phosphate Chemical compound [Na+].[Na+].[Na+].[O-]P([O-])([O-])=O RYFMWSXOAZQYPI-UHFFFAOYSA-K 0.000 description 4
- BCHIXGBGRHLSBE-UHFFFAOYSA-N (4-methyl-2-oxochromen-7-yl) dihydrogen phosphate Chemical compound C1=C(OP(O)(O)=O)C=CC2=C1OC(=O)C=C2C BCHIXGBGRHLSBE-UHFFFAOYSA-N 0.000 description 3
- OXEUETBFKVCRNP-UHFFFAOYSA-N 9-ethyl-3-carbazolamine Chemical compound NC1=CC=C2N(CC)C3=CC=CC=C3C2=C1 OXEUETBFKVCRNP-UHFFFAOYSA-N 0.000 description 3
- CSCPPACGZOOCGX-UHFFFAOYSA-N Acetone Chemical compound CC(C)=O CSCPPACGZOOCGX-UHFFFAOYSA-N 0.000 description 3
- 239000012103 Alexa Fluor 488 Substances 0.000 description 3
- 239000012118 Alexa Fluor 750 Substances 0.000 description 3
- AVXURJPOCDRRFD-UHFFFAOYSA-N Hydroxylamine Chemical compound ON AVXURJPOCDRRFD-UHFFFAOYSA-N 0.000 description 3
- 102000057297 Pepsin A Human genes 0.000 description 3
- 108090000284 Pepsin A Proteins 0.000 description 3
- 102100040283 Peptidyl-prolyl cis-trans isomerase B Human genes 0.000 description 3
- KWYUFKZDYYNOTN-UHFFFAOYSA-M Potassium hydroxide Chemical compound [OH-].[K+] KWYUFKZDYYNOTN-UHFFFAOYSA-M 0.000 description 3
- 210000004958 brain cell Anatomy 0.000 description 3
- 210000005013 brain tissue Anatomy 0.000 description 3
- 238000004422 calculation algorithm Methods 0.000 description 3
- 201000011510 cancer Diseases 0.000 description 3
- 238000004113 cell culture Methods 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 238000004040 coloring Methods 0.000 description 3
- 238000013461 design Methods 0.000 description 3
- 239000003599 detergent Substances 0.000 description 3
- 230000029087 digestion Effects 0.000 description 3
- 201000010099 disease Diseases 0.000 description 3
- 238000009826 distribution Methods 0.000 description 3
- GRWZHXKQBITJKP-UHFFFAOYSA-L dithionite(2-) Chemical compound [O-]S(=O)S([O-])=O GRWZHXKQBITJKP-UHFFFAOYSA-L 0.000 description 3
- 229940079593 drug Drugs 0.000 description 3
- 239000003814 drug Substances 0.000 description 3
- 239000000975 dye Substances 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 238000000684 flow cytometry Methods 0.000 description 3
- 239000011521 glass Substances 0.000 description 3
- 238000010438 heat treatment Methods 0.000 description 3
- 239000000543 intermediate Substances 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- 238000005259 measurement Methods 0.000 description 3
- 108091064355 mitochondrial RNA Proteins 0.000 description 3
- 238000002156 mixing Methods 0.000 description 3
- 238000010369 molecular cloning Methods 0.000 description 3
- 239000002777 nucleoside Substances 0.000 description 3
- 230000001717 pathogenic effect Effects 0.000 description 3
- 229940111202 pepsin Drugs 0.000 description 3
- KHIWWQKSHDUIBK-UHFFFAOYSA-N periodic acid Chemical compound OI(=O)(=O)=O KHIWWQKSHDUIBK-UHFFFAOYSA-N 0.000 description 3
- 230000008823 permeabilization Effects 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 108090000765 processed proteins & peptides Proteins 0.000 description 3
- 239000000047 product Substances 0.000 description 3
- 229910052761 rare earth metal Inorganic materials 0.000 description 3
- 150000002910 rare earth metals Chemical class 0.000 description 3
- PYWVYCXTNDRMGF-UHFFFAOYSA-N rhodamine B Chemical class [Cl-].C=12C=CC(=[N+](CC)CC)C=C2OC2=CC(N(CC)CC)=CC=C2C=1C1=CC=CC=C1C(O)=O PYWVYCXTNDRMGF-UHFFFAOYSA-N 0.000 description 3
- 239000000243 solution Substances 0.000 description 3
- 235000000346 sugar Nutrition 0.000 description 3
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- QEQDLKUMPUDNPG-UHFFFAOYSA-N 2-(7-amino-4-methyl-2-oxochromen-3-yl)acetic acid Chemical compound C1=C(N)C=CC2=C1OC(=O)C(CC(O)=O)=C2C QEQDLKUMPUDNPG-UHFFFAOYSA-N 0.000 description 2
- IOOMXAQUNPWDLL-UHFFFAOYSA-N 2-[6-(diethylamino)-3-(diethyliminiumyl)-3h-xanthen-9-yl]-5-sulfobenzene-1-sulfonate Chemical compound C=12C=CC(=[N+](CC)CC)C=C2OC2=CC(N(CC)CC)=CC=C2C=1C1=CC=C(S(O)(=O)=O)C=C1S([O-])(=O)=O IOOMXAQUNPWDLL-UHFFFAOYSA-N 0.000 description 2
- KDCGOANMDULRCW-UHFFFAOYSA-N 7H-purine Chemical compound N1=CNC2=NC=NC2=C1 KDCGOANMDULRCW-UHFFFAOYSA-N 0.000 description 2
- JLDSMZIBHYTPPR-UHFFFAOYSA-N Alexa Fluor 405 Chemical compound CC[NH+](CC)CC.CC[NH+](CC)CC.CC[NH+](CC)CC.C12=C3C=4C=CC2=C(S([O-])(=O)=O)C=C(S([O-])(=O)=O)C1=CC=C3C(S(=O)(=O)[O-])=CC=4OCC(=O)N(CC1)CCC1C(=O)ON1C(=O)CCC1=O JLDSMZIBHYTPPR-UHFFFAOYSA-N 0.000 description 2
- WEJVZSAYICGDCK-UHFFFAOYSA-N Alexa Fluor 430 Chemical compound CC[NH+](CC)CC.CC1(C)C=C(CS([O-])(=O)=O)C2=CC=3C(C(F)(F)F)=CC(=O)OC=3C=C2N1CCCCCC(=O)ON1C(=O)CCC1=O WEJVZSAYICGDCK-UHFFFAOYSA-N 0.000 description 2
- WHVNXSBKJGAXKU-UHFFFAOYSA-N Alexa Fluor 532 Chemical compound [H+].[H+].CC1(C)C(C)NC(C(=C2OC3=C(C=4C(C(C(C)N=4)(C)C)=CC3=3)S([O-])(=O)=O)S([O-])(=O)=O)=C1C=C2C=3C(C=C1)=CC=C1C(=O)ON1C(=O)CCC1=O WHVNXSBKJGAXKU-UHFFFAOYSA-N 0.000 description 2
- IGAZHQIYONOHQN-UHFFFAOYSA-N Alexa Fluor 555 Chemical compound C=12C=CC(=N)C(S(O)(=O)=O)=C2OC2=C(S(O)(=O)=O)C(N)=CC=C2C=1C1=CC=C(C(O)=O)C=C1C(O)=O IGAZHQIYONOHQN-UHFFFAOYSA-N 0.000 description 2
- 239000012110 Alexa Fluor 594 Substances 0.000 description 2
- 239000012114 Alexa Fluor 647 Substances 0.000 description 2
- 239000012117 Alexa Fluor 700 Substances 0.000 description 2
- 208000005443 Circulating Neoplastic Cells Diseases 0.000 description 2
- 102100032218 Cytokine-inducible SH2-containing protein Human genes 0.000 description 2
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 2
- 108010067770 Endopeptidase K Proteins 0.000 description 2
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 2
- 239000004366 Glucose oxidase Substances 0.000 description 2
- 108010015776 Glucose oxidase Proteins 0.000 description 2
- 101000943420 Homo sapiens Cytokine-inducible SH2-containing protein Proteins 0.000 description 2
- 108090001060 Lipase Proteins 0.000 description 2
- 102000004882 Lipase Human genes 0.000 description 2
- 239000004367 Lipase Substances 0.000 description 2
- WMFOQBRAJBCJND-UHFFFAOYSA-M Lithium hydroxide Chemical compound [Li+].[OH-] WMFOQBRAJBCJND-UHFFFAOYSA-M 0.000 description 2
- 108010053229 Lysyl endopeptidase Proteins 0.000 description 2
- 241000124008 Mammalia Species 0.000 description 2
- 108700011259 MicroRNAs Proteins 0.000 description 2
- 229910019142 PO4 Inorganic materials 0.000 description 2
- 102000003992 Peroxidases Human genes 0.000 description 2
- 108091028664 Ribonucleotide Proteins 0.000 description 2
- 108020004682 Single-Stranded DNA Proteins 0.000 description 2
- 229920004890 Triton X-100 Polymers 0.000 description 2
- 239000013504 Triton X-100 Substances 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- 102000006943 Uracil-DNA Glycosidase Human genes 0.000 description 2
- 108010072685 Uracil-DNA Glycosidase Proteins 0.000 description 2
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 2
- 150000001298 alcohols Chemical class 0.000 description 2
- FOYVTVSSAMSORJ-UHFFFAOYSA-N atto 655 Chemical compound OC(=O)CCCN1C(C)(C)CC(CS([O-])(=O)=O)C2=C1C=C1OC3=CC4=[N+](CC)CCCC4=CC3=NC1=C2 FOYVTVSSAMSORJ-UHFFFAOYSA-N 0.000 description 2
- RIIWUGSYXOBDMC-UHFFFAOYSA-N benzene-1,2-diamine;hydron;dichloride Chemical compound Cl.Cl.NC1=CC=CC=C1N RIIWUGSYXOBDMC-UHFFFAOYSA-N 0.000 description 2
- 239000013060 biological fluid Substances 0.000 description 2
- 239000002981 blocking agent Substances 0.000 description 2
- 238000012512 characterization method Methods 0.000 description 2
- 238000003508 chemical denaturation Methods 0.000 description 2
- 239000003638 chemical reducing agent Substances 0.000 description 2
- 239000003593 chromogenic compound Substances 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- 238000004163 cytometry Methods 0.000 description 2
- 230000018044 dehydration Effects 0.000 description 2
- 238000006297 dehydration reaction Methods 0.000 description 2
- 239000005547 deoxyribonucleotide Substances 0.000 description 2
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 208000035475 disorder Diseases 0.000 description 2
- 230000002708 enhancing effect Effects 0.000 description 2
- 210000003743 erythrocyte Anatomy 0.000 description 2
- 239000000834 fixative Substances 0.000 description 2
- 239000012530 fluid Substances 0.000 description 2
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 2
- 229940116332 glucose oxidase Drugs 0.000 description 2
- 235000019420 glucose oxidase Nutrition 0.000 description 2
- 230000036541 health Effects 0.000 description 2
- 238000003505 heat denaturation Methods 0.000 description 2
- 125000000623 heterocyclic group Chemical group 0.000 description 2
- 238000010191 image analysis Methods 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 235000019421 lipase Nutrition 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 238000000386 microscopy Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- WIXAQXKQLNRFDF-UHFFFAOYSA-N n-[4-[7-(diethylamino)-4-methyl-2-oxochromen-3-yl]phenyl]-2-iodoacetamide Chemical compound O=C1OC2=CC(N(CC)CC)=CC=C2C(C)=C1C1=CC=C(NC(=O)CI)C=C1 WIXAQXKQLNRFDF-UHFFFAOYSA-N 0.000 description 2
- 230000007935 neutral effect Effects 0.000 description 2
- JPXMTWWFLBLUCD-UHFFFAOYSA-N nitro blue tetrazolium(2+) Chemical compound COC1=CC(C=2C=C(OC)C(=CC=2)[N+]=2N(N=C(N=2)C=2C=CC=CC=2)C=2C=CC(=CC=2)[N+]([O-])=O)=CC=C1[N+]1=NC(C=2C=CC=CC=2)=NN1C1=CC=C([N+]([O-])=O)C=C1 JPXMTWWFLBLUCD-UHFFFAOYSA-N 0.000 description 2
- 108091027963 non-coding RNA Proteins 0.000 description 2
- 102000042567 non-coding RNA Human genes 0.000 description 2
- 150000003833 nucleoside derivatives Chemical class 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 210000003463 organelle Anatomy 0.000 description 2
- 239000005022 packaging material Substances 0.000 description 2
- 239000012188 paraffin wax Substances 0.000 description 2
- 239000008188 pellet Substances 0.000 description 2
- 108040007629 peroxidase activity proteins Proteins 0.000 description 2
- KJFMBFZCATUALV-UHFFFAOYSA-N phenolphthalein Chemical compound C1=CC(O)=CC=C1C1(C=2C=CC(O)=CC=2)C2=CC=CC=C2C(=O)O1 KJFMBFZCATUALV-UHFFFAOYSA-N 0.000 description 2
- 239000010452 phosphate Substances 0.000 description 2
- 238000011176 pooling Methods 0.000 description 2
- 235000019833 protease Nutrition 0.000 description 2
- 230000006337 proteolytic cleavage Effects 0.000 description 2
- 150000003212 purines Chemical class 0.000 description 2
- 150000003220 pyrenes Chemical class 0.000 description 2
- 150000003230 pyrimidines Chemical class 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 239000002336 ribonucleotide Substances 0.000 description 2
- 125000002652 ribonucleotide group Chemical group 0.000 description 2
- 239000001509 sodium citrate Substances 0.000 description 2
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 238000004611 spectroscopical analysis Methods 0.000 description 2
- 238000001228 spectrum Methods 0.000 description 2
- 230000002123 temporal effect Effects 0.000 description 2
- MPLHNVLQVRSVEE-UHFFFAOYSA-N texas red Chemical compound [O-]S(=O)(=O)C1=CC(S(Cl)(=O)=O)=CC=C1C(C1=CC=2CCCN3CCCC(C=23)=C1O1)=C2C1=C(CCC1)C3=[N+]1CCCC3=C2 MPLHNVLQVRSVEE-UHFFFAOYSA-N 0.000 description 2
- 238000001269 time-of-flight mass spectrometry Methods 0.000 description 2
- DZGWFCGJZKJUFP-UHFFFAOYSA-N tyramine Chemical compound NCCC1=CC=C(O)C=C1 DZGWFCGJZKJUFP-UHFFFAOYSA-N 0.000 description 2
- YPFDHNVEDLHUCE-UHFFFAOYSA-N 1,3-propanediol Substances OCCCO YPFDHNVEDLHUCE-UHFFFAOYSA-N 0.000 description 1
- 229940035437 1,3-propanediol Drugs 0.000 description 1
- JNGRENQDBKMCCR-UHFFFAOYSA-N 2-(3-amino-6-iminoxanthen-9-yl)benzoic acid;hydrochloride Chemical compound [Cl-].C=12C=CC(=[NH2+])C=C2OC2=CC(N)=CC=C2C=1C1=CC=CC=C1C(O)=O JNGRENQDBKMCCR-UHFFFAOYSA-N 0.000 description 1
- WZIMSXIXZTUBSO-UHFFFAOYSA-N 2-[[bis(carboxymethyl)amino]methyl-(carboxymethyl)amino]acetic acid Chemical compound OC(=O)CN(CC(O)=O)CN(CC(O)=O)CC(O)=O WZIMSXIXZTUBSO-UHFFFAOYSA-N 0.000 description 1
- JRYMOPZHXMVHTA-DAGMQNCNSA-N 2-amino-7-[(2r,3r,4s,5r)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-1h-pyrrolo[2,3-d]pyrimidin-4-one Chemical compound C1=CC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O JRYMOPZHXMVHTA-DAGMQNCNSA-N 0.000 description 1
- BNBQQYFXBLBYJK-UHFFFAOYSA-N 2-pyridin-2-yl-1,3-oxazole Chemical class C1=COC(C=2N=CC=CC=2)=N1 BNBQQYFXBLBYJK-UHFFFAOYSA-N 0.000 description 1
- UAIUNKRWKOVEES-UHFFFAOYSA-N 3,3',5,5'-tetramethylbenzidine Chemical compound CC1=C(N)C(C)=CC(C=2C=C(C)C(N)=C(C)C=2)=C1 UAIUNKRWKOVEES-UHFFFAOYSA-N 0.000 description 1
- HSTOKWSFWGCZMH-UHFFFAOYSA-N 3,3'-diaminobenzidine Chemical compound C1=C(N)C(N)=CC=C1C1=CC=C(N)C(N)=C1 HSTOKWSFWGCZMH-UHFFFAOYSA-N 0.000 description 1
- ZTOJFFHGPLIVKC-UHFFFAOYSA-N 3-ethyl-2-[(3-ethyl-6-sulfo-1,3-benzothiazol-2-ylidene)hydrazinylidene]-1,3-benzothiazole-6-sulfonic acid Chemical compound S1C2=CC(S(O)(=O)=O)=CC=C2N(CC)C1=NN=C1SC2=CC(S(O)(=O)=O)=CC=C2N1CC ZTOJFFHGPLIVKC-UHFFFAOYSA-N 0.000 description 1
- 108010091324 3C proteases Proteins 0.000 description 1
- CXNVOWPRHWWCQR-UHFFFAOYSA-N 4-Chloro-ortho-toluidine Chemical compound CC1=CC(Cl)=CC=C1N CXNVOWPRHWWCQR-UHFFFAOYSA-N 0.000 description 1
- PSGQCCSGKGJLRL-UHFFFAOYSA-N 4-methyl-2h-chromen-2-one Chemical group C1=CC=CC2=C1OC(=O)C=C2C PSGQCCSGKGJLRL-UHFFFAOYSA-N 0.000 description 1
- YUDPTGPSBJVHCN-CHUNWDLHSA-N 4-methylumbelliferyl alpha-D-galactoside Chemical compound C1=CC=2C(C)=CC(=O)OC=2C=C1O[C@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O YUDPTGPSBJVHCN-CHUNWDLHSA-N 0.000 description 1
- XZKIHKMTEMTJQX-UHFFFAOYSA-L 4-nitrophenyl phosphate(2-) Chemical compound [O-][N+](=O)C1=CC=C(OP([O-])([O-])=O)C=C1 XZKIHKMTEMTJQX-UHFFFAOYSA-L 0.000 description 1
- WNDDWSAHNYBXKY-UHFFFAOYSA-N ATTO 425-2 Chemical compound CC1CC(C)(C)N(CCCC(O)=O)C2=C1C=C1C=C(C(=O)OCC)C(=O)OC1=C2 WNDDWSAHNYBXKY-UHFFFAOYSA-N 0.000 description 1
- YIXZUOWWYKISPQ-UHFFFAOYSA-N ATTO 565 para-isomer Chemical compound [O-]Cl(=O)(=O)=O.C=12C=C3CCC[N+](CC)=C3C=C2OC=2C=C3N(CC)CCCC3=CC=2C=1C1=CC(C(O)=O)=CC=C1C(O)=O YIXZUOWWYKISPQ-UHFFFAOYSA-N 0.000 description 1
- PWZJEXGKUHVUFP-UHFFFAOYSA-N ATTO 590 meta-isomer Chemical compound [O-]Cl(=O)(=O)=O.C1=2C=C3C(C)=CC(C)(C)N(CC)C3=CC=2OC2=CC3=[N+](CC)C(C)(C)C=C(C)C3=CC2=C1C1=CC=C(C(O)=O)C=C1C(O)=O PWZJEXGKUHVUFP-UHFFFAOYSA-N 0.000 description 1
- SLQQGEVQWLDVDF-UHFFFAOYSA-N ATTO 610-2 Chemical compound [O-]Cl(=O)(=O)=O.C1=C2CCC[N+](CCCC(O)=O)=C2C=C2C1=CC1=CC=C(N(C)C)C=C1C2(C)C SLQQGEVQWLDVDF-UHFFFAOYSA-N 0.000 description 1
- 239000012109 Alexa Fluor 568 Substances 0.000 description 1
- 239000012111 Alexa Fluor 610 Substances 0.000 description 1
- 239000012112 Alexa Fluor 633 Substances 0.000 description 1
- 239000012113 Alexa Fluor 635 Substances 0.000 description 1
- 239000012115 Alexa Fluor 660 Substances 0.000 description 1
- 239000012116 Alexa Fluor 680 Substances 0.000 description 1
- 239000012099 Alexa Fluor family Substances 0.000 description 1
- 239000004382 Amylase Substances 0.000 description 1
- 108010065511 Amylases Proteins 0.000 description 1
- 102000013142 Amylases Human genes 0.000 description 1
- 101710170230 Antimicrobial peptide 1 Proteins 0.000 description 1
- 101710170231 Antimicrobial peptide 2 Proteins 0.000 description 1
- 108050001427 Avidin/streptavidin Proteins 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- HIHVCASFAJDRKI-UHFFFAOYSA-N C1=CC=C2C(O)=C(Cl)C=CC2=C1.C1=CC=C2C(O)=CCC(Cl)(C#N)C2=C1 Chemical compound C1=CC=C2C(O)=C(Cl)C=CC2=C1.C1=CC=C2C(O)=CCC(Cl)(C#N)C2=C1 HIHVCASFAJDRKI-UHFFFAOYSA-N 0.000 description 1
- 206010008263 Cervical dysplasia Diseases 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- 108020004394 Complementary RNA Proteins 0.000 description 1
- HMFHBZSHGGEWLO-SOOFDHNKSA-N D-ribofuranose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H]1O HMFHBZSHGGEWLO-SOOFDHNKSA-N 0.000 description 1
- QRLVDLBMBULFAL-UHFFFAOYSA-N Digitonin Natural products CC1CCC2(OC1)OC3C(O)C4C5CCC6CC(OC7OC(CO)C(OC8OC(CO)C(O)C(OC9OCC(O)C(O)C9OC%10OC(CO)C(O)C(OC%11OC(CO)C(O)C(O)C%11O)C%10O)C8O)C(O)C7O)C(O)CC6(C)C5CCC4(C)C3C2C QRLVDLBMBULFAL-UHFFFAOYSA-N 0.000 description 1
- 108010013369 Enteropeptidase Proteins 0.000 description 1
- 102100029727 Enteropeptidase Human genes 0.000 description 1
- 108090000371 Esterases Proteins 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 102000002464 Galactosidases Human genes 0.000 description 1
- 108010093031 Galactosidases Proteins 0.000 description 1
- 102000004366 Glucosidases Human genes 0.000 description 1
- 108010056771 Glucosidases Proteins 0.000 description 1
- SXRSQZLOMIGNAQ-UHFFFAOYSA-N Glutaraldehyde Chemical compound O=CCCCC=O SXRSQZLOMIGNAQ-UHFFFAOYSA-N 0.000 description 1
- 241000430519 Human rhinovirus sp. Species 0.000 description 1
- 229930010555 Inosine Natural products 0.000 description 1
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 1
- 235000000391 Lepidium draba Nutrition 0.000 description 1
- 102000016943 Muramidase Human genes 0.000 description 1
- 108010014251 Muramidase Proteins 0.000 description 1
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 1
- IXQIUDNVFVTQLJ-UHFFFAOYSA-N Naphthofluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C(C=CC=1C3=CC=C(O)C=1)=C3OC1=C2C=CC2=CC(O)=CC=C21 IXQIUDNVFVTQLJ-UHFFFAOYSA-N 0.000 description 1
- 102000005348 Neuraminidase Human genes 0.000 description 1
- 108010006232 Neuraminidase Proteins 0.000 description 1
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 1
- 108091005461 Nucleic proteins Proteins 0.000 description 1
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 1
- 108091093037 Peptide nucleic acid Proteins 0.000 description 1
- 102000004160 Phosphoric Monoester Hydrolases Human genes 0.000 description 1
- 108090000608 Phosphoric Monoester Hydrolases Proteins 0.000 description 1
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 1
- 238000003559 RNA-seq method Methods 0.000 description 1
- PYMYPHUHKUWMLA-LMVFSUKVSA-N Ribose Natural products OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 1
- 108020004459 Small interfering RNA Proteins 0.000 description 1
- 108010090804 Streptavidin Proteins 0.000 description 1
- 108010076818 TEV protease Proteins 0.000 description 1
- 108091046869 Telomeric non-coding RNA Proteins 0.000 description 1
- 108090000190 Thrombin Proteins 0.000 description 1
- 101150058395 US22 gene Proteins 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 150000001299 aldehydes Chemical class 0.000 description 1
- 125000001931 aliphatic group Chemical group 0.000 description 1
- HMFHBZSHGGEWLO-UHFFFAOYSA-N alpha-D-Furanose-Ribose Natural products OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 1
- WLDHEUZGFKACJH-UHFFFAOYSA-K amaranth Chemical compound [Na+].[Na+].[Na+].C12=CC=C(S([O-])(=O)=O)C=C2C=C(S([O-])(=O)=O)C(O)=C1N=NC1=CC=C(S([O-])(=O)=O)C2=CC=CC=C12 WLDHEUZGFKACJH-UHFFFAOYSA-K 0.000 description 1
- 150000001412 amines Chemical class 0.000 description 1
- 239000003708 ampul Substances 0.000 description 1
- 235000019418 amylase Nutrition 0.000 description 1
- 238000000149 argon plasma sintering Methods 0.000 description 1
- MHHMNDJIDRZZNT-UHFFFAOYSA-N atto 680 Chemical compound OC(=O)CCCN1C(C)(C)C=C(CS([O-])(=O)=O)C2=C1C=C1OC3=CC4=[N+](CC)CCCC4=CC3=NC1=C2 MHHMNDJIDRZZNT-UHFFFAOYSA-N 0.000 description 1
- 238000011888 autopsy Methods 0.000 description 1
- OHDRQQURAXLVGJ-HLVWOLMTSA-N azane;(2e)-3-ethyl-2-[(e)-(3-ethyl-6-sulfo-1,3-benzothiazol-2-ylidene)hydrazinylidene]-1,3-benzothiazole-6-sulfonic acid Chemical compound [NH4+].[NH4+].S/1C2=CC(S([O-])(=O)=O)=CC=C2N(CC)C\1=N/N=C1/SC2=CC(S([O-])(=O)=O)=CC=C2N1CC OHDRQQURAXLVGJ-HLVWOLMTSA-N 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 239000012620 biological material Substances 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 229960002685 biotin Drugs 0.000 description 1
- 235000020958 biotin Nutrition 0.000 description 1
- 239000011616 biotin Substances 0.000 description 1
- 210000001124 body fluid Anatomy 0.000 description 1
- 238000010804 cDNA synthesis Methods 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 208000007951 cervical intraepithelial neoplasia Diseases 0.000 description 1
- 239000002738 chelating agent Substances 0.000 description 1
- VYXSBFYARXAAKO-WTKGSRSZSA-N chembl402140 Chemical compound Cl.C1=2C=C(C)C(NCC)=CC=2OC2=C\C(=N/CC)C(C)=CC2=C1C1=CC=CC=C1C(=O)OCC VYXSBFYARXAAKO-WTKGSRSZSA-N 0.000 description 1
- 239000013043 chemical agent Substances 0.000 description 1
- ZPUCINDJVBIVPJ-LJISPDSOSA-N ***e Chemical compound O([C@H]1C[C@@H]2CC[C@@H](N2C)[C@H]1C(=O)OC)C(=O)C1=CC=CC=C1 ZPUCINDJVBIVPJ-LJISPDSOSA-N 0.000 description 1
- 238000002573 colposcopy Methods 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 239000003184 complementary RNA Substances 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 230000001268 conjugating effect Effects 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 150000004775 coumarins Chemical class 0.000 description 1
- 238000004132 cross linking Methods 0.000 description 1
- 230000009260 cross reactivity Effects 0.000 description 1
- 108010048032 cyclophilin B Proteins 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 125000001295 dansyl group Chemical group [H]C1=C([H])C(N(C([H])([H])[H])C([H])([H])[H])=C2C([H])=C([H])C([H])=C(C2=C1[H])S(*)(=O)=O 0.000 description 1
- 238000013135 deep learning Methods 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 230000008021 deposition Effects 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 125000000664 diazo group Chemical group [N-]=[N+]=[*] 0.000 description 1
- UVYVLBIGDKGWPX-KUAJCENISA-N digitonin Chemical compound O([C@@H]1[C@@H]([C@]2(CC[C@@H]3[C@@]4(C)C[C@@H](O)[C@H](O[C@H]5[C@@H]([C@@H](O)[C@@H](O[C@H]6[C@@H]([C@@H](O[C@H]7[C@@H]([C@@H](O)[C@H](O)CO7)O)[C@H](O)[C@@H](CO)O6)O[C@H]6[C@@H]([C@@H](O[C@H]7[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O7)O)[C@@H](O)[C@@H](CO)O6)O)[C@@H](CO)O5)O)C[C@@H]4CC[C@H]3[C@@H]2[C@@H]1O)C)[C@@H]1C)[C@]11CC[C@@H](C)CO1 UVYVLBIGDKGWPX-KUAJCENISA-N 0.000 description 1
- UVYVLBIGDKGWPX-UHFFFAOYSA-N digitonine Natural products CC1C(C2(CCC3C4(C)CC(O)C(OC5C(C(O)C(OC6C(C(OC7C(C(O)C(O)CO7)O)C(O)C(CO)O6)OC6C(C(OC7C(C(O)C(O)C(CO)O7)O)C(O)C(CO)O6)O)C(CO)O5)O)CC4CCC3C2C2O)C)C2OC11CCC(C)CO1 UVYVLBIGDKGWPX-UHFFFAOYSA-N 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 150000002009 diols Chemical class 0.000 description 1
- BNIILDVGGAEEIG-UHFFFAOYSA-L disodium hydrogen phosphate Chemical compound [Na+].[Na+].OP([O-])([O-])=O BNIILDVGGAEEIG-UHFFFAOYSA-L 0.000 description 1
- 229910000397 disodium phosphate Inorganic materials 0.000 description 1
- 235000019800 disodium phosphate Nutrition 0.000 description 1
- 150000002019 disulfides Chemical class 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 150000002148 esters Chemical class 0.000 description 1
- 150000002170 ethers Chemical class 0.000 description 1
- 210000001808 exosome Anatomy 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 239000011888 foil Substances 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 230000002496 gastric effect Effects 0.000 description 1
- 230000007274 generation of a signal involved in cell-cell signaling Effects 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 239000012362 glacial acetic acid Substances 0.000 description 1
- 150000002334 glycols Chemical class 0.000 description 1
- 125000005843 halogen group Chemical group 0.000 description 1
- 229910052739 hydrogen Inorganic materials 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 1
- 238000003702 image correction Methods 0.000 description 1
- 238000010166 immunofluorescence Methods 0.000 description 1
- 238000003364 immunohistochemistry Methods 0.000 description 1
- 238000012296 in situ hybridization assay Methods 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 229960003786 inosine Drugs 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 210000001153 interneuron Anatomy 0.000 description 1
- 238000011005 laboratory method Methods 0.000 description 1
- 229910052747 lanthanoid Inorganic materials 0.000 description 1
- 150000002602 lanthanoids Chemical class 0.000 description 1
- 210000000265 leukocyte Anatomy 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- DLBFLQKQABVKGT-UHFFFAOYSA-L lucifer yellow dye Chemical compound [Li+].[Li+].[O-]S(=O)(=O)C1=CC(C(N(C(=O)NN)C2=O)=O)=C3C2=CC(S([O-])(=O)=O)=CC3=C1N DLBFLQKQABVKGT-UHFFFAOYSA-L 0.000 description 1
- 239000004325 lysozyme Substances 0.000 description 1
- 229960000274 lysozyme Drugs 0.000 description 1
- 235000010335 lysozyme Nutrition 0.000 description 1
- 239000006249 magnetic particle Substances 0.000 description 1
- 229940107698 malachite green Drugs 0.000 description 1
- FDZZZRQASAIRJF-UHFFFAOYSA-M malachite green Chemical compound [Cl-].C1=CC(N(C)C)=CC=C1C(C=1C=CC=CC=1)=C1C=CC(=[N+](C)C)C=C1 FDZZZRQASAIRJF-UHFFFAOYSA-M 0.000 description 1
- 238000012083 mass cytometry Methods 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- WSFSSNUMVMOOMR-NJFSPNSNSA-N methanone Chemical compound O=[14CH2] WSFSSNUMVMOOMR-NJFSPNSNSA-N 0.000 description 1
- 210000000274 microglia Anatomy 0.000 description 1
- 238000001823 molecular biology technique Methods 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 210000000337 motor cortex Anatomy 0.000 description 1
- 238000007837 multiplex assay Methods 0.000 description 1
- 150000002790 naphthalenes Chemical class 0.000 description 1
- 210000002569 neuron Anatomy 0.000 description 1
- 125000006501 nitrophenyl group Chemical group 0.000 description 1
- 239000002853 nucleic acid probe Substances 0.000 description 1
- 125000003835 nucleoside group Chemical group 0.000 description 1
- 239000002751 oligonucleotide probe Substances 0.000 description 1
- 239000006174 pH buffer Substances 0.000 description 1
- 238000009595 pap smear Methods 0.000 description 1
- 244000045947 parasite Species 0.000 description 1
- 230000001575 pathological effect Effects 0.000 description 1
- 230000007170 pathology Effects 0.000 description 1
- 210000005259 peripheral blood Anatomy 0.000 description 1
- 239000011886 peripheral blood Substances 0.000 description 1
- 210000003819 peripheral blood mononuclear cell Anatomy 0.000 description 1
- 230000035699 permeability Effects 0.000 description 1
- 239000008191 permeabilizing agent Substances 0.000 description 1
- 239000012466 permeate Substances 0.000 description 1
- 150000002989 phenols Chemical class 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 239000008363 phosphate buffer Substances 0.000 description 1
- 229920003023 plastic Polymers 0.000 description 1
- 239000004033 plastic Substances 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 229920001184 polypeptide Polymers 0.000 description 1
- 229920000166 polytrimethylene carbonate Polymers 0.000 description 1
- 239000013641 positive control Substances 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 102000004196 processed proteins & peptides Human genes 0.000 description 1
- 238000003672 processing method Methods 0.000 description 1
- 125000006239 protecting group Chemical group 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 239000001397 quillaja saponaria molina bark Substances 0.000 description 1
- 239000012048 reactive intermediate Substances 0.000 description 1
- 229940043267 rhodamine b Drugs 0.000 description 1
- 150000003291 riboses Chemical class 0.000 description 1
- 210000003296 saliva Anatomy 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 229930182490 saponin Natural products 0.000 description 1
- 150000007949 saponins Chemical class 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 210000000582 semen Anatomy 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 239000002689 soil Substances 0.000 description 1
- 125000006850 spacer group Chemical group 0.000 description 1
- 230000009870 specific binding Effects 0.000 description 1
- 230000003595 spectral effect Effects 0.000 description 1
- 125000001424 substituent group Chemical group 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 150000003457 sulfones Chemical class 0.000 description 1
- COIVODZMVVUETJ-UHFFFAOYSA-N sulforhodamine 101 Chemical compound OS(=O)(=O)C1=CC(S([O-])(=O)=O)=CC=C1C1=C(C=C2C3=C4CCCN3CCC2)C4=[O+]C2=C1C=C1CCCN3CCCC2=C13 COIVODZMVVUETJ-UHFFFAOYSA-N 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 208000024891 symptom Diseases 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- WGTODYJZXSJIAG-UHFFFAOYSA-N tetramethylrhodamine chloride Chemical compound [Cl-].C=12C=CC(N(C)C)=CC2=[O+]C2=CC(N(C)C)=CC=C2C=1C1=CC=CC=C1C(O)=O WGTODYJZXSJIAG-UHFFFAOYSA-N 0.000 description 1
- HODRFAVLXIFVTR-RKDXNWHRSA-N tevenel Chemical compound NS(=O)(=O)C1=CC=C([C@@H](O)[C@@H](CO)NC(=O)C(Cl)Cl)C=C1 HODRFAVLXIFVTR-RKDXNWHRSA-N 0.000 description 1
- 238000011285 therapeutic regimen Methods 0.000 description 1
- 229960004072 thrombin Drugs 0.000 description 1
- 238000003777 tissue processing method Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- GPRLSGONYQIRFK-MNYXATJNSA-N triton Chemical compound [3H+] GPRLSGONYQIRFK-MNYXATJNSA-N 0.000 description 1
- 229960003732 tyramine Drugs 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- 210000002700 urine Anatomy 0.000 description 1
- 238000012800 visualization Methods 0.000 description 1
- 239000011534 wash buffer Substances 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6813—Hybridisation assays
- C12Q1/6816—Hybridisation assays characterised by the detection means
- C12Q1/682—Signal amplification
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6813—Hybridisation assays
- C12Q1/6841—In situ hybridisation
Definitions
- the present disclosure provides methods, compositions, and kits for the multiplexed detection and corresponding expression analysis of nucleic acids in biological samples.
- RNA in situ hybridization is a molecular biology technique widely used to measure and localize specific RNA sequences, for example, messenger RNAs (mRNAs), long non-coding RNAs (IncRNAs), and microRNAs (miRNAs) within cells, such as circulating tumor cells (CTCs) or tissue sections, while preserving the cellular and tissue context.
- RNA ISH therefore provides for spatial-temporal visualization as well as quantification of gene expression within cells and tissues. It has wide applications in research and in diagnostics (Hu et al., Biomark. Res. 2(1):1-13, doi: 10.1186/2050-7771-2-3 (2014); Ratan et al., Cureus 9(6):el325.
- Fluorescent RNA ISH utilizes fluorescent dyes and fluorescent microscopes for RNA labeling and detection, respectively. Fluorescent RNA ISH typically provides for limited multiplexing of four to five target sequences. The limited multiplexing capability is largely due to the small number of spectrally distinct fluorescent dyes that can be distinguished by the optical systems of the fluorescence microscope. Higher level of multiplexing is highly desirable in areas such as generating cell and tissue maps to understand complex biological systems, particularly in human health and disease.
- a method for detecting a plurality of target nucleic acids in a sample comprising: contacting a sample comprising a plurality of target nucleic acids with a plurality of target probe sets, wherein each target probe set is complementary to a target nucleic acid; contacting the sample with a first plurality of signal generating complexes (SGCs) capable of hybridizing to at least one target probe set, wherein each SGC comprises a plurality of spatially distributed label probes comprising detectable labels; obtaining a first independent identifier corresponding to at least one target nucleic acid, wherein the first independent identifier is based on the number of distinct detectable labels present in each SGC of the first plurality of SGCs; removing signals generated by the detectable labels from the sample; contacting the sample with at least a second plurality of SGCs capable of hybridizing to at least one target probe set, wherein each SGC comprises a plurality of spatially distributed label probes comprising detectable labels; and
- the method further comprises identifying a target nucleic acid of the plurality of target nucleic acids based on a combination of the first and second independent identifiers.
- the target nucleic acid is identified independent of the order by which the first and second independent identifiers are obtained.
- the first and second independent identifiers are based on the same number of distinct detectable labels present in the first and second plurality of SGCs, respectively. In some embodiments, the first and second independent identifiers are based on a different number of distinct detectable labels present in the first and second plurality of SGCs, respectively. In some embodiments, the independent identifier is based on the number of distinct detectable labels present in a combination of at least two SGCs.
- the spatially distributed label probes comprise two distinct detectable labels. In some embodiments, the spatially distributed label probes comprise three distinct detectable labels. In some embodiments, the spatially distributed label probes comprise four distinct detectable labels. In some embodiments, the spatially distributed label probes comprise five distinct detectable labels. In some embodiments, the spatially distributed label probes comprise six distinct detectable labels.
- obtaining an independent identifier comprises obtaining an image of the sample and detecting signals generated by the detectable labels.
- the method further comprises: removing the signals generated by the detectable labels of the second plurality of SGCs from the sample; contacting the sample with at least a third plurality of SGCs capable of hybridizing to at least one target probe set, wherein each SGC comprises a plurality of spatially distributed label probes comprising detectable labels; and obtaining a third independent identifier corresponding to at least one target nucleic acid, wherein the third independent identifier is based on the number of distinct detectable labels present in each SGC of the third plurality of SGCs.
- each target probe set comprises two or more target probes capable of hybridizing to a target nucleic acid.
- each target probe in the target probe set comprises a T section complementary to a region of a target nucleic acid and an L section complementary to a region of an SGC.
- each T section is complementary to a non-overlapping region of a target nucleic acid and each L section is complementary to a non-overlapping region of an SGC.
- the T section of at least one of the target probes in the target probe set is 3’ of its L section. In some embodiments, the T section of at least one of the target probes in the target probe set is 5 ’ of its L section.
- each SGC comprises one or more of an amplifier, a preamplifier, and a pre-pre-amplifier.
- each label probe in the plurality of label probes comprises a binding site for the amplifier.
- the amplifier comprises at least one binding site for the pre-amplifier and a plurality of binding sites for the plurality of label probes.
- the pre-amplifier comprises at least one binding site for the L section of the target probe and at least one binding site for the amplifier.
- the pre-amplifier comprises at least one binding site for the pre-pre- amplifier and at least one binding side for the amplifier.
- the at least one binding site for the L section of the target probe in the pre-amplifier or the pre-pre-amplifier is distinct for each SGC corresponding to a target nucleic acid. In some embodiments, the at least one binding site for the L section of the target probe in the pre-amplifier or the pre-pre-amplifier is the same for each SGC corresponding to a target nucleic acid.
- removing the signals generated by the detectable labels comprises removing the SGCs.
- removing the SGCs comprises treatment with an acid reagent that disrupts hybridization between the SGCs and the target nucleic acids.
- the acid reagent comprises formic acid, acetic acid, propionic acid, butyric acid, valeric acid, oxalic acid, malonic acid, succinic acid, malic acid, tartaric acid, citric acid.
- the second plurality of SGCs are hybridized to the same target probe set.
- the second plurality of SGCs are hybridized to a different target probe set.
- removing the signals generated by the detectable labels comprises removing the detectable labels from the plurality of label probes using a cleavage reagent. In some embodiments, after the detectable labels are removed, the second plurality of SGCs are hybridized to a different target probe set.
- the method further comprises a step of contacting the sample with a second plurality of target probe sets, wherein each target probe set in the second plurality of target probe sets is complementary to a target nucleic acid.
- the sample comprises a cell. In some embodiments, the method further comprises fixing and/or permeabilizing the cell.
- the target nucleic acid is RNA. In some embodiments, the method further comprises detecting at least one non-nucleic acid target in the sample. In some embodiments, the non-nucleic acid target is a protein.
- a method for detecting a plurality of target nucleic acids in a sample comprising: contacting a sample comprising a plurality of target nucleic acids with a plurality of target probe sets, wherein each target probe set is complementary to a target nucleic acid; contacting the sample with a plurality of amplification complexes capable of hybridizing to at least one target probe set, wherein each amplification complex comprises a plurality of spatially distributed binding sites for label probes; contacting the sample with a first plurality of label probes comprising detectable labels, which hybridize to binding sites on the amplification complexes to generate a first plurality of signal generating complexes (SGCs); obtaining a first independent identifier corresponding to at least one target nucleic acid, wherein the first independent identifier is based on the number of distinct detectable labels present in each SGC of the first plurality of SGCs; removing signals generated by the detectable labels from the sample; contacting the sample with
- the method further comprises identifying a target nucleic acid of the plurality of target nucleic acids based on a combination of the first and second independent identifiers.
- the target nucleic acid is identified independent of the order by which the first and second independent identifiers are obtained.
- the first and second independent identifiers are based on the same number of distinct detectable labels present in the first and second plurality of SGCs, In some embodiments, the first and second independent identifiers are based on a different number of distinct detectable labels present in the first and second plurality of SGCs, respectively. In some embodiments, the independent identifier is based on the number of distinct detectable labels present in a combination of at least two SGCs.
- the spatially distributed label probes comprise two distinct detectable labels. In some embodiments, the spatially distributed label probes comprise three distinct detectable labels. In some embodiments, the spatially distributed label probes comprise four distinct detectable labels. In some embodiments, the spatially distributed label probes comprise five distinct detectable labels. In some embodiments, the spatially distributed label probes comprise six distinct detectable labels.
- obtaining an independent identifier comprises obtaining an image of the sample and detecting signals generated by the detectable labels.
- the method further comprises: removing the signals generated by the detectable labels of the second plurality of SGCs from the sample; contacting the sample with at least a third plurality of label probes comprising detectable labels, which hybridize to binding sites on the amplification complexes to generate a third plurality of SGCs; and obtaining a third independent identifier corresponding to at least one target nucleic acid, wherein the third independent identifier is based on the number of distinct detectable labels present in each SGC of the third plurality of SGCs.
- each target probe set comprises two or more target probes capable of hybridizing to a target nucleic acid.
- each target probe in the target probe set comprises a T section complementary to a region of a target nucleic acid and an L section complementary to a region of an amplification complex.
- each T section is complementary to a non-overlapping region of a target nucleic acid and each L section is complementary to a non-overlapping region of an amplification complex.
- the T section of at least one of the target probes in the target probe set is 3 ’ of its L section. In some embodiments, the T section of at least one of the target probes in the target probe set is 5 ’ of its L section.
- each amplification complex comprises one or more of an amplifier, a pre-amplifier, and a pre-pre-amplifier.
- each label probe in the plurality of label probes comprises a binding site for the amplifier.
- the amplifier comprises at least one binding site for the pre-amplifier and a plurality of binding sites for the plurality of label probes.
- the pre-amplifier comprises at least one binding site for the L section of the target probe and at least one binding site for the amplifier.
- the pre-amplifier comprises at least one binding site for the pre-pre-amplifier and at least one binding side for the amplifier.
- the at least one binding site for the L section of the target probe in the pre-amplifier or the pre-pre- amplifier is distinct for each amplification complex corresponding to a target nucleic acid. In some embodiments, the at least one binding site for the L section of the target probe in the preamplifier or the pre-pre-amplifier is the same for each amplification complex corresponding to a target nucleic acid.
- removing the signals generated by the detectable labels comprises removing the detectable labels from the plurality of label probes using a cleavage reagent.
- the second plurality of label probes are hybridized to different amplification complexes to generate the second plurality of SGCs.
- the method further comprises a step of contacting the sample with a second plurality of target probe sets, wherein each target probe set in the second plurality of target probe sets is complementary to a target nucleic acid.
- the sample comprises a cell. In some embodiments, the method further comprises fixing and/or permeabilizing the cell.
- the target nucleic acid is RNA. In some embodiments, the method further comprises detecting at least one non-nucleic acid target in the sample. In some embodiments, the non-nucleic acid target is a protein. In one aspect, disclosed herein is a kit for carrying out any of the methods disclosed herein.
- FIGS. 1A-1B show exemplary configurations of probes for detecting a target nucleic acid.
- each individual target probe has a target (T) segment complementary to the target nucleic acid (i.e. , a segment of the target probe that can hybridize to the target nucleic acid) and a label (L) segment complementary to a component of a Signal Generating Complex (SGC) (i.e., a segment of the target probe that can hybridize to a component of the SGC).
- Each SGC comprises multiple layers of components, such as amplifiers (AP) and pre-amplifiers (PA) that assemble into a tree-like structure which is capable of carrying many label probes (LP) on its “branches.”
- AP amplifiers
- PA pre-amplifiers
- FIG. IB if the target sequence is sufficiently long, many TP sets and associated SGCs can be assembled on the target nucleic acid to generate a detectable signal that appears in an imaging system as a discrete “dot.”
- FIGS. 2A-2B show two exemplary configurations of probes for detecting a target nucleic acid.
- the configurations utilize SGCs comprising LPs, APs, PAs, TPs, as shown in FIGS. 1 A-1B, except that assembly of the SGC utilizes a Collaboration Amplifier (COM).
- the COMs bind to two pre-amplifiers (PAs) and to an amplifier (AP) for assembly of the SGC.
- FIGS. 2A-2B show two different configurations of the target probe binding to the target nucleic acid.
- the configurations allow more LPs to be incorporated into one SGC. Such configurations are suitable, e.g., for detecting short target sequences, because a detectable signal can be generated with a single SGC.
- FIG. 3 shows an exemplary configuration for multiplex detection of target nucleic acids.
- each SGC carries the same LPs with a specific label, where different SGCs carry distinct labels.
- Components of each SGC (such as PAs, APs, LPs, etc.) are uniquely associated with the SGC.
- the SGCs are designed so that the components of a target-specific SGC hybridize to each other to assemble the SGC but cannot cross-hybridize to any components of any other SGC.
- two target nucleic acids are shown bound to the respective SGCs.
- SGCs for the target comprise four labels (LI, L2, L3, and L4 that are specific to this SGC).
- FIGS. 4A-4G illustrate various exemplary embodiments for multiplex detection of target nucleic acids.
- FIG. 4A illustrates one embodiment of sub-SGC implementation, where the SGC ID code is implemented on the amplifier (AP) molecule.
- an AP has one region designed to bind to the amplifier anchor (AA) on the PA molecule (i.e. , binding site for the amplifier on the pre-amplifier) and another region comprising multiple segments of label probe anchors (LAs) (i.e., binding sites for the label probes).
- AA amplifier anchor
- LAs label probe anchors
- a mixture of different LAs are designed according to the unique identification code of the SGC. For example, if the identifier of the SGC is based on three labels, then equal number of LAs for each of the three labels are made on the AP molecule, which will bind a designed number of desired LPs to generate the identifier in the assay.
- the code for the SGC shown uses labels LI, L3, and L4 (alphanumeric code C, see Table la).
- FIG. 4B illustrates another embodiment of sub-SGC implementation, where the SGC ID code is implemented on the pre-amplifier (PA) molecule.
- PA pre-amplifier
- N “pure” AP molecules are made, each carrying the same type of LP.
- a PA molecule has one region designed to bind to the SA (i.e., binding site for the PA on the TP; see FIG. 1A) of a TP set and another region comprising multiple segments of AAs (i.e., binding sites for amplifiers on the pre-amplifier).
- a mixture of different AAs is designed according to the unique identification code of the SGC.
- the ID code of the SGC includes the two labels L2 and L4, then equal number of AAs for APs carrying LP2 and LP4 are made on the PA molecule, which will bind a designed number of desired LPs to generate the identifier in the assay.
- the identifier of the SGC is shown using LI, L3, and L4 (alphanumeric code C, see Table la).
- FIG. 4C illustrates another embodiment of sub-SGC implementation, where the SGC ID code is implemented on the LP molecule.
- LP molecules binding to the same SGC can be mixtures of LPs each conjugated to a different label according to a predefined code book.
- SGC5 LPs are a mixture of LPs conjugated to three different labels (LI, L3, and L4).
- An advantage of this embodiment is that the “coloring” of the SGC complex by the LPs will be completely randomized, which can further help to reduce coding errors.
- a partial LP mixing code book is shown on the right of FIG. 4C, with 7 different exemplary SGC identifiers shown using 4 labels.
- FIG. 4D shows an embodiment of FIG. 4C in more detail.
- a specific label anchor LA, the binding site on the amplifier for the label probe
- LPs label probes
- SGC5 is illustrated showing that the amplifiers comprise a plurality of identical LAs, labeled “E.”
- SGC5 is coded with three distinct labels conjugated to label probes (L 1 , L3 , and L4), all of which have the same binding site for the plurality of “E” LAs on the corresponding amplifier. Therefore, all label probes specific to SGC5 are bound to the amplifiers of SGC5.
- FIG. 4E shows an embodiment of FIG. 4C in more detail.
- the SGC5 of FIG. 4D is shown bound to its respective target nucleic acid, with the label probes having an “E” binding site bound to the respective “E” LAs of the SGC5 amplifiers.
- two additional exemplary SGCs bound to their respective target nucleic acids SGC 1 , coded as shown in FIG. 4D, comprises a plurality of identical LAs, labeled “A.”
- SGC1 is coded with one label (LI), which has the binding site for the plurality of “A” LAs of the SGC1 amplifiers.
- LI one label
- SGC3 coded as shown in FIG. 4D, comprises a plurality of identical LAs, labeled “C.”
- SGC3 is coded with two distinct label probes (L 1 and L2), both of which have the same binding site for the plurality of “C” LAs on the corresponding SGC3 amplifiers. Therefore, both label probes are bound to the amplifiers of SGC3, thereby labeling the SGC3 target nucleic acid with the independent identifier based on LI and L2 (alphanumeric code A, see Table la).
- FIG. 4F shows an embodiment of FIG. 4C in more detail.
- SGC2 comprises amplifiers with “B” LAs and is coded using label probe L2.
- the same SGC assembly can be used with respect to the target probes, pre-amplifier, and amplifiers with “B” LAs, but instead of using “B” LA-binding label probes with only L2 as in FIG.
- “B” LA-binding label probes can be used that have a mixture of labels L2 and L3, such that SGC2 is now coded with both labels (alphanumeric code 8, see Table la).
- L2 and L3 are bound to “B” LAs on the SGC2 amplifiers.
- SGC5 comprising amplifiers with “E” LAs is now coded in FIG. 4F by using label probes with “E” LA-binding label probes that have a mixture of labels L2, L3, and L4 (alphanumeric code E, see Table la), instead of labels LI, L3, and L4 (alphanumeric code C, see Table la) as shown in FIG. 4D.
- FIG. 4G shows an embodiment of FIG. 4C in more detail.
- Two target nucleic acids are shown with two bound SGCs, SGC2 and SGC5.
- SGC2 is coded with an identifier based on labels L2 and L3 (alphanumeric code 8, see Table la)
- SGC5 is coded with an identifier based on labels L2, L3, and L4 (alphanumeric code E, see Table la).
- the number of respective labels that can bind to SGC2 will be higher than the number of respective labels that bind to SGC5 (i.e., the 2 distinct labels for SGC2 and the 3 distinct labels for SGC5 will be bound to the same number of sites, resulting in a higher number of L2 and L3 being bound to SGC2 than SGC5 since some of the SGC5 sites are occupied by L4).
- the number of labels can be normalized by including “blank” label probes, i.e., probes having a binding site for the respective LAs (in this case “B” for SGC2 and “E” for SGC5) but without a label.
- “blank” label probes i.e., probes having a binding site for the respective LAs (in this case “B” for SGC2 and “E” for SGC5) but without a label.
- 1/3 “blank” label probes can be included with the mixture of “B” LA-specific probes so that the intensity of L2 and L3 will be the same on both SGCs (i.e., 1/3 of SGC2 occupied by “blank” label probes and 1/3 of SGC5 occupied by L4).
- the assay can be performed so that the same proportion of “blank” label probes are included in the label probe sets using less than 4 labels, for example, 1/2 “blank” label probes can be included with the SG2-specific label probes and 1/4 “blank” label probes to be included with the SGC5-specific label probes so that the amount of each distinct label probe bound to the respective SGCs is the same on each SGC.
- EIG. 5 shows two configurations of assembly of SGCs on a target nucleic acid.
- the SGCs providing the same label are shown binding in a group next to each other.
- the SGCs providing different labels are shown with binding sites on the target nucleic acid intermingled or intertwined.
- the intermingling of target probe binding sites on the target nucleic acid for different labels are advantageous because, if different SGC types are positioned apart, in separate groups, a certain section of the target may be blocked or masked, thereby preventing attachment of one specific SGC type, which will result in miscoding.
- FIGS. 6A-6B demonstrate configurations for reducing miscoding for multiplex detection of target nucleic acids.
- the particular SGC is miscoded because the PA is truncated, which could occur during manufacturing of the PA.
- L4 is not bound due to truncation.
- the same truncation will not cause miscoding if the labels are intertwined or intermingled on the PA. Arranging different labels into alternating positions reduces the chance of miscoding.
- FIGS. 7A-7B show a method to minimize potential miscoding caused by truncation by randomizing the position of different labels on the AP or PA. As illustrated in FIGS.
- the multiplexing channel ID is encoded on the AP molecule.
- different label probes are positioned on each AP in exactly the same way, that is, each amplifier in the SGC is the same. Truncation of some of the APs can cause substantial reduction in certain labels being bound to the target nucleic acid compared to other labels on different positions of the AP. This imbalance increases the chance of miscoding. In the most severe case, truncation could cause the loss of all copies of one certain label, leading to an outright miscode.
- locations of different labels on the APs are randomized.
- the APs are provided as a plurality of amplifiers, where a mix of non-identical amplifiers is included, where the position of LAs for specific label probes are distributed differently and can be randomized on the non-identical amplifiers. Truncation therefore does not cause a large bias in the number of labels in the SGC.
- FIG. 8 shows an exemplary embodiment of the methods disclosed herein for multiplexed detection of target nucleic acids, by generation of two separate independent identifiers of each target nucleic acid using two separate rounds of labeling and detection.
- the identifiers are incorporated at the label-probe level, using the same regions of each target analyte in each round.
- each analyte i.e. “gene” in this figure
- one type of the unique SGCs i.e. “trees” in this figure
- each SGC includes three different label probes, but not the fourth.
- Round 2 can target either the same sequence of the target as in the first round, or a different sequence. If the same sequence is targeted, the T-sections of the target probes can be the same as those used in the previous round, and the entire signal-generating complex is removed between each round. If a different sequence is targeted, the T-sections of the target probes will be different, and only the fluorophore need be cleaved from the SGCs between each round.
- FIG. 9 shows a representative illustration of an embodiment using different combinations of labels at the SGC level.
- each analyte i.e. “gene” in this figure
- the unique SGCs i.e. “trees” in this figure
- SGC “a” contains the first signal only but not others.
- the expanded combinatorial code can then be used to identify the analyte.
- Round 2 can target either the same sequence of the target as in the first round, or a different sequence. If the same sequence is targeted, the T- sections of the target probes can be either the same as those used in the previous round or different from those used in the previous round, and the entire signal-generating complex is removed between each round. If a different sequence is targeted, the T-sections of the target probes will be different, and only the fluorophore needs be cleaved from the SGCs between each round.
- FIGS. 10A-10C show an exemplary assay workflows for the methods disclosed herein.
- FIG. 11 shows a representative image demonstrating detection of a single gene (RNA transcript), Ubc, in a two-cycle experiment according to the methods disclosed herein.
- the independent identifier in this experiment was based on a single fluorescent label.
- the signals detected in the first round are indicated by triangles in the image, and signals detected in the second round are indicated by squares. Locations with overlapping triangles and squares are the locations where the target gene, was detected.
- FIGS. 12A-12C show representative images from a single two-cycle experiment to detect three genes (RNA transcripts Gadph, Sdha, and Ubc). Each target was assigned the same first independent identifier based on a single fluorescent label, and each target was assigned a different second independent identifier based on a single fluorescent label.
- FIG. 12A shows detection of Gadph
- FIG. 12B shows detection of Sdha
- FIG. 12C shows detection of Ubc.
- the signals detected in the first cycle represent all three genes, and are shown in triangles.
- signals detected in the second cycle are indicated by circles, and locations with overlapping triangles and circles are the locations where the combined first and second identifiers show that Gadph was detected.
- FIG. 12A shows detection of Gadph
- FIG. 12B shows detection of Sdha
- FIG. 12C shows detection of Ubc.
- the signals detected in the first cycle represent all three genes, and are shown in triangles.
- signals detected in the second cycle are indicated
- signals detected in the second cycle are indicated by crosses, and locations with overlapping triangles and crosses are the locations where the combined first and second identifiers show that Sdha was detected.
- signals detected in the second cycle are indicated by squares, and locations with overlapping triangles and squares are the locations where the combined first and second identifiers show that Ubc was detected.
- FIGS. 13A-13C show representative images from a first two-cycle experiment to detect one gene (RNA transcript Hs-PPIB), with the first and second identifiers each based on two fluorescent labels. Further details can be found in Example 3.
- FIG. 13A shows images from the first round of labeling and detection, with panels showing detection of DAPI (upper left), Dylight 550 (upper middle, no label detected), Dylight 650 (upper right, label detected), Alexa Fluoro 488 (lower left, no label detected), Alexa Fluoro 750 (lower middle, label detected), and an overlay (lower right).
- FIG. 13A shows images from the first round of labeling and detection, with panels showing detection of DAPI (upper left), Dylight 550 (upper middle, no label detected), Dylight 650 (upper right, label detected), Alexa Fluoro 488 (lower left, no label detected), Alexa Fluoro 750 (lower middle, label detected), and an overlay (lower right).
- DAPI upper left
- Dylight 550
- FIG. 13B shows images from the second round of labeling and detection, with panels showing detection of DAPI (upper left), Dylight 550 (upper middle, label detected), Dylight 650 (upper right, label detected), Alexa Fluoro 750 (lower left, no label detected), Alexa Fluoro 488 (lower middle, no label detected), and an overlay (lower right).
- FIG. 13C is an overlay of all signals detected in the first and second rounds, with shaded circles showing “dots” having the complete identifier “code” for the target.
- FIGS. 14A-14C show representative images from a second two-cycle experiment to detect one target nucleic acid (RNA transcript Hs-PPIB), with the first and second identifiers each based on two fluorescent labels, where the order of the identifiers was reversed compared to those in the experiments shown in FIGS. 13A-13C. Further details can be found in Example 3.
- FIG. 14A shows images from the first round of labeling and detection, with panels showing detection of DAPI (upper left), Dylight 550 (upper middle, label detected), Dylight 650 (upper right, label detected), Alexa Fluoro 488 (lower left, no label detected), Alexa Fluoro 750 (lower middle, no label detected), and an overlay (lower right).
- FIG. 14A shows images from the first round of labeling and detection, with panels showing detection of DAPI (upper left), Dylight 550 (upper middle, label detected), Dylight 650 (upper right, label detected), Alexa Fluoro 488 (lower left, no label detected), Alexa Fluor
- FIG. 14B shows images from the second round of labeling and detection, with panels showing detection of DAPI (upper left), Dylight 550 (upper middle, no label detected), Dylight 650 (upper right, label detected), Alexa Fluoro 750 (lower left, label detected), Alexa Fluoro 488 (lower middle, no label detected), and an overlay (lower right).
- FIG. 14C is an overlay of all signals detected in the first and second rounds, with shaded circles showing “dots” having the complete identifier “code” for the target.
- FIGS. 15A-15C show representative images from a single ISH labeling experiment to detect one target nucleic acid (RNA transcript Hs-PPIB) in Hela cells, to demonstrate that individual PPIB signals (coded with alphanumeric Code 8) can be detected in any order to generate an independent identifier, as further described in Example 4.
- FIG. 15A shows images detected using two different Excitation/Emission filter sequences.
- FIG. 15B shows an overlay of the images from each sequence.
- FIG. 15C presents overlays showing individual “dots” for detection of PPIB.
- FIGS. 16A-16B show exemplary embodiments of the methods disclosed herein for multiplexed detection of target nucleic acids, by generation of two separate independent identifiers of each target nucleic acid using two separate rounds of labeling and detection.
- the identifiers are incorporated at the label-probe level, using the same regions of each target analyte in each round.
- each analyte i.e. , “gene” in this figure
- is labeled with one type of the unique SGCs i.e., “trees” in this figure
- the SGC includes three different label probes (L4, L3, and L2), but not the fourth (LI).
- L4, L3, and L2 the label probes
- LI the fourth
- the signals are detected and recorded to generate the first independent identifier of each target gene
- the signals are removed, and the analyte is labeled with another type of SGC.
- the second set of signals is then detected and recorded. This procedure can continue with more rounds.
- the expanded code can then be used to identify the analyte.
- Round 2 can target either the same sequence of the target as in the first round, or a different sequence. If the same sequence is targeted, the T-sections of the target probes can be the same as those used in the previous round, and the entire signal-generating complex is removed between each round.
- FIG. 16B shows a representative illustration of an embodiment using different combinations of labels at the SGC level.
- each analyte i.e., “gene” in this figure
- one combination of the unique SGCs i.e., “trees” in this figure
- SGC “a” contains the first signal only but not others.
- the first independent identifier i.e., the first independent identifier
- either the signals or the SGCs are removed, and analyte is labeled with another combination of unique SGCs for the second round.
- the second set of the combinatorial code is then detected and recorded. This procedure can continue with more rounds.
- the expanded combinatorial code can then be used to identify the analyte.
- Round 2 can target either the same sequence of the target as in the first round, or a different sequence. If the same sequence is targeted, the T-sections of the target probes can be either the same as those used in the previous round or different from those used in the previous round, and the entire signal-generating complex is removed between each round. If a different sequence is targeted, the T-sections of the target probes will be different, and only the fluorescent signals need to be removed from the SGCs between each round.
- FIG. 17 shows representative graphs of spatial transcriptomic profiling in a 1 month old C57BL/6 mouse brain (top, motor cortex) and a 1 year old C57BL/6 mouse brain (bottom, lateral cortex).
- Each of the 100 target genes is differentially expressed by a unique cell type in the mouse brain and thus can be used to mark the particular cell type.
- the correlation coefficient (R value) ranges from 0.6-0.81, demonstrating that the disclosed method (referred to as “Ultraplex assay”) can effectively profile the gene expressions of mouse brain cells at single-cell level.
- FIG. 18 shows representative graphs of data pertaining to minor gene expression discrepancies between the Ultraplex assay and the control assay tested in FIG. 17. The data indicated that the Ultraplex assay resulted in more accurate measurements of the gene expressions in mouse brain (as suggested by the Hiplex assay performed in the same mouse brain tissues against these disputed genes).
- FIG. 19 shows representative graphs correlating gene expression data from the Ultraplex assay with annotated cell types from control assays. Between 10-16 major cell types in the different mouse brain regions were correctly identified using the Ultraplex assay, indicating consistent spatial distribution across the mouse brain datasets.
- the disclosed methods allow the detection of multiple target nucleic acids within the same sample and within the same cell.
- Each target nucleic acid is labeled with one or more detectable labels that generate a first independent identifier of the target nucleic acid. Removal of the signals followed by a second round of labeling generates a second independent identifier of the target nucleic acid.
- a combination of the first and second independent identifiers allows for identification of a target nucleic acid, independent of the order by which the first and second independent identifiers are obtained.
- exogenous refers to the substances originating from within an organism.
- exogenous refers to the substances originating from outside an organism.
- nucleic acid and “polynucleotide” are used interchangeably herein to describe a polymer of any length composed of nucleotides, e.g., deoxyribonucleotides or ribonucleotides, or compounds produced synthetically, which can hybridize with naturally occurring nucleic acids in a sequence specific manner analogous to that of two naturally occurring nucleic acids, e.g. , can participate in Watson-Crick base pairing interactions.
- bases are synonymous with “nucleotides” (or “nucleotide”), i.e., the monomer subunit of a polynucleotide.
- nucleoside and nucleotide are intended to include those moieties that contain not only the known purine and pyrimidine bases, but also other heterocyclic bases that have been modified. Such modifications include methylated purines or pyrimidines, acylated purines or pyrimidines, alkylated riboses or other heterocycles.
- nucleoside and nucleotide include those moieties that contain not only conventional ribose and deoxyribose sugars, but other sugars as well. Modified nucleosides or nucleotides also include modifications on the sugar moiety, e.g. , wherein one or more of the hydroxyl groups are replaced with halogen atoms or aliphatic groups, or are functionalized as ethers, amines, or the like.
- Analogues refer to molecules having structural features that are recognized in the literature as being mimetics, derivatives, having analogous structures, or other like terms, and include, for example, polynucleotides incorporating non-natural nucleotides, nucleotide mimetics such as 2’- modified nucleosides, peptide nucleic acids, oligomeric nucleoside phosphonates, and any polynucleotide that has added substituent groups, such as protecting groups or linking moieties.
- first polynucleotide and a second polynucleotide are complementary if they bind to each other in a hybridization assay under stringent conditions, e.g. , if they produce a given or detectable level of signal in a hybridization assay.
- Portions of polynucleotides are complementary to each other if they follow conventional base-pairing rules, e.g., A pairs with T (or U) and G pairs with C, although small regions e.g., fewer than about 3 bases) of mismatch, insertion, or deleted sequence may be present.
- sample as used herein relates to a material or mixture of materials containing one or more components of interest.
- sample includes “biological sample” which refers to a sample obtained from a biological subject, including a sample of biological tissue or fluid origin, obtained, reached, or collected in vivo or in situ.
- a biological sample also includes samples from a region of a biological subject containing precancerous or cancer cells or tissues. Such samples can be, but are not limited to, organs, tissues, cells, and exosomes isolated from a mammal.
- Exemplary biological samples include but are not limited to cell lysate, a cell culture, a cell line, a tissue, oral tissue, gastrointestinal tissue, an organ, an organelle, a biological fluid, a blood sample, a urine sample, a skin sample, and the like.
- Preferred biological samples include, but are not limited to, whole blood, partially purified blood, PBMC, tissue biopsies, and the like.
- probe refers to a capture agent that is directed to a specific target mRNA sequence. Accordingly, each probe of a probe set has a respective target mRNA sequence.
- the probe provided herein is a “nucleic acid probe” or “oligonucleotide probe” which refers to a nucleic acid capable of binding to a target nucleic acid of complementary sequence, such as the mRNA biomarkers provided herein, usually through complementary base pairing by forming hydrogen bond.
- a probe may include natural (e.g., A, G, C, or T) or modified bases (7-deazaguanosine, inosine, etc.).
- the bases in a probe may be joined by a linkage other than a phosphodiester bond, so long as it does not interfere with hybridization.
- the probes can be directly or indirectly labeled with tags, for example, chromophores, lumiphores, or chromogens. By assaying for the presence or absence of the probe, one can detect the presence or absence of a target mRNA biomarker of interest.
- a plurality can refer to, for example, 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, 8 or more, 9 or more, 10 or more, 11 or more, 12 or more, 13 or more, 14 or more, 15 or more, 16 or more, 17 or more, 18 or more, 19 or more, 20 or more, 21 or more, 22 or more, 23 or more, 24 or more, 25 or more, 26 or more, 27 or more, 28 or more, 29 or more, 30 or more, 31 or more, 32 or more, 33 or more, 34 or more, 35 or more, 36 or more, 37 or more, 38 or more, 39 or more, 40 or more, 41 or more, 42 or more, 43 or more, 44 or more, 45 or more, 46 or more, 47 or more, 48 or more, 49 or more, 50 or more, 55 or more, 60 or more, 65 or more, 70 or more, 75 or more, 80 or more, 85 or more, 90
- one or more refers to, for example, 1 or more, 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, 8 or more, 9 or more, 10 or more, 11 or more, 12 or more, 13 or more, 14 or more, 15 or more, 16 or more, 17 or more, 18 or more, 19 or more, 20 or more, 21 or more, 22 or more, 23 or more, 24 or more, 25 or more, 26 or more, 27 or more, 28 or more, 29 or more, 30 or more, 31 or more, 32 or more, 33 or more, 34 or more, 35 or more, 36 or more, 37 or more, 38 or more, 39 or more, 40 or more, 41 or more, 42 or more, 43 or more, 44 or more, 45 or more, 46 or more, 47 or more, 48 or more, 49 or more, 50 or more, 55 or more, 60 or more, 65 or more, 70 or more, 75 or more, 80 or more, 85 or more, 90 or more, 95 or more,
- each individual target probe has a target section (T) complementary to the target and a label section (L) complementary to a component of a Signal Generating Complex (SGC).
- SGC Signal Generating Complex
- the T sections of the one or more target probe(s) are complementary to non-overlapping regions of the target nucleic acid
- the L sections of the one or more target probe(s) are complementary to non-overlapping regions of the nucleic acid component of the SGC.
- Each SGC comprises multiple layers of components, such as Amplifiers (AMP) and Pre-Amplifiers (PA), which assemble into tree-like structures capable of carrying many Label Probes (LP) on their “branches.” As shown in FIG.
- FIG. 2 shows two additional different configurations of the SGC, in which additional layer(s) of amplification molecules, such as a Collaboration Amplifier (COM), are incorporated to carry more LPs in one SGC.
- additional layer(s) of amplification molecules such as a Collaboration Amplifier (COM)
- COM Collaboration Amplifier
- each target has unique TP sets but all TP sets have the same L sections, and can therefore be bound to the same SGCs. In this way, a signal is detected when any one of the multiple targets is present.
- the pooling approach is useful when a group of targets has the same clinical utility or biological functionality.
- Other methods use a multiplexing approach, in which each target has its own unique SGC that does not cross-hybridize, generating a unique identifiable signal for each target when it is present. The multiplexing approach is useful when each target in the group provides a different clinical or biological indication alone or in combination.
- each unique signal is generated by a large number of LPs carrying the same label.
- a drawback to this approach is that there are usually a limited number of uniquely identifiable labels.
- fluorescent detection modalities for example, four to six fluorophores at different wavelengths are commonly used. More than six fluorophores in an imaging-based multiplexing system is possible, but can become challenging due to bandwidth limits and cross-talk between wavelengths. This limitation can impose a limit on the number of targets that can be multiplexed in an assay.
- the methods disclosed herein allow for multiplexed detection of large numbers of targets in a single sample.
- Each target in the sample is detected by generating at least two separate independent identifiers of each target in the sample (e.g., two, three, or more independent identifiers), and then combining the separate independent identifiers to obtain a unique identifier for each target in the sample.
- the combination of multiple independent identifiers allows for simultaneous detection of large numbers of different targets in the same sample.
- Each independent identifier is based on the number of distinct detectable labels associated with each target.
- each target can be identified independent of the order by which the independent identifiers are obtained.
- the targets are nucleic acids, such as RNA.
- the method comprises first contacting the sample with a plurality of target probe sets, wherein each target probe set is complementary to a target nucleic acid. This step is followed by contacting the sample with a first plurality of signal generating complexes (SGCs), each capable of hybridizing to a target probe set, and each comprising a plurality of spatially distributed label probes (LPs) comprising detectable labels. A first independent identifier is then obtained for each target nucleic acid, based on the number of distinct detectable labels present in each SGC of the first plurality of SGCs.
- the method comprises first contacting the sample with a plurality of target probe sets, wherein each target probe set is complementary to a target nucleic acid.
- This step is followed by contacting the sample with a plurality of amplification complexes capable of hybridizing to at least one target probe set, wherein each amplification complex comprises a plurality of spatially distributed binding sites for label probes, and then contacting the sample with a first plurality of label probes comprising detectable labels, which hybridize to binding sites on the amplification complexes to generate a first plurality of signal generating complexes (SGCs).
- SGCs signal generating complexes
- any from zero to ten different LPs bearing different detectable labels can form combinations to generate each independent identifier. Examples are shown in Table la, where four different labels (LI, L2, L3 and L4) can generate 16 unique combinations for the first independent identifier. Further examples are shown in Table lb, where five different labels (LI, L2, L3, L4, and L5) can generate 32 unique combinations for the first independent identifier. Use of six different labels would produce yet additional unique combinations.
- fluorophores that can be used in combination to generate four, five, or six LPs include those with excitation maxima at or near at 488 nm, 550 nm, 594 nm, 650 nm, 700 nm, and/or 750 nm.
- the independent identifiers can each be easily identified, e.g., using an alphanumeric code, to generate a first independent identifier of each target. Specific methods and probe configurations for associating these different combinations of spatially distributed LPs with each target nucleic acid will be further discussed below.
- Table la Combinations of labels based on four distinct labels to generate a first independent identifier of each target (shown using an alphanumeric code).
- Table lb Combinations of labels based on four distinct labels to generate a first independent identifier of each target (shown using an alphanumeric code).
- the method comprises contacting the sample with a second plurality of SGCs, each capable of hybridizing to a target probe set, and each comprising a plurality of spatially distributed label probes (LPs) comprising detectable labels.
- this step is preceded by contacting the sample with a second plurality of target probe sets, wherein each target probe set in the second plurality of target probe sets is complementary to a target nucleic acid.
- a second independent identifier is obtained for each target nucleic acid, based on the number of distinct detectable labels present in each SGC of the second plurality of SGCs.
- the method further comprises identifying a target nucleic acid of the plurality of target nucleic acids based on a combination of the first and second independent identifiers (e.g., via generation of a “combined” identifier). Examples are shown in Table 2, where 16 target genes all have the same first independent identifier (alphanumeric code 1, in this example), and a second round of labeling provides a second independent identifier using four different labels (LI, L2, L3 and L4) that can generate 16 unique combinations for the second independent identifier.
- first and second independent identifiers e.g., via generation of a “combined” identifier. Examples are shown in Table 2, where 16 target genes all have the same first independent identifier (alphanumeric code 1, in this example), and a second round of labeling provides a second independent identifier using four different labels (LI, L2, L3 and L4) that can generate 16 unique combinations for the second independent identifier.
- the method further comprises another round of labeling to generate a third independent identifier. Adding a third independent identifier further increases the number of independent targets that can be identified.
- additional rounds of labeling can further increase the number of target nucleic acids that can be detected. Of course, fewer than the maximum total number of targets can be detected in any given assay.
- the disclosed methods comprise one or more steps of contacting a sample with a plurality of target probe sets, wherein each target probe set is complementary to a target nucleic acid.
- each individual set of target probes is specific for (i.e., specifically hybridizes to) each target nucleic acid in the plurality of target nucleic acids being detected in the method.
- Each target probe set comprises one or more pairs of individual target probes.
- a “target probe” refers to a polynucleotide that is capable of hybridizing to a target (e.g., a target nucleic acid) and capturing or binding a signal-generating complex (SGC) component to that target.
- a target e.g., a target nucleic acid
- SGC signal-generating complex
- the target probe can hybridize to an amplifier, a pre-amplifier or a pre-pre-amplifier in an SGC.
- the target probe thus includes a first polynucleotide sequence that is complementary to a polynucleotide sequence of the target nucleic acid, and a second polynucleotide sequence that is complementary to a polynucleotide sequence of a component of a SGC (amplifier, pre-amplifier, pre-pre-amplifier, or the like).
- the target probe is generally single stranded so that the complementary sequence is available to hybridize with a corresponding target, amplifier, pre-amplifier, pre-pre-amplifier, or the like.
- Each target probe comprises a target (T) section and a label (L) section, wherein the T section is a nucleic acid sequence complementary to a section on the target nucleic acid and the L section is a nucleic acid sequence complementary to a section on the nucleic acid component of the SGC.
- the T sections are complementary to non-overlapping regions of the target nucleic acid
- the L sections are complementary to non-overlapping regions of the nucleic acid component(s) of the SGC.
- the two target probes can have the same directionality, or opposite directionalities.
- T section of each target probe in the target probe pair is 3 ’ of its L section.
- the T section of each target probe in the target probe pair is 5’ of its L section.
- the T section of one target probe in the target probe pair is 3’ of its L section, and the T section of the other target probe in the target probe pair is 5 ’ of its L section.
- each target probe comprises a non-binding portion separating its T section from its L section (e.g., a spacer or linker region).
- this nonbinding portion does not bind to either the target nucleic acid or the nucleic acid component of the SGC, but is useful in forming the SGC.
- this non-binding region can form the unpaired loop section of a hairpin loop.
- none of the target probes contains a non-binding portion.
- one pair of target probes is used to detect each target nucleic acid.
- two or more pairs of target probes are used to detect each target nucleic acid.
- the pairs of target probes in the target probe set specific for a particular target nucleic acid bind to different and non-overlapping sequences of the target nucleic acid.
- the molecule that binds to the target probe pairs i. e. , an amplifier, a pre-amplifier, or a pre-pre-amplifier
- the molecule that binds to the target probe pairs i. e. , an amplifier, a pre-amplifier, or a pre-pre-amplifier
- the target probe pairs that bind to the same target nucleic acid can be designed to include the same binding site for the molecule in the SGC that binds to the target probe pairs (e.g., a pre-amplifier or pre-pre-amplifier).
- the use of multiple target probe pairs to detect a target nucleic acid can provide for a higher signal associated with the assembly of multiple SGCs on the same target nucleic acid.
- the number of target probe pairs used for binding to the same target nucleic acid are in the range of 1-10, 1-20, 1- 30, 1-40, 1-50, 1-60, 1-70, 1-80, 1-90, 1-100, 1-110, 1-120, 1-130, 1-140, 1-150, 1-160, 1-170, 1-180, 1-190, or 1-200 pairs per target, or larger numbers of pairs, or any integer number of pairs in between, for example, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20,
- Each target probe pair can be designed to bind to immediately adjacent segments of the target nucleic acid, or on segments that have one to a number of bases between the target probe binding sites of the target probe pair.
- target probe pairs are designed for binding to the target nucleic acid such that there are generally between 0 to 500 bases between the binding sites on the target nucleic acid, for example, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 120, 140, 160, 180, 200, 220, 240, 260, 280, 300, 320, 340, 360, 380, 400, 420, 440, 460, 480, or 500 bases, or any integer length in between.
- the binding sites for the pair of target probes are between 0 to 100, 0 to 200, or 0 to 300 bases, or any integer length in between.
- more than one target probe pair is used in a target probe set to bind to the same target, and where there is a gap in the binding sites between a pair of target probes, it is understood that the binding sites of different target probe pairs do not overlap.
- some overlap between different target probe pairs can occur, so long as the target probe pairs are able to concurrently bind to the respective binding sites of the double stranded target nucleic acid.
- the disclosed methods comprise one or more steps of contacting a sample with a plurality of signal generating complexes (SGCs) capable of hybridizing to at least one target probe set, wherein each SGC comprises a plurality of spatially distributed label probes comprising detectable labels.
- the disclosed methods comprise one or more steps of contacting a sample with a plurality of amplification complexes capable of hybridizing to at least one target probe set, wherein each amplification complex comprises a plurality of spatially distributed binding sites for label probes; subsequently, the sample is contacted with a plurality of label probes comprising detectable labels, which hybridize to binding sites on the amplification complexes to generate a plurality of SGCs .
- amplification complexes and SGCs can be employed in the above methods, to allow for binding of 0, 1, 2, 3, or 4 different labels to each target nucleic acid, to generate each independent identifier for the target nucleic acid (e.g., a first independent identifier, a second independent identifier, a third independent identifier, etc.)
- label probe refers to an entity that binds to a target molecule, directly or indirectly, generally indirectly, and allows the target to be detected.
- a label probe (or “LP”) contains a nucleic acid binding portion that is typically a single stranded polynucleotide or oligonucleotide that comprises one or more labels which directly or indirectly provides a detectable signal.
- the label can be covalently attached to the polynucleotide, or the polynucleotide can be configured to bind to the label.
- a biotinylated polynucleotide can bind a streptavidin-associated label.
- the label probe can hybridize to a nucleic acid that is in turn hybridized to the target nucleic acid or to one or more other nucleic acids that are hybridized to the target nucleic acid.
- the label probe can comprise at least one polynucleotide sequence that is complementary to a polynucleotide sequence in an amplifier, pre-amplifier, or pre-pre-amplifier in an amplification complex or SGC.
- the amplification complexes and SGCs provided herein comprise components such an amplifier, a pre-amplifier, and/or a pre-pre-amplifier.
- an “amplifier” is a molecule, typically a polynucleotide, that is capable of hybridizing to multiple label probes.
- the amplifier hybridizes to multiple identical label probes, or multiple different label probes.
- the amplifier can also hybridize to a target nucleic acid, to at least one target probe of a pair of target probes, to both target probes of a pair of target probes, or to nucleic acid bound to a target probe such as a pre-amplifier or pre-pre-amplifier.
- the amplifier can hybridize to at least one target probe and to a plurality of label probes, or to a pre-amplifier and a plurality of label probes.
- the amplifier can be, for example, a linear, forked, comb-like, or branched nucleic acid.
- the amplifier can include modified nucleotides and/or nonstandard intemucleotide linkages as well as standard deoxyribonucleotides, ribonucleotides, and/or phosphodiester bonds. Suitable amplifiers are described, for example, in U.S. Patent Nos. 5,635,352, 5,124,246, 5,710,264, 5,849,481, and 7,709,198 and U.S. publications 2008/0038725 and 2009/0081688, each of which is incorporated by reference.
- a “pre-amplifier” is a molecule, typically a polynucleotide, that serves as an intermediate binding component between one or more target probes and one or more amplifiers. Typically, the pre-amplifier hybridizes simultaneously to one or more target probes and to a plurality of amplifiers. Exemplary pre-amplifiers are described, for example, in U.S. Patent Nos. 5,635,352, 5,681,697 and 7,709,198 and U.S. publications 2008/0038725, 2009/0081688 and 2017/0101672, each of which is incorporated by reference.
- a “pre-pre-amplifier” is a molecule, typically a polynucleotide, that serves as an intermediate binding component between one or more target probes and one or more pre-amplifiers. Typically, the pre-pre-amplifier hybridizes simultaneously to one or more target probes and to a plurality of pre-amplifiers. Exemplary pre-pre-amplifiers are described, for example, in U.S. Patent No. 11,078,528, which is incorporated by reference.
- Each SGC comprises a plurality of spatially distributed label probes comprising detectable labels.
- the labeling scheme can be implemented in different ways. For example, in some embodiments, the labeling scheme can be employed at the SGC level. As shown in FIG. 3, for generation of an independent identifier with N unique labels, N unique label-specific SGCs can be made. Each SGC carries the same LP with a specific label. Components of each SGC (such as PAs, APs, LPs, etc.) are uniquely associated with the SGC. They are specially designed to hybridize to each other to assemble the SGC but cannot cross-hybridize to any components of any other SGC. In the exemplary embodiment shown in FIG. 3, two target nucleic acids are shown bound to the respective SGCs.
- the SGCs for the target comprise four labels (LI, L2, L3, and L4).
- the SGCs for the target nucleic acid comprise two labels (L2 and L4).
- the plurality of SGCs comprise: (a) a set of preamplifiers, wherein the pre-amplifier set comprises one or more subsets of pre-amplifiers, wherein the one or more pre-amplifier subsets comprise a pre-amplifier specific for each of the target probe pairs in the plurality of target probe sets, wherein each pre-amplifier comprises binding sites for the pair of target probes of one of the target probe sets and a plurality of binding sites for an amplifier; (b) a set of amplifiers, wherein the amplifier set comprises one or more subsets of amplifiers specific for each pre-amplifier subset, wherein each amplifier subset comprises a plurality of amplifiers, wherein the amplifiers of one of the amplifier subsets comprise a binding site for the pre-amplifiers of one of the pre-amplifier subsets and a plurality of binding sites for a label probe; and (c) a set of label probes, wherein the label probe set comprises one or more subsets of
- the labeling scheme can also be implemented at the component level within an SGC (i.e. , sub-SGC level).
- SGC i.e. , sub-SGC level
- Such a system comprises N different, label specific LPs and 2N-1 unique, target-specific SGCs.
- Each LP has a segment designed to hybridize to a label probe anchor (LA) on an AP molecule (i.e., the LA being the binding site on the amplifier for the label probe) in the SGC.
- LA label probe anchor
- AP AP molecule
- a mixture of different LAs are designed and made to bind a set of pre-determined, different LPs, generating a unique combination of detectable labels that are used to identify the target.
- the sub-SGC level implementation can be advantageous when the target sequence is shorter. In addition, the probability of miscoding is reduced, as discussed below.
- FIG. 4A illustrates one embodiment of sub-SGC implementation, where the above mentioned SGC ID code is implemented on the AP molecule.
- an AP has one region designed to bind to the amplifier anchor (AA) on the PA (i.e., the AA being the binding site on the pre-amplifier for the amplifier) and another region comprising multiple segments of LAs (i.e., the binding sites on the amplifier for the label probes).
- the amplifier used repeats of the same LA, that is, the amplifier had a plurality of binding sites for the same label probe.
- An embodiment using repeats of the same LA for binding a plurality of the same LPs to the amplifier is shown in FIG. 3.
- a mixture of different LAs are designed according to the unique identifier of the SGC. For example, if the independent identifier of the SGC includes three separate labels (e.g., LI, L3, and L4, as shown in FIG. 4A), then an equal number of LAs for LPs with LI , L3, and L4 are made on the AP molecule, which will bind a designed number of desired LP to generate the independent identifier.
- the plurality of SGCs comprise: (a) a set of preamplifiers, wherein the pre-amplifier set comprises a plurality of pre-amplifiers, wherein the pre-amplifiers comprise binding sites for the pairs of target probes and a plurality of binding sites for an amplifier; (b) a set of amplifiers, wherein the amplifier set comprises a plurality of amplifiers, wherein the amplifiers comprise a binding site for the pre-amplifiers and a plurality of binding sites for a label probe or two or more distinct label probes; and (c) a set of label probes, wherein the label probe set comprises a label probe or two or more distinct label probes, wherein each label probe comprises a label and a binding site for the amplifiers, wherein the labels in each distinct label probe are distinguishable between the distinct label probes; wherein the amplifier in each probe subset specific for a target nucleic acid comprises a binding site for a label or a combination of two or more distinct labels that is different for each probe
- the label probe set comprises two or more distinct label probes, wherein the amplifier set comprises a plurality of non-identical amplifiers, and wherein the binding sites for the two or more distinct label probes on each non-identical amplifier are in a different order on each non-identical amplifier (see FIG. 7B).
- This embodiment can be used to reduce miscoding, as described below in more detail.
- FIG. 4B illustrates another embodiment of sub-SGC implementation, where the SGC ID code is implemented on the PA molecule.
- N “pure” AP molecules are made, each carrying the same LA for the same type of LP.
- a PA molecule has one region designed to bind to the L-section of the target probe (i.e., the segment of the TP that binds to the pre-amplifier; see FIG. 1A) of a target probe pair, and another region comprising multiple segments of AAs (amplifier anchors, i.e., the segments on the pre-amplifier that bind to the amplifiers). In previously disclosed methods, these are repeats of the same AA.
- FIG. 1A the segment of the TP that binds to the pre-amplifier
- IB a mixture of different AAs are designed according to the unique identification code of the SGC.
- the independent identifier of the SGC is based on two labels
- equal number of AAs for APs carrying the two LPs are made on the PA molecule, which will bind a designed number of desired LPs to generate the identifier in the assay.
- the code for the SGC shown uses three labels (LI, L3, and L4).
- the plurality of SGCs comprise: (a) a set of preamplifiers, wherein the pre-amplifier set comprises a plurality of pre-amplifiers, wherein the pre-amplifiers comprise binding sites for the pairs of target probes and a plurality of binding sites for amplifiers; (b) a set of amplifiers, wherein the amplifier set comprises a plurality of amplifiers, wherein the plurality of amplifiers comprise an amplifier comprising a binding site for the pre-amplifiers and a plurality of binding sites for a label probe, or wherein the plurality of amplifiers comprise two or more distinct amplifiers, wherein each distinct amplifier comprises a binding site for the pre-amplifiers and a plurality of binding sites for a distinct label probe; and (c) a set of label probes, wherein the label probe set comprises a label probe or two or more distinct label probes, wherein the label probe comprises a label and a binding site for the amplifier, or wherein the two or more distinct label probes comprise a label
- the plurality of amplifiers comprise two or more distinct amplifiers, and wherein the binding sites on the pre-amplifier for the distinct amplifiers are intermingled (see FIG. 6B). As described below in more detail, this embodiment can be used to reduce miscoding.
- FIG. 4C illustrates another embodiment of sub-SGC implementation, where the independent identifier is implemented on the LP molecule.
- the LP sequences bound to a single SGC all carry the same label.
- the embodiment of FIG. 3 also utilizes the same labels on a given SGC.
- LP molecules binding to the same SGC can be a mixture of LPs each conjugated to a different label according to a predefined code book.
- SGC5 LPs are a mixture of LPs conjugated to three different labels.
- this scheme provides the flexibility to assign different identifiers to different SGCs in different assay configurations simply by devising a different code book on the fly.
- the mixing of LPs can be made in unequal amounts to normalize the labeling intensities across the N labels, which again can help in reducing encoding/decoding errors.
- the disadvantage of this embodiment is that M N - 1 LP and LA sequences will be required to uniquely encode and decode each SGC. In comparison, in the embodiment shown in FIG. 4A, only N unique LPs and LAs are required.
- the SGC ID code is implemented on the LP molecule.
- LP molecules binding to the same SGC can be mixtures of LPs each conjugated to a different label according to a predefined code book.
- SGC5 LPs are a mixture of LPs conjugated to three different labels, corresponding to labels LI, L3, and L4.
- An advantage of this embodiment is that the “coloring” of the SGC complex by the LPs will be completely randomized, which can further help to reduce coding errors.
- a partial LP mixing code book is shown on the right of FIG. 4C, with 7 different exemplary SGC codes shown using 4 labels.
- FIG. 4D shows an embodiment of FIG. 4C in more detail.
- a specific label anchor LA, the binding site on the amplifier for the label probe
- LPs label probes
- SGC5 is illustrated showing that the amplifiers comprise a plurality of identical LAs, labeled “E.”
- SGC5 is coded with 3 distinct label probes (LI, L3, and L4), all of which have the same binding site for the plurality of “E” LAs on the corresponding amplifiers. Therefore, all three label probes are bound to the amplifiers of SGC5.
- FIG. 4E shows an embodiment of FIG. 4C in more detail.
- the SGC5 of FIG. 4D is shown bound to its respective target nucleic acid, with the label probes having an “E” binding site bound to the respective “E” LAs of the SGC5 amplifiers (as in FIG. 4D).
- SGC 1 coded as shown in FIG. 4D, comprises a plurality of identical LAs, labeled “A.”
- SGC1 is coded with one label probe (LI), which has the binding site for the plurality of “A” LAs of the SGC1 amplifiers.
- LI label probe
- SGC3 coded as shown in FIG. 4D, comprises a plurality of identical LAs, labeled “C.”
- SGC3 is coded with 2 probes corresponding to 2 distinct label probes (LI and L2), both of which have the same binding site for the plurality of “C” LAs on the corresponding SGC3 amplifiers. Therefore, both label probes are bound to the amplifiers of SGC3, thereby labeling the SGC3 target nucleic acid.
- FIG. 4F shows an embodiment of FIG. 4C in more detail.
- FIG. 4F illustrates that, once an SGC for a particular target nucleic acid has been designed, the actual coding for the target nucleic acid can be readily modified simply by changing the labels on the label probes that bind to the amplifiers of a particular SGC.
- SGC2 comprises amplifiers with “B” LAs and is coded using L2.
- the same SGC assembly can be used with respect to the target probes, pre-amplifier, and amplifiers with “B” LAs, but instead of using “B” LA-binding label probes with only L2 as in FIG.
- “B” LA-binding label probes can be used that have a mixture of L2 and L3 such that SGC2 is now coded with both labels.
- labels L2 and L3 are bound to “B” LAs on the SGC2 amplifiers.
- SGC5 comprising amplifiers with “E” LAs is now coded in FIG. 4F by using label probes with “E” LA-binding label probes that have a mixture of labels L2, L3, and L4, instead of labels LI, L3, and L4 as shown in FIG. 4D.
- SGCs for different target nucleic acids will have different numbers of labels in the code.
- the SGC coded with only one label will have a higher number of bound labels than the number of corresponding probes bound to an SGC coded with four labels; for example an SGC coded with only LI, LI can bind to all of the sites on one of the SGCs, but for an SGC coded with LI, L2, L3, and L4, LI can only bind to % of the sites on the SGC.
- two target nucleic acids are shown with two bound SGCs, SGC2 and SGC5.
- SGC2 is coded with L2 and L3
- SGC5 is coded with L2, L3, and L4.
- the number of respective labels that can bind to SGC2 will be higher than the number of respective labels that bind to SGC5 (i.e., the 2 distinct labels for SGC2 (L2 and L3) and the 3 distinct labels for SGC5 (L2, L3, and L4) will be bound to the same number of sites, resulting in a higher number of L2 and L3 being bound to SGC2 than SGC5 since some of the SGC5 sites are occupied by L4).
- the number of labels can be normalized by including “blank” label probes, i.e., probes having a binding site for the respective LAs (in this case “B” for SGC2 and “E” for SGC5) but without a label.
- “blank” label probes i.e., probes having a binding site for the respective LAs (in this case “B” for SGC2 and “E” for SGC5) but without a label.
- “blank” label probes” can be included with the mixture of “B” LA-specific probes so that the intensity of L2 and L3 will be the same on both SGCs (i.e., 1/3 of SGC2 occupied by “blank” label probes and 1/3 of SGC5 occupied by L4).
- the assay can be performed so that the same proportion of “blank” label probes are included in the label probe sets using less than 4 labels, for example, 1/2 “blank” label probes can be included with the SG2-specific label probes coded by 2 distinct labels, and 1/4 “blank” label probes can be included with the SGC5-specific label probes coded by 3 distinct labels, so that the amount of each distinct label probe (LI, L2, L3, and L4) bound to the respective SGCs is the same on each SGC.
- Such “blank” label probes can also be used in combination with distinct label probes to provide for a desired proportion of respective labels, such as a desired ratio of label probes on an SGC.
- the plurality of SGCs comprises: (a) a set of pre-amplifiers, wherein the pre-amplifier set comprises a plurality of pre-amplifiers, wherein the preamplifiers comprise binding sites for the pairs of target probes and a plurality of binding sites for an amplifier; (b) a set of amplifiers, wherein the amplifier set comprises a plurality of amplifiers, wherein the amplifiers comprise a binding site for the pre-amplifiers and a plurality of identical binding sites for a label probe; and (c) a set of label probes, wherein the label probe set comprises a label probe or two or more distinct label probes, wherein each label probe comprises a label and a binding site for the amplifiers, wherein the binding site for the amplifier is the same for each label probe, wherein the labels in each distinct label probe are distinguishable between the distinct label probes; wherein the amplifier in each probe subset specific for a target nucleic acid comprises a binding site for a label or a combination of two or more distinct labels
- Similar approaches can be implemented on other components of the SGC and the two embodiments shown in FIG. 4A and FIG. 4B can be used in combination to ensure the predetermined mixture of LPs can be assembled onto the SGC to generate the independent identifier.
- the methods of the disclosure can be modified to reduce miscoding. Miscoding can occur when the signal from a type of LP designed to be present with a particular target is undetectable, or background noise is misinterpreted as a signal from a particular LP. Errant positive signals can be largely eliminated by setting up an appropriate threshold. Signal level from the area surrounding the image dot as well as the global background level can be used to reference the background. Methods for reducing errant undetected signal-type miscoding are described herein.
- FIG. 4B is implemented as shown in FIG. 6B.
- a similar strategy can be used in a configuration with AP level coding, as illustrated in FIG. 4A.
- An additional method to minimize the potential miscoding caused by truncation is to randomize the position of different labels on the AP or PA that encodes the target specific code, as illustrated in FIG. 7, in which the multiplexing channel ID is encoded on the AP molecule.
- FIG. 7A different label probes are positioned on each AP in exactly the same way, that is, each amplifier is the same. Truncation of the AP, for example, during manufacturing, may cause substantial reduction in certain label compared to others. This imbalance increases the chance of miscoding.
- truncation may cause the loss of all copies of one certain label, leading to an outright miscode.
- locations of different labels on the AP are purposefully randomized. Truncation therefore does not cause a large bias in the numbers and types of labels in the SGC.
- the APs are provided as a plurality of amplifiers, where a mix of non-identical amplifiers is included, where the position of LAs for specific label probes are distributed differently and can be randomized on the non-identical amplifiers. Randomization of different labels in the SGCs can be achieved by using one or combinations of the embodiments described herein.
- miscoding occurs because a label that should be present is absent, the target may be mis-identified as another target with fewer labels in its ID code. In most situations, the probability of miscoding is low (e.g., ⁇ 5%). When the quantities of targets are in a similar level, such miscoding does not significantly impact the results. Miscoding can have a significant impact if one target is present at a significantly higher quantity than the other target that it miscoded into (i.e., one target is miscoded to be misread as another target due to differences in amounts of the two targets). Therefore, one important method to reduce the impact of miscoding is to assign ID codes with fewer labels to higher quantity targets if the relative quantities of the targets are known.
- the independent identifier of gene targets 2-5 are each generated using a single label
- the independent identifier in gene targets 2-6 are each generated using a single label.
- a parity check can ensure accurate data transmission during communication.
- a parity bit is appended to the original data bits to make an even or odd number of total data bits.
- the signal from one of the N Labels can be used as a “parity check” bit (LI in the example codebook in Table 3).
- the bit will be made “X” (present) or (absent) to make the total number of labels in the N label system odd (i.e., odd parity check) or even (i.e., even parity check).
- the detected target may or may not be counted.
- This parity check scheme can detect single or odd number of bit errors but cannot detect double or even number bit errors. This can substantially reduce the probability of miscoding. For example, if the chance of single bit error is 5%, the chance of double bit error is theoretically 0.25%.
- the price for using such a parity check is that the number of multiplexing channels is reduced to 2 N 1 -1 or 2 N 1 if an even or odd parity check scheme is adopted, respectively (e.g., 7 or 8 SGC codes in Table 3 compared to 15 SGC codes in Table la).
- a code book can be designed to include an odd (or even) number of labels. In the example shown in Table la, for an odd number, the allowed channels from Table la would be those for gene targets 2, 3, 4, 5, 12, 13, 14, and 15. If one of the other channels is detected, then it must be an error.
- a specific target nucleic acid can be labeled with more than one distinct label.
- a single “dot” will be comprised of two or more distinct labels.
- a dot comprising more than one label can be deconvoluted to identify the individual labels in the dot using well known methods.
- well known methods include the Richardson-Lucy deconvolution algorithm (see Example I) as described previously (Biggs et al., Applied Optics, Vol. 36, No. 8, (1997); Hanisch et al., “Deconvolutions of Hubble Space Telescope Images and Spectra, Deconvolution of Images and Spectra,” Ed. P.A. Jansson, 2nd ed., Academic Press CA, (1997)).
- deconvolution includes, but are not limited to, Wiener deconvolution, regularized filter deconvolution, and the like (Gonzalez et al. , “Digital Image Processing,” Addison-Wesley Publishing Company, Inc. (1992)).
- FIG. 3-7 show SGCs with preamplifiers, amplifiers and label probes. It is understood that the same principles can be applied to an SGC where a pre-pre-amplifier component is included in the SGC, as disclosed herein (see, for example, FIGS. 5B, 5C, 6B and 6C for examples of SGCs with a pre-pre-amplifier layer).
- the SGCs that can be used in the disclosed methods comprise: (a) a set of pre-pre-amplifiers, wherein the pre-pre-amplifier set comprises one or more subsets of pre-pre-amplifiers, wherein the one or more pre-pre-amplifier subsets comprise a pre-pre- amplifier specific for each of the target probe pairs in the one or more target probe sets, wherein each pre-pre-amplifier comprises binding sites for the pair of target probes of one of the target probe sets and a plurality of binding sites for a pre-amplifier; (b) a set of pre-amplifiers, wherein the pre-amplifier set comprises one or more subsets of pre-amplifiers, wherein the one or more pre-amplifier subsets comprise a pre-amplifier specific for the pre-pre-amplifiers in the one or more pre-pre-amplifier subsets, wherein each pre-amplifier comprises binding sites for the pre- pre-amplifiers of one of the pre
- the set of target probes, pre-amplifiers, amplifiers and label probes each comprise two or more subsets.
- the set of target probes, pre-amplifiers, amplifiers and label probes each comprise three or more subsets.
- the set of target probes, pre-amplifiers, amplifiers and label probes each comprise four or more subsets.
- the target probe binding sites for the two or more subsets are intermingled on the target nucleic acid (see FIG. 5, top panel, except a pre-pre-amplifier is included in the SGC).
- the SGCs that can be used in the disclosed methods comprise: (a) a set of pre-pre-amplifiers, wherein the pre-pre-amplifier set comprises a plurality of pre- pre-amplifiers, wherein the pre-pre-amplifiers comprise binding sites for the pairs of target probes and a plurality of binding sites for a pre-amplifier; (b) a set of pre-amplifiers, wherein the pre-amplifier set comprises a plurality of pre-amplifiers, wherein the pre-amplifiers comprise binding sites for the pre-pre-amplifiers and a plurality of binding sites for an amplifier; (c) a set of amplifiers, wherein the amplifier set comprises a plurality of amplifiers, wherein the amplifiers comprise a binding site for the pre-amplifiers and a plurality of binding sites for a label probe or two or more distinct label probes; and d) a set of label probes, wherein the label probe set comprises a label probe or two or more distinct label probe
- the label probe set comprises two or more distinct label probes, wherein the amplifier set comprises a plurality of non-identical amplifiers, and wherein the binding sites for the two or more distinct label probes on each non-identical amplifier are in a different order on each non-identical amplifier (similar to FIG. 7B except with a pre-pre- amplifier in the SGC).
- the SGCs that can be used in the disclosed methods comprise: (a) a set of pre-pre-amplifiers, wherein the pre-pre-amplifier set comprises a plurality of pre- pre-amplifiers, wherein each pre-pre-amplifier comprises binding sites for the pairs of target probes and a plurality of binding sites for a pre-amplifier; (b) a set of pre-amplifiers, wherein the pre-amplifier set comprises a plurality of pre-amplifiers, wherein the pre-amplifiers comprise binding sites for the pre-pre-amplifiers and a plurality of binding sites for amplifiers; (c) a set of amplifiers, wherein the amplifier set comprises a plurality of amplifiers, wherein the plurality of amplifiers comprise an amplifier comprising a binding site for the preamplifiers and a plurality of binding sites for a label probe, or wherein the plurality of amplifiers comprise two or more distinct amplifiers, wherein each distinct amplifier comprises a binding site for the pre-ampli
- the plurality of amplifiers comprise two or more distinct amplifiers, and wherein the binding sites on the pre-amplifier for the distinct amplifiers are intermingled (similar to FIG. 6B except with a pre-pre-amplifier layer in the SGC).
- the SGCs that can be used in the disclosed methods comprise: (a) a set of pre-pre-amplifiers, wherein the pre-pre-amplifier set comprises a plurality of pre- pre-amplifiers, wherein the pre-pre-amplifiers comprise binding sites for the pairs of target probes and a plurality of binding sites for a pre-amplifier or for two or more distinct preamplifiers; (b) a set of pre-amplifiers, wherein the pre-amplifier set comprises a plurality of pre-amplifiers, wherein the plurality of pre-amplifiers comprise a pre-amplifier comprising a binding site for the pre-pre-amplifiers and a plurality of binding sites for an amplifier, or wherein the plurality of pre-amplifiers comprise two or more distinct pre-amplifiers, wherein each distinct pre-amplifier comprises a binding site for the pre-pre-amplifiers and a plurality of binding sites for a distinct amplifier; (c) a set of amplifiers
- This embodiment is similar to FIG. 4B, except that the combinatorial labeling is implemented at the level of one or more distinct pre-amplifiers binding to the pre-pre-amplifier, rather than at the level of one or more distinct amplifiers binding to the pre-amplifier, as shown in FIG. 4B.
- the plurality of pre-amplifiers comprise two or more distinct preamplifiers, and wherein the binding sites on the pre-pre-amplifier for the distinct pre-amplifiers are intermingled.
- This embodiment is similar to FIG. 6B except that the combinatorial labeling is implemented at the level of one or more distinct pre-amplifiers binding to the pre-pre- amplifier, rather than at the level of one or more distinct amplifiers binding to the pre-amplifier, as shown in FIG. 6B.
- the SGCs that can be used in the disclosed methods comprise: (a) a set of pre-pre-amplifiers, wherein the pre-pre-amplifier set comprises a plurality of pre- pre-amplifiers, wherein the pre-pre-amplifiers comprise binding sites for the pairs of target probes and a plurality of binding sites for a pre-amplifier; (b) a set of pre-amplifiers, wherein the pre-amplifier set comprises a plurality of pre-amplifiers, wherein the pre-amplifiers comprise binding sites for the pairs of target probes and a plurality of binding sites for an amplifier; (c) a set of amplifiers, wherein the amplifier set comprises a plurality of amplifiers, wherein the amplifiers comprise a binding site for the pre-amplifiers and a plurality of identical binding sites for a label probe; and (d) a set of label probes, wherein the label probe set comprises a label probe or two or more distinct label probes, wherein each label probe comprises
- the components of the SGC are generally bound directly to each other.
- the binding reaction is generally by hybridization.
- the binding between the components is direct.
- an intermediary component can be included such that the binding of one component to another is indirect, for example, the intermediary component contains complementary binding sites to bridge two other components.
- the segments of complementary nucleic acid sequence between the components is generally in the range of 10 to 50 nucleotides, or greater, for example, 16 to 30 nucleotides, such as 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, or 50 nucleotides, or greater.
- the segments of complementary nucleic acid sequence between the components is generally in the range of 5 to 18 nucleotides, for example, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17 or 18 nucleotides.
- the nucleotide lengths can be somewhat shorter or longer for a stable or unstable hybridization, depending on the sequence (for example, GC content) and the conditions employed in the assay.
- modified nucleotides such as Locked Nucleic Acid (LNA) or Bridged Nucleic Acid (BNA) can be used to increase the binding strength at the modified base, thereby allowing length of the binding segment to be reduced.
- nucleic acid segments that are complementary to other nucleic acid segments can be reduced further, if desired.
- a person skilled in the art can readily determine appropriate probe designs, including length, the presence of modified nucleotides, and the like, to achieve a desired interaction between nucleic acid components.
- the complementary sequences can optionally be designed to maximize the difference in melting temperature (dTm). This can be done by using melting temperature calculation algorithms known in the art (see, for example, SantaLucia, Proc. Natl. Acad. Sci. U.S.A. 95: 1460-1465 (1998)).
- artificial modified bases such as Locked Nucleic Acid (LNA) or bridged nucleic acid (BNA) and naturally occurring 2'-O-methyl RNA are known to enhance the binding strength between complementary pairs (Petersen and Wengel, Trends Biotechnol. 21:74-81 (2003); Majlessi et al., Nucl. Acids Res. 26:2224-2229 (1998)). These modified bases can be strategically introduced into the binding site between components of an SGC, as desired.
- LNA Locked Nucleic Acid
- BNA bridged nucleic acid
- 2'-O-methyl RNA naturally occurring 2'-O-methyl RNA
- LNA, BNA or 2'-O-methyl RNA modified nucleotides
- each modified base can increase the melting temperature, the length of binding regions between two nucleic acid sequences (i.e., complementary sequences) can be substantially shortened.
- the binding strength of a modified base to its complement is stronger, and the difference in melting temperatures (dTm) is increased.
- Yet another embodiment is to use three modified bases (for example, three LNA, BNA or 2'-O-methyl RNA bases, or a combination of two or three different modified bases) in the complementary sequences of a nucleic acid component or between two nucleic acid components, for example of a signal generating complex (SGC), that are to be hybridized.
- SGC signal generating complex
- Such components can be, for example, a pre-pre- amplifier, a pre-amplifier, an amplifier, a label probe, or a pair of target probes.
- the modified bases can be used in the segments of selected components of SGC, in particular those mediating binding between nucleic acid components, which increases the binding strength of the base to its complementary base, allowing a reduction in the length of the complementary segments (see, for example, Petersen and Wengel, Trends Biotechnol. 21:74-81 (2003); US Patent No. 7,399,845).
- Artificial bases that expand the natural 4-letter alphabet such as the Artificially Expanded Genetic Information System (AEGIS; Yang et al., Nucl. Acids Res. 34 (21): 6095-6101 (2006)) can be incorporated into the binding sites among the interacting components of the SGC. These artificial bases can increase the specificity of the interacting components, which in turn can allow lower stringency hybridization reactions to yield a higher signal.
- each label probe comprises a segment that is detectable.
- a “label” is a moiety that facilitates detection of a molecule. Common labels include fluorescent, luminescent, light-scattering, and/or colorimetric labels. Suitable labels include enzymes, and fluorescent and chromogenic moieties, as well as radionuclides, substrates, cofactors, inhibitors, chemiluminescent moieties, magnetic particles, rare earth metals, metal isotopes, and the like. In a particular embodiment, the label is an enzyme.
- Exemplary enzyme labels include, but are not limited to horseradish peroxidase (HRP), alkaline phosphatase (AP), P-galactosidase, glucose oxidase, and the like, as well as various proteases.
- Other labels include, but are not limited to, fluorophores, dinitrophenyl (DNP), and the like. Labels are known to those skilled in the art, as described, for example, in Hermanson, Bioconjugate Techniques, Academic Press, San Diego (1996), and U.S. Patent Nos. 3,817,837; 3,850,752; 3,939,350; 3,996,345; 4,277,437; 4,275,149; and 4,366,241.
- the enzyme can utilize a chromogenic or Anorogenic substrate to produce a detectable signal, as described herein.
- Exemplary labels are described herein. Any of a number of enzymes or non-enzyme labels can be utilized so long as the enzymatic activity or non-enzyme label, respectively, can be detected.
- the enzyme thereby produces a detectable signal, which can be utilized to detect a target nucleic acid.
- Particularly useful detectable signals are chromogenic or Anorogenic signals.
- particularly useful enzymes for use as a label include those for which a chromogenic or Anorogenic substrate is available. Such chromogenic or Anorogenic substrates can be converted by enzymatic reaction to a readily detectable chromogenic or Auorescent product, which can be readily detected and/or quantified using microscopy or spectroscopy.
- Such enzymes are known to those skilled in the art, including but not limited to, horseradish peroxidase, alkaline phosphatase, -galactosidase, glucose oxidase, and the like (see Hermanson, Bioconjugate Techniques, Academic Press, San Diego (1996)).
- chromogenic or Auorogenic substrates include various peptidases, where chromogenic or Auorogenic peptide substrates can be utilized to detect proteolytic cleavage reactions.
- the use of chromogenic and Auorogenic substrates is also known in bacterial diagnostics, including but not limited to the use of a- and -galactosidase, P-glucuronidase, 6-phospho-P-D-galactoside 6-phosphogalactohydrolase, P-glucosidase, a-glucosidase, amylase, neuraminidase, esterases, lipases, and the like (ManaA eta/., Microbiol. Rev. 55:335-348 (1991)), and such enzymes with known chromogenic or Auorogenic substrates can readily be adapted for use in methods provided herein.
- chromogenic or Auorogenic substrates to produce detectable signals are known to those skilled in the art and are commercially available.
- Exemplary substrates that can be utilized to produce a detectable signal include, but are not limited to, 3, 3 '-diaminobenzidine (DAB), 3, 3 ’,5, 5 ’-tetramethylbenzidine (TMB), chloronaphthol (4-CN)(4-chloro-l -naphthol), 2,2'-azino-bis(3-ethylbenzothiazoline-6-sulphonic acid) (ABTS), o-phenylenediamine dihydrochloride (OPD), and 3-amino-9-ethylcarbazole (AEC) for horseradish peroxidase; 5- bromo-4-chloro-3-indolyl-l -phosphate (BCIP), nitroblue tetrazolium (NBT), Fast Red (Fast Red TR/AS-MX), and
- Exemplary Auorogenic substrates include, but are not limited to, 4- (triAuoromethyl)umbelliferyl phosphate for alkaline phosphatase; 4-methylumbelliferyl phosphate bis (2-amino- 2-methyl- 1,3 -propanediol), 4-methylumbelliferyl phosphate bis (cyclohexylammonium) and 4-methylumbelliferyl phosphate for phosphatases; QuantaBluTM and Quintolet for horseradish peroxidase; 4-methylumbelliferyl P-D-galactopyranoside, fluorescein di(P-D-galactopyranoside) and naphthofluorescein di-(P-D-galactopyranoside) for P-galactosidase; 3-acetylumbelliferyl P-D-glucopyranoside and 4-methylumbelliferyl-P- D- glucopyranoside for P-glucosidase; and 4-methylumbelliferyl-a
- Exemplary enzymes and substrates for producing a detectable signal are also described, for example, in US publication 2012/0100540.
- Various detectable enzyme substrates including chromogenic or Anorogenic substrates, are known and commercially available (Pierce, Rockford IL; Santa Cruz Biotechnology, Dallas TX; Invitrogen, Carlsbad CA; 42 Life Science; Biocare).
- the substrates are converted to products that form precipitates that are deposited at the site of the target nucleic acid.
- exemplary substrates include, but are not limited to, HRP-Green (42 Life Science), Betazoid DAB, Cardassian DAB, Romulin AEC, Bajoran Purple, Vina Green, Deep Space BlackTM, Warp RedTM, Vulcan Past Red and Ferangi Blue from Biocare (Concord CA; biocare.net/products/detection/chromogens).
- Exemplary rare earth metals and metal isotopes suitable as a detectable label include, but are not limited to, lanthanide (III) isotopes such as 141 Pr, 142 Nd, 143 Nd, 144 Nd, 145 Nd, 146 Nd, 147 Sm, 148 Nd, 149 Sm, 150 Nd, 151 Eu, 152 Sm, 153 Eu, 154 Sm, 155 Gd, 156 Gd, 158 Gd, 159 Tb, 160 Gd, 161 Dy, 162 Dy, 163 Dy, 164 Dy, 165 Ho, 166 Er, 167 Er, 168 Er, 169 Tm, 170 Er, 171 Yb, 172 Yb, 173 Yb, 174 Yb, 175 Lu, and 176 Yb.
- III lanthanide
- Metal isotopes can be detected, for example, using time-of-flight mass spectrometry (TOF-MS) (for example, Fluidigm Helios and Hyperion systems, fluidigm.com/systems; South San Francisco, CA).
- TOF-MS time-of-flight mass spectrometry
- Biotin-avidin (or biotin-streptavidin) is a well-known signal amplification system based on the fact that the two molecules have extraordinarily high affinity to each other, and that one avidin/streptavidin molecule can bind four biotin molecules.
- Antibodies are widely used for signal amplification in immunohistochemistry and ISH.
- Tyramide signal amplification (TSA) is based on the deposition of a large number of haptenized tyramide molecules by peroxidase activity. Tyramine is a phenolic compound.
- HRP horseradish peroxidase
- the activated substrate molecules then very rapidly react with and covalently bind to electron-rich moieties of proteins, such as tyrosine, at or near the site of the peroxidase binding site.
- proteins such as tyrosine
- many hapten molecules conjugated to tyramide can be introduced at the hybridization site in situ.
- the deposited tyramide-hapten molecules can be visualized directly or indirectly.
- a detection system is described in more detail, for example, in U.S. Patent No. 8,658,361.
- Embodiments described herein can utilize enzymes to generate a detectable signal using appropriate chromogenic or fluorogenic substrates.
- a label probe can have a detectable label directly coupled to the nucleic acid portion of the label probe.
- exemplary detectable labels are known to those skilled in the art, including but not limited to chromogenic or fluorescent labels (see Hermanson, Bioconjugate Techniques, Academic Press, San Diego (1996)).
- fluorophores useful as labels include, but are not limited to, rhodamine derivatives, for example, tetramethylrhodamine, rhodamine B, rhodamine 6G, sulforhodamine B, Texas Red (sulforhodamine 101), rhodamine 110, and derivatives thereof such as tetramethylrhodamine-5-(or 6), lissamine rhodamine B, and the like; 7-nitrobenz-2- oxa- 1,3 -diazole (NBD); fluorescein and derivatives thereof; naphthalenes such as dansyl (5- dimethylaminonapthalene-1 -sulfonyl); coumarin derivatives such as 7-amino-4- methylcoumarin-3 -acetic acid (AMCA), 7-diethylamino-3-[(4'-(iodoacetyl)amino)phenyl]-4--
- Particular dyes include ATTO 390, DyLight 395XL, Alexa Fluor 405, ATTO 425, Alexa Fluor 430, DyLight430, ATTO 465, ATTO 488, Alexa Fluor 488, ATTO 490LS, ATTO 495, ATTO 514, ATTO 520, ATTO 532, Alexa Fluor 532, ATTO Rho6G, ATTO 542, Alexa Fluor 546, ATTO 550, DyLight 550, Alexa Fluor 555, ATTO 565, Alexa Fluor 568, ATTO Rho3B, ATTO Rhol l, ATTO Rhol2, ATTO Thiol2, ATTO RholOl, ATTO 590, ATTO 594, Alexa Fluor 594, Dylight 594, ATTO Rhol3, ATTO 610, Alexa Fluor 610, ATTO 620, ATTO Rhol4, ATTO 633, Alexa Fluor 633, Alexa Fluor 635, ATTO 643, ATTO 647, Alexa Fluor 647,
- Exemplary chromophores include, but are not limited to, phenolphthalein, malachite green, nitroaromatics such as nitrophenyl, diazo dyes, dabsyl (4- dimethylaminoazobenzene-4'-sulfonyl), and the like.
- the methods provided herein can be used for concurrent detection of multiple target nucleic acids.
- the fluorophores to be used for detection of multiple target nucleic acids are selected so that, within each round of labeling and detection, each of the fluorophores is distinguishable and can be detected concurrently.
- Such fluorophores are selected to have spectral separation of the emissions so that distinct labeling of the target nucleic acids can be detected concurrently.
- 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 distinguishable fluorophores can be used.
- selection of one to six fluorophores with excitation maxima at or near 488 nm, 550 nm, 594 nm, 650 nm, 700 nm, and 750 nm may be suitable in combination.
- fluorophores with excitation maxima at or near 488 nm include, e.g., Alexa Fluor 488, DyLight 488, ATTO 488, and Cy2.
- fluorophores with excitation maxima at or near 550 nm include, e.g., DyLight 550, Alexa Fluor 546, ATTO 550, Cy3, and rhodamine.
- fluorophores with excitation maxima at or near 594 nm include, e.g., Dylight 594, Alexa Fluor 594, ATTO 594, and Texas Red.
- fluorophores with excitation maxima at or near 650 nm include, e.g., DyLight 650, ATTO 647, Alexa Fluor 647, ATTO 647N, and ATTO 655.
- fluorophores with excitation maxima at or near 700 nm include, e.g., ATTO 700, DyLight700, and AlexaFluor700.
- fluorophores with excitation maxima at or near 750 nm include, e.g., Alexa Fluor 750, Cy7, and DyLight 755, and ATTO 740.
- Methods such as microscopy, cytometry e.g., mass cytometry, cytometry by time of flight (CyTOF), flow cytometry), or spectroscopy can be used to visualize chromogenic, fluorescent, or metal detectable signals associated with the respective target nucleic acids.
- cytometry e.g., mass cytometry, cytometry by time of flight (CyTOF), flow cytometry
- spectroscopy can be used to visualize chromogenic, fluorescent, or metal detectable signals associated with the respective target nucleic acids.
- cytometry e.g., mass cytometry, cytometry by time of flight (CyTOF), flow cytometry
- spectroscopy can be used to visualize chromogenic, fluorescent, or metal detectable signals associated with the respective target nucleic acids.
- chromogenic substrates or Anorogenic substrates, or chromogenic or Auorescent labels, or rare earth metal isotopes will be utilized for a particular assay, if different labels are
- the label can be designed such that the labels are optionally cleavable.
- a “cleavable label” refers to a label that is attached or conjugated to a label probe so that the label can be removed, for example, in order to use the same label in a subsequent round of labeling and detecting of target nucleic acids.
- the cleavable labels are conjugated to the label probe by a chemical linker that is cleavable.
- Cleavable chemical linkers can include a cleavable chemical moiety, such as disulfides, which can be cleaved by reduction, glycols or diols, which can be cleaved by periodate, diazo bonds, which can be cleaved by dithionite, esters, which can be cleaved by hydroxylamine, sulfones, which can be cleaved by base, and the like (see Hermanson, supra, 1996).
- One particularly useful cleavable linker is a linker containing a disulfide bond, which can be cleaved by reducing the disulfide bond.
- the linker can include a site for cleavage by an enzyme.
- the linker can contain a proteolytic cleavage site.
- a cleavage site is for a sequence-specific protease.
- proteases include, but are not limited to, human rhinovirus 3C protease (cleavage site LEVLFQ/GP), enterokinase (cleavage site DDDDK/), factor X a (cleavage site IEGR/), tobacco etch virus protease (cleavage site ENLYFQ/G), and thrombin (cleavage site LVPR/GS) (see, for example, Oxford Genetics, Oxford, UK).
- cleavable moiety can be, for example, uracil-DNA (DNA containing uracil), which can be cleaved by uracil-DNA glycosylase (UNG) (see, for example, Sidorenko et al., FEBS Lett. 582(3 ):410—404 (2008)).
- UNG uracil-DNA glycosylase
- the cleavable labels can be removed by applying an agent, such as a chemical agent or light, to cleave the label and release it from the label probe.
- agent such as a chemical agent or light
- useful cleaving agents for chemical cleavage include, but are not limited to, reducing agents, periodate, dithionite, hydroxylamine, base, and the like (see Hermanson, supra, 1996).
- One useful method for cleaving a linker containing a disulfide bond is the use of tris(2-carboxyethyl)phosphine (TCEP) (see Moffitt et al., Proc. Natl. Acad. Sci. USA 113: 11046-11051 (2016)).
- TCEP tris(2-carboxyethyl)phosphine
- Methods disclosed herein include a step of removing signals generated by the detectable labels from the sample.
- the step of removing signals generated by the detectable labels comprises removing the labels from the label probes, e.g., by cleaving cleavable labels as discussed above.
- the step of removing the signals generated by the detectable labels comprises removing the SGCs from the sample, e.g., by disrupting binding of the SGCs bound to the target nucleic acid.
- the SGCs are removed from the sample by contacting the sample with an acid reagent that disrupts hybridization between the SGCs and the target nucleic acid.
- the acid reagent comprises an acid.
- Exemplary acids suitable for use in an acid reagent include, but are not limited to, formic acid, acetic acid, propionic acid, butyric acid, valeric acid, oxalic acid, malonic acid, succinic acid, malic acid, tartaric acid, citric acid, and the like.
- the acid reagent generally includes an acid at a concentration of about 5-40% acid (vol/vol). In some embodiments, the acid reagent comprises an acid at a concentration of about 20-30% (vol/vol).
- the acid reagent comprises an acid at a concentration of about 5%, about 6%, about 7%, about 8%, about 9%, about 10%, about 11%, about 12%, about 13%, about 14%, about 15%, about 16%, about 17%, about 18%, about 19%, about 20%, about 21%, about 22,%, about 23%, about 24%, about 25%, about 26%, about 27%, about 28%, about 29%, about 30%, about 31%, about 32%, about 33%, about 34%, about 35%, about 36%, about 37%, about 38%, about 39%, or about 40% acid (vol/vol).
- the acid reagent comprises acetic acid at a concentration of about 5- 40%, or about 20-30%, e.g., about 5%, about 6%, about 7%, about 8%, about 9%, about 10%, about 11%, about 12%, about 13%, about 14%, about 15%, about 16%, about 17%, about 18%, about 19%, about 20%, about 21%, about 22,%, about 23%, about 24%, about 25%, about 26%, about 27%, about 28%, about 29%, about 30%, about 31%, about 32%, about 33%, about 34%, about 35%, about 36%, about 37%, about 38%, about 39% or 40% acetic acid (% vol/vol), or any concentration therebetween.
- acetic acid % vol/vol
- the acid reagent further comprises one or more components selected from salts, chelating agents, buffers, and any combination thereof.
- the acid reagent further comprises saline sodium citrate (SSC), where 20X SSC corresponds to 3.0 M NaCl and 0.3 M sodium citrate, at pH 7.0 (see Sambrook et al., Molecular Cloning: A Laboratory Manual, Third Ed., Cold Spring Harbor Laboratory, New York (2001); Ausubel et al., Current Protocols in Molecular Biology, John Wiley and Sons, Baltimore, MD (1999)).
- the acid reagent further comprises IX SSC to 13X SSC.
- the acid reagent further comprises SSC at IX, 1.1X, 1.2X, 1.3X, 1.4X, 1.5X, 1.6X, 1.7X, 1.8X, 1.9X, 2X, 2.1X, 2.2X, 2.3X, 2.4X, 2.5X, 2.6X, 2.7X, 2.8X, 2.9X, 3X, 3. IX, 3.2X, 3.3X, 3.4X, 3.5X, 3.6X, 3.7X, 3.8X, 3.9X, 4X, 4. IX, 4.2X, 4.3X, 4.4X, 4.5X, 4.6X, 4.7X, 4.8X, 4.9X, 5X, 5.
- the acid reagent further comprises saline sodium phosphate EDTA (SSPE), which is a mixture of sodium chloride, sodium phosphate, and ethylenediaminetetraacetic acid (EDTA), where 20X SSPE corresponds to 3.0 M sodium chloride, 0.2 M sodium hydrogen phosphate (NaEEPC ), and 0.02 M ethylenediaminetetraacetic acid (EDTA), pH 7.4 (see Sambrook et al., Molecular Cloning: A Laboratory Manual, Third Ed., Cold Spring Harbor Laboratory, New York (2001)).
- the acid reagent further comprises IX SSPE to 13X SSPE.
- the acid reagent comprises SSPE at IX, 1.1X, 1.2X, 1.3X, 1.4X, 1.5X, 1.6X, 1.7X, 1.8X, 1.9X, 2X, 2. IX, 2.2X, 2.3X, 2.4X, 2.5X, 2.6X, 2.7X, 2.8X, 2.9X, 3X, 3. IX, 3.2X, 3.3X, 3.4X, 3.5X, 3.6X, 3.7X, 3.8X, 3.9X, 4X, 4. IX, 4.2X, 4.3X, 4.4X, 4.5X, 4.6X, 4.7X, 4.8X, 4.9X, 5X, 5.
- the acid reagent further comprises a phosphate buffer, e.g., 10- 500 mM sodium phosphate, with a pH in the range of about 7 to about 8 (e.g., a pH of about 7.8).
- the acid reagent further comprises sodium phosphate at about 10 mM, about 20 mM, about 30 mM, about 40 mM, about 50 mM, about 60 mM, about 70 mM, about 80 mM, about 90 mM, about 100 mM, about 110 mM, about 120 mM, about 130 mM, about 140 mM, about 150 mM, about 160 mM, about 170 mM, about 180 mM, about 190 mM, about 200 mM, about 210 mM, about 220 mM, about 230 mM, about 240 mM, about 250 mM, about 260 mM, about 270 mM, about 280
- the acid reagent further comprises sodium chloride (NaCl), e.g., at a concentration of about 10 mM to about 6 M.
- the acid reagent comprises sodium chloride at a concentration of about 10 mM, about 50 mM, about 100 mM, about 150 mM, about 200 mM, about 250 mM, about 300 mM, about 350 mM, about 400 mM, about 450 mM, about 500 mM, about 550 mM, about 600 mM, about 650 mM, about 700 mM, about 750 mM, about 800 mM, about 850 mM, about 900 mM, about 950 mM, about 1 M, about 1.1 M, about 1.2 M, about 1.3 M 1.4 M, about 1.5 M, about 1.6 M, about 1.7 M, about 1.8 M, about 1.9 M, about 2 M, about 2.1 M, about 2.2 M, about 2.3 M,
- the acid reagent comprises 5-40% acid and 1X-12.8X SSC, wherein the acid and the SSC can each be present independently at any concentration disclosed herein.
- the acid reagent comprises 20-30% acid and 3.2X- 12.8X SSC.
- the acid reagent comprises 20% acid and 3.2X SSC.
- the acid reagent comprises 20% acid and 6.4X SSC.
- the acid reagent comprises 20% acid and 12.8X SSC.
- the acid reagent comprises 30% acid and 3.2X SSC.
- the acid reagent comprises 30% acid and 6.4X SSC.
- the acid reagent comprises 30% acid and 12.8X SSC.
- the acid reagent comprises 5-40% acetic acid and 1X-12.8X SSC, wherein the acetic acid and the SSC can each be present independently at any concentration disclosed herein.
- the acid reagent comprises 20% acetic acid and 3.2X SSC.
- the acid reagent comprises 20% acetic acid and 6.4X SSC.
- the acid reagent comprises 20% acetic acid and 12.8X SSC.
- the acid reagent comprises 30% acetic acid and 3.2X SSC.
- the acid reagent comprises 30% acetic acid and 6.4X SSC.
- the acid reagent comprises 30% acetic acid and 12.8X SSC.
- the step of removing SGCs from a sample by adding an acid reagent to effect disruption of hybridization between the SGCs and the target nucleic acids in the sample is carried out at room temperature. In other embodiments, this step can be conducted at a temperature just below or above room temperature.
- the acid reagent can be applied to a sample, for example, at temperature of about 4°C to about 40°C.
- the methods can be carried out at a temperature of about 4°C, about 5°C, about 6°C, about 7°C, about 8°C, about 9°C, about 10°C, about 11°C, about 12°C, about 13°C, about 14°C, about 15°C, about 16°C, about 17°C, about 18°C, about 19°C, about 20°C, about 21°C, about 22°C, about 23 °C, about 24 °C, about 25 °C, about 26 °C, about 27 °C, about 28 °C, about 29 °C, about 30 °C, about 31 °C, about 32 °C, about 33 °C, about 34 °C, about 35 °C, about 36 °C, about 37 °C, about 38 °C, about 39 °C, about 40 °C, or any range therebetween.
- the step of removing SGCs from a sample by adding an acid reagent to effect disruption of hybridization between the SGCs and the target nucleic acids in the sample is carried out for a period of time of about 1 minute to about 30 minutes, or about 3 minutes to about 10 minutes, for example, about 1 minute, about 2 minute, about 3 minute, about 4 minute, about 5 minute, about 6 minute, about 7 minute, about 8 minute, about 9 minute, about 10 minutes, about 11 minutes, about 12 minutes, about 13 minutes, about 14 minutes, about 15 minutes, about 16 minutes, about 17 minutes, about 18 minutes, about 19 minutes, about 20 minutes, about 21 minutes, about 22 minutes, about 23 minutes, about 24 minutes, about 25 minutes, about 26 minutes, about 27 minutes, about 28 minutes, about 29 minutes, about 30 minutes, or any range therebetween.
- the acid reagent treatment is repeated 1 to 10 times.
- the step of removing SGCs from a sample by adding an acid reagent to effect disruption of hybridization between the SGCs and the target nucleic acids in the sample is carried out 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or 11 times (i.e., repeated up to 10 times). In some embodiments, this step is carried out 1, 2, 3, 4, 5 or 6 times (i.e., repeated up to 5 times). In some embodiments, this step is carried out 1, 2, or 3 times (i.e., repeated up to 2 times).
- the acid reagent is contacted with the sample sequentially without removing the acid reagent (for example, by aspirating the acid reagent from the sample) or washing the sample (for example, washing the sample between applications of the acid reagent).
- the acid reagent can be removed from contact with the sample, for example, by aspirating the acid reagent away from the sample or washing the sample with a suitable buffer.
- Suitable wash buffers include, but are not limited to, a buffer used routinely in in situ hybridization assays.
- the conditions for removal of probes bound to target nucleic acids in a sample can be readily determined depending on the components and concentration of components of the acid reagent, the time of incubation of the acid reagent with the sample, and the number of times the incubation is repeated, as disclosed herein.
- the effectiveness of the removal of the probes from a sample can be readily determined by analyzing the sample using the same method used to detect a target nucleic acid to see if residual probe can be detected (see Examples I and II). If residual probe is still present, the acid reagent treatment merely needs to be repeated until the previously detected probes are no longer detected or are detected at a sufficiently low level to permit detectable labeling of a target nucleic acid with the same label in a subsequent round of labeling.
- the number of times that a sample can be treated with the acid reagent while preserving cell morphology and integrity of the nucleic acids in the sample to permit subsequent detection of nucleic acids can be readily determined by performing repeated acid reagent treatments of a sample for a given acid reagent and under a given set of conditions and determining whether or not target nucleic acids can still be detected, for example, by determining the ability to detect a positive control nucleic acid in the sample or determining that a similar cell morphology can be detected in the sample after the sample has been treated one or more times with the acid reagent (see Examples I and II).
- the target nucleic acids detected by the methods of the invention can be any nucleic acid present in the cell sample, including but not limited to, RNA, including messenger RNA (mRNA), micro RNA (miRNA), ribosomal RNA (rRNA), mitochondrial RNA, non-coding RNA, and the like, or DNA, and the like.
- the nucleic acid is RNA.
- the target nucleic acids can independently be DNA or RNA.
- the target nucleic acids to be detected can be, but are not necessarily, the same type of nucleic acid.
- the target nucleic acids to be detected in an assay of the invention can be DNA and RNA.
- the target nucleic acids can independently be selected from the group consisting of messenger RNA (mRNA), micro RNA (miRNA), ribosomal RNA (rRNA), mitochondrial RNA, and non-coding RNA.
- mRNA messenger RNA
- miRNA micro RNA
- rRNA ribosomal RNA
- mitochondrial RNA mitochondrial RNA
- non-coding RNA non-coding RNA.
- the target nucleic acids can independently be DNA or any type of RNA.
- ISH in situ hybridization
- the probe types can be double stranded DNA (dsDNA), single stranded DNA (ssDNA), single stranded complimentary RNA (sscRNA), messenger RNA (mRNA), micro RNA (miRNA), ribosomal RNA, mitochondrial RNA, and/or synthetic oligonucleotides.
- dsDNA double stranded DNA
- ssDNA single stranded DNA
- sscRNA single stranded complimentary RNA
- mRNA messenger RNA
- miRNA micro RNA
- ribosomal RNA mitochondrial RNA
- synthetic oligonucleotides synthetic oligonucleotides.
- FISH fluorescent in situ hybridization
- CISH chromogenic in situ hybridization
- ISH, FISH and CISH methods are well known to those skilled in the art (see, for example, Staler, Clinics in Laboratory Medicine 10(l):215-236 (1990); In situ hybridization. A practical approach, Wilkinson, ed., IRL Press, Oxford (1992); Schwarzacher and Heslop- Harrison, Practical in situ hybridization, BIOS Scientific Publishers Ltd, Oxford (2000)).
- the sample comprises cells that are optionally fixed and/or permeabilized before hybridization of the target probes.
- Fixing and permeabilizing cells can facilitate retaining the nucleic acid targets in the cell and permit the target probes, label probes, amplifiers, pre-amplifiers, pre-pre-amplifiers, and so forth, to enter the cell and reach the target nucleic acid molecule.
- the cell is optionally washed to remove materials not captured to a nucleic acid target.
- the cell can be washed after any of various steps, for example, after hybridization of the target probes to the nucleic acid targets to remove unbound target probes, after hybridization of the pre-pre-amplifiers, preamplifiers, amplifiers, and/or label probes to the target probes, and the like.
- Methods for fixing and permeabilizing cells for in situ detection of nucleic acids, as well as methods for hybridizing, washing and detecting target nucleic acids are also well known in the art (see, for example, US 2008/0038725; US 2009/0081688; Hicks et al., J. Mol. Histol. 35:595-601 (2004); Staler, Clinics in Laboratory Medicine 10(l):215-236 (1990); In situ hybridization.
- Exemplary fixing agents include, but are not limited to, aldehydes (formaldehyde, glutaraldehyde, and the like), acetone, alcohols (methanol, ethanol, and the like).
- Exemplary permeabilizing agents include, but are not limited to, alcohols (methanol, ethanol, and the like), acids (glacial acetic acid, and the like), detergents (Triton, NP-40, TweenTM 20, and the like), saponin, digitonin, LeucopermTM (BioRad, Hercules, CA), and enzymes (for example, lysozyme, lipases, proteases and peptidases). Permeabilization can also occur by mechanical disruption, such as in tissue slices.
- the sample is treated to denature the double stranded nucleic acids in the sample to provide accessibility for the target probes to bind by hybridization to a strand of the target double stranded nucleic acid.
- Conditions for denaturing double stranded nucleic acids are well known in the art, and include heat and chemical denaturation, for example, with base (NaOH), formamide, dimethyl sulfoxide, and the like (see Wang et al., Environ. Health Toxicol.
- tissue specimens immobilized on a glass slide on single cells in suspension such as peripheral blood mononucleated cells (PBMCs) isolated from blood samples, and the like.
- Tissue specimens include, for example, tissue biopsy samples.
- Blood samples include, for example, blood samples taken for diagnostic purposes.
- the blood can be directly analyzed, such as in a blood smear, or the blood can be processed, for example, lysis of red blood cells, isolation of PBMCs or leukocytes, isolation of target cells, and the like, such that the cells in the sample analyzed by the disclosed methods are in a blood sample or are derived from a blood sample.
- tissue specimen can be processed, for example, the tissue specimen minced and treated physically or enzymatically to disrupt the tissue into individual cells or cell clusters.
- a cytological sample can be processed to isolate cells or disrupt cell clusters, if desired.
- the tissue, blood and cytological samples can be obtained and processed using methods well known in the art. The disclosed methods can be used in diagnostic applications to identify the presence or absence of pathological cells based on the presence or absence of a nucleic acid target that is a biomarker indicative of a pathology.
- the sample is a biological sample or tissue sample.
- a biological sample can be obtained from a biological subject, including a sample of biological tissue or fluid origin that is collected from an individual or some other source of biological material such as biopsy, autopsy or forensic materials.
- a biological sample also includes samples from a region of a biological subject containing or suspected of containing precancerous or cancer cells or tissues, for example, a tissue biopsy, including fine needle aspirates, blood sample or cytological specimen.
- tissue biopsy including fine needle aspirates, blood sample or cytological specimen.
- Such samples can be, but are not limited to, organs, tissues, tissue fractions and/or cells isolated from an organism such as a mammal.
- Exemplary biological samples include, but are not limited to, a cell culture, including a primary cell culture, a cell line, a tissue, an organ, an organelle, a biological fluid, and the like. Additional biological samples include but are not limited to a skin sample, tissue biopsies, including fine needle aspirates, cytological samples, stool, bodily fluids, including blood and/or serum samples, saliva, semen, and the like. Such samples can be used for medical or veterinary diagnostic purposes. A sample can also be obtained from other sources, for example, food, soil, surfaces of objects, and the like, and other materials for which detection of target nucleic acids is desired.
- the disclosed methods can be used for detection of one or more pathogens, such as a virus, a bacterium, a fungus, a single celled organism such as a parasite, and the like, from a biological sample obtained from an individual or other sources.
- pathogens such as a virus, a bacterium, a fungus, a single celled organism such as a parasite, and the like.
- the cytological sample is a cervical sample, for example, a pap smear.
- the sample is a fine needle aspirate.
- the sample is a tissue specimen or is derived from a tissue specimen.
- the tissue specimen is a formalin-fixed paraffin-embedded (FFPE) sample.
- the tissue specimen is fresh frozen.
- the tissue specimen is prepared with a fixative.
- the tissue specimen is prepared with a crosslinking fixative.
- the sample is a blood sample or is derived from a blood sample.
- the sample is a cytological sample or is derived from a cytological sample.
- the method for detecting a target nucleic acid in a cell comprises a pretreatment step before hybridization of the target probe sets.
- the pretreatment step comprises a blocking step where certain blocking agent(s) is/are applied to block certain endogenous components of the cell thus reducing assay background.
- certain blocking agent(s) is/are applied to block certain endogenous components of the cell thus reducing assay background.
- hydrogen peroxide is a blocking agent when horseradish peroxidase (HRP) is used as detection enzyme in the later steps. Hydrogen peroxide is added to inactivate the endogenous HRP activity in the sample, thus reducing assay background.
- this blocking step is added as the first step in the pretreatment right after deparaffmization.
- the pretreatment step comprises an epitope retrieval step, where certain epitope retrieval buffer(s) can be added to unmask the target nucleic acid.
- the epitope retrieval step comprises heating the sample.
- the epitope retrieval step comprises heating the sample to 50 °C to 100 °C.
- the epitope retrieval step comprises heating the sample to about 88°C.
- the pretreatment step comprises a permeabilization step to retain the nucleic acid targets in the cell and to permit the target probe(s), signal-generating complex, etc. to enter the cell.
- the permeabilization step comprises a digestion with a protease.
- Detergents e.g., Triton X-100 or SDS
- Proteinase K can also be used to increase the permeability of the fixed cells.
- Detergent treatment usually with Triton X-100 or SDS, is frequently used to permeate the membranes by extracting the lipids.
- Proteinase K is a nonspecific protease that is active over a wide pH range and is not easily inactivated. It is used to digest proteins that surround the target mRNA. Optimal concentrations and durations of treatment can be experimentally determined as is known in the art.
- a cell washing step can follow, to remove the dissolved materials produced in the any steps in the pretreatment step.
- the sample is in a formalin-fixed paraffin embedded tissue, a deparaffmization step is needed, when paraffin is removed. e. Codetection Methods
- the methods described herein generally relates to detection of multiple target nucleic acids in a sample. It is understood that the disclosed can additionally be applied to detecting multiple target nucleic acids and optionally other molecules in the sample, in particular in the same cell as the target nucleic acid.
- proteins expressed in a cell can also concurrently be detected using a similar rationale as described herein for detecting target nucleic acids.
- one or more rounds of detection of multiple target nucleic acids and optionally one or more proteins expressed in a cell can be detected, for example, by using a detectable label to detect the protein.
- the protein can be detected with a cleavable label, similar to that used for detecting target nucleic acids. If the protein is being detected in the last round of detection, the label does not need to be cleavable.
- Detection of proteins in a cell are well known to those skilled in the art, for example, by detecting the binding of protein-specific antibodies using any of the well-known detection systems, including those described herein for detection of target nucleic acids. Detection of target nucleic acids and protein in the same cell has been described (see also Schulz et al., Cell Syst. 6(l):25-36 (2016)).
- proteins can be co-detected using antibodies (or fragments thereof) that are conjugated to an oligonucleotide, using the same types of SGCs that are disclosed herein. Such methods are disclosed in U.S. Provisional Patent Application No. 63/301,711, which is incorporated herein by reference.
- Embodiments of the present disclosure also include a kit for carrying out the methods disclosed herein.
- the kit can include the components described herein for performing a multiplexed in situ hybridization reaction, e.g., sets of target probes, SGCs (including label probes and amplifiers, pre-amplifiers, and/or pre-pre-amplifiers). Such components have been discussed extensively herein.
- the kit further comprises at least one compound or composition for removing signals generated by the detectable labels from the sample, to facilitate additional rounds of labeling and detection as discussed herein.
- the kit further comprises a reagent for cleaving labels from a label probe, such as a reducing agent, periodate, dithionite, hydroxylamine, base, or the like.
- a reagent for cleaving labels from a label probe such as a reducing agent, periodate, dithionite, hydroxylamine, base, or the like.
- the cleaving reagent is tris(2-carboxyethyl)phosphine (TCEP).
- the kit further comprises a reagent that disrupts hybridization between SGCs and a target nucleic acid, such as an acid reagent.
- a reagent that disrupts hybridization between SGCs and a target nucleic acid such as an acid reagent.
- Exemplary acids suitable for use in an acid reagent include, but are not limited to, formic acid, acetic acid, propionic acid, butyric acid, valeric acid, oxalic acid, malonic acid, succinic acid, malic acid, tartaric acid, citric acid, and the like.
- the acid reagent generally includes an acid at a concentration of about 5-40% acid (vol/vol). Further information regarding acid reagents is provided above; any such reagents can be included in the kits.
- the kit further comprises other agents or materials for performing RNA ISH, including fixing agents and agents for treating samples for preparing hybridization, agents for washing samples, and so on.
- the kit may further comprise packaging material, which refers to a physical structure housing the components of the kit.
- packaging material can maintain the components sterilely, and can be made of material commonly used for such purposes (e.g. , paper, corrugated fiber, glass, plastic, foil, ampules, vials, tubes, etc.).
- Kits provided herein can include labels or inserts.
- Labels or inserts include information on a condition, disorder, disease, or symptom for which the kit component may be used for.
- Labels or inserts can include instructions for a clinician or for a subject to use one or more of the kit components in a method, treatment protocol, or therapeutic regimen.
- the kit can be used for identification of tissues and cell types.
- the kit can be used for identification of different stages of development.
- the kit can be used for detection of clinical biomarkers for cancers.
- the kit can be used for diagnosing a disease or disorder based on the expression of one or more altered small RNAs or the presence of pathogen-associated small RNAs.
- the kit can be used for characterization of adult tissue. In some embodiments, the kit can be used for detection of clinical biomarkers for pathogen diagnosis. In some embodiments, the kit can be used for detection and characterization of small RNA- based therapies. In some embodiments, the kit can be used for confirmation of the initial efficiency of small RNA-based therapies. In some embodiments, the kit can be used to continue monitoring the efficiency of small RNA-based therapies. In some embodiments, the kit can be used for determining the efficiency of small RNA-based therapies. In some embodiments, the kit can be used for detecting the presence, localizing, and quantifying siRNAs. In some embodiments, the kit can be used for detecting the presence, localizing, and quantifying ASO molecules. In some embodiments, the kit can be used for detection and identification of pathogen-derived small RNAs.
- Embodiments of the present disclosure also include a method for enhancing detection of a target (e.g., a target nucleic acid).
- the method includes an image processing method, such as the methods described in International Patent Application PCT/US22/24975, which is herein incorporated by reference.
- the method is implemented at least in part with a computer having corresponding instructions stored on a memory (i.e., a non- transitory computer readable medium).
- the final images, and in some embodiments the intermediate images, from the method are stored in a memory.
- the memory is accessible by a network.
- user input or instructions are receivable or accessible over the network.
- the method includes imaging a sample with a target signal to create a probe image and imaging a sample with no target signal to create a background image (i.e., “blank image”).
- a “blank image” is an image that includes RNA blocking molecules of the present disclosure.
- a “blank image” is an image that does not include RNA blocking molecules of the present disclosure.
- the imaging utilizes a fluorescent microscope coupled to a computer via a network.
- the target signal is obtained by subjecting the sample to a fluorescent in situ hybridization assay and/or an immunofluorescence assay.
- the background image with no target signal is obtained by removing the target signal from the sample (i.e., by a cleaving process). In other embodiments, the background image with no target signal is obtained before the assay is performed.
- the target signal comprises a fluorescent label bound to a target nucleic acid. In other embodiments, the target signal comprises a fluorescent label bound to a target peptide or polypeptide.
- the method can also include registering the probe image and the background image (e.g., with or without RNA blocking molecules). Potential background fluorescence discrepancy between the probe image and the background image creates spatial pattern mismatches that occur due to whole sample movement between different rounds of image acquisition. To remove such discrepancies, image registration techniques (e.g., phase correlation) are utilized. Robust image registration utilizes detection and matching of image features to compensate for any global sample movement (i.e., translation and rotation).
- image registration techniques e.g., phase correlation
- the method further includes modifying the background image (e.g., with or without RNA blocking molecules) to create an adjusted background image (e.g., transformed, intensity- adjusted blank image) based on at least one image metric.
- the at least one image metric is a ratio factor, a multiplication factor, a local maximum value transform, and any other suitable metric.
- the method includes a single image metric. In other embodiments, the method includes a combination of image metrics.
- the method further includes subtracting the adjusted background image (e.g., with or without RNA blocking molecules) from the probe image to create a final image comprising an enhanced target signal.
- the modified (i.e., transformed, adjusted, scaled, etc.) blank image is used in the subtracting step instead of the original blank image.
- the enhanced target signal includes enhanced contrast.
- the method includes displaying the final image on a display (e.g., a computer display). The final image may be saved to a memory and may be accessible by a user, for example, over a network. As such, the method provides improved signal detection in the presence of a background with tissue autofluorescence.
- the image metric is a ratio factor to account for intensity differences in background between the blank image and the probe image. Intensity differences can occur when image acquisition settings are different or from photobleaching during fluorophore excitation.
- the method includes determining a ratio factor that compares the overall background intensity of the probe image versus the blank image.
- the pixel locations of the probe are estimated. The probe locations in the probe image are estimated using, for example, the White Top Hat algorithm (Gonzalez & Woods, 2008, Digital Image Processing), bandpass filtering (Shenoi, 2006, Introduction to Digital Signal Processing and Filter Design), or any combination of suitable methods.
- the method After determining an estimated location of the target signals in the probe image, the pixels at the estimated probe locations are excluded from both the probe image and the blank image, resulting in background-pixel-only images (i.e., background-only images).
- the method includes removing the estimated location from the probe image to create a first background-only image and removing the estimated location from the blank image (background image) to create a second background-only image.
- the method includes determining a ratio factor.
- a ratio factor is utilized in some embodiments to modify the background image to create an adjusted background image.
- modifying the background image to create an adjusted background image can include, in some embodiments, scaling the background image by the ratio factor.
- the at least one image metric is a ratio factor of the first background-only image and the second background-only image.
- the ratio factor in some embodiments is a first intensity to a second intensity, with the first intensity is determined from the first background-only image and the second intensity is determined from the second background-only image.
- the first and second intensities used in the ratio factor are statistical metrics such as a statistical mean, median, or a combination of both for any portion of (including all) the intensity values in an image.
- the first intensity is the mean of a plurality of pixel intensity values in the first background-only image and the second intensity is the mean of a plurality of pixel intensity values in the second background-only image.
- the mean is of all the pixel intensity values in the image.
- the first intensity is the median of a plurality of pixel intensity values in the first background-only image
- the second intensity is the median of a plurality of pixel intensity values in the second background- only image.
- the median is of all the pixel intensity values in the image.
- the first intensity is the mean of a central approximately 80% of all the pixel intensity values (i.e., excluding the approximate top 10% and the approximate bottom 10%) in the first background-only image
- the second intensity is the mean of a central approximately 80% of all the pixel intensity values in the second background-only image.
- the image metric is a multiplication factor to account for potential local intensity differences between the blank image and the probe image.
- the method includes determining the multiplication factor.
- the multiplication factor is within a range of approximately 1.0 to approximately 1.2. In other embodiments, the multiplication factor is within a range of approximately 1.0 to approximately 1.1.
- the multiplication factor is utilized in some embodiments to modify the background image to create an adjusted background image. In other words, modifying the background image to create an adjusted background image can include, in some embodiments, scaling the background image by the multiplication factor.
- the image metric is a local maximum value transform.
- the method includes transforming the blank image with a local maximum value transform. Even after global image registration, there may remain local background patern mismatches that from, for example, image acquisition at different focal planes, or samples not firmly attached to the supporting material (e.g., glass slides) and partially moving between imaging sessions. To resolve this issue, local mismatches are compensated with a transform.
- pixel of interest a neighborhood of a pre-defined radius surrounding the pixel of interest is searched. The search process will find the pixel of maximum intensity, and this maximum intensity is assigned to that pixel of interest.
- the pre-defmed radius (“match distance”) is adjustable.
- the pre-defined radius used in the local maximum valve transform is within a range of approximately 0 to approximately 5 pixels.
- the local maximum value transform includes a search radius within a range of approximately 0 to 5 pixels.
- a pre-defined radius of 0 pixels is utilized, for example, when there is no noticeable local background pattern mismatch.
- the search area is simplified to reduce computational time by using eight angularly equally spaced lines (i.e., 45 degrees apart), each with a single-pixel width, radiating from the pixel of interest.
- the image metric is a block-matching transform.
- the method includes a step to transform the blank image with a blockmatching transform.
- the block-matching transform is used in place of the local maximum value transform to resolve the issue of local mismatches.
- a block (“block of interest”) is used with a pre-defined block size (e.g., a 3-pixel- by-3-pixel block). Each block in the blank image is compared with blocks of the same size in the probe image in nearby locations (i.e., within a pre-defined block search size). The search determines the nearby block that is most similar to the block of interest.
- a similarity metric is utilized to measure the similarity of the blocks, and the searched nearby block with the highest similarity metric is determined to be the target block. Then, the block of interest is moved to the corresponding location of the target block.
- the similarity metric is a mean absolute difference, a sum of absolute difference, a mean squared difference, or a sum of squared difference, wherein the differences are the pixel intensity differences between the two blocks being compared.
- the block-matching transform is performed for each block of interest, searching its corresponding neighborhood in the probe image and moving its location accordingly, to form a transformed blank image. In some embodiments, this transformed blank image is used instead of the original blank image in later subtracting steps.
- the pre-defined block size and the pre-defined block search size are adjustable.
- the pre-defined block size used in the block-matching transform is within a range of approximately 1 to approximately 10 pixels. In other words, the block-matching transform includes a block size within a range of approximately 1 to 10 pixels.
- the pre-defined block search size used in the block matching transform is within a range of approximately 1 to approximately 10 pixels. In other words, the blockmatching transform includes a block search size within a range of approximately 1 to 10 pixels.
- the method for enhancing detection of a target includes any combination of the steps described herein, in various orders. In some embodiments, steps may be omitted. Further, the order of the steps may be reversed, altered, or performed simultaneously.
- the electronic-based aspects of the method may be implemented in software (e.g., stored on non-transitory computer-readable medium) executable by a computer with one or more processing units, such as a microprocessor and/or application specific integrated circuits (“ASICs”).
- ASICs application specific integrated circuits
- Some embodiments may include hardware, software, and electronic components or modules. As such, it should be noted that a plurality of hardware and software-based devices, as well as a plurality of different structural components, may be utilized to implement the embodiments.
- Mouse brain sections (5um thick) were collected from a fresh-frozen BL6 mouse brain from a 1 month old mouse and a 12-month-old mouse. The tissues were embedded in OCT, snap-frozen, and sectioned coronally from the brain block with a cryotome. Brain sections were fixed in 10% NBF and dehydrated in Ethanol before the assay.
- Target probe selections selections, synthesis, and formulation.
- One hundred mouse genes were selected as the target of transcriptomic profiling. These genes were examined for their spatial expression pattern in the mouse brains, their expression levels, and their specificity for brain cell types using single-cell RNAseq datasets from public sc-RNA seq data deposits. Multiplex and Hiplex assays were also performed on the coronal mouse brain sections to further verify the spatial expression patterns of some high-expression genes. The goal of these screening steps was to avoid any spatial crowding in the final FISH images, which would otherwise defeat the decoding process.
- All target probes were designed by ACD probe design team and were checked to ensure no potential cross-reactivity to other RNA species. All DNA oligo probes were formulated at lOuM concentration and were pooled in ACD hybridization buffer A at 1:500 dilution. 100 target probes were designed and synthesized for the first cycle, and another set of 100 target probes for the second cycle of the assay.
- the assay is built on a codebook that is composed of 10 fundamental codes. Each fundamental code has 5 bits and two fluorescence signals (e.g., ‘ 10100’ has an AF488 signal and a Dy594 signal). A pool of 5 unique fluorophores was used to generate various combinations of two fluorophores, which include: AF488, Dy550, Dy650, Dy594, and AF750. The resultant 5-bit codes are listed below: Table 4. The list of the 10 fundamental codes used in an exemplary lOOplex Ultraplex assay.
- Each code has 5 bits and contains two fluorescence signals.
- each RNA transcript was first hybridized a specific double-Z target probe designed by ACD.
- the target probe was then annealed with 3 tiers of branching DNA oligos (i.e. , AMP1, AMP2, and AMP3).
- the third AMP oligo contained a unique binding sequence that compliments the corresponding fluorophore-conjugated label oligos (label probe, or LP).
- label probe or LP
- the development of signal generating configuration was completed upon incubating the branching DNA structure with its corresponding label probes.
- the branching DNA structure it hybridized to was annealed with its complementary label probes that were conjugated with either fluorophore A or fluorophore B (50%:50% LP-A /LP-B in the final solution), as predetermined in the codebook (Table 5).
- RNA targets included in the panel were encoded twice through two independent barcoding cycles.
- the first cycle of barcoding provides the first 5 bits of the code
- the second cycle provides the last 5 bits of the code.
- all 100 genes were simultaneously labeled with their corresponding 5-bit codes, which does not require any specific temporal barcoding order (FIGS. 16A-16B).
- the two cycles were independent of each other.
- Each barcoding cycle can be performed either first or last. In each cycle, any prior branching DNA structures were disrupted and stripped off from the RNA molecules using the HiplexUP reagent, and the RNA molecules were subject to a new signal generation configuration, beginning with a new target probe hybridization.
- the ACD PT3, PT4 or Sigma ready-to-use pepsin solution (cat No. R2283) were used, depending on the tissue types, tissue processing methods, and extent of formalin fixation.
- the tissue was incubated with 300uL of pepsin at room temperature for lOmins.
- Pepsin offers some unique benefits over the PT3 or PT4, including reduction of autofluorescent background in the FFPE tissues, removal of >90% red blood cells, and removal of non-specific signals in the Ultraplex assay, which all contribute to a higher signal-to-noise ratio.
- Final assay outputs from the pipeline include: a cell x feature matrix, cell segmentation boundaries, field of view ID, the x and y coordinates of all detected transcripts in the tissue, and x and y coordinates of the centroids of each cell in the tissue.
- RNA transcript A single gene (RNA transcript), Ubc, was detected in a two-cycle experiment according to the methods disclosed herein. In each round of labeling, the independent identifier was based on a single fluorescent label, corresponding to Code 3 in Table la.
- Samples fresh-frozen Balb/c mouse brain sections from Acepix were fixed 10% neutral buffered formalin (NBF) for 1 hour at room temperature, subjected to serial dehydration in 50%, 80%, and 100% ethanol, and then subjected to protease-IV digestion for 30 minutes at room temperature.
- NAF neutral buffered formalin
- target probe sets were hybridized for 2 hours at 40 °C, followed by hybridization of amplification reagents, and incubation with label probes (based on Dy650) for 15 minutes at 40 °C.
- the fluorophores from the label probes were cleaved, and the amplification trees were removed using the HiplexUP reagent (Advanced Cell Diagnostics, Inc.) (3 incubations at 5 minutes each, room temperature).
- HiplexUP reagent Advanced Cell Diagnostics, Inc.
- a second round of target probe sets were then hybridized for 2 hours at 40 °C, followed by hybridization of amplification reagents, incubation with label probes (based on Dy650) for 15 minutes at 40 °C, DAPI counterstaining, mounting, and imaging (Akoya Polaris microscope, 40x). Standard washing steps were used between the main hybridization steps.
- a resulting combined image from the two rounds of detection is shown in FIG. 11.
- the signals detected in the first round are indicated by triangles, and signals detected in the second round are indicated by squares. Locations with overlapping triangles and squares are the locations where the target gene, Ubc, was detected.
- RNA transcripts Gadph, Sdha, and Ubc Three genes (RNA transcripts Gadph, Sdha, and Ubc) were detected in a two-cycle experiment according to the methods disclosed herein. Each target was assigned the same first independent identifier based on a single fluorescent label (Code 3, see Table 6), and each target was assigned a different second independent identifier based on a single fluorescent label (Codes 1, 2, and 3 respectively, see Table 6).
- Samples fresh-frozen Balb/c mouse brain sections from Acepix
- NMF neutral buffered formalin
- target probe sets were hybridized for 2 hours at 40 °C, followed by hybridization of amplification reagents, and incubation with label probes (based on Dy650) for 15 minutes at 40 °C.
- label probes based on Dy650
- DAPI counterstaining, mounting, and imaging Alkoya Polaris microscope, 40x
- the fluorophores from the label probes were cleaved, and the amplification trees were removed using the HiplexUP reagent (Advanced Cell Diagnostics, Newark, CA) (3 incubations at 5 minutes each, room temperature).
- a second round of target probe sets were then hybridized for 2 hours at 40 °C, followed by hybridization of amplification reagents, incubation with label probes (based on Dy650, Dy550, and AF488) for 15 minutes at 40 °C, DAPI counterstaining, mounting, and imaging (Akoya Polaris microscope, 40x). Standard washing steps were used between the main hybridization steps.
- FIG. 12A shows detection of Gadph
- FIG. 12B shows detection of Sdha
- FIG. 12C shows detection of Ubc.
- the signals detected in the first cycle i.e., all three targets
- the signals detected in the second cycle are shown in triangles.
- signals detected in the second cycle are indicated by circles, and locations with overlapping triangles and circles are the locations where the combined first and second identifiers (i.e., 31) show that Gadph was detected.
- signals detected in the second cycle are indicated by crosses, and locations with overlapping triangles and crosses are the locations where the combined first and second identifiers (i.e., 32) show that Sdha was detected.
- signals detected in the second cycle are indicated by squares, and locations with overlapping triangles and squares are the locations where the combined first and second identifiers (i.e., 33) show that Ubc was detected.
- Hs-PPIB human peptidylprolyl isomerase B
- target probes were hybridized to the target for 2 hours at 40 °C, followed by treatment with amplification reagents for 30 minutes each at 40 °C, and then with the appropriate label probes for 30 minutes at 40 °C.
- DAPI counterstaining, mounting, and imaging Alkoya Polaris microscope, 40x, images shown in FIG. 13 A
- the labels were cleaved and the signalgenerating complexes were removed using a cleaving reagent and HiplexUP reagent (Advanced Cell Diagnostics, Newark, CA), respectively.
- the second round of labeling and detection was conducted in a manner similar to the first round. Standard washing steps were used between the main hybridization steps. Images are shown in FIG. 13B, and an overlay of the signals detected in the first and second rounds of labeling and imaging is shown in FIG. 13C, with shaded circles showing locations in which the full target identifier (A8) was detected.
- the second experiment was conducted with a reversed coding approach. Images from each round of labeling are shown in FIGS. 14A and 14B, and an overlay is shown in FIG. 14C.
- FIG. 15 A Images were obtained on a Leica DM2000 epifluorescence microscope using two different filter sequences; the first being DAPI-GFP- Cy3-Cy5-Cy7, and the second being GFP-DAPI-Cy7-Cy5-Cy3.
- Individual images are shown in FIG. 15 A, overlay images are shown in FIG. 15B, and overlays showing individual “dots” for detection of PPIB RNA are shown in FIG. 15C.
- each of the 100 target genes is differentially expressed by a unique cell type in the mouse brain and thus can be used to mark the particular cell type.
- these single-cell gene expression results were significantly correlated with the corresponding control data sets (FIG. 17).
- the correlation coefficient (R value) ranges from 0.6-0.81, indicating that the Ultraplex assay can effectively profile the gene expressions of mouse brain cells at singlecell level.
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Immunology (AREA)
- Physics & Mathematics (AREA)
- Molecular Biology (AREA)
- Biotechnology (AREA)
- Biophysics (AREA)
- Analytical Chemistry (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
The present disclosure provides methods, compositions, and kits for the multiplexed detection and corresponding expression analysis of nucleic acids in biological samples. The methods comprise hybridizing to the targets a plurality of target probe sets and a plurality of signal generating complexes (SGCs), each comprising a plurality of detectable labels, and determining a first identifier based on the number of distinct detectable labels in each SGC. The signals are then removed from the sample and the hybridization cycle is repeated with a second plurality of probes or SGCs to determine a second identifier.
Description
MULTIPLEXED DETECTION OF NUCLEIC ACID TARGETS
CROSS REFERENCE TO RELATED APPLICATIONS
This application claims priority to and the benefit of U.S. Provisional Patent Application No. 63/386,425 filed December 7, 2022, which is incorporated herein by reference in its entirety and for all purposes.
FIELD
The present disclosure provides methods, compositions, and kits for the multiplexed detection and corresponding expression analysis of nucleic acids in biological samples.
BACKGROUND
RNA in situ hybridization (ISH) is a molecular biology technique widely used to measure and localize specific RNA sequences, for example, messenger RNAs (mRNAs), long non-coding RNAs (IncRNAs), and microRNAs (miRNAs) within cells, such as circulating tumor cells (CTCs) or tissue sections, while preserving the cellular and tissue context. RNA ISH therefore provides for spatial-temporal visualization as well as quantification of gene expression within cells and tissues. It has wide applications in research and in diagnostics (Hu et al., Biomark. Res. 2(1):1-13, doi: 10.1186/2050-7771-2-3 (2014); Ratan et al., Cureus 9(6):el325. doi: 10.7759/cureus.l325 (2017); Weier et al., Expert Rev. Mol. Diagn. 2(2): 109- 119 (2002))). Fluorescent RNA ISH utilizes fluorescent dyes and fluorescent microscopes for RNA labeling and detection, respectively. Fluorescent RNA ISH typically provides for limited multiplexing of four to five target sequences. The limited multiplexing capability is largely due to the small number of spectrally distinct fluorescent dyes that can be distinguished by the optical systems of the fluorescence microscope. Higher level of multiplexing is highly desirable in areas such as generating cell and tissue maps to understand complex biological systems, particularly in human health and disease.
SUMMARY
In one aspect, disclosed herein is a method for detecting a plurality of target nucleic acids in a sample, the method comprising: contacting a sample comprising a plurality of target nucleic acids with a plurality of target probe sets, wherein each target probe set is complementary to a target nucleic acid;
contacting the sample with a first plurality of signal generating complexes (SGCs) capable of hybridizing to at least one target probe set, wherein each SGC comprises a plurality of spatially distributed label probes comprising detectable labels; obtaining a first independent identifier corresponding to at least one target nucleic acid, wherein the first independent identifier is based on the number of distinct detectable labels present in each SGC of the first plurality of SGCs; removing signals generated by the detectable labels from the sample; contacting the sample with at least a second plurality of SGCs capable of hybridizing to at least one target probe set, wherein each SGC comprises a plurality of spatially distributed label probes comprising detectable labels; and obtaining a second independent identifier corresponding to at least one target nucleic acid, wherein the second independent identifier is based on the number of distinct detectable labels present in each SGC of the second plurality of SGCs.
In some embodiments, the method further comprises identifying a target nucleic acid of the plurality of target nucleic acids based on a combination of the first and second independent identifiers.
In some embodiments, the target nucleic acid is identified independent of the order by which the first and second independent identifiers are obtained.
In some embodiments, the first and second independent identifiers are based on the same number of distinct detectable labels present in the first and second plurality of SGCs, respectively. In some embodiments, the first and second independent identifiers are based on a different number of distinct detectable labels present in the first and second plurality of SGCs, respectively. In some embodiments, the independent identifier is based on the number of distinct detectable labels present in a combination of at least two SGCs.
In some embodiments, the spatially distributed label probes comprise two distinct detectable labels. In some embodiments, the spatially distributed label probes comprise three distinct detectable labels. In some embodiments, the spatially distributed label probes comprise four distinct detectable labels. In some embodiments, the spatially distributed label probes comprise five distinct detectable labels. In some embodiments, the spatially distributed label probes comprise six distinct detectable labels.
In some embodiments, obtaining an independent identifier comprises obtaining an image of the sample and detecting signals generated by the detectable labels.
In some embodiments, the method further comprises:
removing the signals generated by the detectable labels of the second plurality of SGCs from the sample; contacting the sample with at least a third plurality of SGCs capable of hybridizing to at least one target probe set, wherein each SGC comprises a plurality of spatially distributed label probes comprising detectable labels; and obtaining a third independent identifier corresponding to at least one target nucleic acid, wherein the third independent identifier is based on the number of distinct detectable labels present in each SGC of the third plurality of SGCs.
In some embodiments, each target probe set comprises two or more target probes capable of hybridizing to a target nucleic acid. In some embodiments, each target probe in the target probe set comprises a T section complementary to a region of a target nucleic acid and an L section complementary to a region of an SGC. In some embodiments, each T section is complementary to a non-overlapping region of a target nucleic acid and each L section is complementary to a non-overlapping region of an SGC. In some embodiments, the T section of at least one of the target probes in the target probe set is 3’ of its L section. In some embodiments, the T section of at least one of the target probes in the target probe set is 5 ’ of its L section.
In some embodiments, each SGC comprises one or more of an amplifier, a preamplifier, and a pre-pre-amplifier. In some embodiments, each label probe in the plurality of label probes comprises a binding site for the amplifier. In some embodiments, the amplifier comprises at least one binding site for the pre-amplifier and a plurality of binding sites for the plurality of label probes. In some embodiments, the pre-amplifier comprises at least one binding site for the L section of the target probe and at least one binding site for the amplifier. In some embodiments, the pre-amplifier comprises at least one binding site for the pre-pre- amplifier and at least one binding side for the amplifier. In some embodiments, the at least one binding site for the L section of the target probe in the pre-amplifier or the pre-pre-amplifier is distinct for each SGC corresponding to a target nucleic acid. In some embodiments, the at least one binding site for the L section of the target probe in the pre-amplifier or the pre-pre-amplifier is the same for each SGC corresponding to a target nucleic acid.
In some embodiments, removing the signals generated by the detectable labels comprises removing the SGCs. In some embodiments, removing the SGCs comprises treatment with an acid reagent that disrupts hybridization between the SGCs and the target nucleic acids. In some embodiments, the acid reagent comprises formic acid, acetic acid, propionic acid, butyric acid, valeric acid, oxalic acid, malonic acid, succinic acid, malic acid, tartaric acid,
citric acid. In some embodiments, after the first plurality of SGCs are removed, the second plurality of SGCs are hybridized to the same target probe set. In some embodiments, after the first plurality of SGCs are removed, the second plurality of SGCs are hybridized to a different target probe set.
In some embodiments, removing the signals generated by the detectable labels comprises removing the detectable labels from the plurality of label probes using a cleavage reagent. In some embodiments, after the detectable labels are removed, the second plurality of SGCs are hybridized to a different target probe set.
In some embodiments, after the step of removing signals generated by the detectable labels from the sample, the method further comprises a step of contacting the sample with a second plurality of target probe sets, wherein each target probe set in the second plurality of target probe sets is complementary to a target nucleic acid.
In some embodiments, the sample comprises a cell. In some embodiments, the method further comprises fixing and/or permeabilizing the cell.
In some embodiments, the target nucleic acid is RNA. In some embodiments, the method further comprises detecting at least one non-nucleic acid target in the sample. In some embodiments, the non-nucleic acid target is a protein.
In one aspect, disclosed herein is a method for detecting a plurality of target nucleic acids in a sample, the method comprising: contacting a sample comprising a plurality of target nucleic acids with a plurality of target probe sets, wherein each target probe set is complementary to a target nucleic acid; contacting the sample with a plurality of amplification complexes capable of hybridizing to at least one target probe set, wherein each amplification complex comprises a plurality of spatially distributed binding sites for label probes; contacting the sample with a first plurality of label probes comprising detectable labels, which hybridize to binding sites on the amplification complexes to generate a first plurality of signal generating complexes (SGCs); obtaining a first independent identifier corresponding to at least one target nucleic acid, wherein the first independent identifier is based on the number of distinct detectable labels present in each SGC of the first plurality of SGCs; removing signals generated by the detectable labels from the sample; contacting the sample with at least a second plurality of label probes comprising detectable labels, which hybridize to binding sites on the amplification complexes to generate a second plurality of SGCs; and
obtaining a second independent identifier corresponding to at least one target nucleic acid, wherein the second independent identifier is based on the number of distinct detectable labels present in each SGC of the second plurality of SGCs.
In some embodiments, the method further comprises identifying a target nucleic acid of the plurality of target nucleic acids based on a combination of the first and second independent identifiers. In some embodiments, the target nucleic acid is identified independent of the order by which the first and second independent identifiers are obtained.
In some embodiments, the first and second independent identifiers are based on the same number of distinct detectable labels present in the first and second plurality of SGCs, In some embodiments, the first and second independent identifiers are based on a different number of distinct detectable labels present in the first and second plurality of SGCs, respectively. In some embodiments, the independent identifier is based on the number of distinct detectable labels present in a combination of at least two SGCs.
In some embodiments, the spatially distributed label probes comprise two distinct detectable labels. In some embodiments, the spatially distributed label probes comprise three distinct detectable labels. In some embodiments, the spatially distributed label probes comprise four distinct detectable labels. In some embodiments, the spatially distributed label probes comprise five distinct detectable labels. In some embodiments, the spatially distributed label probes comprise six distinct detectable labels.
In some embodiments, obtaining an independent identifier comprises obtaining an image of the sample and detecting signals generated by the detectable labels.
In some embodiments, the method further comprises: removing the signals generated by the detectable labels of the second plurality of SGCs from the sample; contacting the sample with at least a third plurality of label probes comprising detectable labels, which hybridize to binding sites on the amplification complexes to generate a third plurality of SGCs; and obtaining a third independent identifier corresponding to at least one target nucleic acid, wherein the third independent identifier is based on the number of distinct detectable labels present in each SGC of the third plurality of SGCs.
In some embodiments, each target probe set comprises two or more target probes capable of hybridizing to a target nucleic acid. In some embodiments, each target probe in the target probe set comprises a T section complementary to a region of a target nucleic acid and an L section complementary to a region of an amplification complex. In some embodiments,
each T section is complementary to a non-overlapping region of a target nucleic acid and each L section is complementary to a non-overlapping region of an amplification complex. In some embodiments, the T section of at least one of the target probes in the target probe set is 3 ’ of its L section. In some embodiments, the T section of at least one of the target probes in the target probe set is 5 ’ of its L section.
In some embodiments, each amplification complex comprises one or more of an amplifier, a pre-amplifier, and a pre-pre-amplifier. In some embodiments, each label probe in the plurality of label probes comprises a binding site for the amplifier. In some embodiments, the amplifier comprises at least one binding site for the pre-amplifier and a plurality of binding sites for the plurality of label probes. In some embodiments, the pre-amplifier comprises at least one binding site for the L section of the target probe and at least one binding site for the amplifier. In some embodiments, the pre-amplifier comprises at least one binding site for the pre-pre-amplifier and at least one binding side for the amplifier. In some embodiments, the at least one binding site for the L section of the target probe in the pre-amplifier or the pre-pre- amplifier is distinct for each amplification complex corresponding to a target nucleic acid. In some embodiments, the at least one binding site for the L section of the target probe in the preamplifier or the pre-pre-amplifier is the same for each amplification complex corresponding to a target nucleic acid.
In some embodiments, removing the signals generated by the detectable labels comprises removing the detectable labels from the plurality of label probes using a cleavage reagent.
In some embodiments, after the detectable labels from the first plurality of SGCs are removed, the second plurality of label probes are hybridized to different amplification complexes to generate the second plurality of SGCs. In some embodiments, after the step of removing signals generated by the detectable labels from the first plurality of label probes from the sample, the method further comprises a step of contacting the sample with a second plurality of target probe sets, wherein each target probe set in the second plurality of target probe sets is complementary to a target nucleic acid.
In some embodiments, the sample comprises a cell. In some embodiments, the method further comprises fixing and/or permeabilizing the cell.
In some embodiments, the target nucleic acid is RNA. In some embodiments, the method further comprises detecting at least one non-nucleic acid target in the sample. In some embodiments, the non-nucleic acid target is a protein.
In one aspect, disclosed herein is a kit for carrying out any of the methods disclosed herein.
BRIEF DESCRIPTION OF THE DRAWINGS
FIGS. 1A-1B show exemplary configurations of probes for detecting a target nucleic acid. In the configuration shown in FIG. 1A, each individual target probe has a target (T) segment complementary to the target nucleic acid (i.e. , a segment of the target probe that can hybridize to the target nucleic acid) and a label (L) segment complementary to a component of a Signal Generating Complex (SGC) (i.e., a segment of the target probe that can hybridize to a component of the SGC). Each SGC comprises multiple layers of components, such as amplifiers (AP) and pre-amplifiers (PA) that assemble into a tree-like structure which is capable of carrying many label probes (LP) on its “branches.” As shown in FIG. IB, if the target sequence is sufficiently long, many TP sets and associated SGCs can be assembled on the target nucleic acid to generate a detectable signal that appears in an imaging system as a discrete “dot.”
FIGS. 2A-2B show two exemplary configurations of probes for detecting a target nucleic acid. The configurations utilize SGCs comprising LPs, APs, PAs, TPs, as shown in FIGS. 1 A-1B, except that assembly of the SGC utilizes a Collaboration Amplifier (COM). The COMs bind to two pre-amplifiers (PAs) and to an amplifier (AP) for assembly of the SGC. FIGS. 2A-2B show two different configurations of the target probe binding to the target nucleic acid. The configurations allow more LPs to be incorporated into one SGC. Such configurations are suitable, e.g., for detecting short target sequences, because a detectable signal can be generated with a single SGC.
FIG. 3 shows an exemplary configuration for multiplex detection of target nucleic acids. In this embodiment, each SGC carries the same LPs with a specific label, where different SGCs carry distinct labels. Components of each SGC (such as PAs, APs, LPs, etc.) are uniquely associated with the SGC. The SGCs are designed so that the components of a target-specific SGC hybridize to each other to assemble the SGC but cannot cross-hybridize to any components of any other SGC. In FIG. 3, two target nucleic acids are shown bound to the respective SGCs. For one target nucleic acid (upper target in FIG. 3), SGCs for the target comprise four labels (LI, L2, L3, and L4 that are specific to this SGC). For a second target (lower target in FIG. 3), the code for the target nucleic acid, the SGCs for the target nucleic acid comprise two labels (L2 and L4 that are specific to this SGC).
FIGS. 4A-4G illustrate various exemplary embodiments for multiplex detection of target nucleic acids. FIG. 4A illustrates one embodiment of sub-SGC implementation, where the SGC ID code is implemented on the amplifier (AP) molecule. As shown in FIG. 4A, an AP has one region designed to bind to the amplifier anchor (AA) on the PA molecule (i.e. , binding site for the amplifier on the pre-amplifier) and another region comprising multiple segments of label probe anchors (LAs) (i.e., binding sites for the label probes). In this embodiment, a mixture of different LAs are designed according to the unique identification code of the SGC. For example, if the identifier of the SGC is based on three labels, then equal number of LAs for each of the three labels are made on the AP molecule, which will bind a designed number of desired LPs to generate the identifier in the assay. In FIG. 4A, the code for the SGC shown uses labels LI, L3, and L4 (alphanumeric code C, see Table la).
FIG. 4B illustrates another embodiment of sub-SGC implementation, where the SGC ID code is implemented on the pre-amplifier (PA) molecule. As shown, N “pure” AP molecules are made, each carrying the same type of LP. A PA molecule has one region designed to bind to the SA (i.e., binding site for the PA on the TP; see FIG. 1A) of a TP set and another region comprising multiple segments of AAs (i.e., binding sites for amplifiers on the pre-amplifier). In this embodiment, a mixture of different AAs is designed according to the unique identification code of the SGC. For example, if the ID code of the SGC includes the two labels L2 and L4, then equal number of AAs for APs carrying LP2 and LP4 are made on the PA molecule, which will bind a designed number of desired LPs to generate the identifier in the assay. In FIG. 4B, the identifier of the SGC is shown using LI, L3, and L4 (alphanumeric code C, see Table la).
FIG. 4C illustrates another embodiment of sub-SGC implementation, where the SGC ID code is implemented on the LP molecule. In this embodiment, LP molecules binding to the same SGC can be mixtures of LPs each conjugated to a different label according to a predefined code book. For example, as shown in FIG. 4C, SGC5 LPs are a mixture of LPs conjugated to three different labels (LI, L3, and L4). An advantage of this embodiment is that the “coloring” of the SGC complex by the LPs will be completely randomized, which can further help to reduce coding errors. A partial LP mixing code book is shown on the right of FIG. 4C, with 7 different exemplary SGC identifiers shown using 4 labels.
FIG. 4D shows an embodiment of FIG. 4C in more detail. For each SGC, a specific label anchor (LA, the binding site on the amplifier for the label probe) is assigned so that each SGC for a particular target nucleic acid has a plurality of the same LAs on the amplifier. The level at which combinatorial labeling can be provided is with the label probes (LPs). In this
case, SGC5 is illustrated showing that the amplifiers comprise a plurality of identical LAs, labeled “E.” As shown in FIG. 4D, SGC5 is coded with three distinct labels conjugated to label probes (L 1 , L3 , and L4), all of which have the same binding site for the plurality of “E” LAs on the corresponding amplifier. Therefore, all label probes specific to SGC5 are bound to the amplifiers of SGC5.
FIG. 4E shows an embodiment of FIG. 4C in more detail. The SGC5 of FIG. 4D is shown bound to its respective target nucleic acid, with the label probes having an “E” binding site bound to the respective “E” LAs of the SGC5 amplifiers. Also shown are two additional exemplary SGCs bound to their respective target nucleic acids. SGC 1 , coded as shown in FIG. 4D, comprises a plurality of identical LAs, labeled “A.” SGC1 is coded with one label (LI), which has the binding site for the plurality of “A” LAs of the SGC1 amplifiers. Therefore, LI is bound to the amplifiers of SGC1, thereby identifying the SGC1 target nucleic acid with the independent identifier based on LI (alphanumeric code 1, see Table la). SGC3, coded as shown in FIG. 4D, comprises a plurality of identical LAs, labeled “C.” SGC3 is coded with two distinct label probes (L 1 and L2), both of which have the same binding site for the plurality of “C” LAs on the corresponding SGC3 amplifiers. Therefore, both label probes are bound to the amplifiers of SGC3, thereby labeling the SGC3 target nucleic acid with the independent identifier based on LI and L2 (alphanumeric code A, see Table la).
FIG. 4F shows an embodiment of FIG. 4C in more detail. Once an SGC for a particular target nucleic acid has been designed, the identifier for the target nucleic acid can be readily modified simply by changing the labels on the label probes that bind to the amplifiers of a particular SGC. For example, in FIG. 4D, SGC2 comprises amplifiers with “B” LAs and is coded using label probe L2. The same SGC assembly can be used with respect to the target probes, pre-amplifier, and amplifiers with “B” LAs, but instead of using “B” LA-binding label probes with only L2 as in FIG. 4D, “B” LA-binding label probes can be used that have a mixture of labels L2 and L3, such that SGC2 is now coded with both labels (alphanumeric code 8, see Table la). Thus, L2 and L3 are bound to “B” LAs on the SGC2 amplifiers. Similarly, SGC5 comprising amplifiers with “E” LAs is now coded in FIG. 4F by using label probes with “E” LA-binding label probes that have a mixture of labels L2, L3, and L4 (alphanumeric code E, see Table la), instead of labels LI, L3, and L4 (alphanumeric code C, see Table la) as shown in FIG. 4D.
FIG. 4G shows an embodiment of FIG. 4C in more detail. Two target nucleic acids are shown with two bound SGCs, SGC2 and SGC5. SGC2 is coded with an identifier based on labels L2 and L3 (alphanumeric code 8, see Table la), and SGC5 is coded with an identifier
based on labels L2, L3, and L4 (alphanumeric code E, see Table la). In this case, where all of the label probes bind to the respective LAs, “B” LAs in the case of SGC2 and “E” LAs in the case of SGC5, and assuming that the SGC2 and SGC5 have approximately the same number of LAs in the respective SGCs, the number of respective labels that can bind to SGC2 will be higher than the number of respective labels that bind to SGC5 (i.e., the 2 distinct labels for SGC2 and the 3 distinct labels for SGC5 will be bound to the same number of sites, resulting in a higher number of L2 and L3 being bound to SGC2 than SGC5 since some of the SGC5 sites are occupied by L4). If desired, the number of labels (and therefore intensity of signal) can be normalized by including “blank” label probes, i.e., probes having a binding site for the respective LAs (in this case “B” for SGC2 and “E” for SGC5) but without a label. Lor example, if it is desired to compare SGC2 and SGC5 with equal intensity signals for the respective labels, 1/3 “blank” label probes can be included with the mixture of “B” LA-specific probes so that the intensity of L2 and L3 will be the same on both SGCs (i.e., 1/3 of SGC2 occupied by “blank” label probes and 1/3 of SGC5 occupied by L4). In another example, if a multiplex assay is being performed where some SGCs include 4 labels, then the assay can be performed so that the same proportion of “blank” label probes are included in the label probe sets using less than 4 labels, for example, 1/2 “blank” label probes can be included with the SG2-specific label probes and 1/4 “blank” label probes to be included with the SGC5-specific label probes so that the amount of each distinct label probe bound to the respective SGCs is the same on each SGC.
EIG. 5 shows two configurations of assembly of SGCs on a target nucleic acid. In the lower panel of EIG. 5, the SGCs providing the same label are shown binding in a group next to each other. In the upper panel of FIG. 5, the SGCs providing different labels are shown with binding sites on the target nucleic acid intermingled or intertwined. The intermingling of target probe binding sites on the target nucleic acid for different labels are advantageous because, if different SGC types are positioned apart, in separate groups, a certain section of the target may be blocked or masked, thereby preventing attachment of one specific SGC type, which will result in miscoding.
FIGS. 6A-6B demonstrate configurations for reducing miscoding for multiplex detection of target nucleic acids. As shown in FIG. 6A, the particular SGC is miscoded because the PA is truncated, which could occur during manufacturing of the PA. In this case in FIG. 6A, L4 is not bound due to truncation. As shown in FIG. 6B, the same truncation will not cause miscoding if the labels are intertwined or intermingled on the PA. Arranging different labels into alternating positions reduces the chance of miscoding.
FIGS. 7A-7B show a method to minimize potential miscoding caused by truncation by randomizing the position of different labels on the AP or PA. As illustrated in FIGS. 7A-7B, the multiplexing channel ID is encoded on the AP molecule. In FIG. 7A, different label probes are positioned on each AP in exactly the same way, that is, each amplifier in the SGC is the same. Truncation of some of the APs can cause substantial reduction in certain labels being bound to the target nucleic acid compared to other labels on different positions of the AP. This imbalance increases the chance of miscoding. In the most severe case, truncation could cause the loss of all copies of one certain label, leading to an outright miscode. In FIG. 7B, locations of different labels on the APs are randomized. The APs are provided as a plurality of amplifiers, where a mix of non-identical amplifiers is included, where the position of LAs for specific label probes are distributed differently and can be randomized on the non-identical amplifiers. Truncation therefore does not cause a large bias in the number of labels in the SGC.
FIG. 8 shows an exemplary embodiment of the methods disclosed herein for multiplexed detection of target nucleic acids, by generation of two separate independent identifiers of each target nucleic acid using two separate rounds of labeling and detection. The identifiers are incorporated at the label-probe level, using the same regions of each target analyte in each round. In the first round, each analyte (i.e. “gene” in this figure) is labeled with one type of the unique SGCs (i.e. “trees” in this figure), each of which provides signals that forms a unique combinatorial code. For example, for SGC “E” with alphanumeric code E, the SGC includes three different label probes, but not the fourth. Once these signals are detected and recorded to generate the first independent identifier of each target gene, the signals are removed, and the analyte is labeled with another type of SGC. The second set of signals is then detected and recorded. This procedure can continue with more rounds. The expanded code can then be used to identify the analyte. Note in this figure the individual labels are not distinguished for simplicity. Round 2 can target either the same sequence of the target as in the first round, or a different sequence. If the same sequence is targeted, the T-sections of the target probes can be the same as those used in the previous round, and the entire signal-generating complex is removed between each round. If a different sequence is targeted, the T-sections of the target probes will be different, and only the fluorophore need be cleaved from the SGCs between each round.
FIG. 9 shows a representative illustration of an embodiment using different combinations of labels at the SGC level. In the first round, each analyte (i.e. “gene” in this figure) is labeled with one combination of the unique SGCs (i.e. “trees” in this figure), each of which provides one signal only. For example, SGC “a” contains the first signal only but not
others. Once these signals (i.e., the first independent identifier) are detected and recorded, either the signals or the SGCs are removed, and analyte is labeled with another combination of unique SGCs for the second round. The second set of the combinatorial code is then detected and recorded. This procedure can continue with more rounds. The expanded combinatorial code can then be used to identify the analyte. Round 2 can target either the same sequence of the target as in the first round, or a different sequence. If the same sequence is targeted, the T- sections of the target probes can be either the same as those used in the previous round or different from those used in the previous round, and the entire signal-generating complex is removed between each round. If a different sequence is targeted, the T-sections of the target probes will be different, and only the fluorophore needs be cleaved from the SGCs between each round.
FIGS. 10A-10C show an exemplary assay workflows for the methods disclosed herein.
FIG. 11 shows a representative image demonstrating detection of a single gene (RNA transcript), Ubc, in a two-cycle experiment according to the methods disclosed herein. In each round of labeling, the independent identifier in this experiment was based on a single fluorescent label. The signals detected in the first round are indicated by triangles in the image, and signals detected in the second round are indicated by squares. Locations with overlapping triangles and squares are the locations where the target gene, was detected.
FIGS. 12A-12C show representative images from a single two-cycle experiment to detect three genes (RNA transcripts Gadph, Sdha, and Ubc). Each target was assigned the same first independent identifier based on a single fluorescent label, and each target was assigned a different second independent identifier based on a single fluorescent label. FIG. 12A shows detection of Gadph, FIG. 12B shows detection of Sdha, and FIG. 12C shows detection of Ubc. In each image, the signals detected in the first cycle represent all three genes, and are shown in triangles. In FIG. 12 A, signals detected in the second cycle are indicated by circles, and locations with overlapping triangles and circles are the locations where the combined first and second identifiers show that Gadph was detected. In FIG. 12B, signals detected in the second cycle are indicated by crosses, and locations with overlapping triangles and crosses are the locations where the combined first and second identifiers show that Sdha was detected. In FIG. 12C, signals detected in the second cycle are indicated by squares, and locations with overlapping triangles and squares are the locations where the combined first and second identifiers show that Ubc was detected.
FIGS. 13A-13C show representative images from a first two-cycle experiment to detect one gene (RNA transcript Hs-PPIB), with the first and second identifiers each based on two
fluorescent labels. Further details can be found in Example 3. FIG. 13A shows images from the first round of labeling and detection, with panels showing detection of DAPI (upper left), Dylight 550 (upper middle, no label detected), Dylight 650 (upper right, label detected), Alexa Fluoro 488 (lower left, no label detected), Alexa Fluoro 750 (lower middle, label detected), and an overlay (lower right). FIG. 13B shows images from the second round of labeling and detection, with panels showing detection of DAPI (upper left), Dylight 550 (upper middle, label detected), Dylight 650 (upper right, label detected), Alexa Fluoro 750 (lower left, no label detected), Alexa Fluoro 488 (lower middle, no label detected), and an overlay (lower right). FIG. 13C is an overlay of all signals detected in the first and second rounds, with shaded circles showing “dots” having the complete identifier “code” for the target.
FIGS. 14A-14C show representative images from a second two-cycle experiment to detect one target nucleic acid (RNA transcript Hs-PPIB), with the first and second identifiers each based on two fluorescent labels, where the order of the identifiers was reversed compared to those in the experiments shown in FIGS. 13A-13C. Further details can be found in Example 3. FIG. 14A shows images from the first round of labeling and detection, with panels showing detection of DAPI (upper left), Dylight 550 (upper middle, label detected), Dylight 650 (upper right, label detected), Alexa Fluoro 488 (lower left, no label detected), Alexa Fluoro 750 (lower middle, no label detected), and an overlay (lower right). FIG. 14B shows images from the second round of labeling and detection, with panels showing detection of DAPI (upper left), Dylight 550 (upper middle, no label detected), Dylight 650 (upper right, label detected), Alexa Fluoro 750 (lower left, label detected), Alexa Fluoro 488 (lower middle, no label detected), and an overlay (lower right). FIG. 14C is an overlay of all signals detected in the first and second rounds, with shaded circles showing “dots” having the complete identifier “code” for the target.
FIGS. 15A-15C show representative images from a single ISH labeling experiment to detect one target nucleic acid (RNA transcript Hs-PPIB) in Hela cells, to demonstrate that individual PPIB signals (coded with alphanumeric Code 8) can be detected in any order to generate an independent identifier, as further described in Example 4. FIG. 15A shows images detected using two different Excitation/Emission filter sequences. FIG. 15B shows an overlay of the images from each sequence. FIG. 15C presents overlays showing individual “dots” for detection of PPIB.
FIGS. 16A-16B show exemplary embodiments of the methods disclosed herein for multiplexed detection of target nucleic acids, by generation of two separate independent identifiers of each target nucleic acid using two separate rounds of labeling and detection. In
FIG. 16A, the identifiers are incorporated at the label-probe level, using the same regions of each target analyte in each round. In the first round, each analyte (i.e. , “gene” in this figure) is labeled with one type of the unique SGCs (i.e., “trees” in this figure), each of which provides signals that forms a unique combinatorial code. For example, for SGC “E” with alphanumeric code E, the SGC includes three different label probes (L4, L3, and L2), but not the fourth (LI). Once these signals are detected and recorded to generate the first independent identifier of each target gene, the signals are removed, and the analyte is labeled with another type of SGC. The second set of signals is then detected and recorded. This procedure can continue with more rounds. The expanded code can then be used to identify the analyte. Round 2 can target either the same sequence of the target as in the first round, or a different sequence. If the same sequence is targeted, the T-sections of the target probes can be the same as those used in the previous round, and the entire signal-generating complex is removed between each round. If a different sequence is targeted, the T-sections of the target probes will be different, and only the fluorophore need be cleaved from the SGCs between each round. FIG. 16B shows a representative illustration of an embodiment using different combinations of labels at the SGC level. In the first round, each analyte (i.e., “gene” in this figure) is labeled with one combination of the unique SGCs (i.e., “trees” in this figure), each of which provides one signal only. For example, SGC “a” contains the first signal only but not others. Once these signals (i.e., the first independent identifier) are detected and recorded, either the signals or the SGCs are removed, and analyte is labeled with another combination of unique SGCs for the second round. The second set of the combinatorial code is then detected and recorded. This procedure can continue with more rounds. The expanded combinatorial code can then be used to identify the analyte. Round 2 can target either the same sequence of the target as in the first round, or a different sequence. If the same sequence is targeted, the T-sections of the target probes can be either the same as those used in the previous round or different from those used in the previous round, and the entire signal-generating complex is removed between each round. If a different sequence is targeted, the T-sections of the target probes will be different, and only the fluorescent signals need to be removed from the SGCs between each round.
FIG. 17 shows representative graphs of spatial transcriptomic profiling in a 1 month old C57BL/6 mouse brain (top, motor cortex) and a 1 year old C57BL/6 mouse brain (bottom, lateral cortex). Each of the 100 target genes is differentially expressed by a unique cell type in the mouse brain and thus can be used to mark the particular cell type. The correlation coefficient (R value) ranges from 0.6-0.81, demonstrating that the disclosed method (referred
to as “Ultraplex assay”) can effectively profile the gene expressions of mouse brain cells at single-cell level.
FIG. 18 shows representative graphs of data pertaining to minor gene expression discrepancies between the Ultraplex assay and the control assay tested in FIG. 17. The data indicated that the Ultraplex assay resulted in more accurate measurements of the gene expressions in mouse brain (as suggested by the Hiplex assay performed in the same mouse brain tissues against these disputed genes).
FIG. 19 shows representative graphs correlating gene expression data from the Ultraplex assay with annotated cell types from control assays. Between 10-16 major cell types in the different mouse brain regions were correctly identified using the Ultraplex assay, indicating consistent spatial distribution across the mouse brain datasets.
DETAILED DESCRIPTION
Disclosed herein are methods for multiplex analysis of nucleic acids by in situ hybridization. The disclosed methods allow the detection of multiple target nucleic acids within the same sample and within the same cell. Each target nucleic acid is labeled with one or more detectable labels that generate a first independent identifier of the target nucleic acid. Removal of the signals followed by a second round of labeling generates a second independent identifier of the target nucleic acid. A combination of the first and second independent identifiers allows for identification of a target nucleic acid, independent of the order by which the first and second independent identifiers are obtained.
Definitions
Unless otherwise defined herein, scientific and technical terms used in connection with the present disclosure shall have the meanings that are commonly understood by those of ordinary skill in the art. For example, any nomenclatures used in connection with, and techniques of, cell and tissue culture, molecular biology, immunology, microbiology, genetics, and protein and nucleic acid chemistry and hybridization described herein are those that are well known and commonly used in the art. The meaning and scope of the terms should be clear; in the event, however of any latent ambiguity, definitions provided herein take precedent over any dictionary or extrinsic definition. Further, unless otherwise required by context, singular terms shall include pluralities and plural terms shall include the singular.
The terms “detecting” as used herein generally refer to any form of measurement, and include determining whether an element is present or not. This term includes quantitative and/or qualitative determinations.
As used herein, the term “endogenous” refers to the substances originating from within an organism. As used herein, the term “exogenous” refers to the substances originating from outside an organism.
The terms “nucleic acid” and “polynucleotide” are used interchangeably herein to describe a polymer of any length composed of nucleotides, e.g., deoxyribonucleotides or ribonucleotides, or compounds produced synthetically, which can hybridize with naturally occurring nucleic acids in a sequence specific manner analogous to that of two naturally occurring nucleic acids, e.g. , can participate in Watson-Crick base pairing interactions. As used herein in the context of a polynucleotide sequence, the term “bases” (or “base”) is synonymous with “nucleotides” (or “nucleotide”), i.e., the monomer subunit of a polynucleotide. The terms “nucleoside” and “nucleotide” are intended to include those moieties that contain not only the known purine and pyrimidine bases, but also other heterocyclic bases that have been modified. Such modifications include methylated purines or pyrimidines, acylated purines or pyrimidines, alkylated riboses or other heterocycles. In addition, the terms “nucleoside” and “nucleotide” include those moieties that contain not only conventional ribose and deoxyribose sugars, but other sugars as well. Modified nucleosides or nucleotides also include modifications on the sugar moiety, e.g. , wherein one or more of the hydroxyl groups are replaced with halogen atoms or aliphatic groups, or are functionalized as ethers, amines, or the like. “Analogues” refer to molecules having structural features that are recognized in the literature as being mimetics, derivatives, having analogous structures, or other like terms, and include, for example, polynucleotides incorporating non-natural nucleotides, nucleotide mimetics such as 2’- modified nucleosides, peptide nucleic acids, oligomeric nucleoside phosphonates, and any polynucleotide that has added substituent groups, such as protecting groups or linking moieties.
The term “complementary” refers to specific binding between polynucleotides based on the sequences of the polynucleotides. As used herein, a first polynucleotide and a second polynucleotide are complementary if they bind to each other in a hybridization assay under stringent conditions, e.g. , if they produce a given or detectable level of signal in a hybridization assay. Portions of polynucleotides are complementary to each other if they follow conventional base-pairing rules, e.g., A pairs with T (or U) and G pairs with C, although small regions e.g., fewer than about 3 bases) of mismatch, insertion, or deleted sequence may be present.
The term “sample” as used herein relates to a material or mixture of materials containing one or more components of interest. The term “sample” includes “biological sample” which refers to a sample obtained from a biological subject, including a sample of biological tissue or fluid origin, obtained, reached, or collected in vivo or in situ. A biological sample also includes samples from a region of a biological subject containing precancerous or cancer cells or tissues. Such samples can be, but are not limited to, organs, tissues, cells, and exosomes isolated from a mammal. Exemplary biological samples include but are not limited to cell lysate, a cell culture, a cell line, a tissue, oral tissue, gastrointestinal tissue, an organ, an organelle, a biological fluid, a blood sample, a urine sample, a skin sample, and the like. Preferred biological samples include, but are not limited to, whole blood, partially purified blood, PBMC, tissue biopsies, and the like.
The term “probe” as used herein refers to a capture agent that is directed to a specific target mRNA sequence. Accordingly, each probe of a probe set has a respective target mRNA sequence. In some embodiments, the probe provided herein is a “nucleic acid probe” or “oligonucleotide probe” which refers to a nucleic acid capable of binding to a target nucleic acid of complementary sequence, such as the mRNA biomarkers provided herein, usually through complementary base pairing by forming hydrogen bond. As used herein, a probe may include natural (e.g., A, G, C, or T) or modified bases (7-deazaguanosine, inosine, etc.). In addition, the bases in a probe may be joined by a linkage other than a phosphodiester bond, so long as it does not interfere with hybridization. The probes can be directly or indirectly labeled with tags, for example, chromophores, lumiphores, or chromogens. By assaying for the presence or absence of the probe, one can detect the presence or absence of a target mRNA biomarker of interest.
As used herein, the term “plurality” means two or more. Thus, a plurality can refer to, for example, 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, 8 or more, 9 or more, 10 or more, 11 or more, 12 or more, 13 or more, 14 or more, 15 or more, 16 or more, 17 or more, 18 or more, 19 or more, 20 or more, 21 or more, 22 or more, 23 or more, 24 or more, 25 or more, 26 or more, 27 or more, 28 or more, 29 or more, 30 or more, 31 or more, 32 or more, 33 or more, 34 or more, 35 or more, 36 or more, 37 or more, 38 or more, 39 or more, 40 or more, 41 or more, 42 or more, 43 or more, 44 or more, 45 or more, 46 or more, 47 or more, 48 or more, 49 or more, 50 or more, 55 or more, 60 or more, 65 or more, 70 or more, 75 or more, 80 or more, 85 or more, 90 or more, 95 or more, 100 or more, 110 or more, 120 or more, 130 or more, 140 or more, 150 or more, 160 or more, 170 or more, 180 or more, 190 or more,
200 or more, 300 or more, 400 or more, 500 or more, 600 or more, 700 or more, 800 or more, 900 or more, or 1000 or more, or even a greater number, if desired for a particular use.
As used herein, the term “one or more” refers to, for example, 1 or more, 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, 8 or more, 9 or more, 10 or more, 11 or more, 12 or more, 13 or more, 14 or more, 15 or more, 16 or more, 17 or more, 18 or more, 19 or more, 20 or more, 21 or more, 22 or more, 23 or more, 24 or more, 25 or more, 26 or more, 27 or more, 28 or more, 29 or more, 30 or more, 31 or more, 32 or more, 33 or more, 34 or more, 35 or more, 36 or more, 37 or more, 38 or more, 39 or more, 40 or more, 41 or more, 42 or more, 43 or more, 44 or more, 45 or more, 46 or more, 47 or more, 48 or more, 49 or more, 50 or more, 55 or more, 60 or more, 65 or more, 70 or more, 75 or more, 80 or more, 85 or more, 90 or more, 95 or more, 100 or more, 110 or more, 120 or more, 130 or more, 140 or more, 150 or more, 160 or more, 170 or more, 180 or more, 190 or more, 200 or more, 300 or more, 400 or more, 500 or more, 600 or more, 700 or more, 800 or more, 900 or more, or 1000 or more, or even a greater number, if desired for a particular use.
As used in the present disclosure and claims, the singular forms “a,” “an,” and “the” include plural forms unless the context clearly dictates otherwise.
It is understood that wherever embodiments are described herein with the term “comprising” otherwise analogous embodiments described in terms of “consisting of’ and/or “consisting essentially of’ are also provided. It is also understood that wherever embodiments are described herein with the phrase “consisting essentially of’ otherwise analogous embodiments described in terms of “consisting of’ are also provided.
The term “between” as used in a phrase as such “between A and B” or “between A-B” refers to a range including both A and B.
The term “and/or” as used in a phrase such as “A and/or B” herein is intended to include both A and B; A or B; A (alone); and B (alone). Likewise, the term “and/or” as used in a phrase such as “A, B, and/or C” is intended to encompass each of the following embodiments: A, B, and C; A, B, or C; A or C; A or B; B or C; A and C; A and B; B and C; A (alone); B (alone); and C (alone).
Methods For Multiplexed Nucleic Acid Detection
Disclosed herein are methods for detecting a plurality of target nucleic acids in a sample.
Previously-disclosed methods of detecting target nucleic acids are described in US Patent No. 7,709,198 and European Patent No. 2500439, which are incorporated herein by
reference. These references describe methods of in situ detection of nucleic acid targets in which target probes (TP) are arranged in sets of two or more short probes adjacent to each other when they are hybridized to the target. As shown in FIG. 1A, each individual target probe has a target section (T) complementary to the target and a label section (L) complementary to a component of a Signal Generating Complex (SGC). The T sections of the one or more target probe(s) are complementary to non-overlapping regions of the target nucleic acid, and the L sections of the one or more target probe(s) are complementary to non-overlapping regions of the nucleic acid component of the SGC. Each SGC comprises multiple layers of components, such as Amplifiers (AMP) and Pre-Amplifiers (PA), which assemble into tree-like structures capable of carrying many Label Probes (LP) on their “branches.” As shown in FIG. IB, if the target sequence is sufficiently long, many TP sets and associated SGCs can be assembled on the target to generate a detectable signal that appears in an imaging system as a discrete “dot.” Manual or computerized dot counting can be conducted to quantify the number of targets in a particular cell in a sample. FIG. 2 shows two additional different configurations of the SGC, in which additional layer(s) of amplification molecules, such as a Collaboration Amplifier (COM), are incorporated to carry more LPs in one SGC. Such configurations are more suitable for detecting short target sequences because a detectable signal can be generated with a single SGC (see, for example, WO 2017/066211, which is incorporated herein by reference).
In many applications, it is valuable to detect multiple targets in the same assay. Previously-disclosed methods address this need in various ways. In a “pooling” approach, each target has unique TP sets but all TP sets have the same L sections, and can therefore be bound to the same SGCs. In this way, a signal is detected when any one of the multiple targets is present. The pooling approach is useful when a group of targets has the same clinical utility or biological functionality. Other methods use a multiplexing approach, in which each target has its own unique SGC that does not cross-hybridize, generating a unique identifiable signal for each target when it is present. The multiplexing approach is useful when each target in the group provides a different clinical or biological indication alone or in combination. In previously-described methods, each unique signal is generated by a large number of LPs carrying the same label. A drawback to this approach is that there are usually a limited number of uniquely identifiable labels. In fluorescent detection modalities, for example, four to six fluorophores at different wavelengths are commonly used. More than six fluorophores in an imaging-based multiplexing system is possible, but can become challenging due to bandwidth limits and cross-talk between wavelengths. This limitation can impose a limit on the number of targets that can be multiplexed in an assay.
The methods disclosed herein allow for multiplexed detection of large numbers of targets in a single sample. Each target in the sample is detected by generating at least two separate independent identifiers of each target in the sample (e.g., two, three, or more independent identifiers), and then combining the separate independent identifiers to obtain a unique identifier for each target in the sample. The combination of multiple independent identifiers allows for simultaneous detection of large numbers of different targets in the same sample. Each independent identifier is based on the number of distinct detectable labels associated with each target. Notably, each target can be identified independent of the order by which the independent identifiers are obtained. As discussed further herein, in some embodiments, the targets are nucleic acids, such as RNA.
In some embodiments, the method comprises first contacting the sample with a plurality of target probe sets, wherein each target probe set is complementary to a target nucleic acid. This step is followed by contacting the sample with a first plurality of signal generating complexes (SGCs), each capable of hybridizing to a target probe set, and each comprising a plurality of spatially distributed label probes (LPs) comprising detectable labels. A first independent identifier is then obtained for each target nucleic acid, based on the number of distinct detectable labels present in each SGC of the first plurality of SGCs. In other embodiments, the method comprises first contacting the sample with a plurality of target probe sets, wherein each target probe set is complementary to a target nucleic acid. This step is followed by contacting the sample with a plurality of amplification complexes capable of hybridizing to at least one target probe set, wherein each amplification complex comprises a plurality of spatially distributed binding sites for label probes, and then contacting the sample with a first plurality of label probes comprising detectable labels, which hybridize to binding sites on the amplification complexes to generate a first plurality of signal generating complexes (SGCs). A first independent identifier is then obtained for each target nucleic acid, based on the number of distinct detectable labels present in each SGC of the first plurality of SGCs.
In one embodiment, anywhere from zero to ten different LPs bearing different detectable labels (e.g., from zero to eight, zero to six, or zero to four different detectable labels) can form combinations to generate each independent identifier. Examples are shown in Table la, where four different labels (LI, L2, L3 and L4) can generate 16 unique combinations for the first independent identifier. Further examples are shown in Table lb, where five different labels (LI, L2, L3, L4, and L5) can generate 32 unique combinations for the first independent identifier. Use of six different labels would produce yet additional unique combinations. For example, fluorophores that can be used in combination to generate four, five, or six LPs include
those with excitation maxima at or near at 488 nm, 550 nm, 594 nm, 650 nm, 700 nm, and/or 750 nm. The independent identifiers can each be easily identified, e.g., using an alphanumeric code, to generate a first independent identifier of each target. Specific methods and probe configurations for associating these different combinations of spatially distributed LPs with each target nucleic acid will be further discussed below.
Table la. Combinations of labels based on four distinct labels to generate a first independent identifier of each target (shown using an alphanumeric code).
Table lb. Combinations of labels based on four distinct labels to generate a first independent identifier of each target (shown using an alphanumeric code).
Following generation of the first independent identifier, the signals generated by the detectable labels are removed from the sample, e.g., by removing the labels from the first plurality of SGCs, or by removing the first plurality of SGCs from the sample (as further discussed below). Next, the method comprises contacting the sample with a second plurality of SGCs, each capable of hybridizing to a target probe set, and each comprising a plurality of spatially distributed label probes (LPs) comprising detectable labels. Optionally, this step is preceded by contacting the sample with a second plurality of target probe sets, wherein each target probe set in the second plurality of target probe sets is complementary to a target nucleic acid. After contacting the sample with the second plurality of SGCs, a second independent identifier is obtained for each target nucleic acid, based on the number of distinct detectable labels present in each SGC of the second plurality of SGCs.
In some embodiments, the method further comprises identifying a target nucleic acid of the plurality of target nucleic acids based on a combination of the first and second independent identifiers (e.g., via generation of a “combined” identifier). Examples are shown in Table 2, where 16 target genes all have the same first independent identifier (alphanumeric code 1, in this example), and a second round of labeling provides a second independent identifier using four different labels (LI, L2, L3 and L4) that can generate 16 unique combinations for the second independent identifier.
Table 2. Combinations of labels based on four distinct labels to generate a second independent identifier of each target nucleic acid.
Notably, adding the second independent identifier doubles the total number of labels being used for each target nucleic acid. Accordingly, when 4 total labels are used in each round, and two cycles of labeling and detection are completed, a maximum of 255 targets can be detected (N = 8; 2N - 1 = 255). (Note that the combined identifier 00 corresponds to no signal in either round of detection, so this code is not used.) When 5 total labels are used in each round, and two cycles of labeling and detection are completed, a maximum of 1023 targets can be detected (N = 10; 2N - 1 = 1023).
In some embodiments, rather than identifying a target nucleic acid of the plurality of target nucleic acids based on a combination of the first and second independent identifiers, the method further comprises another round of labeling to generate a third independent identifier. Adding a third independent identifier further increases the number of independent targets that can be identified. When 4 total labels are used in each round, and three cycles of labeling and detection are completed, a total of 4095 targets can be detected (N = 12; 2N - 1 = 4095). When 5 total labels are used in each round, and three cycles of labeling and detection are completed, a total of 32767 targets can be detected (N = 15; 2N - 1 = 32767). Yet additional rounds of labeling can further increase the number of target nucleic acids that can be detected. Of course, fewer than the maximum total number of targets can be detected in any given assay. a. Target Probe Sets
The disclosed methods comprise one or more steps of contacting a sample with a plurality of target probe sets, wherein each target probe set is complementary to a target nucleic acid. In the plurality of target probe sets, each individual set of target probes is specific for (i.e., specifically hybridizes to) each target nucleic acid in the plurality of target nucleic acids being
detected in the method. Each target probe set comprises one or more pairs of individual target probes.
As used herein, a “target probe” refers to a polynucleotide that is capable of hybridizing to a target (e.g., a target nucleic acid) and capturing or binding a signal-generating complex (SGC) component to that target. For example, the target probe can hybridize to an amplifier, a pre-amplifier or a pre-pre-amplifier in an SGC. The target probe thus includes a first polynucleotide sequence that is complementary to a polynucleotide sequence of the target nucleic acid, and a second polynucleotide sequence that is complementary to a polynucleotide sequence of a component of a SGC (amplifier, pre-amplifier, pre-pre-amplifier, or the like). The target probe is generally single stranded so that the complementary sequence is available to hybridize with a corresponding target, amplifier, pre-amplifier, pre-pre-amplifier, or the like.
Each target probe comprises a target (T) section and a label (L) section, wherein the T section is a nucleic acid sequence complementary to a section on the target nucleic acid and the L section is a nucleic acid sequence complementary to a section on the nucleic acid component of the SGC. Within each target probe pair, the T sections are complementary to non-overlapping regions of the target nucleic acid, and the L sections are complementary to non-overlapping regions of the nucleic acid component(s) of the SGC.
In each target probe pair, the two target probes can have the same directionality, or opposite directionalities. For example, in some embodiments, T section of each target probe in the target probe pair is 3 ’ of its L section. In some embodiments, the T section of each target probe in the target probe pair is 5’ of its L section. In some embodiments, the T section of one target probe in the target probe pair is 3’ of its L section, and the T section of the other target probe in the target probe pair is 5 ’ of its L section.
In some embodiments, each target probe comprises a non-binding portion separating its T section from its L section (e.g., a spacer or linker region). In some embodiments, this nonbinding portion does not bind to either the target nucleic acid or the nucleic acid component of the SGC, but is useful in forming the SGC. In some embodiments, this non-binding region can form the unpaired loop section of a hairpin loop. In other embodiments, none of the target probes contains a non-binding portion.
In some embodiments, one pair of target probes is used to detect each target nucleic acid. In other embodiments, two or more pairs of target probes are used to detect each target nucleic acid. In such a case, the pairs of target probes in the target probe set specific for a particular target nucleic acid bind to different and non-overlapping sequences of the target nucleic acid. For example, when two or more pairs of target probes specifically hybridize to
the same target nucleic acid, the molecule that binds to the target probe pairs (i. e. , an amplifier, a pre-amplifier, or a pre-pre-amplifier) generally are the same for target probe pairs in the same target probe set. Thus, the target probe pairs that bind to the same target nucleic acid can be designed to include the same binding site for the molecule in the SGC that binds to the target probe pairs (e.g., a pre-amplifier or pre-pre-amplifier). The use of multiple target probe pairs to detect a target nucleic acid can provide for a higher signal associated with the assembly of multiple SGCs on the same target nucleic acid. In some embodiments, the number of target probe pairs used for binding to the same target nucleic acid are in the range of 1-10, 1-20, 1- 30, 1-40, 1-50, 1-60, 1-70, 1-80, 1-90, 1-100, 1-110, 1-120, 1-130, 1-140, 1-150, 1-160, 1-170, 1-180, 1-190, or 1-200 pairs per target, or larger numbers of pairs, or any integer number of pairs in between, for example, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20,
21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45,
46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70,
71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95,
96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134,
135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153,
154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172,
173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191,
192, 193, 194, 195, 196, 197, 198, 199, 200, and the like.
Each target probe pair can be designed to bind to immediately adjacent segments of the target nucleic acid, or on segments that have one to a number of bases between the target probe binding sites of the target probe pair. Generally, target probe pairs are designed for binding to the target nucleic acid such that there are generally between 0 to 500 bases between the binding sites on the target nucleic acid, for example, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 120, 140, 160, 180, 200, 220, 240, 260, 280, 300, 320, 340, 360, 380, 400, 420, 440, 460, 480, or 500 bases, or any integer length in between. In particular embodiments, the binding sites for the pair of target probes are between 0 to 100, 0 to 200, or 0 to 300 bases, or any integer length in between. In the case where more than one target probe pair is used in a target probe set to bind to the same target, and where there is a gap in the binding sites between a pair of target probes, it is understood that the binding sites of different target probe pairs do not overlap. In the case of detecting double stranded nucleic acids, such as DNA, some overlap between different target probe pairs can occur, so long as the target probe pairs are able to concurrently bind to the respective binding sites of the double stranded target nucleic acid.
b. Signal Generating Complexes and Labels
In some embodiments, the disclosed methods comprise one or more steps of contacting a sample with a plurality of signal generating complexes (SGCs) capable of hybridizing to at least one target probe set, wherein each SGC comprises a plurality of spatially distributed label probes comprising detectable labels. In some embodiments, the disclosed methods comprise one or more steps of contacting a sample with a plurality of amplification complexes capable of hybridizing to at least one target probe set, wherein each amplification complex comprises a plurality of spatially distributed binding sites for label probes; subsequently, the sample is contacted with a plurality of label probes comprising detectable labels, which hybridize to binding sites on the amplification complexes to generate a plurality of SGCs . Various configurations of amplification complexes and SGCs can be employed in the above methods, to allow for binding of 0, 1, 2, 3, or 4 different labels to each target nucleic acid, to generate each independent identifier for the target nucleic acid (e.g., a first independent identifier, a second independent identifier, a third independent identifier, etc.)
As used herein, the term “label probe” refers to an entity that binds to a target molecule, directly or indirectly, generally indirectly, and allows the target to be detected. A label probe (or “LP”) contains a nucleic acid binding portion that is typically a single stranded polynucleotide or oligonucleotide that comprises one or more labels which directly or indirectly provides a detectable signal. The label can be covalently attached to the polynucleotide, or the polynucleotide can be configured to bind to the label. For example, a biotinylated polynucleotide can bind a streptavidin-associated label. Generally, the label probe can hybridize to a nucleic acid that is in turn hybridized to the target nucleic acid or to one or more other nucleic acids that are hybridized to the target nucleic acid. Thus, the label probe can comprise at least one polynucleotide sequence that is complementary to a polynucleotide sequence in an amplifier, pre-amplifier, or pre-pre-amplifier in an amplification complex or SGC.
In some embodiments, the amplification complexes and SGCs provided herein comprise components such an amplifier, a pre-amplifier, and/or a pre-pre-amplifier. As used herein, an “amplifier” is a molecule, typically a polynucleotide, that is capable of hybridizing to multiple label probes. The amplifier hybridizes to multiple identical label probes, or multiple different label probes. The amplifier can also hybridize to a target nucleic acid, to at least one target probe of a pair of target probes, to both target probes of a pair of target probes, or to nucleic acid bound to a target probe such as a pre-amplifier or pre-pre-amplifier. For example, the amplifier can hybridize to at least one target probe and to a plurality of label probes, or to
a pre-amplifier and a plurality of label probes. The amplifier can be, for example, a linear, forked, comb-like, or branched nucleic acid. As described herein for all polynucleotides, the amplifier can include modified nucleotides and/or nonstandard intemucleotide linkages as well as standard deoxyribonucleotides, ribonucleotides, and/or phosphodiester bonds. Suitable amplifiers are described, for example, in U.S. Patent Nos. 5,635,352, 5,124,246, 5,710,264, 5,849,481, and 7,709,198 and U.S. publications 2008/0038725 and 2009/0081688, each of which is incorporated by reference.
As used herein, a “pre-amplifier” is a molecule, typically a polynucleotide, that serves as an intermediate binding component between one or more target probes and one or more amplifiers. Typically, the pre-amplifier hybridizes simultaneously to one or more target probes and to a plurality of amplifiers. Exemplary pre-amplifiers are described, for example, in U.S. Patent Nos. 5,635,352, 5,681,697 and 7,709,198 and U.S. publications 2008/0038725, 2009/0081688 and 2017/0101672, each of which is incorporated by reference.
As used herein, a “pre-pre-amplifier” is a molecule, typically a polynucleotide, that serves as an intermediate binding component between one or more target probes and one or more pre-amplifiers. Typically, the pre-pre-amplifier hybridizes simultaneously to one or more target probes and to a plurality of pre-amplifiers. Exemplary pre-pre-amplifiers are described, for example, in U.S. Patent No. 11,078,528, which is incorporated by reference.
Each SGC comprises a plurality of spatially distributed label probes comprising detectable labels. The labeling scheme can be implemented in different ways. For example, in some embodiments, the labeling scheme can be employed at the SGC level. As shown in FIG. 3, for generation of an independent identifier with N unique labels, N unique label-specific SGCs can be made. Each SGC carries the same LP with a specific label. Components of each SGC (such as PAs, APs, LPs, etc.) are uniquely associated with the SGC. They are specially designed to hybridize to each other to assemble the SGC but cannot cross-hybridize to any components of any other SGC. In the exemplary embodiment shown in FIG. 3, two target nucleic acids are shown bound to the respective SGCs. For one target nucleic acid (upper target in FIG. 3), the SGCs for the target comprise four labels (LI, L2, L3, and L4). For a second target (lower target in FIG. 3), the SGCs for the target nucleic acid comprise two labels (L2 and L4). When the target sequence is sufficiently long, many TP pairs can be designed to bind specifically to the target. Each TP pair can be coupled to a selected SGC through its L sections (see FIG. 1). Many SGCs can be captured to each target in this way. An independent identifier of each target can be generated through the unique combination of SGCs. An advantage of SGC-level implementation is that only a relatively small number of different SGCs need to be
developed. Since it is essential that components of each SGC do not cross-hybridize to other SGCs, the amount of work involved in developing a large number of SGCs is substantial. With this configuration, the length of target sequence has to be sufficiently long to accommodate many SGCs. Some SGCs may not be assembled successfully on the target due to accessibility of the target sequence. This may lead to miscoding, which can be addressed as described below.
Accordingly, in one embodiment, the plurality of SGCs comprise: (a) a set of preamplifiers, wherein the pre-amplifier set comprises one or more subsets of pre-amplifiers, wherein the one or more pre-amplifier subsets comprise a pre-amplifier specific for each of the target probe pairs in the plurality of target probe sets, wherein each pre-amplifier comprises binding sites for the pair of target probes of one of the target probe sets and a plurality of binding sites for an amplifier; (b) a set of amplifiers, wherein the amplifier set comprises one or more subsets of amplifiers specific for each pre-amplifier subset, wherein each amplifier subset comprises a plurality of amplifiers, wherein the amplifiers of one of the amplifier subsets comprise a binding site for the pre-amplifiers of one of the pre-amplifier subsets and a plurality of binding sites for a label probe; and (c) a set of label probes, wherein the label probe set comprises one or more subsets of label probes, wherein each label probe subset is specific for one of the amplifier subsets, wherein each label probe subset comprises a plurality of label probes, wherein the label probes in each of the label probe subsets comprise a label and a binding site for the amplifiers of one of the amplifier subsets, wherein the labels in each label probe subset are distinguishable between the label probe subsets; wherein the one or more label probe subsets in each probe subset specific for a target nucleic acid comprise at least one label or a combination of labels that is different for each probe subset (see FIG. 3).
The labeling scheme can also be implemented at the component level within an SGC (i.e. , sub-SGC level). Such a system comprises N different, label specific LPs and 2N-1 unique, target-specific SGCs. Each LP has a segment designed to hybridize to a label probe anchor (LA) on an AP molecule (i.e., the LA being the binding site on the amplifier for the label probe) in the SGC. A mixture of different LAs are designed and made to bind a set of pre-determined, different LPs, generating a unique combination of detectable labels that are used to identify the target. The sub-SGC level implementation can be advantageous when the target sequence is shorter. In addition, the probability of miscoding is reduced, as discussed below. This approach can be adopted to detect a single base variant or a unique junction in the target sequence (see, for example, WO 2017/066211). However, a comparatively large number of unique SGCs (2N- 1) need to be developed.
FIG. 4A illustrates one embodiment of sub-SGC implementation, where the above mentioned SGC ID code is implemented on the AP molecule. As shown in FIG. 4A, an AP has one region designed to bind to the amplifier anchor (AA) on the PA (i.e., the AA being the binding site on the pre-amplifier for the amplifier) and another region comprising multiple segments of LAs (i.e., the binding sites on the amplifier for the label probes). In previously described methods, the amplifier used repeats of the same LA, that is, the amplifier had a plurality of binding sites for the same label probe. An embodiment using repeats of the same LA for binding a plurality of the same LPs to the amplifier is shown in FIG. 3. In the embodiment shown in FIG. 4A, however, a mixture of different LAs are designed according to the unique identifier of the SGC. For example, if the independent identifier of the SGC includes three separate labels (e.g., LI, L3, and L4, as shown in FIG. 4A), then an equal number of LAs for LPs with LI , L3, and L4 are made on the AP molecule, which will bind a designed number of desired LP to generate the independent identifier.
Accordingly, some embodiments, the plurality of SGCs comprise: (a) a set of preamplifiers, wherein the pre-amplifier set comprises a plurality of pre-amplifiers, wherein the pre-amplifiers comprise binding sites for the pairs of target probes and a plurality of binding sites for an amplifier; (b) a set of amplifiers, wherein the amplifier set comprises a plurality of amplifiers, wherein the amplifiers comprise a binding site for the pre-amplifiers and a plurality of binding sites for a label probe or two or more distinct label probes; and (c) a set of label probes, wherein the label probe set comprises a label probe or two or more distinct label probes, wherein each label probe comprises a label and a binding site for the amplifiers, wherein the labels in each distinct label probe are distinguishable between the distinct label probes; wherein the amplifier in each probe subset specific for a target nucleic acid comprises a binding site for a label or a combination of two or more distinct labels that is different for each probe subset (see FIG. 4A).
In one embodiment of such a method, the label probe set comprises two or more distinct label probes, wherein the amplifier set comprises a plurality of non-identical amplifiers, and wherein the binding sites for the two or more distinct label probes on each non-identical amplifier are in a different order on each non-identical amplifier (see FIG. 7B). This embodiment can be used to reduce miscoding, as described below in more detail.
FIG. 4B illustrates another embodiment of sub-SGC implementation, where the SGC ID code is implemented on the PA molecule. As shown in FIG. 4B, N “pure” AP molecules are made, each carrying the same LA for the same type of LP. A PA molecule has one region designed to bind to the L-section of the target probe (i.e., the segment of the TP that binds to
the pre-amplifier; see FIG. 1A) of a target probe pair, and another region comprising multiple segments of AAs (amplifier anchors, i.e., the segments on the pre-amplifier that bind to the amplifiers). In previously disclosed methods, these are repeats of the same AA. In the embodiment shown in FIG. IB, however, a mixture of different AAs are designed according to the unique identification code of the SGC. For example, if the independent identifier of the SGC is based on two labels, then equal number of AAs for APs carrying the two LPs are made on the PA molecule, which will bind a designed number of desired LPs to generate the identifier in the assay. In the exemplary embodiment shown in FIG. 4B, the code for the SGC shown uses three labels (LI, L3, and L4).
Accordingly, in one embodiment, the plurality of SGCs comprise: (a) a set of preamplifiers, wherein the pre-amplifier set comprises a plurality of pre-amplifiers, wherein the pre-amplifiers comprise binding sites for the pairs of target probes and a plurality of binding sites for amplifiers; (b) a set of amplifiers, wherein the amplifier set comprises a plurality of amplifiers, wherein the plurality of amplifiers comprise an amplifier comprising a binding site for the pre-amplifiers and a plurality of binding sites for a label probe, or wherein the plurality of amplifiers comprise two or more distinct amplifiers, wherein each distinct amplifier comprises a binding site for the pre-amplifiers and a plurality of binding sites for a distinct label probe; and (c) a set of label probes, wherein the label probe set comprises a label probe or two or more distinct label probes, wherein the label probe comprises a label and a binding site for the amplifier, or wherein the two or more distinct label probes comprise a label and a binding site for the two or more distinct amplifiers, wherein the labels on each distinct label probe are distinguishable between the distinct label probes; wherein the pre-amplifier in each probe subset specific for a target nucleic acid comprises a plurality of binding sites for the amplifier comprising a binding site for the label probe or a plurality of binding sites for the two or more distinct amplifiers comprising binding sites for the two or more distinct label probes, and wherein the label of the label probe or combination of two or more distinct labels of the two or more distinct label probes is different for each probe subset (see FIG. 4B).
In one embodiment of such a method, the plurality of amplifiers comprise two or more distinct amplifiers, and wherein the binding sites on the pre-amplifier for the distinct amplifiers are intermingled (see FIG. 6B). As described below in more detail, this embodiment can be used to reduce miscoding.
FIG. 4C illustrates another embodiment of sub-SGC implementation, where the independent identifier is implemented on the LP molecule. In previously described methods, the LP sequences bound to a single SGC all carry the same label. The embodiment of FIG. 3
also utilizes the same labels on a given SGC. In the embodiment depicted in FIG. 4C, LP molecules binding to the same SGC can be a mixture of LPs each conjugated to a different label according to a predefined code book. For example, as shown in FIG. 4C, SGC5 LPs are a mixture of LPs conjugated to three different labels. An advantage of this embodiment is that the “coloring” of the SGC complex by the LPs will be completely randomized, which can further help to reduce coding errors. Since the SGC identifiers are not hard coded in the SGCs, this scheme provides the flexibility to assign different identifiers to different SGCs in different assay configurations simply by devising a different code book on the fly. In addition, the mixing of LPs can be made in unequal amounts to normalize the labeling intensities across the N labels, which again can help in reducing encoding/decoding errors. The disadvantage of this embodiment is that MN- 1 LP and LA sequences will be required to uniquely encode and decode each SGC. In comparison, in the embodiment shown in FIG. 4A, only N unique LPs and LAs are required.
As shown in FIG. 4C, the SGC ID code is implemented on the LP molecule. In this embodiment, LP molecules binding to the same SGC can be mixtures of LPs each conjugated to a different label according to a predefined code book. For example, as shown in FIG. 4C, SGC5 LPs are a mixture of LPs conjugated to three different labels, corresponding to labels LI, L3, and L4. An advantage of this embodiment is that the “coloring” of the SGC complex by the LPs will be completely randomized, which can further help to reduce coding errors. A partial LP mixing code book is shown on the right of FIG. 4C, with 7 different exemplary SGC codes shown using 4 labels.
FIG. 4D shows an embodiment of FIG. 4C in more detail. As depicted in FIG. 4D, for each SGC, a specific label anchor (LA, the binding site on the amplifier for the label probe) is assigned so that each SGC for a particular target nucleic acid has a plurality of the same LAs on the amplifier. The level at which combinatorial labeling can be provided is with the label probes (LPs). In this case, SGC5 is illustrated showing that the amplifiers comprise a plurality of identical LAs, labeled “E.” As shown in FIG. 4D, SGC5 is coded with 3 distinct label probes (LI, L3, and L4), all of which have the same binding site for the plurality of “E” LAs on the corresponding amplifiers. Therefore, all three label probes are bound to the amplifiers of SGC5.
FIG. 4E shows an embodiment of FIG. 4C in more detail. The SGC5 of FIG. 4D is shown bound to its respective target nucleic acid, with the label probes having an “E” binding site bound to the respective “E” LAs of the SGC5 amplifiers (as in FIG. 4D). Also shown are two additional exemplary SGCs bound to their respective target nucleic acids. SGC 1 , coded as shown in FIG. 4D, comprises a plurality of identical LAs, labeled “A.” SGC1 is coded with
one label probe (LI), which has the binding site for the plurality of “A” LAs of the SGC1 amplifiers. Therefore, the label probe “L 1 ” is bound to the amplifiers of SGC 1 , thereby labeling the SGC1 target nucleic acid. SGC3, coded as shown in FIG. 4D, comprises a plurality of identical LAs, labeled “C.” SGC3 is coded with 2 probes corresponding to 2 distinct label probes (LI and L2), both of which have the same binding site for the plurality of “C” LAs on the corresponding SGC3 amplifiers. Therefore, both label probes are bound to the amplifiers of SGC3, thereby labeling the SGC3 target nucleic acid.
FIG. 4F shows an embodiment of FIG. 4C in more detail. FIG. 4F illustrates that, once an SGC for a particular target nucleic acid has been designed, the actual coding for the target nucleic acid can be readily modified simply by changing the labels on the label probes that bind to the amplifiers of a particular SGC. For example, in FIG. 4D, SGC2 comprises amplifiers with “B” LAs and is coded using L2. In FIG. 4F, the same SGC assembly can be used with respect to the target probes, pre-amplifier, and amplifiers with “B” LAs, but instead of using “B” LA-binding label probes with only L2 as in FIG. 4B, “B” LA-binding label probes can be used that have a mixture of L2 and L3 such that SGC2 is now coded with both labels. Thus, labels L2 and L3 are bound to “B” LAs on the SGC2 amplifiers. Similarly, SGC5 comprising amplifiers with “E” LAs is now coded in FIG. 4F by using label probes with “E” LA-binding label probes that have a mixture of labels L2, L3, and L4, instead of labels LI, L3, and L4 as shown in FIG. 4D.
As described herein, in some embodiments, SGCs for different target nucleic acids will have different numbers of labels in the code. In this situation and in the case where the number of LAs on the respective SGCs is the same, if the number of label probes are added to the SGCs, the SGC coded with only one label will have a higher number of bound labels than the number of corresponding probes bound to an SGC coded with four labels; for example an SGC coded with only LI, LI can bind to all of the sites on one of the SGCs, but for an SGC coded with LI, L2, L3, and L4, LI can only bind to % of the sites on the SGC. Accordingly, in some embodiments, it can be desirable to normalize the amount of label bound to different SGCs coded by different numbers of distinct labels.
In an exemplary embodiment as shown in FIG. 4G, two target nucleic acids are shown with two bound SGCs, SGC2 and SGC5. SGC2 is coded with L2 and L3, and SGC5 is coded with L2, L3, and L4. In this case, where all of the label probes bind to the respective LAs, “B” LAs in the case of SGC2 and “E” LAs in the case of SGC5, and assuming that the SGC2 and SGC5 have approximately the same number of LAs in the respective SGCs, the number of respective labels that can bind to SGC2 will be higher than the number of respective labels that
bind to SGC5 (i.e., the 2 distinct labels for SGC2 (L2 and L3) and the 3 distinct labels for SGC5 (L2, L3, and L4) will be bound to the same number of sites, resulting in a higher number of L2 and L3 being bound to SGC2 than SGC5 since some of the SGC5 sites are occupied by L4). If desired, the number of labels (and therefore intensity of signal) can be normalized by including “blank” label probes, i.e., probes having a binding site for the respective LAs (in this case “B” for SGC2 and “E” for SGC5) but without a label. For example, if it is desired to compare SGC2 and SGC5 with equal intensity signals for the respective labels, 1/3 “blank” label probes can be included with the mixture of “B” LA-specific probes so that the intensity of L2 and L3 will be the same on both SGCs (i.e., 1/3 of SGC2 occupied by “blank” label probes and 1/3 of SGC5 occupied by L4). In another example, if a assay is being performed in which some SGCs include 4 labels, then the assay can be performed so that the same proportion of “blank” label probes are included in the label probe sets using less than 4 labels, for example, 1/2 “blank” label probes can be included with the SG2-specific label probes coded by 2 distinct labels, and 1/4 “blank” label probes can be included with the SGC5-specific label probes coded by 3 distinct labels, so that the amount of each distinct label probe (LI, L2, L3, and L4) bound to the respective SGCs is the same on each SGC. Such “blank” label probes can also be used in combination with distinct label probes to provide for a desired proportion of respective labels, such as a desired ratio of label probes on an SGC.
In one embodiment, the plurality of SGCs comprises: (a) a set of pre-amplifiers, wherein the pre-amplifier set comprises a plurality of pre-amplifiers, wherein the preamplifiers comprise binding sites for the pairs of target probes and a plurality of binding sites for an amplifier; (b) a set of amplifiers, wherein the amplifier set comprises a plurality of amplifiers, wherein the amplifiers comprise a binding site for the pre-amplifiers and a plurality of identical binding sites for a label probe; and (c) a set of label probes, wherein the label probe set comprises a label probe or two or more distinct label probes, wherein each label probe comprises a label and a binding site for the amplifiers, wherein the binding site for the amplifier is the same for each label probe, wherein the labels in each distinct label probe are distinguishable between the distinct label probes; wherein the amplifier in each probe subset specific for a target nucleic acid comprises a binding site for a label or a combination of two or more distinct labels that is different for each probe subset (see FIG. 4C).
Similar approaches can be implemented on other components of the SGC and the two embodiments shown in FIG. 4A and FIG. 4B can be used in combination to ensure the predetermined mixture of LPs can be assembled onto the SGC to generate the independent identifier.
In some embodiments, the methods of the disclosure can be modified to reduce miscoding. Miscoding can occur when the signal from a type of LP designed to be present with a particular target is undetectable, or background noise is misinterpreted as a signal from a particular LP. Errant positive signals can be largely eliminated by setting up an appropriate threshold. Signal level from the area surrounding the image dot as well as the global background level can be used to reference the background. Methods for reducing errant undetected signal-type miscoding are described herein.
When the SGC level code implementation method described previously is used, errant “null”-type miscoding can occur when SGCs of a certain LP type are not attached to the target due to limited TP accessibility to the target or target degradation. It is therefore important that each label-specific SGC has many copies in the set designed to bind to the same target so that there are statistically many chances for each needed LPs to be present in the detected signal. This means that the target sequence has to be very long in order to reduce the miscoding rate. For example, assuming four different LPs are used, 20 copies of each LP specific SGCs can be needed to ensure reliable detection and each SGC binds to a TP set occupying 50 nucleotides (50nt), so the total target length in this case has to be 4x20x50=4000 nucleotides. Also, it may be advantageous to intertwine different SGCs along the target, as shown in FIG. 5. If different SGC types are positioned apart in separate groups, as depicted in FIG. 5, lower panel, a certain section of the target may be blocked or masked, thereby preventing attachment of one specific SGC type, which will result in miscoding. Designing the target probes in the SGC of the same type, when more than one type of SGC is used, to bind to the target nucleic acid at sites that are intermingled or intertwined, as shown in FIG. 5, upper panel, reduces the chance of miscoding.
From the point of view of reducing miscoding, there are advantages to implementing the ID code at the sub-SGC level because, once a TP set is successfully hybridized to the target, there are many more chances for different LPs to successfully attach to the SGC without bias. Arranging different labels into alternating positions is still an important strategy to reduce the chance of miscoding. As shown in FIG. 6A, this particular SGC is miscoded; only LI is bound, when the intent was for LI and L4 to be bound. In this case the miscoding occurred because the PA is truncated, which can occur during manufacturing of the PA. The same truncation, however, will not cause miscoding if the labels are intertwined or intermingled on the PA, if the embodiment shown in FIG. 4B is implemented as shown in FIG. 6B. A similar strategy can be used in a configuration with AP level coding, as illustrated in FIG. 4A. An additional method to minimize the potential miscoding caused by truncation is to randomize the position of
different labels on the AP or PA that encodes the target specific code, as illustrated in FIG. 7, in which the multiplexing channel ID is encoded on the AP molecule. In FIG. 7A, different label probes are positioned on each AP in exactly the same way, that is, each amplifier is the same. Truncation of the AP, for example, during manufacturing, may cause substantial reduction in certain label compared to others. This imbalance increases the chance of miscoding. In the most severe case, truncation may cause the loss of all copies of one certain label, leading to an outright miscode. In FIG. 7B, locations of different labels on the AP are purposefully randomized. Truncation therefore does not cause a large bias in the numbers and types of labels in the SGC. The APs are provided as a plurality of amplifiers, where a mix of non-identical amplifiers is included, where the position of LAs for specific label probes are distributed differently and can be randomized on the non-identical amplifiers. Randomization of different labels in the SGCs can be achieved by using one or combinations of the embodiments described herein.
When miscoding occurs because a label that should be present is absent, the target may be mis-identified as another target with fewer labels in its ID code. In most situations, the probability of miscoding is low (e.g., <5%). When the quantities of targets are in a similar level, such miscoding does not significantly impact the results. Miscoding can have a significant impact if one target is present at a significantly higher quantity than the other target that it miscoded into (i.e., one target is miscoded to be misread as another target due to differences in amounts of the two targets). Therefore, one important method to reduce the impact of miscoding is to assign ID codes with fewer labels to higher quantity targets if the relative quantities of the targets are known. For example, in Table la, the independent identifier of gene targets 2-5 are each generated using a single label, and in Table lb, the independent identifier in gene targets 2-6 are each generated using a single label. These “codes” can be reserved for targets with the highest quantities because if any miscoding occurs due to the erroneous absence of labeling, they will not be mis-interpreted into the signals for other targets.
Many error detection schemes used in the digital communication field can be adopted to detect miscoding. For example, a parity check can ensure accurate data transmission during communication. A parity bit is appended to the original data bits to make an even or odd number of total data bits. For example, the signal from one of the N Labels can be used as a “parity check” bit (LI in the example codebook in Table 3). The bit will be made “X” (present) or (absent) to make the total number of labels in the N label system odd (i.e., odd parity check) or even (i.e., even parity check). Depending on the result of the parity check, the detected target may or may not be counted. This parity check scheme can detect single or odd
number of bit errors but cannot detect double or even number bit errors. This can substantially reduce the probability of miscoding. For example, if the chance of single bit error is 5%, the chance of double bit error is theoretically 0.25%. The price for using such a parity check is that the number of multiplexing channels is reduced to 2N 1-1 or 2N 1 if an even or odd parity check scheme is adopted, respectively (e.g., 7 or 8 SGC codes in Table 3 compared to 15 SGC codes in Table la). A code book can be designed to include an odd (or even) number of labels. In the example shown in Table la, for an odd number, the allowed channels from Table la would be those for gene targets 2, 3, 4, 5, 12, 13, 14, and 15. If one of the other channels is detected, then it must be an error.
As described herein, a specific target nucleic acid can be labeled with more than one distinct label. In such cases, a single “dot” will be comprised of two or more distinct labels. A dot comprising more than one label can be deconvoluted to identify the individual labels in the dot using well known methods. Such well known methods include the Richardson-Lucy deconvolution algorithm (see Example I) as described previously (Biggs et al., Applied Optics, Vol. 36, No. 8, (1997); Hanisch et al., “Deconvolutions of Hubble Space Telescope Images and Spectra, Deconvolution of Images and Spectra,” Ed. P.A. Jansson, 2nd ed., Academic Press CA, (1997)). Other methods for deconvolution include, but are not limited to, Wiener deconvolution, regularized filter deconvolution, and the like (Gonzalez et al. , “Digital Image Processing,” Addison-Wesley Publishing Company, Inc. (1992)).
The embodiments described above and depicted in FIG. 3-7 show SGCs with preamplifiers, amplifiers and label probes. It is understood that the same principles can be applied to an SGC where a pre-pre-amplifier component is included in the SGC, as disclosed herein (see, for example, FIGS. 5B, 5C, 6B and 6C for examples of SGCs with a pre-pre-amplifier layer).
In some embodiments, the SGCs that can be used in the disclosed methods comprise: (a) a set of pre-pre-amplifiers, wherein the pre-pre-amplifier set comprises one or more subsets of pre-pre-amplifiers, wherein the one or more pre-pre-amplifier subsets comprise a pre-pre- amplifier specific for each of the target probe pairs in the one or more target probe sets, wherein each pre-pre-amplifier comprises binding sites for the pair of target probes of one of the target probe sets and a plurality of binding sites for a pre-amplifier; (b) a set of pre-amplifiers, wherein the pre-amplifier set comprises one or more subsets of pre-amplifiers, wherein the one or more pre-amplifier subsets comprise a pre-amplifier specific for the pre-pre-amplifiers in the one or more pre-pre-amplifier subsets, wherein each pre-amplifier comprises binding sites for the pre- pre-amplifiers of one of the pre-pre-amplifier subsets and a plurality of binding sites for an amplifier; (d) a set of amplifiers, wherein the amplifier set comprises one or more subsets of amplifiers specific for each pre-amplifier subset, wherein each amplifier subset comprises a plurality of amplifiers, wherein the amplifiers of one of the amplifier subsets comprise a binding site for the pre-amplifiers of one of the pre-amplifier subsets and a plurality of binding sites for a label probe; and (d) a set of label probes, wherein the label probe set comprises one or more subsets of label probes, wherein each label probe subset is specific for one of the amplifier subsets, wherein each label probe subset comprises a plurality of label probes, wherein the label probes in each of the label probe subsets comprise a label and a binding site for the amplifiers of one of the amplifier subsets, wherein the labels in each label probe subset are distinguishable between the label probe subsets; wherein the one or more label probe subsets in each probe subset specific for a target nucleic acid comprise at least one label or a combination of labels that is different for each probe subset. This embodiment is similar to FIG. 3 except that a pre-pre-amplifier is included in the SGC.
In one embodiment, the set of target probes, pre-amplifiers, amplifiers and label probes each comprise two or more subsets. In another embodiment of such a method, the set of target probes, pre-amplifiers, amplifiers and label probes each comprise three or more subsets. In another embodiment, the set of target probes, pre-amplifiers, amplifiers and label probes each comprise four or more subsets.
In one embodiment, the target probe binding sites for the two or more subsets are intermingled on the target nucleic acid (see FIG. 5, top panel, except a pre-pre-amplifier is included in the SGC).
In some embodiments, the SGCs that can be used in the disclosed methods comprise: (a) a set of pre-pre-amplifiers, wherein the pre-pre-amplifier set comprises a plurality of pre- pre-amplifiers, wherein the pre-pre-amplifiers comprise binding sites for the pairs of target probes and a plurality of binding sites for a pre-amplifier; (b) a set of pre-amplifiers, wherein the pre-amplifier set comprises a plurality of pre-amplifiers, wherein the pre-amplifiers comprise binding sites for the pre-pre-amplifiers and a plurality of binding sites for an amplifier; (c) a set of amplifiers, wherein the amplifier set comprises a plurality of amplifiers, wherein the amplifiers comprise a binding site for the pre-amplifiers and a plurality of binding sites for a label probe or two or more distinct label probes; and d) a set of label probes, wherein the label probe set comprises a label probe or two or more distinct label probes, wherein each label probe comprises a label and a binding site for the amplifiers, wherein the labels in each distinct label probe are distinguishable between the distinct label probes; wherein the amplifier in each probe subset specific for a target nucleic acid comprises a binding site for a label or a combination of two or more distinct labels that is different for each probe subset. This embodiment is similar to FIG. 4A except that a pre-pre-amplifier is included in the SGC.
In one embodiment, the label probe set comprises two or more distinct label probes, wherein the amplifier set comprises a plurality of non-identical amplifiers, and wherein the binding sites for the two or more distinct label probes on each non-identical amplifier are in a different order on each non-identical amplifier (similar to FIG. 7B except with a pre-pre- amplifier in the SGC).
In some embodiments, the SGCs that can be used in the disclosed methods comprise: (a) a set of pre-pre-amplifiers, wherein the pre-pre-amplifier set comprises a plurality of pre- pre-amplifiers, wherein each pre-pre-amplifier comprises binding sites for the pairs of target probes and a plurality of binding sites for a pre-amplifier; (b) a set of pre-amplifiers, wherein the pre-amplifier set comprises a plurality of pre-amplifiers, wherein the pre-amplifiers comprise binding sites for the pre-pre-amplifiers and a plurality of binding sites for amplifiers; (c) a set of amplifiers, wherein the amplifier set comprises a plurality of amplifiers, wherein the plurality of amplifiers comprise an amplifier comprising a binding site for the preamplifiers and a plurality of binding sites for a label probe, or wherein the plurality of amplifiers comprise two or more distinct amplifiers, wherein each distinct amplifier comprises a binding site for the pre-amplifiers and a plurality of binding sites for a distinct label probe;
and (d) a set of label probes, wherein the label probe set comprises a label probe or two or more distinct label probes, wherein the label probe comprises a label and a binding site for the amplifier, or wherein the two or more distinct label probes comprise a label and a binding site for the two or more distinct amplifiers, wherein the labels on each distinct label probe are distinguishable between the distinct label probes; wherein the pre-amplifier in each probe subset specific for a target nucleic acid comprises a plurality of binding sites for the amplifier comprising a binding site for the label probe or a plurality of binding sites for the two or more distinct amplifiers comprising binding sites for the two or more distinct label probes, and wherein the label of the label probe or combination of two or more distinct labels of the two or more distinct label probes is different for each probe subset. This embodiment is similar to FIG. 4B except that a pre-pre-amplifier is included in the SGC.
In one embodiment, the plurality of amplifiers comprise two or more distinct amplifiers, and wherein the binding sites on the pre-amplifier for the distinct amplifiers are intermingled (similar to FIG. 6B except with a pre-pre-amplifier layer in the SGC).
In some embodiments, the SGCs that can be used in the disclosed methods comprise: (a) a set of pre-pre-amplifiers, wherein the pre-pre-amplifier set comprises a plurality of pre- pre-amplifiers, wherein the pre-pre-amplifiers comprise binding sites for the pairs of target probes and a plurality of binding sites for a pre-amplifier or for two or more distinct preamplifiers; (b) a set of pre-amplifiers, wherein the pre-amplifier set comprises a plurality of pre-amplifiers, wherein the plurality of pre-amplifiers comprise a pre-amplifier comprising a binding site for the pre-pre-amplifiers and a plurality of binding sites for an amplifier, or wherein the plurality of pre-amplifiers comprise two or more distinct pre-amplifiers, wherein each distinct pre-amplifier comprises a binding site for the pre-pre-amplifiers and a plurality of binding sites for a distinct amplifier; (c) a set of amplifiers, wherein the amplifier set comprises a plurality of amplifiers, wherein the plurality of amplifiers comprise an amplifier comprising a binding site for the pre-amplifiers and a plurality of binding sites for a label probe, or wherein the plurality of amplifiers comprise two or more distinct amplifiers, wherein each distinct amplifier comprises a binding site for one of the distinct pre-amplifiers and a plurality of binding sites for a distinct label probe; and (d) a set of label probes, wherein the label probe set comprises a label probe or two or more distinct label probes, wherein the label probe comprises a label and a binding site for the amplifier, or wherein the two or more distinct label probes comprise a label and a binding site for the two or more distinct amplifiers, wherein the labels on each distinct label probe are distinguishable between the distinct label probes; wherein the pre-pre-amplifier in each probe subset specific for a target nucleic acid comprises
a plurality of binding sites for the pre-amplifier comprising a plurality of binding sites for the amplifier comprising a binding site for the label probe, or a plurality of binding sites for the two or more distinct pre-amplifiers each comprising a plurality of binding sites for one of the two or more distinct amplifiers comprising binding sites for one of the two or more distinct label probes, and wherein the label of the label probe or combination of two or more distinct labels of the two or more distinct label probes is different for each probe subset. This embodiment is similar to FIG. 4B, except that the combinatorial labeling is implemented at the level of one or more distinct pre-amplifiers binding to the pre-pre-amplifier, rather than at the level of one or more distinct amplifiers binding to the pre-amplifier, as shown in FIG. 4B.
In one embodiment, the plurality of pre-amplifiers comprise two or more distinct preamplifiers, and wherein the binding sites on the pre-pre-amplifier for the distinct pre-amplifiers are intermingled. This embodiment is similar to FIG. 6B except that the combinatorial labeling is implemented at the level of one or more distinct pre-amplifiers binding to the pre-pre- amplifier, rather than at the level of one or more distinct amplifiers binding to the pre-amplifier, as shown in FIG. 6B.
In some embodiments, the SGCs that can be used in the disclosed methods comprise: (a) a set of pre-pre-amplifiers, wherein the pre-pre-amplifier set comprises a plurality of pre- pre-amplifiers, wherein the pre-pre-amplifiers comprise binding sites for the pairs of target probes and a plurality of binding sites for a pre-amplifier; (b) a set of pre-amplifiers, wherein the pre-amplifier set comprises a plurality of pre-amplifiers, wherein the pre-amplifiers comprise binding sites for the pairs of target probes and a plurality of binding sites for an amplifier; (c) a set of amplifiers, wherein the amplifier set comprises a plurality of amplifiers, wherein the amplifiers comprise a binding site for the pre-amplifiers and a plurality of identical binding sites for a label probe; and (d) a set of label probes, wherein the label probe set comprises a label probe or two or more distinct label probes, wherein each label probe comprises a label and a binding site for the amplifiers, wherein the binding site for the amplifier is the same for each label probe, wherein the labels in each distinct label probe are distinguishable between the distinct label probes; wherein the amplifier in each probe subset specific for a target nucleic acid comprises a binding site for a label probe or a combination of two or more distinct label probes that is different for each probe subset. This embodiment is similar to FIG. 4C except that a pre-pre-amplifier is included in the SGC.
As disclosed herein, the components of the SGC are generally bound directly to each other. In the case of nucleic acid containing components, the binding reaction is generally by hybridization. In the case of a hybridization reaction, the binding between the components is
direct. If desired, an intermediary component can be included such that the binding of one component to another is indirect, for example, the intermediary component contains complementary binding sites to bridge two other components.
In general, if a binding reaction is desired to be stable, the segments of complementary nucleic acid sequence between the components is generally in the range of 10 to 50 nucleotides, or greater, for example, 16 to 30 nucleotides, such as 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, or 50 nucleotides, or greater. If a binding reaction is desired to be relatively unstable, such as when a collaborative hybridization binding reaction is employed, the segments of complementary nucleic acid sequence between the components is generally in the range of 5 to 18 nucleotides, for example, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17 or 18 nucleotides. The nucleotide lengths can be somewhat shorter or longer for a stable or unstable hybridization, depending on the sequence (for example, GC content) and the conditions employed in the assay. As disclosed herein, modified nucleotides such as Locked Nucleic Acid (LNA) or Bridged Nucleic Acid (BNA) can be used to increase the binding strength at the modified base, thereby allowing length of the binding segment to be reduced. Thus, with respect to the length of nucleic acid segments that are complementary to other nucleic acid segments, the lengths described herein can be reduced further, if desired. A person skilled in the art can readily determine appropriate probe designs, including length, the presence of modified nucleotides, and the like, to achieve a desired interaction between nucleic acid components.
In designing binding sites between two nucleic acid sequences comprising complementary sequences, the complementary sequences can optionally be designed to maximize the difference in melting temperature (dTm). This can be done by using melting temperature calculation algorithms known in the art (see, for example, SantaLucia, Proc. Natl. Acad. Sci. U.S.A. 95: 1460-1465 (1998)). In addition, artificial modified bases such as Locked Nucleic Acid (LNA) or bridged nucleic acid (BNA) and naturally occurring 2'-O-methyl RNA are known to enhance the binding strength between complementary pairs (Petersen and Wengel, Trends Biotechnol. 21:74-81 (2003); Majlessi et al., Nucl. Acids Res. 26:2224-2229 (1998)). These modified bases can be strategically introduced into the binding site between components of an SGC, as desired.
One approach is to utilize modified nucleotides (LNA, BNA or 2'-O-methyl RNA). Because each modified base can increase the melting temperature, the length of binding regions between two nucleic acid sequences (i.e., complementary sequences) can be substantially
shortened. The binding strength of a modified base to its complement is stronger, and the difference in melting temperatures (dTm) is increased. Yet another embodiment is to use three modified bases (for example, three LNA, BNA or 2'-O-methyl RNA bases, or a combination of two or three different modified bases) in the complementary sequences of a nucleic acid component or between two nucleic acid components, for example of a signal generating complex (SGC), that are to be hybridized. Such components can be, for example, a pre-pre- amplifier, a pre-amplifier, an amplifier, a label probe, or a pair of target probes.
The modified bases, such as LNA or BNA, can be used in the segments of selected components of SGC, in particular those mediating binding between nucleic acid components, which increases the binding strength of the base to its complementary base, allowing a reduction in the length of the complementary segments (see, for example, Petersen and Wengel, Trends Biotechnol. 21:74-81 (2003); US Patent No. 7,399,845). Artificial bases that expand the natural 4-letter alphabet such as the Artificially Expanded Genetic Information System (AEGIS; Yang et al., Nucl. Acids Res. 34 (21): 6095-6101 (2006)) can be incorporated into the binding sites among the interacting components of the SGC. These artificial bases can increase the specificity of the interacting components, which in turn can allow lower stringency hybridization reactions to yield a higher signal.
In the SGCs, each label probe (LP) comprises a segment that is detectable. As used herein, a “label” is a moiety that facilitates detection of a molecule. Common labels include fluorescent, luminescent, light-scattering, and/or colorimetric labels. Suitable labels include enzymes, and fluorescent and chromogenic moieties, as well as radionuclides, substrates, cofactors, inhibitors, chemiluminescent moieties, magnetic particles, rare earth metals, metal isotopes, and the like. In a particular embodiment, the label is an enzyme. Exemplary enzyme labels include, but are not limited to horseradish peroxidase (HRP), alkaline phosphatase (AP), P-galactosidase, glucose oxidase, and the like, as well as various proteases. Other labels include, but are not limited to, fluorophores, dinitrophenyl (DNP), and the like. Labels are known to those skilled in the art, as described, for example, in Hermanson, Bioconjugate Techniques, Academic Press, San Diego (1996), and U.S. Patent Nos. 3,817,837; 3,850,752; 3,939,350; 3,996,345; 4,277,437; 4,275,149; and 4,366,241. Many labels are commercially available and can be used in methods and assays of the disclosure, including detectable enzyme/substrate combinations (Pierce, Rockford IL; Santa Cruz Biotechnology, Dallas TX; Life Technologies, Carlsbad CA). In a particular embodiment of the disclosure, the enzyme can utilize a chromogenic or Anorogenic substrate to produce a detectable signal, as described herein. Exemplary labels are described herein.
Any of a number of enzymes or non-enzyme labels can be utilized so long as the enzymatic activity or non-enzyme label, respectively, can be detected. The enzyme thereby produces a detectable signal, which can be utilized to detect a target nucleic acid. Particularly useful detectable signals are chromogenic or Anorogenic signals. Accordingly, particularly useful enzymes for use as a label include those for which a chromogenic or Anorogenic substrate is available. Such chromogenic or Anorogenic substrates can be converted by enzymatic reaction to a readily detectable chromogenic or Auorescent product, which can be readily detected and/or quantified using microscopy or spectroscopy. Such enzymes are known to those skilled in the art, including but not limited to, horseradish peroxidase, alkaline phosphatase, -galactosidase, glucose oxidase, and the like (see Hermanson, Bioconjugate Techniques, Academic Press, San Diego (1996)). Other enzymes that have known chromogenic or Auorogenic substrates include various peptidases, where chromogenic or Auorogenic peptide substrates can be utilized to detect proteolytic cleavage reactions. The use of chromogenic and Auorogenic substrates is also known in bacterial diagnostics, including but not limited to the use of a- and -galactosidase, P-glucuronidase, 6-phospho-P-D-galactoside 6-phosphogalactohydrolase, P-glucosidase, a-glucosidase, amylase, neuraminidase, esterases, lipases, and the like (ManaA eta/., Microbiol. Rev. 55:335-348 (1991)), and such enzymes with known chromogenic or Auorogenic substrates can readily be adapted for use in methods provided herein.
Various chromogenic or Auorogenic substrates to produce detectable signals are known to those skilled in the art and are commercially available. Exemplary substrates that can be utilized to produce a detectable signal include, but are not limited to, 3, 3 '-diaminobenzidine (DAB), 3, 3 ’,5, 5 ’-tetramethylbenzidine (TMB), chloronaphthol (4-CN)(4-chloro-l -naphthol), 2,2'-azino-bis(3-ethylbenzothiazoline-6-sulphonic acid) (ABTS), o-phenylenediamine dihydrochloride (OPD), and 3-amino-9-ethylcarbazole (AEC) for horseradish peroxidase; 5- bromo-4-chloro-3-indolyl-l -phosphate (BCIP), nitroblue tetrazolium (NBT), Fast Red (Fast Red TR/AS-MX), and p-nitrophenyl phosphate (PNPP) for alkaline phosphatase; l-methyl-3- indolyl-P-D-galactopyranoside and 2-methoxy-4-(2-nitrovinyl)phenyl P-D-galactopyranoside for P-galactosidase; 2-methoxy-4-(2-nitrovinyl)phenyl P-D-glucopyranoside for P- glucosidase; and the like. Exemplary Auorogenic substrates include, but are not limited to, 4- (triAuoromethyl)umbelliferyl phosphate for alkaline phosphatase; 4-methylumbelliferyl phosphate bis (2-amino- 2-methyl- 1,3 -propanediol), 4-methylumbelliferyl phosphate bis (cyclohexylammonium) and 4-methylumbelliferyl phosphate for phosphatases; QuantaBlu™ and Quintolet for horseradish peroxidase; 4-methylumbelliferyl P-D-galactopyranoside,
fluorescein di(P-D-galactopyranoside) and naphthofluorescein di-(P-D-galactopyranoside) for P-galactosidase; 3-acetylumbelliferyl P-D-glucopyranoside and 4-methylumbelliferyl-P- D- glucopyranoside for P-glucosidase; and 4-methylumbelliferyl-a-D-galactopyranoside for a- galactosidase. Exemplary enzymes and substrates for producing a detectable signal are also described, for example, in US publication 2012/0100540. Various detectable enzyme substrates, including chromogenic or Anorogenic substrates, are known and commercially available (Pierce, Rockford IL; Santa Cruz Biotechnology, Dallas TX; Invitrogen, Carlsbad CA; 42 Life Science; Biocare). Generally, the substrates are converted to products that form precipitates that are deposited at the site of the target nucleic acid. Other exemplary substrates include, but are not limited to, HRP-Green (42 Life Science), Betazoid DAB, Cardassian DAB, Romulin AEC, Bajoran Purple, Vina Green, Deep Space Black™, Warp Red™, Vulcan Past Red and Ferangi Blue from Biocare (Concord CA; biocare.net/products/detection/chromogens).
Exemplary rare earth metals and metal isotopes suitable as a detectable label include, but are not limited to, lanthanide (III) isotopes such as 141Pr, 142Nd, 143Nd, 144Nd, 145Nd, 146Nd, 147Sm, 148Nd, 149Sm, 150Nd, 151Eu, 152Sm, 153Eu, 154Sm, 155Gd, 156Gd, 158Gd, 159Tb, 160Gd, 161Dy, 162Dy, 163Dy, 164Dy, 165Ho, 166Er, 167Er, 168Er, 169Tm, 170Er, 171Yb, 172Yb, 173Yb, 174Yb, 175Lu, and 176Yb. Metal isotopes can be detected, for example, using time-of-flight mass spectrometry (TOF-MS) (for example, Fluidigm Helios and Hyperion systems, fluidigm.com/systems; South San Francisco, CA).
Biotin-avidin (or biotin-streptavidin) is a well-known signal amplification system based on the fact that the two molecules have extraordinarily high affinity to each other, and that one avidin/streptavidin molecule can bind four biotin molecules. Antibodies are widely used for signal amplification in immunohistochemistry and ISH. Tyramide signal amplification (TSA) is based on the deposition of a large number of haptenized tyramide molecules by peroxidase activity. Tyramine is a phenolic compound. In the presence of small amounts of hydrogen peroxide, immobilized horseradish peroxidase (HRP) converts the labeled substrate into a short-lived, extremely reactive intermediate. The activated substrate molecules then very rapidly react with and covalently bind to electron-rich moieties of proteins, such as tyrosine, at or near the site of the peroxidase binding site. In this way, many hapten molecules conjugated to tyramide can be introduced at the hybridization site in situ. Subsequently, the deposited tyramide-hapten molecules can be visualized directly or indirectly. Such a detection system is described in more detail, for example, in U.S. Patent No. 8,658,361.
Embodiments described herein can utilize enzymes to generate a detectable signal using appropriate chromogenic or fluorogenic substrates. It is understood that, alternatively, a label probe can have a detectable label directly coupled to the nucleic acid portion of the label probe. Exemplary detectable labels are known to those skilled in the art, including but not limited to chromogenic or fluorescent labels (see Hermanson, Bioconjugate Techniques, Academic Press, San Diego (1996)). Exemplary fluorophores useful as labels include, but are not limited to, rhodamine derivatives, for example, tetramethylrhodamine, rhodamine B, rhodamine 6G, sulforhodamine B, Texas Red (sulforhodamine 101), rhodamine 110, and derivatives thereof such as tetramethylrhodamine-5-(or 6), lissamine rhodamine B, and the like; 7-nitrobenz-2- oxa- 1,3 -diazole (NBD); fluorescein and derivatives thereof; naphthalenes such as dansyl (5- dimethylaminonapthalene-1 -sulfonyl); coumarin derivatives such as 7-amino-4- methylcoumarin-3 -acetic acid (AMCA), 7-diethylamino-3-[(4'-(iodoacetyl)amino)phenyl]-4- methylcoumarin (DCIA); Alexa Fluor dyes (Molecular Probes), and the like; 4,4-difluoro-4- bora-3a,4a-diaza-s-indacene (BODIPY™) and derivatives thereof (Molecular Probes); pyrenes and sulfonated pyrenes such as Cascade Blue™ and derivatives thereof, including 8- methoxypyrene-l,3,6-trisulfonic acid, and the like; pyridyloxazole derivatives and dapoxyl derivatives (Molecular Probes); Lucifer Yellow (3,6-disulfonate-4-amino-naphthalimide) and derivatives thereof; CyDye™ fluorescent dyes (Amersham/GE Healthcare Life Sciences); ATTO dyes (ATTO-TEC GmbH); and the like. Particular dyes include ATTO 390, DyLight 395XL, Alexa Fluor 405, ATTO 425, Alexa Fluor 430, DyLight430, ATTO 465, ATTO 488, Alexa Fluor 488, ATTO 490LS, ATTO 495, ATTO 514, ATTO 520, ATTO 532, Alexa Fluor 532, ATTO Rho6G, ATTO 542, Alexa Fluor 546, ATTO 550, DyLight 550, Alexa Fluor 555, ATTO 565, Alexa Fluor 568, ATTO Rho3B, ATTO Rhol l, ATTO Rhol2, ATTO Thiol2, ATTO RholOl, ATTO 590, ATTO 594, Alexa Fluor 594, Dylight 594, ATTO Rhol3, ATTO 610, Alexa Fluor 610, ATTO 620, ATTO Rhol4, ATTO 633, Alexa Fluor 633, Alexa Fluor 635, ATTO 643, ATTO 647, Alexa Fluor 647, ATTO 647N, DyLight650, ATTO 655, ATTO Oxal2, Alexa Fluor 660, ATTO 665, ATTO 680, Alexa Fluor 680, ATTO 700, DyLight700, AlexaFluor700, ATTO 725, ATTO 740, Alexa Fluor 750, DyLight 755, Cyan 500 NHS-Ester, and the like. Exemplary chromophores include, but are not limited to, phenolphthalein, malachite green, nitroaromatics such as nitrophenyl, diazo dyes, dabsyl (4- dimethylaminoazobenzene-4'-sulfonyl), and the like.
As disclosed herein, the methods provided herein can be used for concurrent detection of multiple target nucleic acids. In the case of using fluorophores as labels, the fluorophores to be used for detection of multiple target nucleic acids are selected so that, within each round of
labeling and detection, each of the fluorophores is distinguishable and can be detected concurrently. Such fluorophores are selected to have spectral separation of the emissions so that distinct labeling of the target nucleic acids can be detected concurrently. Methods of selecting suitable distinguishable fluorophores for use in methods of the disclosure are known in the art (see, for example, Johnson and Spence, “Molecular Probes Handbook, a Guide to Fluorescent Probes and Labeling Technologies," 11th ed., Life Technologies (2010)). In some embodiments, 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 distinguishable fluorophores can be used. For example, selection of one to six fluorophores with excitation maxima at or near 488 nm, 550 nm, 594 nm, 650 nm, 700 nm, and 750 nm may be suitable in combination. Particular examples of fluorophores with excitation maxima at or near 488 nm include, e.g., Alexa Fluor 488, DyLight 488, ATTO 488, and Cy2. Particular examples of fluorophores with excitation maxima at or near 550 nm include, e.g., DyLight 550, Alexa Fluor 546, ATTO 550, Cy3, and rhodamine. Particular examples of fluorophores with excitation maxima at or near 594 nm include, e.g., Dylight 594, Alexa Fluor 594, ATTO 594, and Texas Red. Particular examples of fluorophores with excitation maxima at or near 650 nm include, e.g., DyLight 650, ATTO 647, Alexa Fluor 647, ATTO 647N, and ATTO 655. Particular examples of fluorophores with excitation maxima at or near 700 nm include, e.g., ATTO 700, DyLight700, and AlexaFluor700. Particular examples of fluorophores with excitation maxima at or near 750 nm include, e.g., Alexa Fluor 750, Cy7, and DyLight 755, and ATTO 740.
Methods such as microscopy, cytometry e.g., mass cytometry, cytometry by time of flight (CyTOF), flow cytometry), or spectroscopy can be used to visualize chromogenic, fluorescent, or metal detectable signals associated with the respective target nucleic acids. In general, either chromogenic substrates or Anorogenic substrates, or chromogenic or Auorescent labels, or rare earth metal isotopes, will be utilized for a particular assay, if different labels are used in the same assay, so that a single type of instrument can be used for detection of nucleic acid targets in the same sample.
As disclosed herein, the label can be designed such that the labels are optionally cleavable. As used herein, a “cleavable label” refers to a label that is attached or conjugated to a label probe so that the label can be removed, for example, in order to use the same label in a subsequent round of labeling and detecting of target nucleic acids. Generally, the cleavable labels are conjugated to the label probe by a chemical linker that is cleavable. Methods of conjugating a label to a label probe so that the label is cleavable are known to those skilled in the art (see, for example, Hermanson, Bioconjugate Techniques, Academic Press, San Diego (1996); Daniel et al., BioTechniques 24(3):484-489 (1998)). One particular system of labeling
oligonucleotides is the FastTag™ system (Daniel et al., supra, 1998; Vector Laboratories, Burlinghame CA). Various cleavable moieties can be included in the linker so that the label can be cleaved from the label probe. Such cleavable moieties include groups that can be chemically, photochemically or enzymatically cleaved. Cleavable chemical linkers can include a cleavable chemical moiety, such as disulfides, which can be cleaved by reduction, glycols or diols, which can be cleaved by periodate, diazo bonds, which can be cleaved by dithionite, esters, which can be cleaved by hydroxylamine, sulfones, which can be cleaved by base, and the like (see Hermanson, supra, 1996). One particularly useful cleavable linker is a linker containing a disulfide bond, which can be cleaved by reducing the disulfide bond. In other embodiments, the linker can include a site for cleavage by an enzyme. For example, the linker can contain a proteolytic cleavage site. Generally, such a cleavage site is for a sequence-specific protease. Such proteases include, but are not limited to, human rhinovirus 3C protease (cleavage site LEVLFQ/GP), enterokinase (cleavage site DDDDK/), factor Xa (cleavage site IEGR/), tobacco etch virus protease (cleavage site ENLYFQ/G), and thrombin (cleavage site LVPR/GS) (see, for example, Oxford Genetics, Oxford, UK). Another cleavable moiety can be, for example, uracil-DNA (DNA containing uracil), which can be cleaved by uracil-DNA glycosylase (UNG) (see, for example, Sidorenko et al., FEBS Lett. 582(3 ):410—404 (2008)).
The cleavable labels can be removed by applying an agent, such as a chemical agent or light, to cleave the label and release it from the label probe. As discussed above, useful cleaving agents for chemical cleavage include, but are not limited to, reducing agents, periodate, dithionite, hydroxylamine, base, and the like (see Hermanson, supra, 1996). One useful method for cleaving a linker containing a disulfide bond is the use of tris(2-carboxyethyl)phosphine (TCEP) (see Moffitt et al., Proc. Natl. Acad. Sci. USA 113: 11046-11051 (2016)). In one embodiment, TCEP is used as an agent to cleave a label from a label probe. c. Removal of Signals
Methods disclosed herein include a step of removing signals generated by the detectable labels from the sample. In some embodiments, the step of removing signals generated by the detectable labels comprises removing the labels from the label probes, e.g., by cleaving cleavable labels as discussed above. In other embodiments, the step of removing the signals generated by the detectable labels comprises removing the SGCs from the sample, e.g., by disrupting binding of the SGCs bound to the target nucleic acid. In some embodiments, the SGCs are removed from the sample by contacting the sample with an acid reagent that disrupts hybridization between the SGCs and the target nucleic acid.
The acid reagent comprises an acid. Exemplary acids suitable for use in an acid reagent include, but are not limited to, formic acid, acetic acid, propionic acid, butyric acid, valeric acid, oxalic acid, malonic acid, succinic acid, malic acid, tartaric acid, citric acid, and the like. The acid reagent generally includes an acid at a concentration of about 5-40% acid (vol/vol). In some embodiments, the acid reagent comprises an acid at a concentration of about 20-30% (vol/vol). For example, in some embodiments, the acid reagent comprises an acid at a concentration of about 5%, about 6%, about 7%, about 8%, about 9%, about 10%, about 11%, about 12%, about 13%, about 14%, about 15%, about 16%, about 17%, about 18%, about 19%, about 20%, about 21%, about 22,%, about 23%, about 24%, about 25%, about 26%, about 27%, about 28%, about 29%, about 30%, about 31%, about 32%, about 33%, about 34%, about 35%, about 36%, about 37%, about 38%, about 39%, or about 40% acid (vol/vol). In a particular embodiment, the acid reagent comprises acetic acid at a concentration of about 5- 40%, or about 20-30%, e.g., about 5%, about 6%, about 7%, about 8%, about 9%, about 10%, about 11%, about 12%, about 13%, about 14%, about 15%, about 16%, about 17%, about 18%, about 19%, about 20%, about 21%, about 22,%, about 23%, about 24%, about 25%, about 26%, about 27%, about 28%, about 29%, about 30%, about 31%, about 32%, about 33%, about 34%, about 35%, about 36%, about 37%, about 38%, about 39% or 40% acetic acid (% vol/vol), or any concentration therebetween.
In some embodiments, the acid reagent further comprises one or more components selected from salts, chelating agents, buffers, and any combination thereof. In one embodiment, the acid reagent further comprises saline sodium citrate (SSC), where 20X SSC corresponds to 3.0 M NaCl and 0.3 M sodium citrate, at pH 7.0 (see Sambrook et al., Molecular Cloning: A Laboratory Manual, Third Ed., Cold Spring Harbor Laboratory, New York (2001); Ausubel et al., Current Protocols in Molecular Biology, John Wiley and Sons, Baltimore, MD (1999)). In one embodiment, the acid reagent further comprises IX SSC to 13X SSC. For example, in some embodiments, the acid reagent further comprises SSC at IX, 1.1X, 1.2X, 1.3X, 1.4X, 1.5X, 1.6X, 1.7X, 1.8X, 1.9X, 2X, 2.1X, 2.2X, 2.3X, 2.4X, 2.5X, 2.6X, 2.7X, 2.8X, 2.9X, 3X, 3. IX, 3.2X, 3.3X, 3.4X, 3.5X, 3.6X, 3.7X, 3.8X, 3.9X, 4X, 4. IX, 4.2X, 4.3X, 4.4X, 4.5X, 4.6X, 4.7X, 4.8X, 4.9X, 5X, 5. IX, 5.2X, 5.3X, 5.4X, 5.5X, 5.6X, 5.7X, 5.8X, 5.9X, 6X, 6. IX, 6.2X, 6.3X, 6.4X, 6.5X, 6.6X, 6.7X, 6.8X, 6.9X, 7X, 7. IX, 7.2X, 7.3X, 7.4X, 7.5X, 7.6X, 7.7X, 7.8X, 7.9X, 8X, 8. IX, 8.2X, 8.3X, 8.4X, 8.5X, 8.6X, 8.7X, 8.8X, 8.9X, 9X, 9. IX, 9.2X, 9.3X, 9.4X, 9.5X, 9.6X, 9.7X, 9.8X, 9.9X, 10X, 10. IX, 10.2X, 10.3X, 10.4X, 10.5X, 10.6X, 10.7X, 10.8X, 10.9X, 11X, 11. IX, 11.2X, 11.3X, 11.4X, 11.5X, 11.6X, 11.7X, 11.8X, 11.9X,
12X, 12. IX, 12.2X, 12.3X, 12.4X, 12.5X, 12.6X, 12.7X, 12.8X, 12.9X, or 13X, or any concentration therebetween.
In some embodiments, the acid reagent further comprises saline sodium phosphate EDTA (SSPE), which is a mixture of sodium chloride, sodium phosphate, and ethylenediaminetetraacetic acid (EDTA), where 20X SSPE corresponds to 3.0 M sodium chloride, 0.2 M sodium hydrogen phosphate (NaEEPC ), and 0.02 M ethylenediaminetetraacetic acid (EDTA), pH 7.4 (see Sambrook et al., Molecular Cloning: A Laboratory Manual, Third Ed., Cold Spring Harbor Laboratory, New York (2001)). In some embodiments, the acid reagent further comprises IX SSPE to 13X SSPE. For example, the acid reagent comprises SSPE at IX, 1.1X, 1.2X, 1.3X, 1.4X, 1.5X, 1.6X, 1.7X, 1.8X, 1.9X, 2X, 2. IX, 2.2X, 2.3X, 2.4X, 2.5X, 2.6X, 2.7X, 2.8X, 2.9X, 3X, 3. IX, 3.2X, 3.3X, 3.4X, 3.5X, 3.6X, 3.7X, 3.8X, 3.9X, 4X, 4. IX, 4.2X, 4.3X, 4.4X, 4.5X, 4.6X, 4.7X, 4.8X, 4.9X, 5X, 5. IX, 5.2X, 5.3X, 5.4X, 5.5X, 5.6X, 5.7X, 5.8X, 5.9X, 6X, 6. IX, 6.2X, 6.3X, 6.4X, 6.5X, 6.6X, 6.7X, 6.8X, 6.9X, 7X, 7. IX, 7.2X, 7.3X, 7.4X, 7.5X, 7.6X, 7.7X, 7.8X, 7.9X, 8X, 8. IX, 8.2X, 8.3X, 8.4X, 8.5X, 8.6X, 8.7X, 8.8X, 8.9X, 9X, 9. IX, 9.2X, 9.3X, 9.4X, 9.5X, 9.6X, 9.7X, 9.8X, 9.9X, 10X, 10. IX, 10.2X, 10.3X, 10.4X, 10.5X, 10.6X, 10.7X, 10.8X, 10.9X, 11X, 11. IX, 11.2X, 11.3X, 11.4X, 11.5X, 11.6X, 11.7X, 11.8X, 11.9X, 12X, 12.1X, 12.2X, 12.3X, 12.4X, 12.5X, 12.6X, 12.7X, 12.8X, 12.9X, or 13X, or any concentration therebetween.
In some embodiments, the acid reagent further comprises a phosphate buffer, e.g., 10- 500 mM sodium phosphate, with a pH in the range of about 7 to about 8 (e.g., a pH of about 7.8). For example, in some embodiments, the acid reagent further comprises sodium phosphate at about 10 mM, about 20 mM, about 30 mM, about 40 mM, about 50 mM, about 60 mM, about 70 mM, about 80 mM, about 90 mM, about 100 mM, about 110 mM, about 120 mM, about 130 mM, about 140 mM, about 150 mM, about 160 mM, about 170 mM, about 180 mM, about 190 mM, about 200 mM, about 210 mM, about 220 mM, about 230 mM, about 240 mM, about 250 mM, about 260 mM, about 270 mM, about 280 mM, about 290 mM, about 300 mM, about 310 mM, about 320 mM, about 330 mM, about 340 mM, about 350 mM, about 360 mM, about 370 mM, about 380 mM, about 390 mM, about 400 mM, about 410 mM, about 420 mM, about 430 mM, about 440 mM, about 450 mM, about 460 mM, about 470 mM, about 480 mM, about 490 mM, or about 500 mM, or any concentration therebetween.
In another embodiment, the acid reagent further comprises sodium chloride (NaCl), e.g., at a concentration of about 10 mM to about 6 M. For example, in some embodiments, the acid reagent comprises sodium chloride at a concentration of about 10 mM, about 50 mM, about 100 mM, about 150 mM, about 200 mM, about 250 mM, about 300 mM, about 350 mM,
about 400 mM, about 450 mM, about 500 mM, about 550 mM, about 600 mM, about 650 mM, about 700 mM, about 750 mM, about 800 mM, about 850 mM, about 900 mM, about 950 mM, about 1 M, about 1.1 M, about 1.2 M, about 1.3 M 1.4 M, about 1.5 M, about 1.6 M, about 1.7 M, about 1.8 M, about 1.9 M, about 2 M, about 2.1 M, about 2.2 M, about 2.3 M, about 2.4 M, about 2.5 M, about 2.6 M, about 2.7 M, about 2.8 M, about 2.9 M, about 3 M, about 3.1 M, about 3.2 M, about 3.3 M, about 3.4 M, about 3.5 M, about 3.6 M, about 3.7 M, about 3.8 M, about 3.9 M, about 4 M, about 4.1 M, about 4.2 M, about 4.3 M, about 4.4 M, about 4.5 M, about 4.6 M, about 4.7 M, about 4.8 M, about 4.9 M, about 5 M, about 5.1 M, about 5.2 M, about 5.3 M, about 5.4 M, about 5.5 M, about 5.6 M, about 5.7 M, about 5.8 M, about 5.9 M, or about 6 M, and the like, or any concentration therebetween.
In some embodiments, the acid reagent comprises 5-40% acid and 1X-12.8X SSC, wherein the acid and the SSC can each be present independently at any concentration disclosed herein. For example, in some embodiments, the acid reagent comprises 20-30% acid and 3.2X- 12.8X SSC. In some embodiments, the acid reagent comprises 20% acid and 3.2X SSC. In some embodiments, the acid reagent comprises 20% acid and 6.4X SSC. In some embodiments, the acid reagent comprises 20% acid and 12.8X SSC. In some embodiments, the acid reagent comprises 30% acid and 3.2X SSC. In some embodiments, the acid reagent comprises 30% acid and 6.4X SSC. In some embodiments, the acid reagent comprises 30% acid and 12.8X SSC.
In some embodiments, the acid reagent comprises 5-40% acetic acid and 1X-12.8X SSC, wherein the acetic acid and the SSC can each be present independently at any concentration disclosed herein. For example, in some embodiments, the acid reagent comprises 20% acetic acid and 3.2X SSC. In some embodiments, the acid reagent comprises 20% acetic acid and 6.4X SSC. In some embodiments, the acid reagent comprises 20% acetic acid and 12.8X SSC. In some embodiments, the acid reagent comprises 30% acetic acid and 3.2X SSC. In some embodiments, the acid reagent comprises 30% acetic acid and 6.4X SSC. In some embodiments, the acid reagent comprises 30% acetic acid and 12.8X SSC.
In some embodiments, the step of removing SGCs from a sample by adding an acid reagent to effect disruption of hybridization between the SGCs and the target nucleic acids in the sample is carried out at room temperature. In other embodiments, this step can be conducted at a temperature just below or above room temperature. Thus, the acid reagent can be applied to a sample, for example, at temperature of about 4°C to about 40°C. For example, the methods can be carried out at a temperature of about 4°C, about 5°C, about 6°C, about 7°C, about 8°C, about 9°C, about 10°C, about 11°C, about 12°C, about 13°C, about 14°C, about 15°C, about
16°C, about 17°C, about 18°C, about 19°C, about 20°C, about 21°C, about 22°C, about 23 °C, about 24 °C, about 25 °C, about 26 °C, about 27 °C, about 28 °C, about 29 °C, about 30 °C, about 31 °C, about 32 °C, about 33 °C, about 34 °C, about 35 °C, about 36 °C, about 37 °C, about 38 °C, about 39 °C, about 40 °C, or any range therebetween.
In some embodiments, the step of removing SGCs from a sample by adding an acid reagent to effect disruption of hybridization between the SGCs and the target nucleic acids in the sample is carried out for a period of time of about 1 minute to about 30 minutes, or about 3 minutes to about 10 minutes, for example, about 1 minute, about 2 minute, about 3 minute, about 4 minute, about 5 minute, about 6 minute, about 7 minute, about 8 minute, about 9 minute, about 10 minutes, about 11 minutes, about 12 minutes, about 13 minutes, about 14 minutes, about 15 minutes, about 16 minutes, about 17 minutes, about 18 minutes, about 19 minutes, about 20 minutes, about 21 minutes, about 22 minutes, about 23 minutes, about 24 minutes, about 25 minutes, about 26 minutes, about 27 minutes, about 28 minutes, about 29 minutes, about 30 minutes, or any range therebetween. In some embodiments, the acid reagent treatment is repeated 1 to 10 times.
In some embodiments, the step of removing SGCs from a sample by adding an acid reagent to effect disruption of hybridization between the SGCs and the target nucleic acids in the sample is carried out 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or 11 times (i.e., repeated up to 10 times). In some embodiments, this step is carried out 1, 2, 3, 4, 5 or 6 times (i.e., repeated up to 5 times). In some embodiments, this step is carried out 1, 2, or 3 times (i.e., repeated up to 2 times). In some embodiments, the acid reagent is contacted with the sample sequentially without removing the acid reagent (for example, by aspirating the acid reagent from the sample) or washing the sample (for example, washing the sample between applications of the acid reagent). In other embodiments, the acid reagent can be removed from contact with the sample, for example, by aspirating the acid reagent away from the sample or washing the sample with a suitable buffer. Suitable wash buffers include, but are not limited to, a buffer used routinely in in situ hybridization assays. It is understood that the conditions for removal of probes bound to target nucleic acids in a sample can be readily determined depending on the components and concentration of components of the acid reagent, the time of incubation of the acid reagent with the sample, and the number of times the incubation is repeated, as disclosed herein. The effectiveness of the removal of the probes from a sample can be readily determined by analyzing the sample using the same method used to detect a target nucleic acid to see if residual probe can be detected (see Examples I and II). If residual probe is still present, the acid reagent treatment merely needs to be repeated until the previously detected probes are no longer
detected or are detected at a sufficiently low level to permit detectable labeling of a target nucleic acid with the same label in a subsequent round of labeling. Similarly, the number of times that a sample can be treated with the acid reagent while preserving cell morphology and integrity of the nucleic acids in the sample to permit subsequent detection of nucleic acids can be readily determined by performing repeated acid reagent treatments of a sample for a given acid reagent and under a given set of conditions and determining whether or not target nucleic acids can still be detected, for example, by determining the ability to detect a positive control nucleic acid in the sample or determining that a similar cell morphology can be detected in the sample after the sample has been treated one or more times with the acid reagent (see Examples I and II). Once a set of conditions of incubation time and number of repeats of applying a given acid reagent has been determined, the conditions can be applied to other samples. As disclosed herein, a range of acid reagents and conditions were tested and shown to be effective at disrupting hybridization between probes and target nucleic acids that preserved cell morphology and nucleic acid integrity such that a new round of detection of target nucleic acids could be applied. d. Targets and Samples
In an embodiment of the disclosure, the target nucleic acids detected by the methods of the invention can be any nucleic acid present in the cell sample, including but not limited to, RNA, including messenger RNA (mRNA), micro RNA (miRNA), ribosomal RNA (rRNA), mitochondrial RNA, non-coding RNA, and the like, or DNA, and the like. In a particular embodiment, the nucleic acid is RNA. In the disclosed methods for multiplex detection of nucleic acids, it is understood the target nucleic acids can independently be DNA or RNA. In other words, the target nucleic acids to be detected can be, but are not necessarily, the same type of nucleic acid. Thus, the target nucleic acids to be detected in an assay of the invention can be DNA and RNA. In the case where the target nucleic acids are RNA, the target nucleic acids can independently be selected from the group consisting of messenger RNA (mRNA), micro RNA (miRNA), ribosomal RNA (rRNA), mitochondrial RNA, and non-coding RNA. Thus, the target nucleic acids can independently be DNA or any type of RNA.
As described herein, the methods of the disclosure generally relate to in situ detection of target nucleic acids. General embodiments of such methods are well-known to those skilled in the art (see, for example, US 2008/0038725; US 2009/0081688; Hicks et al., J. Mol. Histol. 35:595-601 (2004)). As used herein, “in situ hybridization” or “ISH” refers to a type of hybridization that uses a directly or indirectly labeled complementary DNA or RNA strand, such as a probe, to bind to and localize a specific nucleic acid, such as DNA or RNA, in a
sample, in particular a portion or section of tissue or cells (in situ). The probe types can be double stranded DNA (dsDNA), single stranded DNA (ssDNA), single stranded complimentary RNA (sscRNA), messenger RNA (mRNA), micro RNA (miRNA), ribosomal RNA, mitochondrial RNA, and/or synthetic oligonucleotides. The term “fluorescent in situ hybridization” or “FISH” refers to a type of ISH utilizing a fluorescent label. The term “chromogenic in situ hybridization” or “CISH” refers to a type of ISH with a chromogenic label. ISH, FISH and CISH methods are well known to those skilled in the art (see, for example, Staler, Clinics in Laboratory Medicine 10(l):215-236 (1990); In situ hybridization. A practical approach, Wilkinson, ed., IRL Press, Oxford (1992); Schwarzacher and Heslop- Harrison, Practical in situ hybridization, BIOS Scientific Publishers Ltd, Oxford (2000)).
In the disclosed methods for detection of a plurality of targets in a sample, the sample comprises cells that are optionally fixed and/or permeabilized before hybridization of the target probes. Fixing and permeabilizing cells can facilitate retaining the nucleic acid targets in the cell and permit the target probes, label probes, amplifiers, pre-amplifiers, pre-pre-amplifiers, and so forth, to enter the cell and reach the target nucleic acid molecule. The cell is optionally washed to remove materials not captured to a nucleic acid target. The cell can be washed after any of various steps, for example, after hybridization of the target probes to the nucleic acid targets to remove unbound target probes, after hybridization of the pre-pre-amplifiers, preamplifiers, amplifiers, and/or label probes to the target probes, and the like. Methods for fixing and permeabilizing cells for in situ detection of nucleic acids, as well as methods for hybridizing, washing and detecting target nucleic acids, are also well known in the art (see, for example, US 2008/0038725; US 2009/0081688; Hicks et al., J. Mol. Histol. 35:595-601 (2004); Staler, Clinics in Laboratory Medicine 10(l):215-236 (1990); In situ hybridization. A practical approach, Wilkinson, ed., IRL Press, Oxford (1992); Schwarzacher and Heslop- Harrison, Practical in situ hybridization, BIOS Scientific Publishers Ltd, Oxford (2000); Shapiro, Practical Flow Cytometry 3rd ed., Wiley -Liss, New York (1995); Ormerod, Flow Cytometry, 2nd ed., Springer (1999)). Exemplary fixing agents include, but are not limited to, aldehydes (formaldehyde, glutaraldehyde, and the like), acetone, alcohols (methanol, ethanol, and the like). Exemplary permeabilizing agents include, but are not limited to, alcohols (methanol, ethanol, and the like), acids (glacial acetic acid, and the like), detergents (Triton, NP-40, Tween™ 20, and the like), saponin, digitonin, Leucoperm™ (BioRad, Hercules, CA), and enzymes (for example, lysozyme, lipases, proteases and peptidases). Permeabilization can also occur by mechanical disruption, such as in tissue slices.
For in situ detection of double stranded nucleic acids, generally the sample is treated to denature the double stranded nucleic acids in the sample to provide accessibility for the target probes to bind by hybridization to a strand of the target double stranded nucleic acid. Conditions for denaturing double stranded nucleic acids are well known in the art, and include heat and chemical denaturation, for example, with base (NaOH), formamide, dimethyl sulfoxide, and the like (see Wang et al., Environ. Health Toxicol. 29:e2014007 (doi: 10.5620/eht.2014.29.e2014007) 2014; Sambrook et al., Molecular Cloning: A Laboratory Manual, Third Ed., Cold Spring Harbor Laboratory, New York (2001); Ausubel et al., Current Protocols in Molecular Biology, John Wiley and Sons, Baltimore, MD (1999)). For example, NaOH, LiOH or KOH, or other high pH buffers (pH >11) can be used to denature double stranded nucleic acids such as DNA. In addition, heat and chemical denaturation methods can be used in combination.
Such in situ detection methods can be used on tissue specimens immobilized on a glass slide, on single cells in suspension such as peripheral blood mononucleated cells (PBMCs) isolated from blood samples, and the like. Tissue specimens include, for example, tissue biopsy samples. Blood samples include, for example, blood samples taken for diagnostic purposes. In the case of a blood sample, the blood can be directly analyzed, such as in a blood smear, or the blood can be processed, for example, lysis of red blood cells, isolation of PBMCs or leukocytes, isolation of target cells, and the like, such that the cells in the sample analyzed by the disclosed methods are in a blood sample or are derived from a blood sample. Similarly, a tissue specimen can be processed, for example, the tissue specimen minced and treated physically or enzymatically to disrupt the tissue into individual cells or cell clusters. Additionally, a cytological sample can be processed to isolate cells or disrupt cell clusters, if desired. Thus, the tissue, blood and cytological samples can be obtained and processed using methods well known in the art. The disclosed methods can be used in diagnostic applications to identify the presence or absence of pathological cells based on the presence or absence of a nucleic acid target that is a biomarker indicative of a pathology.
Any of a number of suitable samples can be used for detecting target nucleic acids using the disclosed methods. In some embodiments, the sample is a biological sample or tissue sample. Such a sample can be obtained from a biological subject, including a sample of biological tissue or fluid origin that is collected from an individual or some other source of biological material such as biopsy, autopsy or forensic materials. A biological sample also includes samples from a region of a biological subject containing or suspected of containing precancerous or cancer cells or tissues, for example, a tissue biopsy, including fine needle
aspirates, blood sample or cytological specimen. Such samples can be, but are not limited to, organs, tissues, tissue fractions and/or cells isolated from an organism such as a mammal. Exemplary biological samples include, but are not limited to, a cell culture, including a primary cell culture, a cell line, a tissue, an organ, an organelle, a biological fluid, and the like. Additional biological samples include but are not limited to a skin sample, tissue biopsies, including fine needle aspirates, cytological samples, stool, bodily fluids, including blood and/or serum samples, saliva, semen, and the like. Such samples can be used for medical or veterinary diagnostic purposes. A sample can also be obtained from other sources, for example, food, soil, surfaces of objects, and the like, and other materials for which detection of target nucleic acids is desired. Thus, the disclosed methods can be used for detection of one or more pathogens, such as a virus, a bacterium, a fungus, a single celled organism such as a parasite, and the like, from a biological sample obtained from an individual or other sources.
Collection of cytological samples for analysis are well known in the art (see, for example, Dey, “Cytology Sample Procurement, Fixation and Processing” in Basic and Advanced Laboratory Techniques in Histopathology and Cytology pp. 121-132, Springer, Singapore (2018); “Non-Gynecological Cytology Practice Guideline” American Society of Cytopathology, Adopted by the ASC executive board March 2, 2004). Methods for processing samples for analysis of cervical tissue, including tissue biopsy and cytology samples, are well known in the art (see, for example, Cecil Textbook of Medicine, Bennett and Plum, eds., 20th ed., WB Saunders, Philadelphia (1996); Colposcopy and Treatment of Cervical Intraepithelial Neoplasia: A Beginner’s Manual, Sellers and Sankaranarayanan, eds., International Agency for Research on Cancer, Lyon, France (2003); Kalaf and Cooper, J. Clin. Pathol. 60:449-455 (2007); Brown and Trimble, Best Pract. Res. Clin. Obstet. Gynaecol. 26:233-242 (2012); Waxman et al., Obstet. Gynecol. 120:1465-1471 (2012); Cervical Cytology Practice Guidelines TOC, Approved by the American Society of Cytopathology (ASC) Executive Board, November 10, 2000)). In one embodiment, the cytological sample is a cervical sample, for example, a pap smear. In one embodiment, the sample is a fine needle aspirate.
In particular embodiments, the sample is a tissue specimen or is derived from a tissue specimen. In some embodiments, the tissue specimen is a formalin-fixed paraffin-embedded (FFPE) sample. In some embodiments, the tissue specimen is fresh frozen. In some embodiments, the tissue specimen is prepared with a fixative. In some embodiments, the tissue specimen is prepared with a crosslinking fixative. In other particular embodiments of the invention, the sample is a blood sample or is derived from a blood sample. In still other
particular embodiments of the invention, the sample is a cytological sample or is derived from a cytological sample.
In some embodiments, the method for detecting a target nucleic acid in a cell provided herein comprises a pretreatment step before hybridization of the target probe sets. In some embodiments, the pretreatment step comprises a blocking step where certain blocking agent(s) is/are applied to block certain endogenous components of the cell thus reducing assay background. For example, hydrogen peroxide is a blocking agent when horseradish peroxidase (HRP) is used as detection enzyme in the later steps. Hydrogen peroxide is added to inactivate the endogenous HRP activity in the sample, thus reducing assay background. In a specific embodiment, this blocking step is added as the first step in the pretreatment right after deparaffmization. In some embodiments, the pretreatment step comprises an epitope retrieval step, where certain epitope retrieval buffer(s) can be added to unmask the target nucleic acid. In some embodiments, the epitope retrieval step comprises heating the sample. In some embodiments, the epitope retrieval step comprises heating the sample to 50 °C to 100 °C. In one embodiment, the epitope retrieval step comprises heating the sample to about 88°C. In some embodiments, the pretreatment step comprises a permeabilization step to retain the nucleic acid targets in the cell and to permit the target probe(s), signal-generating complex, etc. to enter the cell. In some embodiments, the permeabilization step comprises a digestion with a protease. Detergents (e.g., Triton X-100 or SDS) and Proteinase K can also be used to increase the permeability of the fixed cells. Detergent treatment, usually with Triton X-100 or SDS, is frequently used to permeate the membranes by extracting the lipids. Proteinase K is a nonspecific protease that is active over a wide pH range and is not easily inactivated. It is used to digest proteins that surround the target mRNA. Optimal concentrations and durations of treatment can be experimentally determined as is known in the art. A cell washing step can follow, to remove the dissolved materials produced in the any steps in the pretreatment step. In some embodiments, the sample is in a formalin-fixed paraffin embedded tissue, a deparaffmization step is needed, when paraffin is removed. e. Codetection Methods
The methods described herein generally relates to detection of multiple target nucleic acids in a sample. It is understood that the disclosed can additionally be applied to detecting multiple target nucleic acids and optionally other molecules in the sample, in particular in the same cell as the target nucleic acid. For example, in addition to detecting multiple target nucleic acids, proteins expressed in a cell can also concurrently be detected using a similar rationale as described herein for detecting target nucleic acids. In this case, in one or more rounds of
detection of multiple target nucleic acids, and optionally one or more proteins expressed in a cell can be detected, for example, by using a detectable label to detect the protein. For example, if the protein is being detected in an earlier round of target nucleic acid detection, the protein can be detected with a cleavable label, similar to that used for detecting target nucleic acids. If the protein is being detected in the last round of detection, the label does not need to be cleavable. Detection of proteins in a cell are well known to those skilled in the art, for example, by detecting the binding of protein-specific antibodies using any of the well-known detection systems, including those described herein for detection of target nucleic acids. Detection of target nucleic acids and protein in the same cell has been described (see also Schulz et al., Cell Syst. 6(l):25-36 (2018)).
In some embodiments, proteins can be co-detected using antibodies (or fragments thereof) that are conjugated to an oligonucleotide, using the same types of SGCs that are disclosed herein. Such methods are disclosed in U.S. Provisional Patent Application No. 63/301,711, which is incorporated herein by reference.
Kits
Embodiments of the present disclosure also include a kit for carrying out the methods disclosed herein. The kit can include the components described herein for performing a multiplexed in situ hybridization reaction, e.g., sets of target probes, SGCs (including label probes and amplifiers, pre-amplifiers, and/or pre-pre-amplifiers). Such components have been discussed extensively herein. The kit further comprises at least one compound or composition for removing signals generated by the detectable labels from the sample, to facilitate additional rounds of labeling and detection as discussed herein.
For example, in some embodiments, the kit further comprises a reagent for cleaving labels from a label probe, such as a reducing agent, periodate, dithionite, hydroxylamine, base, or the like. In one embodiment, the cleaving reagent is tris(2-carboxyethyl)phosphine (TCEP).
In some embodiments, the kit further comprises a reagent that disrupts hybridization between SGCs and a target nucleic acid, such as an acid reagent. Exemplary acids suitable for use in an acid reagent include, but are not limited to, formic acid, acetic acid, propionic acid, butyric acid, valeric acid, oxalic acid, malonic acid, succinic acid, malic acid, tartaric acid, citric acid, and the like. The acid reagent generally includes an acid at a concentration of about 5-40% acid (vol/vol). Further information regarding acid reagents is provided above; any such reagents can be included in the kits.
In some embodiments, the kit further comprises other agents or materials for performing RNA ISH, including fixing agents and agents for treating samples for preparing hybridization, agents for washing samples, and so on.
The kit may further comprise packaging material, which refers to a physical structure housing the components of the kit. The packaging material can maintain the components sterilely, and can be made of material commonly used for such purposes (e.g. , paper, corrugated fiber, glass, plastic, foil, ampules, vials, tubes, etc.).
Kits provided herein can include labels or inserts. Labels or inserts include information on a condition, disorder, disease, or symptom for which the kit component may be used for. Labels or inserts can include instructions for a clinician or for a subject to use one or more of the kit components in a method, treatment protocol, or therapeutic regimen. In some embodiments, the kit can be used for identification of tissues and cell types. In some embodiments, the kit can be used for identification of different stages of development. In some embodiments, the kit can be used for detection of clinical biomarkers for cancers. In some embodiments, the kit can be used for diagnosing a disease or disorder based on the expression of one or more altered small RNAs or the presence of pathogen-associated small RNAs. In some embodiments, the kit can be used for characterization of adult tissue. In some embodiments, the kit can be used for detection of clinical biomarkers for pathogen diagnosis. In some embodiments, the kit can be used for detection and characterization of small RNA- based therapies. In some embodiments, the kit can be used for confirmation of the initial efficiency of small RNA-based therapies. In some embodiments, the kit can be used to continue monitoring the efficiency of small RNA-based therapies. In some embodiments, the kit can be used for determining the efficiency of small RNA-based therapies. In some embodiments, the kit can be used for detecting the presence, localizing, and quantifying siRNAs. In some embodiments, the kit can be used for detecting the presence, localizing, and quantifying ASO molecules. In some embodiments, the kit can be used for detection and identification of pathogen-derived small RNAs.
Image Processing
Embodiments of the present disclosure also include a method for enhancing detection of a target (e.g., a target nucleic acid). In some embodiments, the method includes an image processing method, such as the methods described in International Patent Application PCT/US22/24975, which is herein incorporated by reference. The method is implemented at least in part with a computer having corresponding instructions stored on a memory (i.e., a non-
transitory computer readable medium). The final images, and in some embodiments the intermediate images, from the method are stored in a memory. In some embodiments, the memory is accessible by a network. In some embodiments, user input or instructions are receivable or accessible over the network.
The method includes imaging a sample with a target signal to create a probe image and imaging a sample with no target signal to create a background image (i.e., “blank image”). In some embodiments, a “blank image” is an image that includes RNA blocking molecules of the present disclosure. In some embodiments, a “blank image” is an image that does not include RNA blocking molecules of the present disclosure. In some embodiments, the imaging utilizes a fluorescent microscope coupled to a computer via a network. In some embodiments, the target signal is obtained by subjecting the sample to a fluorescent in situ hybridization assay and/or an immunofluorescence assay. In some embodiments, the background image with no target signal is obtained by removing the target signal from the sample (i.e., by a cleaving process). In other embodiments, the background image with no target signal is obtained before the assay is performed. In some embodiments, the target signal comprises a fluorescent label bound to a target nucleic acid. In other embodiments, the target signal comprises a fluorescent label bound to a target peptide or polypeptide.
The method can also include registering the probe image and the background image (e.g., with or without RNA blocking molecules). Potential background fluorescence discrepancy between the probe image and the background image creates spatial pattern mismatches that occur due to whole sample movement between different rounds of image acquisition. To remove such discrepancies, image registration techniques (e.g., phase correlation) are utilized. Robust image registration utilizes detection and matching of image features to compensate for any global sample movement (i.e., translation and rotation).
The method further includes modifying the background image (e.g., with or without RNA blocking molecules) to create an adjusted background image (e.g., transformed, intensity- adjusted blank image) based on at least one image metric. As explained further herein, the at least one image metric is a ratio factor, a multiplication factor, a local maximum value transform, and any other suitable metric. In some embodiments, the method includes a single image metric. In other embodiments, the method includes a combination of image metrics.
In some embodiments, the method further includes subtracting the adjusted background image (e.g., with or without RNA blocking molecules) from the probe image to create a final image comprising an enhanced target signal. In other words, the modified (i.e., transformed, adjusted, scaled, etc.) blank image is used in the subtracting step instead of the original blank
image. In some embodiments, the enhanced target signal includes enhanced contrast. In some embodiments, the method includes displaying the final image on a display (e.g., a computer display). The final image may be saved to a memory and may be accessible by a user, for example, over a network. As such, the method provides improved signal detection in the presence of a background with tissue autofluorescence.
In some embodiments, the image metric is a ratio factor to account for intensity differences in background between the blank image and the probe image. Intensity differences can occur when image acquisition settings are different or from photobleaching during fluorophore excitation. To compensate for background intensity differences, the method includes determining a ratio factor that compares the overall background intensity of the probe image versus the blank image. First, the pixel locations of the probe are estimated. The probe locations in the probe image are estimated using, for example, the White Top Hat algorithm (Gonzalez & Woods, 2008, Digital Image Processing), bandpass filtering (Shenoi, 2006, Introduction to Digital Signal Processing and Filter Design), or any combination of suitable methods. After determining an estimated location of the target signals in the probe image, the pixels at the estimated probe locations are excluded from both the probe image and the blank image, resulting in background-pixel-only images (i.e., background-only images). In other words, the method includes removing the estimated location from the probe image to create a first background-only image and removing the estimated location from the blank image (background image) to create a second background-only image.
Following removal of the estimated probe locations from both images, the method includes determining a ratio factor. In other words, a statistical metric for both the probe- excluded blank image and the probe-excluded probe image is evaluated and incorporated into a ratio factor. The ratio factor is utilized in some embodiments to modify the background image to create an adjusted background image. In other words, modifying the background image to create an adjusted background image can include, in some embodiments, scaling the background image by the ratio factor.
In some embodiments, the at least one image metric is a ratio factor of the first background-only image and the second background-only image. For example, the ratio factor in some embodiments is a first intensity to a second intensity, with the first intensity is determined from the first background-only image and the second intensity is determined from the second background-only image. In some embodiments, the first and second intensities used in the ratio factor are statistical metrics such as a statistical mean, median, or a combination of both for any portion of (including all) the intensity values in an image.
In some embodiments, the first intensity is the mean of a plurality of pixel intensity values in the first background-only image and the second intensity is the mean of a plurality of pixel intensity values in the second background-only image. In some embodiments, the mean is of all the pixel intensity values in the image. In other embodiments, the first intensity is the median of a plurality of pixel intensity values in the first background-only image, and the second intensity is the median of a plurality of pixel intensity values in the second background- only image. In some embodiments, the median is of all the pixel intensity values in the image. In another embodiment, the first intensity is the mean of a central approximately 80% of all the pixel intensity values (i.e., excluding the approximate top 10% and the approximate bottom 10%) in the first background-only image, and the second intensity is the mean of a central approximately 80% of all the pixel intensity values in the second background-only image.
In some embodiments, the image metric is a multiplication factor to account for potential local intensity differences between the blank image and the probe image. In particular, the method includes determining the multiplication factor. In some embodiments, the multiplication factor is within a range of approximately 1.0 to approximately 1.2. In other embodiments, the multiplication factor is within a range of approximately 1.0 to approximately 1.1. The multiplication factor is utilized in some embodiments to modify the background image to create an adjusted background image. In other words, modifying the background image to create an adjusted background image can include, in some embodiments, scaling the background image by the multiplication factor.
In some embodiments, the image metric is a local maximum value transform. In particular, the method includes transforming the blank image with a local maximum value transform. Even after global image registration, there may remain local background patern mismatches that from, for example, image acquisition at different focal planes, or samples not firmly attached to the supporting material (e.g., glass slides) and partially moving between imaging sessions. To resolve this issue, local mismatches are compensated with a transform. In the illustrated embodiment, for each pixel in the blank image (“pixel of interest”), a neighborhood of a pre-defined radius surrounding the pixel of interest is searched. The search process will find the pixel of maximum intensity, and this maximum intensity is assigned to that pixel of interest. This searching procedure is performed for each pixel of interest, searching its neighborhood in the original blank image, to form a transformed blank image. As explained in greater detail herein, the transformed blank image can be used instead of the original blank image in the later subtracting step. In some embodiments, the pre-defmed radius (“match distance”) is adjustable.
In some embodiments, the pre-defined radius used in the local maximum valve transform is within a range of approximately 0 to approximately 5 pixels. In other words, the local maximum value transform includes a search radius within a range of approximately 0 to 5 pixels. A pre-defined radius of 0 pixels is utilized, for example, when there is no noticeable local background pattern mismatch. In some embodiments, the search area is simplified to reduce computational time by using eight angularly equally spaced lines (i.e., 45 degrees apart), each with a single-pixel width, radiating from the pixel of interest.
In some embodiments, the image metric is a block-matching transform. In particular, the method, in some embodiments, includes a step to transform the blank image with a blockmatching transform. In some embodiments, the block-matching transform is used in place of the local maximum value transform to resolve the issue of local mismatches. In some embodiments, a block (“block of interest”) is used with a pre-defined block size (e.g., a 3-pixel- by-3-pixel block). Each block in the blank image is compared with blocks of the same size in the probe image in nearby locations (i.e., within a pre-defined block search size). The search determines the nearby block that is most similar to the block of interest. A similarity metric is utilized to measure the similarity of the blocks, and the searched nearby block with the highest similarity metric is determined to be the target block. Then, the block of interest is moved to the corresponding location of the target block. In some embodiments, the similarity metric is a mean absolute difference, a sum of absolute difference, a mean squared difference, or a sum of squared difference, wherein the differences are the pixel intensity differences between the two blocks being compared. As such, the block-matching transform is performed for each block of interest, searching its corresponding neighborhood in the probe image and moving its location accordingly, to form a transformed blank image. In some embodiments, this transformed blank image is used instead of the original blank image in later subtracting steps.
In some embodiments, the pre-defined block size and the pre-defined block search size are adjustable. In some embodiments, the pre-defined block size used in the block-matching transform is within a range of approximately 1 to approximately 10 pixels. In other words, the block-matching transform includes a block size within a range of approximately 1 to 10 pixels. In some embodiments, the pre-defined block search size used in the block matching transform is within a range of approximately 1 to approximately 10 pixels. In other words, the blockmatching transform includes a block search size within a range of approximately 1 to 10 pixels.
In some embodiments, the method for enhancing detection of a target includes any combination of the steps described herein, in various orders. In some embodiments, steps may
be omitted. Further, the order of the steps may be reversed, altered, or performed simultaneously.
In at least one embodiment, the electronic-based aspects of the method may be implemented in software (e.g., stored on non-transitory computer-readable medium) executable by a computer with one or more processing units, such as a microprocessor and/or application specific integrated circuits (“ASICs”). Some embodiments may include hardware, software, and electronic components or modules. As such, it should be noted that a plurality of hardware and software-based devices, as well as a plurality of different structural components, may be utilized to implement the embodiments.
Experimental Materials and Methods
Tissue collection. Mouse brain sections (5um thick) were collected from a fresh-frozen BL6 mouse brain from a 1 month old mouse and a 12-month-old mouse. The tissues were embedded in OCT, snap-frozen, and sectioned coronally from the brain block with a cryotome. Brain sections were fixed in 10% NBF and dehydrated in Ethanol before the assay.
Target probe selections, synthesis, and formulation. One hundred mouse genes were selected as the target of transcriptomic profiling. These genes were examined for their spatial expression pattern in the mouse brains, their expression levels, and their specificity for brain cell types using single-cell RNAseq datasets from public sc-RNA seq data deposits. Multiplex and Hiplex assays were also performed on the coronal mouse brain sections to further verify the spatial expression patterns of some high-expression genes. The goal of these screening steps was to avoid any spatial crowding in the final FISH images, which would otherwise defeat the decoding process.
All target probes were designed by ACD probe design team and were checked to ensure no potential cross-reactivity to other RNA species. All DNA oligo probes were formulated at lOuM concentration and were pooled in ACD hybridization buffer A at 1:500 dilution. 100 target probes were designed and synthesized for the first cycle, and another set of 100 target probes for the second cycle of the assay.
Coding Scheme. The assay is built on a codebook that is composed of 10 fundamental codes. Each fundamental code has 5 bits and two fluorescence signals (e.g., ‘ 10100’ has an AF488 signal and a Dy594 signal). A pool of 5 unique fluorophores was used to generate various combinations of two fluorophores, which include: AF488, Dy550, Dy650, Dy594, and AF750. The resultant 5-bit codes are listed below:
Table 4. The list of the 10 fundamental codes used in an exemplary lOOplex Ultraplex assay.
To reach the lOOplex analytic capacity, the 5 -bit codes were expanded to create a new codebook with 100 ‘ 10-bit’ codes. This was accomplished by running two barcoding cycles, where the primary 5-bit code is randomly paired with another 5-bit code from the list above (Table 4). Each 10-bit code is unique in the codebook and can be assigned to a gene target. This book of 100 codes is provided in Table 5. Table 5. Codebook for an exemplary lOOplex Ultraplex assay. Each code has 10 bits and contains four fluorescence signals.
RNA molecule labeling. To encode the RNA molecules with corresponding codes, each RNA transcript was first hybridized a specific double-Z target probe designed by ACD. The target probe was then annealed with 3 tiers of branching DNA oligos (i.e. , AMP1, AMP2, and AMP3). The third AMP oligo contained a unique binding sequence that compliments the corresponding fluorophore-conjugated label oligos (label probe, or LP). The development of signal generating configuration was completed upon incubating the branching DNA structure with its corresponding label probes. For each RNA species, the branching DNA structure it hybridized to was annealed with its complementary label probes that were conjugated with either fluorophore A or fluorophore B (50%:50% LP-A /LP-B in the final solution), as predetermined in the codebook (Table 5). This allows the RNA molecule to emit two fluorescence signals under a fluorescent microscope, and therefore display a unique 5-bit code in the acquired multi-spectral image. All 100 RNA targets included in the panel were encoded twice through two independent barcoding cycles. The first cycle of barcoding provides the first 5 bits of the code, and the second cycle provides the last 5 bits of the code. In each cycle, all 100 genes were simultaneously labeled with their corresponding 5-bit codes, which does not require any specific temporal barcoding order (FIGS. 16A-16B). In addition, the two cycles were independent of each other. Each barcoding cycle can be performed either first or last. In each cycle, any prior branching DNA structures were disrupted and stripped off from the RNA molecules using the HiplexUP reagent, and the RNA molecules were subject to a new signal generation configuration, beginning with a new target probe hybridization.
This assay has been performed on both Leica BondRx system and with a streamlined manual protocol. Both approaches resulted in excellent positive signals and very clean background.
Protease pretreatment for target retrieval. For the Ultraplex assay, the ACD PT3, PT4 or Sigma ready-to-use pepsin solution (cat No. R2283) were used, depending on the tissue types, tissue processing methods, and extent of formalin fixation. For example, in the Ultraplex assay with mouse brain FFPE tissue, the tissue was incubated with 300uL of pepsin at room temperature for lOmins. Pepsin offers some unique benefits over the PT3 or PT4, including reduction of autofluorescent background in the FFPE tissues, removal of >90% red blood cells, and removal of non-specific signals in the Ultraplex assay, which all contribute to a higher signal-to-noise ratio.
Image analysis. The stained tissue slides were imaged by a Zeiss Axio Imager Z2 epifluorescent microscope. For each tissue, an xyz scan was conducted in the entire tissue. Final fluorescent microscopic images underwent a series of pre-processing steps: image
correction for chromatic errors and optical field shading, tile image registration for cycle 1 and cycle2 scans, tile stitching to a large tissue image. The preprocessed whole-slide images were then imported into an advanced decoding pipeline that are developed with deep learning by ACD image analysis team to extract the spatial transcriptomics information from the test tissues. Final assay outputs from the pipeline include: a cell x feature matrix, cell segmentation boundaries, field of view ID, the x and y coordinates of all detected transcripts in the tissue, and x and y coordinates of the centroids of each cell in the tissue.
EXAMPLES Example 1
The experiments below used the “code book” set forth in Table 6, using the indicated labeling schemes and corresponding alphanumeric code in each round of labeling and detection. Table 6. Code book based on four distinct labels to generate independent identifiers of various targets in the experiments disclosed herein.
A single gene (RNA transcript), Ubc, was detected in a two-cycle experiment according to the methods disclosed herein. In each round of labeling, the independent identifier was based on a single fluorescent label, corresponding to Code 3 in Table la.
Samples (fresh-frozen Balb/c mouse brain sections from Acepix) were fixed 10% neutral buffered formalin (NBF) for 1 hour at room temperature, subjected to serial dehydration in 50%, 80%, and 100% ethanol, and then subjected to protease-IV digestion for 30 minutes at room temperature. For the first round of labeling, target probe sets were hybridized for 2 hours at 40 °C, followed by hybridization of amplification reagents, and incubation with label probes (based on Dy650) for 15 minutes at 40 °C. Following DAPI counterstaining, mounting, and imaging (Akoya Polaris microscope, 40x), the fluorophores from the label probes were cleaved, and the amplification trees were removed using the HiplexUP reagent (Advanced Cell Diagnostics, Inc.) (3 incubations at 5 minutes each, room temperature). A second round of target probe sets were then hybridized for 2 hours at 40 °C, followed by hybridization of amplification reagents, incubation with label probes (based on Dy650) for 15 minutes at 40 °C, DAPI counterstaining, mounting, and imaging (Akoya Polaris microscope, 40x). Standard washing steps were used between the main hybridization steps.
A resulting combined image from the two rounds of detection is shown in FIG. 11. The signals detected in the first round are indicated by triangles, and signals detected in the second round are indicated by squares. Locations with overlapping triangles and squares are the locations where the target gene, Ubc, was detected.
Example 2
Three genes (RNA transcripts Gadph, Sdha, and Ubc) were detected in a two-cycle experiment according to the methods disclosed herein. Each target was assigned the same first independent identifier based on a single fluorescent label (Code 3, see Table 6), and each target was assigned a different second independent identifier based on a single fluorescent label (Codes 1, 2, and 3 respectively, see Table 6).
Samples (fresh-frozen Balb/c mouse brain sections from Acepix) were fixed 10% neutral buffered formalin (NBF) for 1 hour at room temperature, subjected to serial dehydration in 50%, 80%, and 100% ethanol, and then subjected to protease-IV digestion for 30 minutes at room temperature. For the first round of labeling, target probe sets were hybridized for 2 hours at 40 °C, followed by hybridization of amplification reagents, and incubation with label probes (based on Dy650) for 15 minutes at 40 °C. Following DAPI counterstaining, mounting, and imaging (Akoya Polaris microscope, 40x), the fluorophores from the label probes were cleaved, and the amplification trees were removed using the HiplexUP reagent (Advanced Cell Diagnostics, Newark, CA) (3 incubations at 5 minutes each, room temperature). A second round of target probe sets were then hybridized for 2 hours at 40 °C, followed by hybridization of amplification reagents, incubation with label probes (based on Dy650, Dy550, and AF488) for 15 minutes at 40 °C, DAPI counterstaining, mounting, and imaging (Akoya Polaris microscope, 40x). Standard washing steps were used between the main hybridization steps.
FIG. 12A shows detection of Gadph, FIG. 12B shows detection of Sdha, and FIG. 12C shows detection of Ubc. In each image, the signals detected in the first cycle (i.e., all three targets), are shown in triangles. In FIG. 12A, signals detected in the second cycle are indicated by circles, and locations with overlapping triangles and circles are the locations where the combined first and second identifiers (i.e., 31) show that Gadph was detected. In FIG. 12B, signals detected in the second cycle are indicated by crosses, and locations with overlapping triangles and crosses are the locations where the combined first and second identifiers (i.e., 32) show that Sdha was detected. In FIG. 12C, signals detected in the second cycle are indicated by squares, and locations with overlapping triangles and squares are the locations where the combined first and second identifiers (i.e., 33) show that Ubc was detected.
Example 3
Experiments were conducted to demonstrate that target nucleic acids can be detected using the disclosed methods independent of the order in which the first and second independent identifiers are obtained. The human peptidylprolyl isomerase B (Hs-PPIB) gene transcript was detected in two separate two-cycle experiments, each using two fluorescent labels in each round of detection. In this experiment, LI = Alexa Fluor 488, L2 = Dylight 550, L3 = Dylight 650, and L4 = Alexa Fluor 750 (see Table 6). In the first experiment, the first independent identifier was assigned Code A (L3, L4) and the second independent identifier was assigned Code 8 (L2, L3). In the second experiment, the identifiers were reversed, with the first
independent identifier being assigned Code 8 (L2, L3) and the second independent identifier being assigned Code A (L3, L4).
In the first experiment, following standard pretreatment of an FFPE HeLa cell pellet (including baking, deparaffmization, heat-induced epitope retrieval, and protease treatment), target probes were hybridized to the target for 2 hours at 40 °C, followed by treatment with amplification reagents for 30 minutes each at 40 °C, and then with the appropriate label probes for 30 minutes at 40 °C. Following DAPI counterstaining, mounting, and imaging (Akoya Polaris microscope, 40x, images shown in FIG. 13 A), the labels were cleaved and the signalgenerating complexes were removed using a cleaving reagent and HiplexUP reagent (Advanced Cell Diagnostics, Newark, CA), respectively. The second round of labeling and detection was conducted in a manner similar to the first round. Standard washing steps were used between the main hybridization steps. Images are shown in FIG. 13B, and an overlay of the signals detected in the first and second rounds of labeling and imaging is shown in FIG. 13C, with shaded circles showing locations in which the full target identifier (A8) was detected.
The second experiment was conducted with a reversed coding approach. Images from each round of labeling are shown in FIGS. 14A and 14B, and an overlay is shown in FIG. 14C.
These results demonstrate that the disclosed methods are not dependent on any particular temporal order of signals, but rather the first and second independent identifiers can be detected in either order with equal efficiency, and result in the same level of target detection.
Example 4
Experiments were conducted to demonstrate that the individual signals from different labels in an SGC used in methods disclosed herein can be detected in any order to generate each independent identifier. Following pretreatment of an FFPE HeLa cell pellet (including baking, deparaffmization, heat-induced epitope retrieval, and protease treatment), human PPIB target probes were hybridized to the targets by incubating the slides with probe solution for 2 hours at 40 °C, followed by treatment with a pre-pre-amplifiers, pre-amplifiers, and amplifiers for 30 minutes each at 40 °C, and then with two labels, corresponding to Code 8 in Table 6 (Dylight 550 and Dylight 650) for 30min at 40 °C. Images were obtained on a Leica DM2000 epifluorescence microscope using two different filter sequences; the first being DAPI-GFP- Cy3-Cy5-Cy7, and the second being GFP-DAPI-Cy7-Cy5-Cy3. Individual images are shown in FIG. 15 A, overlay images are shown in FIG. 15B, and overlays showing individual “dots” for detection of PPIB RNA are shown in FIG. 15C. These demonstrate that using different detection orders, the disclosed methods can achieve the same level of detection efficiency.
Example 5
Using 100 target genes screened from a published control assay mouse brain panel, the spatial transcriptomics in a 1 month old C57BL/6 mouse brain and a 1 year old C57BL/6 mouse brain were evaluated (FIG. 17). Each of the 100 target genes is differentially expressed by a unique cell type in the mouse brain and thus can be used to mark the particular cell type. In comparison with the publicly available control data from the same mouse brain tissue, these single-cell gene expression results were significantly correlated with the corresponding control data sets (FIG. 17). The correlation coefficient (R value) ranges from 0.6-0.81, indicating that the Ultraplex assay can effectively profile the gene expressions of mouse brain cells at singlecell level.
When comparing the minor discrepancies in gene expression between the Ultraplex assay and control assay, it was found that the Ultraplex assay resulted in more accurate measurements of the gene expressions in mouse brain, as suggested by the Hiplex assay performed in the same mouse brain tissues against these disputed genes. In comparison to the gene expression (in copy number/cell) measured by RNAscope Hiplex assay, the control assay tends to overestimate the expressions of certain disputed genes (FIG. 18).
Cell clustering analysis for the Ultraplex “cell x feature” matrix data was also performed, and this analysis successfully identified 10-16 major cell types in different mouse brain regions. Each cell analyzed in the mouse brain slides was assigned with a specific cell type (e.g., L2/3 interneurons, Sst neurons, microglia, etc.), according to the cluster location of the cell in the UMAP. The Ultraplex cell type annotation in the 1 month old and 1 year old mouse brain and their corresponding spatial distribution (laminar locations) were compared with published datasets (Allen brain institute and Xenium public mouse brain datasets), and the results indicated similar cell type annotations and consistent spatial distribution across the mouse brain datasets (FIG. 19).
Claims
1. A method for detecting a plurality of target nucleic acids in a sample, the method comprising: contacting a sample comprising a plurality of target nucleic acids with a plurality of target probe sets, wherein each target probe set is complementary to a target nucleic acid; contacting the sample with a first plurality of signal generating complexes (SGCs) capable of hybridizing to at least one target probe set, wherein each SGC comprises a plurality of spatially distributed label probes comprising detectable labels; obtaining a first independent identifier corresponding to at least one target nucleic acid, wherein the first independent identifier is based on the number of distinct detectable labels present in each SGC of the first plurality of SGCs; removing signals generated by the detectable labels from the sample; contacting the sample with at least a second plurality of SGCs capable of hybridizing to at least one target probe set, wherein each SGC comprises a plurality of spatially distributed label probes comprising detectable labels; and obtaining a second independent identifier corresponding to at least one target nucleic acid, wherein the second independent identifier is based on the number of distinct detectable labels present in each SGC of the second plurality of SGCs.
2. The method of claim 1, wherein the method further comprises identifying a target nucleic acid of the plurality of target nucleic acids based on a combination of the first and second independent identifiers.
3. The method of claim 2, wherein the target nucleic acid is identified independent of the order by which the first and second independent identifiers are obtained.
4. The method of any one of claims 1 to 3, wherein the first and second independent identifiers are based on the same number of distinct detectable labels present in the first and second plurality of SGCs, respectively.
5. The method of any one of claims 1 to 3, wherein the first and second independent identifiers are based on a different number of distinct detectable labels present in the first and second plurality of SGCs, respectively.
6. The method of any one of claims 1 to 3, wherein the independent identifier is based on the number of distinct detectable labels present in a combination of at least two SGCs.
7. The method of any one of claims 1 to 6, wherein the spatially distributed label probes comprise two distinct detectable labels.
8. The method of any one of claims 1 to 6, wherein the spatially distributed label probes comprise three distinct detectable labels.
9. The method of any one of claims 1 to 6, wherein the spatially distributed label probes comprise four distinct detectable labels.
10. The method of any one of claims 1 to 6, wherein the spatially distributed label probes comprise five distinct detectable labels.
11. The method of any one of claims 1 to 10, wherein obtaining an independent identifier comprises obtaining an image of the sample and detecting signals generated by the detectable labels.
12. The method of any one of claims 1 to 11, wherein the method further comprises: removing the signals generated by the detectable labels of the second plurality of
SGCs from the sample; contacting the sample with at least a third plurality of SGCs capable of hybridizing to at least one target probe set, wherein each SGC comprises a plurality of spatially distributed label probes comprising detectable labels; and obtaining a third independent identifier corresponding to at least one target nucleic acid, wherein the third independent identifier is based on the number of distinct detectable labels present in each SGC of the third plurality of SGCs.
13. The method of any one of claims 1 to 12, wherein each target probe set comprises two or more target probes capable of hybridizing to a target nucleic acid.
14. The method of any one of claims 1 to 13, wherein each target probe in the target probe set comprises a T section complementary to a region of a target nucleic acid and an L section complementary to a region of an SGC.
15. The method of claim 14, wherein each T section is complementary to a nonoverlapping region of a target nucleic acid and each L section is complementary to a nonoverlapping region of an SGC.
16. The method of claim 14 or claim 15, wherein the T section of at least one of the target probes in the target probe set is 3 ’ of its L section.
17. The method of claim 14 or claim 15, wherein the T section of at least one of the target probes in the target probe set is 5 ’ of its L section.
18. The method of any one of claims 1 to 17, wherein each SGC comprises one or more of an amplifier, a pre-amplifier, and a pre-pre-amplifier.
19. The method of claim 18, wherein each label probe in the plurality of label probes comprises a binding site for the amplifier.
20. The method of claim 19, wherein the amplifier comprises at least one binding site for the pre-amplifier and a plurality of binding sites for the plurality of label probes.
21. The method of claim 19, wherein the pre-amplifier comprises at least one binding site for the L section of the target probe and at least one binding site for the amplifier.
22. The method of claim 19, wherein the pre-amplifier comprises at least one binding site for the pre-pre-amplifier and at least one binding side for the amplifier.
23. The method of claim 21 or claim 22, wherein the at least one binding site for the L section of the target probe in the pre-amplifier or the pre-pre-amplifier is distinct for each SGC corresponding to a target nucleic acid.
24. The method of claim 21 or claim 22, wherein the at least one binding site for the L section of the target probe in the pre-amplifier or the pre-pre-amplifier is the same for each SGC corresponding to a target nucleic acid.
25. The method of any one of claims 1 to 24, wherein removing the signals generated by the detectable labels comprises removing the SGCs.
26. The method of claim 25, wherein removing the SGCs comprises treatment with an acid reagent that disrupts hybridization between the SGCs and the target nucleic acids.
27. The method of claim 26, wherein the acid reagent comprises formic acid, acetic acid, propionic acid, butyric acid, valeric acid, oxalic acid, malonic acid, succinic acid, malic acid, tartaric acid, or citric acid.
28. The method of claim 25, wherein after the first plurality of SGCs are removed, the second plurality of SGCs are hybridized to the same target probe set.
29. The method of claim 25, wherein after the first plurality of SGCs are removed, the second plurality of SGCs are hybridized to a different target probe set.
30. The method of any one of claims 1 to 13, wherein removing the signals generated by the detectable labels comprises removing the detectable labels from the plurality of label probes using a cleavage reagent.
31. The method of claim 30, wherein after the detectable labels are removed, the second plurality of SGCs are hybridized to a different target probe set.
32. The method of any one of claims 1 to 31 , wherein, after the step of removing signals generated by the detectable labels from the sample, the method further comprises a step of contacting the sample with a second plurality of target probe sets, wherein each target probe set in the second plurality of target probe sets is complementary to a target nucleic acid.
33. The method of any one of claims 1 to 32, wherein the sample comprises a cell.
34. The method of claim 33, wherein the method further comprises fixing and/or permeabilizing the cell.
35. The method of any of claims 1 to 34, wherein the target nucleic acid is RNA.
36. The method of any of claims 1 to 35, further comprising detecting at least one non- nucleic acid target in the sample.
37. The method of claim 36, wherein the non-nucleic acid target is a protein.
38. A method for detecting a plurality of target nucleic acids in a sample, the method comprising: contacting a sample comprising a plurality of target nucleic acids with a plurality of target probe sets, wherein each target probe set is complementary to a target nucleic acid; contacting the sample with a plurality of amplification complexes capable of hybridizing to at least one target probe set, wherein each amplification complex comprises a plurality of spatially distributed binding sites for label probes; contacting the sample with a first plurality of label probes comprising detectable labels, which hybridize to binding sites on the amplification complexes to generate a first plurality of signal generating complexes (SGCs); obtaining a first independent identifier corresponding to at least one target nucleic acid, wherein the first independent identifier is based on the number of distinct detectable labels present in each SGC of the first plurality of SGCs; removing signals generated by the detectable labels from the sample; contacting the sample with at least a second plurality of label probes comprising detectable labels, which hybridize to binding sites on the amplification complexes to generate a second plurality of SGCs; and obtaining a second independent identifier corresponding to at least one target nucleic acid, wherein the second independent identifier is based on the number of distinct detectable labels present in each SGC of the second plurality of SGCs.
39. The method of claim 38, wherein the method further comprises identifying a target nucleic acid of the plurality of target nucleic acids based on a combination of the first and second independent identifiers.
40. The method of claim 39, wherein the target nucleic acid is identified independent of the order by which the first and second independent identifiers are obtained.
41. The method of any one of claims 38 to 40, wherein the first and second independent identifiers are based on the same number of distinct detectable labels present in the first and second plurality of SGCs, respectively.
42. The method of any one of claims 38 to 40, wherein the first and second independent identifiers are based on a different number of distinct detectable labels present in the first and second plurality of SGCs, respectively.
43. The method of any one of claims 38 to 40, wherein the independent identifier is based on the number of distinct detectable labels present in a combination of at least two SGCs.
44. The method of any one of claims 38 to 43, wherein the spatially distributed label probes comprise two distinct detectable labels.
48. The method of any one of claims 38 to 43, wherein the spatially distributed label probes comprise three distinct detectable labels.
46. The method of any one of claims 38 to 43, wherein the spatially distributed label probes comprise four distinct detectable labels.
47. The method of any one of claims 38 to 43, wherein the spatially distributed label probes comprise five distinct detectable labels.
48. The method of any one of claims 38 to 47, wherein obtaining an independent identifier comprises obtaining an image of the sample and detecting signals generated by the detectable labels.
49. The method of any one of claims 38 to 48, wherein the method further comprises: removing the signals generated by the detectable labels of the second plurality of
SGCs from the sample;
contacting the sample with at least a third plurality of label probes comprising detectable labels, which hybridize to binding sites on the amplification complexes to generate a third plurality of SGCs; and obtaining a third independent identifier corresponding to at least one target nucleic acid, wherein the third independent identifier is based on the number of distinct detectable labels present in each SGC of the third plurality of SGCs.
50. The method of any one of claims 38 to 49, wherein each target probe set comprises two or more target probes capable of hybridizing to a target nucleic acid.
51. The method of any one of claims 38 to 50, wherein each target probe in the target probe set comprises a T section complementary to a region of a target nucleic acid and an L section complementary to a region of an amplification complex.
52. The method of claim 51 , wherein each T section is complementary to a nonoverlapping region of a target nucleic acid and each L section is complementary to a nonoverlapping region of an amplification complex.
53. The method of claim 51 or claim 52, wherein the T section of at least one of the target probes in the target probe set is 3 ’ of its L section.
54. The method of claim 51 or claim 52, wherein the T section of at least one of the target probes in the target probe set is 5 ’ of its L section.
55. The method of any one of claims 38 to 54, wherein each amplification complex comprises one or more of an amplifier, a pre-amplifier, and a pre-pre-amplifier.
56. The method of claim 55, wherein each label probe in the plurality of label probes comprises a binding site for the amplifier.
57. The method of claim 56, wherein the amplifier comprises at least one binding site for the pre-amplifier and a plurality of binding sites for the plurality of label probes.
58. The method of claim 56, wherein the pre-amplifier comprises at least one binding site for the L section of the target probe and at least one binding site for the amplifier.
59. The method of claim 56, wherein the pre-amplifier comprises at least one binding site for the pre-pre-amplifier and at least one binding side for the amplifier.
60. The method of claim 58 or claim 59, wherein the at least one binding site for the L section of the target probe in the pre-amplifier or the pre-pre-amplifier is distinct for each amplification complex corresponding to a target nucleic acid.
61. The method of claim 58 or claim 59, wherein the at least one binding site for the L section of the target probe in the pre-amplifier or the pre-pre-amplifier is the same for each amplification complex corresponding to a target nucleic acid.
62. The method of any one of claims 38 to 61 , wherein removing the signals generated by the detectable labels comprises removing the detectable labels from the plurality of label probes using a cleavage reagent.
63. The method of claim 62, wherein after the detectable labels from the first plurality of SGCs are removed, the second plurality of label probes are hybridized to different amplification complexes to generate the second plurality of SGCs.
64. The method of any one of claims 38 to 63, wherein, after the step of removing signals generated by the detectable labels from the first plurality of label probes from the sample, the method further comprises a step of contacting the sample with a second plurality of target probe sets, wherein each target probe set in the second plurality of target probe sets is complementary to a target nucleic acid.
65. The method of any one of claims 38 to 64, wherein the sample comprises a cell.
66. The method of claim 65, wherein the method further comprises fixing and/or permeabilizing the cell.
67. The method of any of claims 38 to 66, wherein the target nucleic acid is RNA.
68. The method of any of claims 38 to 67, further comprising detecting at least one non- nucleic acid target in the sample.
69. The method of claim 68, wherein the non-nucleic acid target is a protein.
70. A kit for carrying out the method of any one of claims 1-69.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202263386425P | 2022-12-07 | 2022-12-07 | |
US63/386,425 | 2022-12-07 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2024124041A1 true WO2024124041A1 (en) | 2024-06-13 |
Family
ID=89663104
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2023/082958 WO2024124041A1 (en) | 2022-12-07 | 2023-12-07 | Multiplexed detection of nucleic acid targets |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2024124041A1 (en) |
Citations (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3817837A (en) | 1971-05-14 | 1974-06-18 | Syva Corp | Enzyme amplification assay |
US3850752A (en) | 1970-11-10 | 1974-11-26 | Akzona Inc | Process for the demonstration and determination of low molecular compounds and of proteins capable of binding these compounds specifically |
US3939350A (en) | 1974-04-29 | 1976-02-17 | Board Of Trustees Of The Leland Stanford Junior University | Fluorescent immunoassay employing total reflection for activation |
US3996345A (en) | 1974-08-12 | 1976-12-07 | Syva Company | Fluorescence quenching with immunological pairs in immunoassays |
US4275149A (en) | 1978-11-24 | 1981-06-23 | Syva Company | Macromolecular environment control in specific receptor assays |
US4277437A (en) | 1978-04-05 | 1981-07-07 | Syva Company | Kit for carrying out chemically induced fluorescence immunoassay |
US4366241A (en) | 1980-08-07 | 1982-12-28 | Syva Company | Concentrating zone method in heterogeneous immunoassays |
US5124246A (en) | 1987-10-15 | 1992-06-23 | Chiron Corporation | Nucleic acid multimers and amplified nucleic acid hybridization assays using same |
US5635352A (en) | 1993-12-08 | 1997-06-03 | Chiron Corporation | Solution phase nucleic acid sandwich assays having reduced background noise |
US5710264A (en) | 1990-07-27 | 1998-01-20 | Chiron Corporation | Large comb type branched polynucleotides |
US20080038725A1 (en) | 2005-06-20 | 2008-02-14 | Yuling Luo | Methods of detecting nucleic acids in individual cells and of identifying rare cells from large heterogeneous cell populations |
US7399845B2 (en) | 2006-01-27 | 2008-07-15 | Isis Pharmaceuticals, Inc. | 6-modified bicyclic nucleic acid analogs |
US20090081688A1 (en) | 2005-06-20 | 2009-03-26 | Advanced Cell Diagnostics | Methods of detecting nucleic acids in individual cells and of identifying rare cells from large heterogeneous cell populations |
US20120100540A1 (en) | 2010-10-21 | 2012-04-26 | Advanced Cell Diagnostics, Inc. | Ultra sensitive method for in situ detection of nucleic acids |
US20170101672A1 (en) | 2015-10-12 | 2017-04-13 | Advanced Cell Diagnostics, Inc. | In situ detection of nucleotide variants in high noise samples, and compositions and methods related thereto |
WO2020014036A1 (en) * | 2018-07-09 | 2020-01-16 | Ultivue, Inc. | Methods for multicolor multiplex imaging |
WO2020092796A1 (en) * | 2018-11-01 | 2020-05-07 | Advanced Cell Diagnostics, Inc. | Methods for digital multiplexing of nucleic acids in situ |
WO2020168162A1 (en) * | 2019-02-15 | 2020-08-20 | Bio-Techne Corporation | Methods for multiplex detection of nucleic acids by in situ hybridization |
WO2021102237A1 (en) * | 2019-11-20 | 2021-05-27 | Advanced Cell Diagnostics, Inc. | Methods for sequential detection of nucleic acids |
-
2023
- 2023-12-07 WO PCT/US2023/082958 patent/WO2024124041A1/en unknown
Patent Citations (27)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3850752A (en) | 1970-11-10 | 1974-11-26 | Akzona Inc | Process for the demonstration and determination of low molecular compounds and of proteins capable of binding these compounds specifically |
US3817837A (en) | 1971-05-14 | 1974-06-18 | Syva Corp | Enzyme amplification assay |
US3939350A (en) | 1974-04-29 | 1976-02-17 | Board Of Trustees Of The Leland Stanford Junior University | Fluorescent immunoassay employing total reflection for activation |
US3996345A (en) | 1974-08-12 | 1976-12-07 | Syva Company | Fluorescence quenching with immunological pairs in immunoassays |
US4277437A (en) | 1978-04-05 | 1981-07-07 | Syva Company | Kit for carrying out chemically induced fluorescence immunoassay |
US4275149A (en) | 1978-11-24 | 1981-06-23 | Syva Company | Macromolecular environment control in specific receptor assays |
US4366241A (en) | 1980-08-07 | 1982-12-28 | Syva Company | Concentrating zone method in heterogeneous immunoassays |
US4366241B1 (en) | 1980-08-07 | 1988-10-18 | ||
US5124246A (en) | 1987-10-15 | 1992-06-23 | Chiron Corporation | Nucleic acid multimers and amplified nucleic acid hybridization assays using same |
US5849481A (en) | 1990-07-27 | 1998-12-15 | Chiron Corporation | Nucleic acid hybridization assays employing large comb-type branched polynucleotides |
US5710264A (en) | 1990-07-27 | 1998-01-20 | Chiron Corporation | Large comb type branched polynucleotides |
US5635352A (en) | 1993-12-08 | 1997-06-03 | Chiron Corporation | Solution phase nucleic acid sandwich assays having reduced background noise |
US5681697A (en) | 1993-12-08 | 1997-10-28 | Chiron Corporation | Solution phase nucleic acid sandwich assays having reduced background noise and kits therefor |
EP2500439A1 (en) | 2005-06-20 | 2012-09-19 | Advanced Cell Diagnostics, Inc. | Methods of detecting nucleic acids in individual cells and of identifying rare cells from large heterogeneous cell populations |
US20080038725A1 (en) | 2005-06-20 | 2008-02-14 | Yuling Luo | Methods of detecting nucleic acids in individual cells and of identifying rare cells from large heterogeneous cell populations |
US20090081688A1 (en) | 2005-06-20 | 2009-03-26 | Advanced Cell Diagnostics | Methods of detecting nucleic acids in individual cells and of identifying rare cells from large heterogeneous cell populations |
US7709198B2 (en) | 2005-06-20 | 2010-05-04 | Advanced Cell Diagnostics, Inc. | Multiplex detection of nucleic acids |
US7399845B2 (en) | 2006-01-27 | 2008-07-15 | Isis Pharmaceuticals, Inc. | 6-modified bicyclic nucleic acid analogs |
US20120100540A1 (en) | 2010-10-21 | 2012-04-26 | Advanced Cell Diagnostics, Inc. | Ultra sensitive method for in situ detection of nucleic acids |
US8658361B2 (en) | 2010-10-21 | 2014-02-25 | Advanced Cell Diagnostics, Inc. | Ultra sensitive method for in situ detection of nucleic acids |
US20170101672A1 (en) | 2015-10-12 | 2017-04-13 | Advanced Cell Diagnostics, Inc. | In situ detection of nucleotide variants in high noise samples, and compositions and methods related thereto |
WO2017066211A1 (en) | 2015-10-12 | 2017-04-20 | Advanced Cell Diagnostics, Inc. | In situ detection of nucleotide variants in high noise samples, and compositions and methods related thereto |
US11078528B2 (en) | 2015-10-12 | 2021-08-03 | Advanced Cell Diagnostics, Inc. | In situ detection of nucleotide variants in high noise samples, and compositions and methods related thereto |
WO2020014036A1 (en) * | 2018-07-09 | 2020-01-16 | Ultivue, Inc. | Methods for multicolor multiplex imaging |
WO2020092796A1 (en) * | 2018-11-01 | 2020-05-07 | Advanced Cell Diagnostics, Inc. | Methods for digital multiplexing of nucleic acids in situ |
WO2020168162A1 (en) * | 2019-02-15 | 2020-08-20 | Bio-Techne Corporation | Methods for multiplex detection of nucleic acids by in situ hybridization |
WO2021102237A1 (en) * | 2019-11-20 | 2021-05-27 | Advanced Cell Diagnostics, Inc. | Methods for sequential detection of nucleic acids |
Non-Patent Citations (31)
Title |
---|
"Cervical Cytology Practice Guidelines TOC", 10 November 2000, AMERICAN SOCIETY OF CYTOPATHOLOGY |
"Colposcopy and Treatment of Cervical Intraepithelial Neoplasia: A Beginner's Manual", 2003, INTERNATIONAL AGENCY FOR RESEARCH ON CANCER |
"Non-Gynecological Cytology Practice Guideline", 2 March 2004, AMERICAN SOCIETY OF CYTOPATHOLOGY |
AUSUBEL ET AL.: "Current Protocols in Molecular Biology", 1999, JOHN WILEY AND SONS |
BIGGS ET AL., APPLIED OPTICS, vol. 36, no. 8, 1997 |
BROWNTRIMBLE, BEST PRACT. RES. CLIN. OBSTET. GYNAECOL., vol. 26, 2012, pages 233 - 242 |
DANIEL ET AL., BIOTECHNIQUES, vol. 24, no. 3, 1998, pages 484 - 489 |
DEY: "Basic and Advanced Laboratory Techniques in Histopathology and Cytology", 2018, SPRINGER, article "Cytology Sample Procurement, Fixation and Processing", pages: 121 - 132 |
GONZALEZ ET AL.: "In situ hybridization. A practical approach", 1992, ADDISON-WESLEY PUBLISHING COMPANY, INC. |
HANISCH ET AL.: "Deconvolutions of Hubble Space Telescope Images and Spectra, Deconvolution of Images and Spectra", 1997, ACADEMIC PRESS |
HERMANSON: "Cecil Textbook of Medicine", 1996, ACADEMIC PRESS |
HICKS ET AL., J. MOL. HISTOL., vol. 35, 2004, pages 595 - 601 |
HU ET AL., BIOMARK. RES., vol. 2, no. 1, 2014, pages 1 - 13 |
JOHNSONSPENCE: "Molecular Probes Handbook, a Guide to Fluorescent Probes and Labeling Technologies", 2010, LIFE TECHNOLOGIES |
KALAFCOOPER, J. CLIN. PATHOL., vol. 60, 2007, pages 449 - 455 |
MAJLESSI ET AL., NUCL. ACIDS RES., vol. 26, 1998, pages 2224 - 2229 |
MANAFI ET AL., MICROBIOL. REV., vol. 55, 1991, pages 335 - 348 |
MOFFITT ET AL., PROC. NATL. ACAD. SCI. USA, vol. 113, 2016, pages 11046 - 11051 |
PETERSENWENGEL, TRENDS BIOTECHNOL., vol. 21, 2003, pages 74 - 81 |
RATAN ET AL., CUREUS, vol. 9, no. 6, 2017, pages e1325 |
SAMBROOK ET AL.: "Molecular Cloning: A Laboratory Manual", 2001, COLD SPRING HARBOR LABORATORY |
SANTALUCIA, PROC. NATL. ACAD. SCI. U.S.A., vol. 95, 1998, pages 1460 - 1465 |
SCHULZ ET AL., CELL SYST., vol. 6, no. 1, 2018, pages 25 - 36 |
SCHWARZACHERHESLOP-HARRISON: "Practical in situ hybridization", 2000, BIOS SCIENTIFIC PUBLISHERS LTD |
SHAPIRO: "Practical Flow Cytometry", 1995, WILEY-LISS |
SIDORENKO ET AL., FEBS LETT., vol. 582, no. 3, 2008, pages 410 - 404 |
STOLER, CLINICS IN LABORATORY MEDICINE, vol. 10, no. 1, 1990, pages 215 - 236 |
WANG ET AL., ENVIRON. HEALTH TOXICOL., vol. 29, 2014, pages e2014007 |
WAXMAN ET AL., OBSTET. GYNECOL., vol. 120, 2012, pages 1465 - 1471 |
WEIER ET AL., EXPERT REV. MOL. DIAGN., vol. 2, no. 2, 2002, pages 109 - 119 |
YANG ET AL., NUCL. ACIDS RES., vol. 34, no. 21, 2006, pages 6095 - 6101 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20230212658A1 (en) | Multiplex labeling of molecules by sequential hybridization barcoding | |
Nagendran et al. | Automated cell-type classification in intact tissues by single-cell molecular profiling | |
EP3775273B1 (en) | Methods to further enhance signal amplification for the in situ detection of nucleic acids | |
US20220154262A1 (en) | Methods for multiplex detection of nucleic acids by in situ hybridization | |
US20230002815A1 (en) | Methods for sequential detection of nucleic acids | |
US20150045251A1 (en) | Duplex chromogenic assay for in situ detection of nucleic acids | |
US20240035073A1 (en) | Methods for detecting proteins and for co-detecting proteins and nucleic acids, and kits for the same | |
US20220119870A1 (en) | Methods for digital multiplexing of nucleic acids in situ | |
WO2024124041A1 (en) | Multiplexed detection of nucleic acid targets | |
US20230175048A1 (en) | Methods for simultaneously detecting target nucleic acids and proteins and a kit thereof | |
US20240209423A1 (en) | Methods and compositions for reducing autofluorescence | |
US20220177953A1 (en) | Detection of double stranded nucleic acids in situ and methods related thereto | |
US20230151415A1 (en) | Methods for detecting target nucleic acids by in situ hybridization and a kit thereof |