US20170321286A1 - Detection of lung neoplasia by amplification of rna sequences - Google Patents

Detection of lung neoplasia by amplification of rna sequences Download PDF

Info

Publication number
US20170321286A1
US20170321286A1 US15/587,806 US201715587806A US2017321286A1 US 20170321286 A1 US20170321286 A1 US 20170321286A1 US 201715587806 A US201715587806 A US 201715587806A US 2017321286 A1 US2017321286 A1 US 2017321286A1
Authority
US
United States
Prior art keywords
rna
marker
sample
oligonucleotide
nucleic acid
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US15/587,806
Inventor
Hatim Allawi
Graham P. Lidgard
Chateen Krueger
Michael W. Kaiser
Tamara J. Sander
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Exact Sciences Corp
Original Assignee
Exact Sciences Development Co LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Exact Sciences Development Co LLC filed Critical Exact Sciences Development Co LLC
Priority to US15/587,806 priority Critical patent/US20170321286A1/en
Assigned to EXACT SCIENCES CORPORATION reassignment EXACT SCIENCES CORPORATION ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: ALLAWI, HATIM, KAISER, MICHAEL W., KRUEGER, CHATEEN, LIDGARD, GRAHAM P., SANDER, Tamara J.
Assigned to EXACT SCIENCES DEVELOPMENT COMPANY, LLC reassignment EXACT SCIENCES DEVELOPMENT COMPANY, LLC ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: EXACT SCIENCES CORPORATION
Publication of US20170321286A1 publication Critical patent/US20170321286A1/en
Priority to US17/078,924 priority patent/US11479823B2/en
Assigned to EXACT SCIENCES CORPORATION reassignment EXACT SCIENCES CORPORATION MERGER (SEE DOCUMENT FOR DETAILS). Assignors: EXACT SCIENCES DEVELOPMENT COMPANY, LLC
Priority to US17/932,574 priority patent/US20230121671A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6883Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
    • C12Q1/6886Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/118Prognosis of disease development
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/158Expression markers

Definitions

  • neoplasia Provided herein is technology relating to detecting neoplasia and particularly, but not exclusively, to methods, compositions, and related uses for detecting neoplasms such as lung cancer.
  • Lung cancer is the most frequent cause of death among men and women younger than 85 years in the US. It accounts for 27% of all cancer deaths and 221,000 lost lives annually. This mortality rate exceeds that of the next 4 highest ranking cancers combined.
  • Gene expression profiling has confirmed unique mRNA expression in cancers and can be used as an approach for detection of lung malignancies. An mRNA multi-marker approach to detect all subtypes of lung cancer needs to be explored. This study assesses the value of measuring expression levels of multiple mRNA markers in detecting lung cancer of different subtypes.
  • This technology is in the field of nucleic acid detection and quantification. Specifically, the technology addresses the detection and quantification of RNA in samples using single-tube RT-PCR-Invasive cleavage reaction (RT-QuARTS).
  • RT-QuARTS RT-PCR-Invasive cleavage reaction
  • the technology provides methods of screening for a lung neoplasm in a sample obtained from a subject, the methods comprising, e.g., a) assaying a sample from a subject for an amount of at least one RNA marker selected from the group consisting of GAGE12D, FAM83A, LRG1, XAGE-1 d, MAGEA4, SFTPB, AKAP4, and CYP24A1; b) assaying the sample for an amount of a reference marker in the sample; c) comparing the amount of the at least one RNA marker to the amount of the reference marker to determine a level of expression for the at least one marker gene in the sample; and d) generating a record reporting the expression for the at least one marker gene in said sample.
  • the method comprises obtaining a sample comprising RNA from a subject and treating the RNA with a reverse transcriptase, preferably MMLV reverse transcriptase, to form a cDNA copy of at least a portion of the RNA.
  • a reverse transcriptase preferably MMLV reverse transcriptase
  • the cDNA is created and detected in a single vessel, without opening the vessel, e.g., to add additional reagents.
  • the at least one RNA marker is at least two markers.
  • the at least one RNA marker comprises the group consisting of GAGE, FAM83A, LRG1 and MAGEA4 markers, while in some embodiments, the at least one RNA marker comprises the group consisting of GAGE, FAM83A, LRG1, CYP24A1, XAGE1D and MAGEA4 markers.
  • the reference marker is an RNA, preferably an RNA selected from the group consisting of CASC3 mRNA, ⁇ -actin mRNA, U1 snRNA and U6 snRNA.
  • the technology comprises assaying RNA using one or more of a polymerase chain reaction, nucleic acid sequencing, mass spectrometry, mass-based separation, or target capture.
  • the assaying comprises using a flap endonuclease assay, such as a QuARTS assay, as described hereinbelow.
  • assaying the expression of the RNA marker comprises detecting increased or decreased expression of the RNA marker relative to a normal expression of the marker.
  • samples suitable for analysis using the technology are not limited to a particular sample type.
  • the sample is a tissue sample, a blood sample, a serum sample, or a sputum sample.
  • tissue sample comprises lung tissue.
  • kits e.g., for practicing the technology.
  • the technology provides a kit comprising:
  • oligonucleotide a) at least one oligonucleotide, wherein at least a portion of said oligonucleotide specifically hybridizes to a marker RNA selected from the group consisting of GAGE12D, FAM83A, LRG1, XAGE-1 d, MAGEA4, SFTPB, AKAP4, and CYP24A1, and
  • the kit comprises at least two additional oligonucleotides.
  • the kit further comprises one or more components selected from the group consisting of reverse transcriptase, flap endonuclease, DNA polymerase, and a FRET cassette.
  • the at least one RNA marker is selected from the group consisting of GAGE, FAM83A, LRG1, CYP24A1, XAGE1D and MAGEA4, and in some embodiments the RNA marker is selected from the group consisting of GAGE, FAM83A, LRG1 and MAGEA4.
  • the kit comprises at least 4 oligonucleotides, wherein each of the markers in the group consisting of GAGE, FAM83A, LRG1, and MAGEA4 specifically hybridizes to at least one of said 4 oligonucleotides.
  • the kit comprises at least 6 oligonucleotides, wherein each of the markers in the group consisting of GAGE, FAM83A, LRG1, CYP24A1, XAGE1D and MAGEA4 specifically hybridizes to at least one of said 6 oligonucleotides.
  • at least one oligonucleotide is selected from one or more of a capture oligonucleotide, a pair of nucleic acid primers, a nucleic acid probe, and an invasive oligonucleotide.
  • the technology is not limited to which particular reference marker RNA is used, and many are known in the field.
  • the reference marker is an RNA selected from the group consisting of CASC3, ⁇ -actin, U1 and U6 RNA
  • the technology further comprises compositions such as mixtures, e.g., reaction mixtures.
  • the technology provides a mixture comprising a complex of at least one RNA marker selected from the group consisting of GAGE12D, FAM83A, LRG1, XAGE-1 d, MAGEA4, SFTPB, AKAP4, and CYP24A1 and an oligonucleotide that specifically hybridizes to the RNA marker.
  • the composition further comprises a complex of at least one reference marker and an oligonucleotide that specifically hybridizes to the reference RNA marker.
  • the at least one RNA marker is selected from the group consisting of GAGE, FAM83A, LRG1, CYP24A1, XAGE1D and MAGEA4, while in some embodiments the least one RNA marker is selected from the group consisting of GAGE, FAM83A, LRG1 and MAGEA4.
  • the composition comprises a reference marker that is an RNA selected from the group consisting of CASC3, ⁇ -actin, U1 RNA and U6 RNA.
  • the oligonucleotide is selected from one or more of a capture oligonucleotide, a pair of nucleic acid primers, a nucleic acid probe, and an invasive oligonucleotide.
  • the composition comprises a nucleic acid probe oligonucleotide comprising a reporter molecule, e.g., a fluorophore, and/or a flap sequence.
  • the composition further comprises one or more components selected from the group consisting of reverse transcriptase, (e.g., MMLV reverse transcriptase), flap endonuclease, thermostable DNA polymerase, and a FRET cassette.
  • the DNA polymerase is a bacterial DNA polymerase.
  • the term “or” is an inclusive “or” operator and is equivalent to the term “and/or” unless the context clearly dictates otherwise.
  • the term “based on” is not exclusive and allows for being based on additional factors not described, unless the context clearly dictates otherwise.
  • the meaning of “a”, “an”, and “the” include plural references.
  • the meaning of “in” includes “in” and “on.”
  • composition “consisting essentially of” as used in claims in the present application limits the scope of a claim to the specified materials or steps “and those that do not materially affect the basic and novel characteristic(s)” of the claimed invention, as discussed in In re Herz, 537 F.2d 549, 551-52, 190 USPQ 461, 463 (CCPA 1976).
  • a composition “consisting essentially of” recited elements may contain an unrecited contaminant at a level such that, though present, the contaminant does not alter the function of the recited composition as compared to a pure composition, i.e., a composition “consisting of” the recited components.
  • the “sensitivity” of a given marker refers to the percentage of samples that report a marker value (e.g., an expression marker) above a threshold value that distinguishes between neoplastic and non-neoplastic samples.
  • a positive is defined as a histology-confirmed neoplasia that reports a marker value above a threshold value (e.g., the range associated with disease), and a false negative is defined as a histology-confirmed neoplasia that reports a marker value below the threshold value (e.g., the range associated with no disease).
  • the value of sensitivity therefore, reflects the probability that a measurement for a given marker obtained from a known diseased sample will be in the range of disease-associated measurements.
  • the clinical relevance of the calculated sensitivity value represents an estimation of the probability that a given marker would detect the presence of a clinical condition when applied to a subject with that condition.
  • the “specificity” of a given marker refers to the percentage of non-neoplastic samples that report a marker value (e.g., an expression marker) below a threshold value that distinguishes between neoplastic and non-neoplastic samples.
  • a negative is defined as a histology-confirmed non-neoplastic sample that reports a marker value below the threshold value (e.g., the range associated with no disease) and a false positive is defined as a histology-confirmed non-neoplastic sample that reports a marker value above the threshold value (e.g., the range associated with disease).
  • the value of specificity therefore, reflects the probability that a marker measurement for a given marker obtained from a known non-neoplastic sample will be in the range of non-disease associated measurements.
  • the clinical relevance of the calculated specificity value represents an estimation of the probability that a given marker would detect the absence of a clinical condition when applied to a patient without that condition.
  • primer refers to an oligonucleotide, whether occurring naturally as, e.g., a nucleic acid fragment from a restriction digest, or produced synthetically, that is capable of acting as a point of initiation of synthesis when placed under conditions in which synthesis of a primer extension product that is complementary to a nucleic acid template strand is induced, (e.g., in the presence of nucleotides and an inducing agent such as a DNA polymerase, and at a suitable temperature and pH).
  • the primer is preferably single stranded for maximum efficiency in amplification, but may alternatively be double stranded. If double stranded, the primer is first treated to separate its strands before being used to prepare extension products.
  • the primer is an oligodeoxyribonucleotide.
  • the primer must be sufficiently long to prime the synthesis of extension products in the presence of the inducing agent. The exact lengths of the primers will depend on many factors, including temperature, source of primer, and the use of the method.
  • probe refers to an oligonucleotide (e.g., a sequence of nucleotides), whether occurring naturally as in a purified restriction digest or produced synthetically, recombinantly, or by PCR amplification, that is capable of hybridizing to another oligonucleotide of interest.
  • a probe may be single-stranded or double-stranded. Probes are useful in the detection, identification, and isolation of particular gene sequences (e.g., a “capture probe”).
  • any probe used in the present invention may, in some embodiments, be labeled with any “reporter molecule,” so that is detectable in any detection system, including, but not limited to enzyme (e.g., ELISA, as well as enzyme-based histochemical assays), fluorescent, radioactive, and luminescent systems. It is not intended that the present invention be limited to any particular detection system or label.
  • target refers to a nucleic acid sought to be sorted out from other nucleic acids, e.g., by probe binding, amplification, isolation, capture, etc.
  • target refers to the region of nucleic acid bounded by the primers used for polymerase chain reaction
  • a target comprises the site at which a probe and invasive oligonucleotides (e.g., INVADER oligonucleotide) bind to form an invasive cleavage structure, such that the presence of the target nucleic acid can be detected.
  • a “segment” is defined as a region of nucleic acid within the target sequence.
  • marker refers to a substance (e.g., a nucleic acid, or a region of a nucleic acid, or a protein) that may be used to distinguish non-normal cells (e.g., cancer cells) from normal cells, e.g., based on presence, absence, or status (e.g., post-transcriptional processing) of the marker substance.
  • a substance e.g., a nucleic acid, or a region of a nucleic acid, or a protein
  • non-normal cells e.g., cancer cells
  • status e.g., post-transcriptional processing
  • sample is used in its broadest sense. In one sense it can refer to an animal cell or tissue. In another sense, it refers to a specimen or culture obtained from any source, as well as biological and environmental samples. Biological samples may be obtained from plants or animals (including humans) and encompass fluids, solids, tissues, and gases. Environmental samples include environmental material such as surface matter, soil, water, and industrial samples. These examples are not to be construed as limiting the sample types applicable to the present invention.
  • neoplasm refers to any new and abnormal growth of tissue.
  • a neoplasm can be a premalignant neoplasm or a malignant neoplasm.
  • nucleic acid-specific marker refers to any biological material or element that can be used to indicate the presence of a neoplasm.
  • biological materials include, without limitation, nucleic acids (DNA, RNA, miRNA, etc.), polypeptides, carbohydrates, fatty acids, cellular components (e.g., cell membranes and mitochondria), and whole cells.
  • markers are particular nucleic acid regions (e.g., genes, intragenic regions, specific loci, etc.). Regions of nucleic acid that are markers may be referred to, e.g., as “marker genes,” “marker regions,” “marker sequences,” “marker loci,” etc.
  • the terms “patient” or “subject” refer to organisms to be subject to various tests provided by the technology.
  • the term “subject” includes animals, preferably mammals, including humans.
  • the subject is a primate.
  • the subject is a human.
  • a preferred subject is a vertebrate subject.
  • a preferred vertebrate is warm-blooded; a preferred warm-blooded vertebrate is a mammal.
  • a preferred mammal is most preferably a human.
  • the term “subject’ includes both human and animal subjects. Thus, veterinary therapeutic uses are provided herein.
  • the present technology provides for the diagnosis of mammals such as humans, as well as those mammals of importance due to being endangered, such as Siberian tigers; of economic importance, such as animals raised on farms for consumption by humans; and/or animals of social importance to humans, such as animals kept as pets or in zoos.
  • animals include but are not limited to: carnivores such as cats and dogs; swine, including pigs, hogs, and wild boars; ruminants and/or ungulates such as cattle, oxen, sheep, giraffes, deer, goats, bison, and camels; pinnipeds; and horses.
  • the presently-disclosed subject matter further includes a system for diagnosing a lung cancer in a subject.
  • the system can be provided, for example, as a commercial kit that can be used to screen for a risk of lung cancer or diagnose a lung cancer in a subject from whom a biological sample has been collected.
  • An exemplary system provided in accordance with the present technology includes assessing the expression of a marker described herein.
  • amplifying or “amplification” in the context of nucleic acids refers to the production of multiple copies of a polynucleotide, or a portion of the polynucleotide, typically starting from a small amount of the polynucleotide (e.g., a single polynucleotide molecule), where the amplification products or amplicons are generally detectable.
  • Amplification of polynucleotides encompasses a variety of chemical and enzymatic processes. The generation of multiple DNA copies from one or a few copies of a target or template DNA molecule during a polymerase chain reaction (PCR) or a ligase chain reaction (LCR; see, e.g., U.S. Pat. No.
  • miniprimer PCR multiplex ligation-dependent probe amplification (see, e.g., Schouten, et al., (2002) Nucleic Acids Research 30(12): e57; herein incorporated by reference in its entirety), multiplex PCR (see, e.g., Chamberlain, et al., (1988) Nucleic Acids Research 16(23) 11141-11156; Ballabio, et al., (1990) Human Genetics 84(6) 571-573; Hayden, et al., (2008) BMC Genetics 9:80; each of which are herein incorporated by reference in their entireties), nested PCR, overlap-extension PCR (see, e.g., Higuchi, et al., (1988) Nucleic Acids Research 16(15) 7351-7367; herein incorporated by reference in its entirety), real time PCR (see, e.g.,
  • PCR polymerase chain reaction
  • the mixture is denatured and the primers then annealed to their complementary sequences within the target molecule.
  • the primers are extended with a polymerase so as to form a new pair of complementary strands.
  • the steps of denaturation, primer annealing, and polymerase extension can be repeated many times (i.e., denaturation, annealing and extension constitute one “cycle”; there can be numerous “cycles”) to obtain a high concentration of an amplified segment of the desired target sequence.
  • the length of the amplified segment of the desired target sequence is determined by the relative positions of the primers with respect to each other, and therefore, this length is a controllable parameter.
  • PCR polymerase chain reaction
  • nucleic acid detection assay refers to any method of determining the nucleotide composition of a nucleic acid of interest.
  • Nucleic acid detection assay include but are not limited to, DNA sequencing methods, probe hybridization methods, structure specific cleavage assays (e.g., the INVADER assay, (Hologic, Inc.) and are described, e.g., in U.S. Pat. Nos. 5,846,717, 5,985,557, 5,994,069, 6,001,567, 6,090,543, and 6,872,816, WO 2006/050499; Lyamichev et al., Nat.
  • target nucleic acid is amplified (e.g., by PCR) and amplified nucleic acid is detected simultaneously using an invasive cleavage assay.
  • Assays configured for performing a detection assay e.g., invasive cleavage assay
  • an amplification assay are described in U.S. Pat. No. 9,096,893, incorporated herein by reference in its entirety for all purposes.
  • Additional amplification plus invasive cleavage detection configurations, termed the QuARTS method are described in U.S. Pat. Nos. 8,361,720; 8,715,937; 8,916,344; and 9,212,392, incorporated herein by reference in their entireties for all purposes.
  • invasive cleavage structure refers to a cleavage structure comprising i) a target nucleic acid, ii) an upstream nucleic acid (e.g., an invasive or “INVADER” oligonucleotide), and iii) a downstream nucleic acid (e.g., a probe), where the upstream and downstream nucleic acids anneal to contiguous regions of the target nucleic acid, and where an overlap forms between the a 3′ portion of the upstream nucleic acid and duplex formed between the downstream nucleic acid and the target nucleic acid.
  • an upstream nucleic acid e.g., an invasive or “INVADER” oligonucleotide
  • a downstream nucleic acid e.g., a probe
  • an overlap occurs where one or more bases from the upstream and downstream nucleic acids occupy the same position with respect to a target nucleic acid base, whether or not the overlapping base(s) of the upstream nucleic acid are complementary with the target nucleic acid, and whether or not those bases are natural bases or non-natural bases.
  • the 3′ portion of the upstream nucleic acid that overlaps with the downstream duplex is a non-base chemical moiety such as an aromatic ring structure, e.g., as disclosed, for example, in U.S. Pat. No. 6,090,543, incorporated herein by reference in its entirety.
  • one or more of the nucleic acids may be attached to each other, e.g., through a covalent linkage such as nucleic acid stem-loop, or through a non-nucleic acid chemical linkage (e.g., a multi-carbon chain).
  • a covalent linkage such as nucleic acid stem-loop
  • a non-nucleic acid chemical linkage e.g., a multi-carbon chain.
  • the term “flap endonuclease assay” includes “INVADER” invasive cleavage assays and QuARTS assays, as described above.
  • the term “flap oligonucleotide” refers to an oligonucleotide cleavable in a detection assay, such as an invasive cleavage assay, by a flap endonuclease.
  • a flap oligonucleotide forms an invasive cleavage structure with other nucleic acids, e.g.
  • PCR-invasive cleavage assay refers to an assay in which target nucleic acid is amplified and amplified nucleic acid is detected simultaneously using a signal-amplifying invasive cleavage assay employing a FRET cassette, and in which the assay reagents comprise a mixture containing DNA polymerase, FEN-1 endonuclease, a primary probe comprising a portion complementary to a target nucleic acid, and a hairpin FRET cassette.
  • PCR-invasive cleavage assays include the QuARTS assays described in U.S. Pat. Nos.
  • PCR-invasive cleavage assay reagents refers to one or more reagents for detecting target sequences in a PCR-invasive cleavage assay, the reagents comprising nucleic acid molecules capable of participating in amplification of a target nucleic acid and in formation of an invasive cleavage structure in the presence of the target sequence, in a mixture containing DNA polymerase, FEN-1 endonuclease and a FRET cassette, and optionally a reverse transcriptase.
  • FRET cassette refers to a hairpin oligonucleotide that contains a fluorophore moiety and a nearby quencher moiety that quenches the fluorophore.
  • Hybridization of a cleaved flap e.g., from cleavage of a target-specific probe in a PCR-invasive cleavage assay
  • a FRET cassette produces a secondary substrate for the flap endonuclease, e.g., a FEN-1 enzyme. Once this substrate is formed, the 5′ fluorophore-containing base is cleaved from the cassette, thereby generating a fluorescence signal.
  • a FRET cassette comprises an unpaired 3′ portion to which a cleavage product, e.g., a portion of a cleaved flap oligonucleotide, can hybridize to from an invasive cleavage structure cleavable by a FEN-1 endonuclease.
  • a cleavage product e.g., a portion of a cleaved flap oligonucleotide
  • a nucleic acid “hairpin” as used herein refers to a region of a single-stranded nucleic acid that contains a duplex (i.e., base-paired) stem and a loop, formed when the nucleic acid comprises two portions that are sufficiently complementary to each other to form a plurality of consecutive base pairs.
  • FRET refers to fluorescence resonance energy transfer, a process in which moieties (e.g., fluorophores) transfer energy e.g., among themselves, or, from a fluorophore to a non-fluorophore (e.g., a quencher molecule).
  • FRET involves an excited donor fluorophore transferring energy to a lower-energy acceptor fluorophore via a short-range (e.g., about 10 nm or less) dipole-dipole interaction.
  • FRET involves a loss of fluorescence energy from a donor and an increase in fluorescence in an acceptor fluorophore.
  • FRET energy can be exchanged from an excited donor flurophore to a non-fluorescing molecule (e.g., a quenching molecule).
  • a non-fluorescing molecule e.g., a quenching molecule.
  • FRET is known to those of skill in the art and has been described (See, e.g., Stryer et al., 1978, Ann. Rev. Biochem., 47:819; Selvin, 1995, Methods Enzymol., 246:300; Orpana, 2004 Biomol Eng 21, 45-50; Olivier, 2005 Mutant Res 573, 103-110, each of which is incorporated herein by reference in its entirety).
  • FEN-1 in reference to an enzyme refers to a non-polymerase flap endonuclease from a eukaryote or archaeal organism. See, e.g., WO 02/070755, and Kaiser M. W., et al. (1999) J. Biol. Chem., 274:21387, which are incorporated by reference herein in their entireties for all purposes.
  • FEN-1 activity refers to any enzymatic activity of a FEN-1 enzyme, including but not limited to flap endonuclease (FEN), nick exonuclease (EXO), and gap endonuclease (GEN) activities (see, e.g., Shen, et al., BioEssays Volume 27, Issue 7, Pages 717-729, incorporated herein by reference).
  • FEN flap endonuclease
  • EXO nick exonuclease
  • GEN gap endonuclease
  • primer annealing refers to conditions that permit oligonucleotide primers to hybridize to template nucleic acid strands. Conditions for primer annealing vary with the length and sequence of the primer and are generally based upon the T m that is determined or calculated for the primer. For example, an annealing step in an amplification method that involves thermocycling involves reducing the temperature after a heat denaturation step to a temperature based on the T m of the primer sequence, for a time sufficient to permit such annealing.
  • amplifiable nucleic acid is used in reference to nucleic acids that may be amplified by any amplification method. It is contemplated that “amplifiable nucleic acid” will usually comprise “sample template.”
  • sample template refers to nucleic acid originating from a sample that is analyzed for the presence of “target.”
  • background template is used in reference to nucleic acid other than sample template that may or may not be present in a sample. The presence of background template is most often inadvertent. It may be the result of carryover, or it may be due to the presence of nucleic acid contaminants sought to be purified away from the sample. For example, nucleic acids from organisms other than those to be detected may be present as background in a test sample.
  • a sample “suspected of containing” a nucleic acid may contain or not contain the target nucleic acid molecule.
  • real time refers to the detection or measurement of the accumulation of products or signal in the reaction while the reaction is in progress, e.g., during incubation or thermal cycling. Such detection or measurement may occur continuously, or it may occur at a plurality of discrete points during the progress of the amplification reaction, or it may be a combination. For example, in a polymerase chain reaction, detection (e.g., of fluorescence) may occur continuously during all or part of thermal cycling, or it may occur transiently, at one or more points during one or more cycles.
  • detection e.g., of fluorescence
  • real time detection of PCR is accomplished by determining a level of fluorescence at the same point (e.g., a time point in the cycle, or temperature step in the cycle) in each of a plurality of cycles, or in every cycle.
  • Real time detection of amplification may also be referred to as detection “during” the amplification reaction.
  • reverse transcription and “reverse transcribe” refer to the use of a template-dependent polymerase to produce a DNA strand complementary to an RNA template.
  • a polymerase capable of producing a DNA strand complementary to an RNA template is generally referred to as a “reverse transcriptase” or as a polymerase that has “reverse transcriptase activity”.
  • the term “abundance of nucleic acid” refers to the amount of a particular target nucleic acid sequence present in a sample or aliquot. The amount is generally referred to in terms of mass (e.g., ⁇ g), mass per unit of volume (e.g., ⁇ g/ ⁇ 1); copy number (e.g., 1000 copies, 1 attomole), or copy number per unit of volume (e.g., 1000 copies per ml, 1 attomole per ⁇ 1).
  • Abundance of a nucleic acid can also be expressed as an amount relative to the amount of a standard of known concentration or copy number.
  • Measurement of abundance of a nucleic acid may be on any basis understood by those of skill in the art as being a suitable quantitative representation of nucleic acid abundance, including physical density or the sample, optical density, refractive property, staining properties, or on the basis of the intensity of a detectable label, e.g. a fluorescent label.
  • amplicon or “amplified product” refers to a segment of nucleic acid, generally DNA, generated by an amplification process such as the PCR process.
  • RNA segments produced by amplification methods that employ RNA polymerases, such as NASBA, TMA, etc.
  • amplification plot as used in reference to a thermal cycling amplification reaction refers to the plot of signal that is indicative of amplification, e.g., fluorescence signal, versus cycle number.
  • an amplification plot When used in reference to a non-thermal cycling amplification method, an amplification plot generally refers to a plot of the accumulation of signal as a function of time.
  • baseline refers to the detected signal coming from assembled amplification reactions at prior to incubation or, in the case of PCR, in the initial cycles, in which there is little change in signal.
  • no template control and “no target control” as used herein in reference to a control reaction refers to a reaction or sample that does not contain template or target nucleic acid. It is used to verify amplification quality.
  • the term “quantitative amplification data set” refers to the data obtained during quantitative amplification of the target sample, e.g., target DNA.
  • the quantitative amplification data set is a collection of fluorescence values obtained at during amplification, e.g., during a plurality of, or all of the thermal cycles.
  • Data for quantitative amplification is not limited to data collected at any particular point in a reaction, and fluorescence may be measured at a discrete point in each cycle or continuously throughout each cycle.
  • Ct and Cp as used herein in reference to real-time detection during an amplification reaction that is thermal cycled refers to the cycle at which signal (e.g., fluorescent signal) crosses a predetermined threshold value indicative of positive signal.
  • signal e.g., fluorescent signal
  • Various methods have been used to calculate the threshold that is used as a determinant of signal verses concentration, and the value is generally expressed as either the “crossing threshold” (Ct) or the “crossing point” (Cp).
  • Ct crossing threshold
  • Cp crossing point
  • Either Cp values or Ct values may be used in embodiments of the methods presented herein for analysis of real-time signal for the determination of the amount of RNA marker(s) or reference markers in an assay or sample.
  • kits refers to any delivery system for delivering materials.
  • reaction assays such delivery systems include systems that allow for the storage, transport, or delivery of reaction reagents (e.g., oligonucleotides, enzymes, etc. in the appropriate containers) and/or supporting materials (e.g., buffers, written instructions for performing the assay etc.) from one location to another.
  • kits include one or more enclosures (e.g., boxes) containing the relevant reaction reagents and/or supporting materials.
  • fragmentous kit refers to delivery systems comprising two or more separate containers that each contain a subportion of the total kit components. The containers may be delivered to the intended recipient together or separately.
  • a first container may contain an enzyme for use in an assay
  • a second container contains oligonucleotides.
  • system refers to a collection of articles for use for a particular purpose.
  • the articles comprise instructions for use, as information supplied on e.g., an article, on paper, or on recordable media (e.g., DVD, CD, flash drive, etc.).
  • instructions direct a user to an online location, e.g., a website.
  • the term “information” refers to any collection of facts or data. In reference to information stored or processed using a computer system(s), including but not limited to internets, the term refers to any data stored in any format (e.g., analog, digital, optical, etc.).
  • the term “information related to a subject” refers to facts or data pertaining to a subject (e.g., a human, plant, or animal).
  • the term “genomic information” refers to information pertaining to a genome including, but not limited to, nucleic acid sequences, genes, percentage methylation, allele frequencies, RNA expression levels, protein expression, phenotypes correlating to genotypes, etc.
  • Allele frequency information refers to facts or data pertaining to allele frequencies, including, but not limited to, allele identities, statistical correlations between the presence of an allele and a characteristic of a subject (e.g., a human subject), the presence or absence of an allele in an individual or population, the percentage likelihood of an allele being present in an individual having one or more particular characteristics, etc.
  • FIG. 1 shows a schematic diagram of a combined reverse transcription-QuARTS flap endonuclease detection assay for real-time detection of RNA.
  • Use of multiple different probe flap/FRET cassette dye combinations allows multiple different target nucleic acids to be detected together in multiplex reactions.
  • FIG. 2 compares the effects of different amounts of reverse transcriptase and different reverse transcription conditions on the detection of known amounts of target RNA in RT-QuARTS assays.
  • FIG. 3 shows graphs showing standard curves measuring marker LRG1 RNA.
  • Panel A describes the dilution series, the average Cp value at each dilution, and the calculated strands/reaction calculated from the amplification plots shown in panel B.
  • Panel C shows a graph comparing the Cp compared to the log of the amount of RNA present in the sample.
  • FIG. 4 compares the signals measured for markers FAM83A, XAGE1D, CYP24A1, GAGE12D, LRG1, and MAGEA4 in cancer and normal tissue samples, as described below.
  • FIG. 5 shows graphs comparing the sensitivity and specificity when samples are analyzed using the combinations of four or six expression markers, as listed above each panel.
  • RNA expression markers for use in assays for detection and quantification of RNA.
  • the technology relates to use of RNA-based gene expression assays to detect lung cancer.
  • compositions and methods relate to characterizing the expression from marker genes by characterizing RNA molecules (“RNA markers”) in a sample, wherein the RNA presence, absence, or status (e.g., with respect to post-transcription modifications or processing) is indicative of neoplasia. Accordingly, provided here are compositions and method directed toward analysis of RNA markers that correlate with lung neoplasia.
  • the technology provides assays wherein RNA markers are reverse transcribed, amplified, and detected in real time in a single reaction mixture, and in a single vessel.
  • compositions and kits for practicing the methods.
  • reagents e.g., primers, probes
  • sets e.g., sets of primers pairs for amplifying a plurality of RNA markers.
  • Additional reagents for conducting a detection assay may also be provided (e.g., enzymes, buffers, positive and negative controls for conducting QuARTS assays, RT-QuARTS assays, PCR, sequencing, or other assays).
  • the kits containing one or more reagent necessary, sufficient, or useful for conducting a method are provided.
  • reactions mixtures containing the reagents.
  • master mix reagent sets containing a plurality of reagents that may be added to each other and/or to a test sample to complete a reaction mixture.
  • the technology relates to the analysis of any sample associated with lung cancer.
  • the sample comprises a tissue and/or biological fluid obtained from a patient.
  • the sample comprises a secretion.
  • the sample comprises sputum, blood, serum, plasma, lung tissue samples, or lung cells.
  • the subject is human.
  • Such samples can be obtained by any number of means known in the art, such as will be apparent to the skilled person.
  • mRNA markers (GAGE12D, FAM83A, LRG1, XAGE-1 d, MAGEA4, SFTPB, AKAP4, and CYP24A1) were selected based on discrimination reported in the literature. As described below, samples from 246 patients (119 controls, 127 lung cancer cases) were tested. The lung cancer cases comprised adena (65), squamous (34), large cell (13), small cell (4) and others carcinomas (11). The controls were from patients having benign lung nodules (37), normal lung (60), chronic obstructive pulmonary disorder (COPD) (10), and normal lung adjacent to tumor (12). Cases and controls included smokers and non-smokers.
  • RNA expression levels were assayed in a single-tube reverse transcription QuARTS (Quantitative Allele-Specific Real-time Target and Signal amplification) as described herein below, a reaction configuration that simultaneously measures copy numbers of two mRNA markers and a housekeeping reference mRNA (CASC3).
  • a single-tube reverse transcription QuARTS Quantitative Allele-Specific Real-time Target and Signal amplification
  • CASC3 housekeeping reference mRNA
  • Receiver operator characteristic (ROC) curve analyses resulted in an area under the curve (AUC) of 0.976.
  • AUC area under the curve
  • RNA expression assays e.g., qRT-PCR, digital PCR, gene expression arrays, etc.
  • a modified version of a quantitative real-time target and signal amplification (QuARTS) assay is used to evaluate gene expression.
  • QuARTS quantitative real-time target and signal amplification
  • three reactions occur during each QuARTS assay, including amplification (reaction 1) and target probe cleavage (reaction 2) in the primary reaction; and FRET cleavage and fluorescent signal generation (reaction 3) in the secondary reaction. After the first few cycles generate initial amounts of cleaved probe, these reactions occur essentially concurrently.
  • a reverse transcription step is included to produce cDNA for QuARTS flap assay detection.
  • a specific detection probe with a flap sequence loosely binds to the amplicon.
  • the presence of the specific invasive oligonucleotide at the target binding site causes a 5′ nuclease, e.g., a FEN-1 endonuclease, to release the flap sequence by cutting between the detection probe and the flap sequence.
  • the flap sequence is complementary to a non-hairpin portion of a corresponding FRET cassette. Accordingly, the flap sequence functions as an invasive oligonucleotide on the FRET cassette and effects a cleavage between the FRET cassette fluorophore and a quencher, which produces a fluorescent signal.
  • the cleavage reaction can cut multiple probes per target and thus release multiple fluorophore per flap, providing exponential signal amplification.
  • a QuARTS flap endonuclease assay can detect multiple targets in a single reaction vessel, e.g., by using FRET cassettes with different dyes.
  • RNA isolation methods may comprise one or more of organic extraction, ultrafiltration, hybrid capture, etc.
  • cells or lysed samples containing RNA may be added directly to assay reactions without purification.
  • the sample comprises blood, serum, plasma, or saliva.
  • the subject is human.
  • Such samples can be obtained by any number of means known in the art, such as will be apparent to the skilled person.
  • Cell free or substantially cell free samples can be obtained by subjecting the sample to various techniques known to those of skill in the art which include, but are not limited to, centrifugation and filtration. Although it is generally preferred that no invasive techniques are used to obtain the sample, it still may be preferable to obtain samples such as tissue homogenates, tissue sections, and biopsy specimens.
  • the technology is not limited in the methods used to prepare the samples and provide a nucleic acid for testing.
  • a RNA is isolated from blood or from a plasma sample using a hybrid capture method, e.g., using target-specific binding materials (e.g., oligonucleotides) on solid supports.
  • markers can be carried out separately or simultaneously with additional markers within one test sample. For example, several markers can be combined into one test for efficient processing of multiple samples and for potentially providing greater diagnostic and/or prognostic accuracy.
  • markers can be combined into one test for efficient processing of multiple samples and for potentially providing greater diagnostic and/or prognostic accuracy.
  • one skilled in the art would recognize the value of testing multiple samples (for example, at successive time points) from the same subject.
  • Such testing of serial samples can allow the identification of changes in marker expression over time. Changes in expression, as well as the absence of change in expression, can provide useful information about the disease status that includes, but is not limited to, identifying the approximate time from onset of the event, the presence and amount of salvageable tissue, the appropriateness of drug therapies, the effectiveness of various therapies, and identification of the subject's outcome, including risk of future events.
  • biomarkers can be carried out in a variety of physical formats.
  • the use of microtiter plates or automation can be used to facilitate the processing of large numbers of test samples.
  • single sample formats could be developed to facilitate immediate treatment and diagnosis in a timely fashion, for example, in ambulatory transport or emergency room settings.
  • kits comprise embodiments of the compositions, devices, apparatuses, etc. described herein, and instructions for use of the kit.
  • Such instructions describe appropriate methods for preparing an analyte from a sample, e.g., for collecting a sample and preparing a nucleic acid from the sample.
  • Individual components of the kit are packaged in appropriate containers and packaging (e.g., vials, boxes, blister packs, ampules, jars, bottles, tubes, and the like) and the components are packaged together in an appropriate container (e.g., a box or boxes) for convenient storage, shipping, and/or use by the user of the kit.
  • liquid components may be provided in a lyophilized form to be reconstituted by the user.
  • Kits may include a control or reference for assessing, validating, and/or assuring the performance of the kit.
  • a kit for assaying the amount of a nucleic acid present in a sample may include a control comprising a known concentration of the same or another nucleic acid for comparison and, in some embodiments, a detection reagent (e.g., a primer) specific for the control nucleic acid.
  • the kits are appropriate for use in a clinical setting and, in some embodiments, for use in a user's home.
  • the components of a kit in some embodiments, provide the functionalities of a system for preparing a nucleic acid solution from a sample. In some embodiments, certain components of the system are provided by the user.
  • diagnostic assays identify the presence of a disease or condition in an individual.
  • the disease is cancer (e.g., lung cancer).
  • markers whose aberrant expression is associated with a lung cancer e.g., one or more markers selected GAGE12D, FAM83A, LRG1, XAGE-1 d, MAGEA4, SFTPB, AKAP4, and CYP24A1 are used.
  • an assay further comprises detection of a reference nucleic acid (e.g., CASC3 or ⁇ -actin mRNAs; U1 and U6 snRNAs, etc.).
  • the technology finds application in treating a patient (e.g., a patient with lung cancer, with early stage lung cancer, or who may develop lung cancer), the method comprising determining the expression levels of one or more markers as provided herein and administering a treatment to the patient based on the results of determining the expression levels.
  • the treatment may be administration of a pharmaceutical compound, a vaccine, performing a surgery, imaging the patient, performing another test.
  • said use is in a method of clinical screening, a method of prognosis assessment, a method of monitoring the results of therapy, a method to identify patients most likely to respond to a particular therapeutic treatment, a method of imaging a patient or subject, and a method for drug screening and development.
  • the technology finds application in methods for diagnosing lung cancer in a subject.
  • diagnosis and “diagnosis” as used herein refer to methods by which the skilled artisan can estimate and even determine whether or not a subject is suffering from a given disease or condition or may develop a given disease or condition in the future.
  • the skilled artisan often makes a diagnosis on the basis of one or more diagnostic indicators, such as for example a biomarker, the expression of which is indicative of the presence, severity, or absence of the condition.
  • clinical cancer prognosis relates to determining the aggressiveness of the cancer and the likelihood of tumor recurrence to plan the most effective therapy. If a more accurate prognosis can be made or even a potential risk for developing the cancer can be assessed, appropriate therapy, and in some instances less severe therapy for the patient can be chosen.
  • Assessment e.g., analyzing expression
  • cancer biomarkers is useful to separate subjects with good prognosis and/or low risk of developing cancer who will need no therapy or limited therapy from those more likely to develop cancer or suffer a recurrence of cancer who might benefit from more intensive treatments.
  • “making a diagnosis” or “diagnosing”, as used herein, is further inclusive of making determining a risk of developing cancer or determining a prognosis, which can provide for predicting a clinical outcome (with or without medical treatment), selecting an appropriate treatment (or whether treatment would be effective), or monitoring a current treatment and potentially changing the treatment, based on the measure of the diagnostic biomarkers disclosed herein.
  • multiple determinations of the biomarkers over time can be made to facilitate diagnosis and/or prognosis.
  • a temporal change in the biomarker can be used to predict a clinical outcome, monitor the progression of lung cancer, and/or monitor the efficacy of appropriate therapies directed against the cancer.
  • the technology further finds application in methods for determining whether to initiate or continue prophylaxis or treatment of a cancer in a subject.
  • the method comprises providing a series of biological samples over a time period from the subject; analyzing the series of biological samples to determine expression of at least one biomarker disclosed herein in each of the biological samples; and comparing any measurable change in the expression of one or more of the biomarkers in each of the biological samples. Any changes in the expression of biomarkers over the time period can be used to predict risk of developing cancer, predict clinical outcome, determine whether to initiate or continue the prophylaxis or therapy of the cancer, and whether a current therapy is effectively treating the cancer.
  • a first time point can be selected prior to initiation of a treatment and a second time point can be selected at some time after initiation of the treatment.
  • Expression can be measured in each of the samples taken from different time points and qualitative and/or quantitative differences noted.
  • a change in the expression of the biomarkers from the different samples can be correlated with risk for developing lung, prognosis, determining treatment efficacy, and/or progression of the cancer in the subject.
  • the methods and compositions of the invention are for treatment or diagnosis of disease at an early stage, for example, before symptoms of the disease appear. In some embodiments, the methods and compositions of the invention are for treatment or diagnosis of disease at a clinical stage.
  • a diagnostic marker can be determined at an initial time, and again at a second time.
  • an increase in the marker from the initial time to the second time can be diagnostic of a particular type or severity of cancer, or a given prognosis.
  • a decrease in the marker from the initial time to the second time can be indicative of a particular type or severity of cancer, or a given prognosis.
  • the degree of change of one or more markers can be related to the severity of the cancer and future adverse events.
  • comparative measurements can be made of the same biomarker at multiple time points, one can also measure a given biomarker at one time point, and a second biomarker at a second time point, and a comparison of these markers can provide diagnostic information.
  • the phrase “determining the prognosis” refers to methods by which the skilled artisan can predict the course or outcome of a condition in a subject.
  • the term “prognosis” does not refer to the ability to predict the course or outcome of a condition with 100% accuracy, or even that a given course or outcome is predictably more or less likely to occur based on the expression of a biomarker.
  • the term “prognosis” refers to an increased probability that a certain course or outcome will occur; that is, that a course or outcome is more likely to occur in a subject exhibiting a given condition, when compared to those individuals not exhibiting the condition. For example, in individuals not exhibiting the condition, the chance of a given outcome (e.g., suffering from lung cancer) may be very low.
  • a statistical analysis associates a prognostic indicator with a predisposition to an adverse outcome. For example, in some embodiments, an expression level different from that in a normal control sample obtained from a patient who does not have a cancer can signal that a subject is more likely to suffer from a cancer than subjects with a level that is more similar to the expression level in the control sample, as determined by a level of statistical significance. Additionally, a change in expression level from a baseline (e.g., “normal”) level can be reflective of subject prognosis, and the degree of change in expression can be related to the severity of adverse events. Statistical significance is often determined by comparing two or more populations and determining a confidence interval and/or a p value.
  • Exemplary confidence intervals of the present subject matter are 90%, 95%, 97.5%, 98%, 99%, 99.5%, 99.9% and 99.99%, while exemplary p values are 0.1, 0.05, 0.025, 0.02, 0.01, 0.005, 0.001, and 0.0001.
  • a threshold degree of change in expression of a prognostic or diagnostic biomarker disclosed herein can be established, and the degree of change in the expression of the biomarker in a biological sample is simply compared to the threshold degree of change in the expression.
  • a preferred threshold change in the expression level for biomarkers provided herein is about 5%, about 10%, about 15%, about 20%, about 25%, about 30%, about 50%, about 75%, about 100%, and about 150%.
  • a “nomogram” can be established, by which expression of a prognostic or diagnostic indicator (biomarker or combination of biomarkers) is directly related to an associated disposition towards a given outcome. The skilled artisan is acquainted with the use of such nomograms to relate two numeric values with the understanding that the uncertainty in this measurement is the same as the uncertainty in the marker concentration because individual sample measurements are referenced, not population averages.
  • a control sample is analyzed concurrently with the biological sample, such that the results obtained from the biological sample can be compared to the results obtained from the control sample.
  • standard curves can be provided, with which assay results for the biological sample may be compared. Such standard curves present expression levels of a biomarker as a function of assay units, e.g., fluorescent signal intensity, if a fluorescent label is used. Using samples taken from multiple donors, standard curves can be provided for control expression of the one or more biomarkers in normal tissue, as well as for “at-risk” levels of the one or more biomarkers in tissue taken from donors with lung cancer.
  • markers can be carried out separately or simultaneously with additional markers within one test sample. For example, several markers can be combined into one test for efficient processing of a multiple of samples and for potentially providing greater diagnostic and/or prognostic accuracy.
  • one skilled in the art would recognize the value of testing multiple samples (for example, at successive time points) from the same subject.
  • Such testing of serial samples can allow the identification of changes in marker expression over time. Changes in expression, as well as the absence of change in expression, can provide useful information about the disease status that includes, but is not limited to, identifying the approximate time from onset of the event, the presence and amount of salvageable tissue, the appropriateness of drug therapies, the effectiveness of various therapies, and identification of the subject's outcome, including risk of future events.
  • biomarkers can be carried out in a variety of physical formats.
  • the use of microtiter plates or automation can be used to facilitate the processing of large numbers of test samples.
  • single sample formats could be developed to facilitate immediate treatment and diagnosis in a timely fashion, for example, in ambulatory transport or emergency room settings.
  • the subject is diagnosed as having lung cancer if, when compared to a control expression, there is a measurable difference in the expression of at least one biomarker in the sample.
  • the subject can be identified as not having lung cancer, not being at risk for the cancer, or as having a low risk of the cancer.
  • subjects having lung cancer or risk thereof can be differentiated from subjects having low to substantially no cancer or risk thereof.
  • those subjects having a risk of developing lung cancer can be placed on a more intensive and/or regular screening schedule.
  • those subjects having low to substantially no risk may avoid being subjected to screening procedures, until such time as a future screening, for example, a screening conducted in accordance with the present technology, indicates that a risk of lung cancer has appeared in those subjects.
  • detecting a change in expression of the one or more biomarkers can be a qualitative determination or it can be a quantitative determination.
  • the step of diagnosing a subject as having, or at risk of developing, lung cancer indicates that certain threshold measurements are made, e.g., the expression of the one or more biomarkers in the biological sample varies from a predetermined control expression.
  • the control expression is any detectable expression of the biomarker.
  • the predetermined expression is the expression in the control sample.
  • the predetermined expression is based upon and/or identified by a standard curve.
  • the predetermined expression is a specifically state or range of state.
  • the predetermined expression can be chosen, within acceptable limits that will be apparent to those skilled in the art, based in part on the embodiment of the method being practiced and the desired specificity, etc.
  • circulating epithelial cells representing metastatic tumor cells
  • Cristofanilli M et al. (2004) N Engl J Med 351:781-791
  • Hayes D F et al. (2006) Clin Cancer Res 12:4218-4224
  • Budd G T et al., (2006) Clin Cancer Res 12:6403-6409
  • Moreno J G et al.
  • compositions and methods for detecting the presence of metastatic cancer in a subject by identifying the presence of expression markers in plasma or whole blood.
  • Tissue samples were obtained from various commercial and non-commercial sources (Asuragen, BioServe, ConversantBio, Cureline, Mayo Clinic, M D Anderson, and PrecisionMed). Tissue sections were examined by a pathologist, who circled histologically distinct lesions to direct careful micro-dissection. Total nucleic acid extraction was performed using the Promega Maxwell RSC system. FFPE slides were scraped and extracted using the Maxwell® RSC DNA FFPE Kit (#AS1450) using the manufacturer's procedure but skipping the RNase digestion step. The same procedure was used for FFPE bulk curls.
  • Gene expression markers tested comprised AKAP4GAGE12D, FAM83A, SFTPB (Pro-Surfactant B), XAGE-1D, CYP24A1, LRG1, and MAGEA4, and one reference gene expression were tested on lung cancer tissue samples. Expression of CASC3 was used as a reference marker.
  • a QuARTS flap endonuclease assay reaction was modified to add a reverse transcription step.
  • the assay probes were designed to span exon junctions so that the RT-QuARTS assay would specifically detect mRNA targets rather than the corresponding genomic loci.
  • the technique combines a reverse transcription step to convert the RNA target into a cDNA strand, a polymerase based target amplification and a simultaneous invasive cleavage signal amplification reaction ( FIG. 1 ).
  • the format results in a real-time accumulation of fluorescent signal in proportion to the amount of target mRNA. It produces a similar output to quantitative RT-PCR, but with the added sensitivity and specificity resulting from the addition of the invasive cleavage reaction.
  • RT-QuARTS reactions comprising different amounts of Molony Murine Leukemia Virus (MMLV) reverse transcriptase and different dilutions of RNA were conducted using a reverse transcription reactions for 10 to 45 minutes.
  • FIG. 2 provides a table comparing the results of the different reaction conditions.
  • MMLV Molony Murine Leukemia Virus
  • Each triplex RT-QuARTS assay as describe below consists of one mRNA target reporting to FAM, one to HEX, and the reference mRNA to Quasar 670.
  • Standard curves for each assay were generated by serially diluting known quantities of in vitro-produced transcripts for each marker. Standard curves were created by plotting Cp value by Log input strands. The resulting slope and intercept values were used to convert the Cp values of the unknown samples to mRNA strand values.
  • Oligonucleotide sequences for the assays are shown in Table 1.
  • In vitro transcripts for each target were made from templates containing the DNA sequence amplified in the QuARTS reaction with additional flanking 5′ and 3′ sequences coupled to a T7 promoter. In vitro transcription was done using the T7 Ribomax system (Promega) and the resulting transcripts were quantitated with the Quant-iT RNA assay kit (Thermo Fisher Scientific).
  • an exemplary QuARTS reaction typically comprises approximately 400-600 nmol/1 (e.g., 500 nmol/1) of each primer and detection probe, approximately 100 nmol/1 of the invasive oligonucleotide, approximately 600-700 nmol/1 of each FRET cassette (FAM, e.g., as supplied commercially by Hologic, Inc.; HEX, e.g., as supplied commercially by BioSearch Technologies; and Quasar 670, e.g., as supplied commercially by BioSearch Technologies), 6.675 ng/ ⁇ l FEN-1 endonuclease (e.g., Cleavase® 2.0, Hologic, Inc.), 1 unit Taq DNA polymerase in a 30 ⁇ l reaction volume (e.g., GoTaq® DNA polymerase, Promega Corp., Madison, Wis.), 10 mmol/l 3-(n-morpholino) propanesulfonic acid (MOPS), 7.5 mmol/l MgCl 2
  • FAM
  • RT-QuARTS reactions contained 20 U of MMLV reverse transcriptase (MMLV-RT), 219 ng of Cleavase® 2.0, 1.5 U of GoTaq® DNA Polymerase, 200 nM of each primer, 500 nM each of probe and FRET oligonucleotides, 10 mM MOPS buffer, pH7.5, 7.5 mM MgCl 2 , and 250 ⁇ M each dNTP. Reactions were run on a Roche LightCycler 480 system under the following conditions: 42° C. for 30 minutes (RT reaction), 95° C. for 3 min, 10 cycles of 95° C. for 20 seconds, 63° C. for 30 sec, 70° C. for 30 sec, followed by 35 cycles of 95° C. for 20 sec, 53° C. for 1 min, 70° C. for 30 sec, and hold at 40° C. for 30 sec.
  • RT-QuARTS assays may comprise a step of multiplex pre-amplification, e.g., to pre-amplify 10, 12, or more targets in as sample.
  • Multiplex pre-amplification for QuARTS assays is described, e.g., in U.S. Patent Appln. Ser. No. 62/249,097, filed Oct. 30, 2015, and 62/332,295, filed May 5, 2016, each of which is incorporated herein by reference.
  • An RT-pre-amplification is conducted in a reaction mixture containing, e.g., 20 U of MMLV reverse transcriptase, 1.5 U of GoTaq® DNA Polymerase, 10 mM MOPS buffer, pH7.5, 7.5 mM MgCl 2 , 250 ⁇ M each dNTP, and oligonucleotide primers, (e.g., for 12 targets, 12 primer pairs/24 primers, in equimolar amounts (e.g., 200 nM each primer), or with individual primer concentrations adjusted to balance amplification efficiencies of the different targets).
  • Thermal cycling times and temperatures are selected to be appropriate for the volume of the reaction and the amplification vessel.
  • the reactions may be cycled as follows:
  • aliquots of the pre-amplification reaction e.g., 10 ⁇ L
  • aliquots of the pre-amplification reaction are diluted to 500 ⁇ L in 10 mM Tris, 0.1 mM EDTA, with or without fish DNA.
  • Aliquots of the diluted pre-amplified DNA e.g., 10 ⁇ L
  • QuARTS PCR-flap assay e.g., as described above.
  • DNA targets e.g., methylated DNA marker genes, genes corresponding to the RNA marker, etc.
  • DNA targets may be amplified and detected along with the reverse-transcribed cDNAs in a QuARTS assay reaction.
  • DNA and cDNA are co-amplified and detected in a single-tube reaction, i.e., without the need to open the reaction vessel at any point between combining the reagents and collecting the output data.
  • marker DNA from the same sample or from a different sample may be separately isolated, with or without a bisulfite conversion step, and may be combined with sample RNA in an RT-QuARTS assay.
  • RNA and/or DNA samples may be pre-amplified as described above.
  • amplification primers and probes used for reverse transcription, amplification, and the flap endonuclease reactions that occur in the RT-QuARTS assay as described herein are shown in Table 1, below:
  • FIG. 3 shows exemplary standard curves for LRG-1 RNA at dilutions A-E, i.e., 10 to 100,000 copies per reaction of input RNA, in the RT-QuARTS assay as described above.
  • the average number of RNA strands present as calculated from the fluorescence signal during amplification are shown under “Calc. Strands/Rxn” on the right half of panel A.
  • the graph in panel C shows the fluorescence signal accumulation by cycle number for the reactions having the different indicated amounts of input RNA.
  • Strand values for individual markers from the samples were determined by using the standard curves for each marker, as discussed above for the LRG-1 RNA.
  • the strand numbers were divided by the CASC3 reference marker strand numbers determined in the same assay well to normalize for varying input RNA amounts. The resulting ratio was multiplied by 100 to generate the “% MARKER” value for each mRNA as shown in FIG. 4 .
  • Receiver operating characteristic (ROC) curves were generated for different groupings of markers using JMP 11.0 software (SAS).
  • the positive percent agreement (diagnostic sensitivity) was calculated by dividing the detected positives by the known lung cancer samples and multiplying by 100, and the negative percent agreement (diagnostic specificity) by dividing the detected negatives by the known normal controls and multiplying by 100.
  • FIG. 4 shows the signal measured from individual marker RNAs from cancer and normal samples.
  • FIG. 5 shows the aggregate sensitivity and specificity for samples analyzed using the indicated combinations of mRNA markers.
  • Target mRNA sequences (showing T nucleotides in place of U nucleotides) are as follows:
  • AKAP4 (SEQ ID NO: 28) >gi

Abstract

Provided herein is technology for lung neoplasia screening and particularly, but not exclusively, to methods, compositions, and related uses for detecting the presence of lung cancer.

Description

    CROSS REFERENCE TO RELATED APPLICATIONS
  • The present invention claims priority benefit of U.S. Provisional Patent Application No. 62/332,419, filed May 5, 2016, which is incorporated by reference in its entirety.
  • FIELD OF THE INVENTION
  • Provided herein is technology relating to detecting neoplasia and particularly, but not exclusively, to methods, compositions, and related uses for detecting neoplasms such as lung cancer.
  • BACKGROUND OF THE INVENTION
  • Lung cancer is the most frequent cause of death among men and women younger than 85 years in the US. It accounts for 27% of all cancer deaths and 221,000 lost lives annually. This mortality rate exceeds that of the next 4 highest ranking cancers combined. Gene expression profiling has confirmed unique mRNA expression in cancers and can be used as an approach for detection of lung malignancies. An mRNA multi-marker approach to detect all subtypes of lung cancer needs to be explored. This study assesses the value of measuring expression levels of multiple mRNA markers in detecting lung cancer of different subtypes.
  • SUMMARY OF THE INVENTION
  • This technology is in the field of nucleic acid detection and quantification. Specifically, the technology addresses the detection and quantification of RNA in samples using single-tube RT-PCR-Invasive cleavage reaction (RT-QuARTS).
  • In some embodiments the technology provides methods of screening for a lung neoplasm in a sample obtained from a subject, the methods comprising, e.g., a) assaying a sample from a subject for an amount of at least one RNA marker selected from the group consisting of GAGE12D, FAM83A, LRG1, XAGE-1 d, MAGEA4, SFTPB, AKAP4, and CYP24A1; b) assaying the sample for an amount of a reference marker in the sample; c) comparing the amount of the at least one RNA marker to the amount of the reference marker to determine a level of expression for the at least one marker gene in the sample; and d) generating a record reporting the expression for the at least one marker gene in said sample. In some embodiments the method comprises obtaining a sample comprising RNA from a subject and treating the RNA with a reverse transcriptase, preferably MMLV reverse transcriptase, to form a cDNA copy of at least a portion of the RNA. In preferred embodiments, the cDNA is created and detected in a single vessel, without opening the vessel, e.g., to add additional reagents.
  • In some embodiments the at least one RNA marker is at least two markers. In some preferred embodiments the at least one RNA marker comprises the group consisting of GAGE, FAM83A, LRG1 and MAGEA4 markers, while in some embodiments, the at least one RNA marker comprises the group consisting of GAGE, FAM83A, LRG1, CYP24A1, XAGE1D and MAGEA4 markers. In some embodiments, the reference marker is an RNA, preferably an RNA selected from the group consisting of CASC3 mRNA, β-actin mRNA, U1 snRNA and U6 snRNA.
  • In some embodiments the technology comprises assaying RNA using one or more of a polymerase chain reaction, nucleic acid sequencing, mass spectrometry, mass-based separation, or target capture. In particularly preferred embodiments, the assaying comprises using a flap endonuclease assay, such as a QuARTS assay, as described hereinbelow.
  • In some embodiments, assaying the expression of the RNA marker comprises detecting increased or decreased expression of the RNA marker relative to a normal expression of the marker.
  • Samples suitable for analysis using the technology are not limited to a particular sample type. In some embodiments the sample is a tissue sample, a blood sample, a serum sample, or a sputum sample. In certain preferred embodiments the tissue sample comprises lung tissue.
  • The technology further provides kits, e.g., for practicing the technology. For example, in some embodiments the technology provides a kit comprising:
  • a) at least one oligonucleotide, wherein at least a portion of said oligonucleotide specifically hybridizes to a marker RNA selected from the group consisting of GAGE12D, FAM83A, LRG1, XAGE-1 d, MAGEA4, SFTPB, AKAP4, and CYP24A1, and
  • b) at least one additional oligonucleotide, wherein at least a portion of said additional oligonucleotide specifically hybridizes to a reference nucleic acid.
  • In preferred embodiments the kit comprises at least two additional oligonucleotides. In some embodiments, the kit further comprises one or more components selected from the group consisting of reverse transcriptase, flap endonuclease, DNA polymerase, and a FRET cassette.
  • In some embodiments the at least one RNA marker is selected from the group consisting of GAGE, FAM83A, LRG1, CYP24A1, XAGE1D and MAGEA4, and in some embodiments the RNA marker is selected from the group consisting of GAGE, FAM83A, LRG1 and MAGEA4. In certain preferred embodiments the kit comprises at least 4 oligonucleotides, wherein each of the markers in the group consisting of GAGE, FAM83A, LRG1, and MAGEA4 specifically hybridizes to at least one of said 4 oligonucleotides. In other embodiments, the kit comprises at least 6 oligonucleotides, wherein each of the markers in the group consisting of GAGE, FAM83A, LRG1, CYP24A1, XAGE1D and MAGEA4 specifically hybridizes to at least one of said 6 oligonucleotides. In preferred embodiments at least one oligonucleotide is selected from one or more of a capture oligonucleotide, a pair of nucleic acid primers, a nucleic acid probe, and an invasive oligonucleotide.
  • The technology is not limited to which particular reference marker RNA is used, and many are known in the field. In preferred embodiments, the reference marker is an RNA selected from the group consisting of CASC3, β-actin, U1 and U6 RNA
  • The technology further comprises compositions such as mixtures, e.g., reaction mixtures. In some embodiments the technology provides a mixture comprising a complex of at least one RNA marker selected from the group consisting of GAGE12D, FAM83A, LRG1, XAGE-1 d, MAGEA4, SFTPB, AKAP4, and CYP24A1 and an oligonucleotide that specifically hybridizes to the RNA marker. In preferred embodiments, the composition further comprises a complex of at least one reference marker and an oligonucleotide that specifically hybridizes to the reference RNA marker. In some embodiments the at least one RNA marker is selected from the group consisting of GAGE, FAM83A, LRG1, CYP24A1, XAGE1D and MAGEA4, while in some embodiments the least one RNA marker is selected from the group consisting of GAGE, FAM83A, LRG1 and MAGEA4. In preferred embodiments the composition comprises a reference marker that is an RNA selected from the group consisting of CASC3, β-actin, U1 RNA and U6 RNA. In particularly preferred embodiments, the oligonucleotide is selected from one or more of a capture oligonucleotide, a pair of nucleic acid primers, a nucleic acid probe, and an invasive oligonucleotide. Preferably the composition comprises a nucleic acid probe oligonucleotide comprising a reporter molecule, e.g., a fluorophore, and/or a flap sequence.
  • In some embodiments, the composition further comprises one or more components selected from the group consisting of reverse transcriptase, (e.g., MMLV reverse transcriptase), flap endonuclease, thermostable DNA polymerase, and a FRET cassette. In preferred embodiments, the DNA polymerase is a bacterial DNA polymerase.
  • Definitions
  • To facilitate an understanding of the present technology, a number of terms and phrases are defined below. Additional definitions are set forth throughout the detailed description.
  • Throughout the specification and claims, the following terms take the meanings explicitly associated herein, unless the context clearly dictates otherwise. The phrase “in one embodiment” as used herein does not necessarily refer to the same embodiment, though it may. Furthermore, the phrase “in another embodiment” as used herein does not necessarily refer to a different embodiment, although it may. Thus, as described below, various embodiments of the invention may be readily combined, without departing from the scope or spirit of the invention.
  • In addition, as used herein, the term “or” is an inclusive “or” operator and is equivalent to the term “and/or” unless the context clearly dictates otherwise. The term “based on” is not exclusive and allows for being based on additional factors not described, unless the context clearly dictates otherwise. In addition, throughout the specification, the meaning of “a”, “an”, and “the” include plural references. The meaning of “in” includes “in” and “on.”
  • The transitional phrase “consisting essentially of” as used in claims in the present application limits the scope of a claim to the specified materials or steps “and those that do not materially affect the basic and novel characteristic(s)” of the claimed invention, as discussed in In re Herz, 537 F.2d 549, 551-52, 190 USPQ 461, 463 (CCPA 1976). For example, a composition “consisting essentially of” recited elements may contain an unrecited contaminant at a level such that, though present, the contaminant does not alter the function of the recited composition as compared to a pure composition, i.e., a composition “consisting of” the recited components.
  • As used herein, the “sensitivity” of a given marker (or set of markers used together) refers to the percentage of samples that report a marker value (e.g., an expression marker) above a threshold value that distinguishes between neoplastic and non-neoplastic samples. In some embodiments, a positive is defined as a histology-confirmed neoplasia that reports a marker value above a threshold value (e.g., the range associated with disease), and a false negative is defined as a histology-confirmed neoplasia that reports a marker value below the threshold value (e.g., the range associated with no disease). The value of sensitivity, therefore, reflects the probability that a measurement for a given marker obtained from a known diseased sample will be in the range of disease-associated measurements. As defined here, the clinical relevance of the calculated sensitivity value represents an estimation of the probability that a given marker would detect the presence of a clinical condition when applied to a subject with that condition.
  • As used herein, the “specificity” of a given marker (or set of markers used together) refers to the percentage of non-neoplastic samples that report a marker value (e.g., an expression marker) below a threshold value that distinguishes between neoplastic and non-neoplastic samples. In some embodiments, a negative is defined as a histology-confirmed non-neoplastic sample that reports a marker value below the threshold value (e.g., the range associated with no disease) and a false positive is defined as a histology-confirmed non-neoplastic sample that reports a marker value above the threshold value (e.g., the range associated with disease). The value of specificity, therefore, reflects the probability that a marker measurement for a given marker obtained from a known non-neoplastic sample will be in the range of non-disease associated measurements. As defined here, the clinical relevance of the calculated specificity value represents an estimation of the probability that a given marker would detect the absence of a clinical condition when applied to a patient without that condition.
  • The term “primer” refers to an oligonucleotide, whether occurring naturally as, e.g., a nucleic acid fragment from a restriction digest, or produced synthetically, that is capable of acting as a point of initiation of synthesis when placed under conditions in which synthesis of a primer extension product that is complementary to a nucleic acid template strand is induced, (e.g., in the presence of nucleotides and an inducing agent such as a DNA polymerase, and at a suitable temperature and pH). The primer is preferably single stranded for maximum efficiency in amplification, but may alternatively be double stranded. If double stranded, the primer is first treated to separate its strands before being used to prepare extension products. Preferably, the primer is an oligodeoxyribonucleotide. The primer must be sufficiently long to prime the synthesis of extension products in the presence of the inducing agent. The exact lengths of the primers will depend on many factors, including temperature, source of primer, and the use of the method.
  • The term “probe” refers to an oligonucleotide (e.g., a sequence of nucleotides), whether occurring naturally as in a purified restriction digest or produced synthetically, recombinantly, or by PCR amplification, that is capable of hybridizing to another oligonucleotide of interest. A probe may be single-stranded or double-stranded. Probes are useful in the detection, identification, and isolation of particular gene sequences (e.g., a “capture probe”). It is contemplated that any probe used in the present invention may, in some embodiments, be labeled with any “reporter molecule,” so that is detectable in any detection system, including, but not limited to enzyme (e.g., ELISA, as well as enzyme-based histochemical assays), fluorescent, radioactive, and luminescent systems. It is not intended that the present invention be limited to any particular detection system or label.
  • The term “target,” as used herein refers to a nucleic acid sought to be sorted out from other nucleic acids, e.g., by probe binding, amplification, isolation, capture, etc. For example, when used in reference to the polymerase chain reaction, “target” refers to the region of nucleic acid bounded by the primers used for polymerase chain reaction, while when used in an assay in which target nucleic acid is not amplified, e.g., in some embodiments of an invasive cleavage assay, a target comprises the site at which a probe and invasive oligonucleotides (e.g., INVADER oligonucleotide) bind to form an invasive cleavage structure, such that the presence of the target nucleic acid can be detected. A “segment” is defined as a region of nucleic acid within the target sequence.
  • The term “marker”, as used herein, refers to a substance (e.g., a nucleic acid, or a region of a nucleic acid, or a protein) that may be used to distinguish non-normal cells (e.g., cancer cells) from normal cells, e.g., based on presence, absence, or status (e.g., post-transcriptional processing) of the marker substance.
  • The term “sample” is used in its broadest sense. In one sense it can refer to an animal cell or tissue. In another sense, it refers to a specimen or culture obtained from any source, as well as biological and environmental samples. Biological samples may be obtained from plants or animals (including humans) and encompass fluids, solids, tissues, and gases. Environmental samples include environmental material such as surface matter, soil, water, and industrial samples. These examples are not to be construed as limiting the sample types applicable to the present invention.
  • The term “neoplasm” as used herein refers to any new and abnormal growth of tissue. Thus, a neoplasm can be a premalignant neoplasm or a malignant neoplasm.
  • The term “neoplasm-specific marker,” as used herein, refers to any biological material or element that can be used to indicate the presence of a neoplasm. Examples of biological materials include, without limitation, nucleic acids (DNA, RNA, miRNA, etc.), polypeptides, carbohydrates, fatty acids, cellular components (e.g., cell membranes and mitochondria), and whole cells. In some instances, markers are particular nucleic acid regions (e.g., genes, intragenic regions, specific loci, etc.). Regions of nucleic acid that are markers may be referred to, e.g., as “marker genes,” “marker regions,” “marker sequences,” “marker loci,” etc.
  • As used herein, the terms “patient” or “subject” refer to organisms to be subject to various tests provided by the technology. The term “subject” includes animals, preferably mammals, including humans. In a preferred embodiment, the subject is a primate. In an even more preferred embodiment, the subject is a human. Further with respect to diagnostic methods, a preferred subject is a vertebrate subject. A preferred vertebrate is warm-blooded; a preferred warm-blooded vertebrate is a mammal. A preferred mammal is most preferably a human. As used herein, the term “subject’ includes both human and animal subjects. Thus, veterinary therapeutic uses are provided herein. As such, the present technology provides for the diagnosis of mammals such as humans, as well as those mammals of importance due to being endangered, such as Siberian tigers; of economic importance, such as animals raised on farms for consumption by humans; and/or animals of social importance to humans, such as animals kept as pets or in zoos. Examples of such animals include but are not limited to: carnivores such as cats and dogs; swine, including pigs, hogs, and wild boars; ruminants and/or ungulates such as cattle, oxen, sheep, giraffes, deer, goats, bison, and camels; pinnipeds; and horses. Thus, also provided is the diagnosis and treatment of livestock, including, but not limited to, domesticated swine, ruminants, ungulates, horses (including race horses), and the like. The presently-disclosed subject matter further includes a system for diagnosing a lung cancer in a subject. The system can be provided, for example, as a commercial kit that can be used to screen for a risk of lung cancer or diagnose a lung cancer in a subject from whom a biological sample has been collected. An exemplary system provided in accordance with the present technology includes assessing the expression of a marker described herein.
  • The term “amplifying” or “amplification” in the context of nucleic acids refers to the production of multiple copies of a polynucleotide, or a portion of the polynucleotide, typically starting from a small amount of the polynucleotide (e.g., a single polynucleotide molecule), where the amplification products or amplicons are generally detectable. Amplification of polynucleotides encompasses a variety of chemical and enzymatic processes. The generation of multiple DNA copies from one or a few copies of a target or template DNA molecule during a polymerase chain reaction (PCR) or a ligase chain reaction (LCR; see, e.g., U.S. Pat. No. 5,494,810; herein incorporated by reference in its entirety) are forms of amplification. Additional types of amplification include, but are not limited to, allele-specific PCR (see, e.g., U.S. Pat. No. 5,639,611; herein incorporated by reference in its entirety), assembly PCR (see, e.g., U.S. Pat. No. 5,965,408; herein incorporated by reference in its entirety), helicase-dependent amplification (see, e.g., U.S. Pat. No. 7,662,594; herein incorporated by reference in its entirety), hot-start PCR (see, e.g., U.S. Pat. Nos. 5,773,258 and 5,338,671; each herein incorporated by reference in their entireties), intersequence-specific PCR, inverse PCR (see, e.g., Triglia, et al. (1988) Nucleic Acids Res., 16:8186; herein incorporated by reference in its entirety), ligation-mediated PCR (see, e.g., Guilfoyle, R. et al., Nucleic Acids Research, 25:1854-1858 (1997); U.S. Pat. No. 5,508,169; each of which are herein incorporated by reference in their entireties), miniprimer PCR, multiplex ligation-dependent probe amplification (see, e.g., Schouten, et al., (2002) Nucleic Acids Research 30(12): e57; herein incorporated by reference in its entirety), multiplex PCR (see, e.g., Chamberlain, et al., (1988) Nucleic Acids Research 16(23) 11141-11156; Ballabio, et al., (1990) Human Genetics 84(6) 571-573; Hayden, et al., (2008) BMC Genetics 9:80; each of which are herein incorporated by reference in their entireties), nested PCR, overlap-extension PCR (see, e.g., Higuchi, et al., (1988) Nucleic Acids Research 16(15) 7351-7367; herein incorporated by reference in its entirety), real time PCR (see, e.g., Higuchi, et al., (1992) Biotechnology 10:413-417; Higuchi, et al., (1993) Biotechnology 11:1026-1030; each of which are herein incorporated by reference in their entireties), reverse transcription PCR (see, e.g., Bustin, S. A. (2000) J. Molecular Endocrinology 25:169-193; herein incorporated by reference in its entirety), solid phase PCR, thermal asymmetric interlaced PCR, and Touchdown PCR (see, e.g., Don, et al., Nucleic Acids Research (1991) 19(14) 4008; Roux, K. (1994) Biotechniques 16(5) 812-814; Hecker, et al., (1996) Biotechniques 20(3) 478-485; each of which are herein incorporated by reference in their entireties). Polynucleotide amplification also can be accomplished using digital PCR (see, e.g., Kalinina, et al., Nucleic Acids Research. 25; 1999-2004, (1997); Vogelstein and Kinzler, Proc Natl Acad Sci USA. 96; 9236-41, (1999); International Patent Publication No. WO05023091A2; US Patent Application Publication No. 20070202525; each of which are incorporated herein by reference in their entireties).
  • The term “polymerase chain reaction” (“PCR”) refers to the method of K. B. Mullis U.S. Pat. Nos. 4,683,195, 4,683,202, and 4,965,188, that describe a method for increasing the concentration of a segment of a target sequence in a mixture of genomic or other DNA or RNA, without cloning or purification. This process for amplifying the target sequence consists of introducing a large excess of two oligonucleotide primers to the DNA mixture containing the desired target sequence, followed by a precise sequence of thermal cycling in the presence of a DNA polymerase. The two primers are complementary to their respective strands of the double stranded target sequence. To effect amplification, the mixture is denatured and the primers then annealed to their complementary sequences within the target molecule. Following annealing, the primers are extended with a polymerase so as to form a new pair of complementary strands. The steps of denaturation, primer annealing, and polymerase extension can be repeated many times (i.e., denaturation, annealing and extension constitute one “cycle”; there can be numerous “cycles”) to obtain a high concentration of an amplified segment of the desired target sequence. The length of the amplified segment of the desired target sequence is determined by the relative positions of the primers with respect to each other, and therefore, this length is a controllable parameter. By virtue of the repeating aspect of the process, the method is referred to as the “polymerase chain reaction” (“PCR”). Because the desired amplified segments of the target sequence become the predominant sequences (in terms of concentration) in the mixture, they are said to be “PCR amplified” and are “PCR products” or “amplicons.” Those of skill in the art will understand the term “PCR” encompasses many variants of the originally described method using, e.g., real time PCR, nested PCR, reverse transcription PCR (RT-PCR), single primer and arbitrarily primed PCR, etc.
  • As used herein, the term “nucleic acid detection assay” refers to any method of determining the nucleotide composition of a nucleic acid of interest. Nucleic acid detection assay include but are not limited to, DNA sequencing methods, probe hybridization methods, structure specific cleavage assays (e.g., the INVADER assay, (Hologic, Inc.) and are described, e.g., in U.S. Pat. Nos. 5,846,717, 5,985,557, 5,994,069, 6,001,567, 6,090,543, and 6,872,816, WO 2006/050499; Lyamichev et al., Nat. Biotech., 17:292 (1999), Hall et al., PNAS, USA, 97:8272 (2000), and US 2009/0253142, each of which is herein incorporated by reference in its entirety for all purposes); enzyme mismatch cleavage methods (e.g., Variagenics, U.S. Pat. Nos. 6,110,684, 5,958,692, 5,851,770, herein incorporated by reference in their entireties); polymerase chain reaction (PCR), described above; branched hybridization methods (e.g., Chiron, U.S. Pat. Nos. 5,849,481, 5,710,264, 5,124,246, and 5,624,802, herein incorporated by reference in their entireties); rolling circle replication (e.g., U.S. Pat. Nos. 6,210,884, 6,183,960 and 6,235,502, herein incorporated by reference in their entireties); NASBA (e.g., U.S. Pat. No. 5,409,818, herein incorporated by reference in its entirety); molecular beacon technology (e.g., U.S. Pat. No. 6,150,097, herein incorporated by reference in its entirety); E-sensor technology (Motorola, U.S. Pat. Nos. 6,248,229, 6,221,583, 6,013,170, and 6,063,573, herein incorporated by reference in their entireties); cycling probe technology (e.g., U.S. Pat. Nos. 5,403,711, 5,011,769, and 5,660,988, herein incorporated by reference in their entireties); Dade Behring signal amplification methods (e.g., U.S. Pat. Nos. 6,121,001, 6,110,677, 5,914,230, 5,882,867, and 5,792,614, herein incorporated by reference in their entireties); ligase chain reaction (e.g., Baranay Proc. Natl. Acad. Sci USA 88, 189-93 (1991)); and sandwich hybridization methods (e.g., U.S. Pat. No. 5,288,609, herein incorporated by reference in its entirety).
  • In some embodiments, target nucleic acid is amplified (e.g., by PCR) and amplified nucleic acid is detected simultaneously using an invasive cleavage assay. Assays configured for performing a detection assay (e.g., invasive cleavage assay) in combination with an amplification assay are described in U.S. Pat. No. 9,096,893, incorporated herein by reference in its entirety for all purposes. Additional amplification plus invasive cleavage detection configurations, termed the QuARTS method, are described in U.S. Pat. Nos. 8,361,720; 8,715,937; 8,916,344; and 9,212,392, incorporated herein by reference in their entireties for all purposes. The term “invasive cleavage structure” as used herein refers to a cleavage structure comprising i) a target nucleic acid, ii) an upstream nucleic acid (e.g., an invasive or “INVADER” oligonucleotide), and iii) a downstream nucleic acid (e.g., a probe), where the upstream and downstream nucleic acids anneal to contiguous regions of the target nucleic acid, and where an overlap forms between the a 3′ portion of the upstream nucleic acid and duplex formed between the downstream nucleic acid and the target nucleic acid. An overlap occurs where one or more bases from the upstream and downstream nucleic acids occupy the same position with respect to a target nucleic acid base, whether or not the overlapping base(s) of the upstream nucleic acid are complementary with the target nucleic acid, and whether or not those bases are natural bases or non-natural bases. In some embodiments, the 3′ portion of the upstream nucleic acid that overlaps with the downstream duplex is a non-base chemical moiety such as an aromatic ring structure, e.g., as disclosed, for example, in U.S. Pat. No. 6,090,543, incorporated herein by reference in its entirety. In some embodiments, one or more of the nucleic acids may be attached to each other, e.g., through a covalent linkage such as nucleic acid stem-loop, or through a non-nucleic acid chemical linkage (e.g., a multi-carbon chain). As used herein, the term “flap endonuclease assay” includes “INVADER” invasive cleavage assays and QuARTS assays, as described above. The term “flap oligonucleotide” refers to an oligonucleotide cleavable in a detection assay, such as an invasive cleavage assay, by a flap endonuclease. In preferred embodiments, a flap oligonucleotide forms an invasive cleavage structure with other nucleic acids, e.g., a target nucleic acid and an invasive oligonucleotide.
  • As used herein, the term “PCR-invasive cleavage assay” refers to an assay in which target nucleic acid is amplified and amplified nucleic acid is detected simultaneously using a signal-amplifying invasive cleavage assay employing a FRET cassette, and in which the assay reagents comprise a mixture containing DNA polymerase, FEN-1 endonuclease, a primary probe comprising a portion complementary to a target nucleic acid, and a hairpin FRET cassette. PCR-invasive cleavage assays include the QuARTS assays described in U.S. Pat. Nos. 8,361,720; 8,715,937; 8,916,344; and 9,212,392, and the amplification assays of U.S. Pat. No. 9,096,893, as diagrammed in FIG. 1 of that patent, each of which is incorporated herein by reference for all purposes.
  • As used herein, the term “PCR-invasive cleavage assay reagents” refers to one or more reagents for detecting target sequences in a PCR-invasive cleavage assay, the reagents comprising nucleic acid molecules capable of participating in amplification of a target nucleic acid and in formation of an invasive cleavage structure in the presence of the target sequence, in a mixture containing DNA polymerase, FEN-1 endonuclease and a FRET cassette, and optionally a reverse transcriptase.
  • As used herein, the term “FRET cassette” refers to a hairpin oligonucleotide that contains a fluorophore moiety and a nearby quencher moiety that quenches the fluorophore. Hybridization of a cleaved flap (e.g., from cleavage of a target-specific probe in a PCR-invasive cleavage assay) with a FRET cassette produces a secondary substrate for the flap endonuclease, e.g., a FEN-1 enzyme. Once this substrate is formed, the 5′ fluorophore-containing base is cleaved from the cassette, thereby generating a fluorescence signal. In preferred embodiments, a FRET cassette comprises an unpaired 3′ portion to which a cleavage product, e.g., a portion of a cleaved flap oligonucleotide, can hybridize to from an invasive cleavage structure cleavable by a FEN-1 endonuclease.
  • A nucleic acid “hairpin” as used herein refers to a region of a single-stranded nucleic acid that contains a duplex (i.e., base-paired) stem and a loop, formed when the nucleic acid comprises two portions that are sufficiently complementary to each other to form a plurality of consecutive base pairs.
  • As used herein, the term “FRET” refers to fluorescence resonance energy transfer, a process in which moieties (e.g., fluorophores) transfer energy e.g., among themselves, or, from a fluorophore to a non-fluorophore (e.g., a quencher molecule). In some circumstances, FRET involves an excited donor fluorophore transferring energy to a lower-energy acceptor fluorophore via a short-range (e.g., about 10 nm or less) dipole-dipole interaction. In other circumstances, FRET involves a loss of fluorescence energy from a donor and an increase in fluorescence in an acceptor fluorophore. In still other forms of FRET, energy can be exchanged from an excited donor flurophore to a non-fluorescing molecule (e.g., a quenching molecule). FRET is known to those of skill in the art and has been described (See, e.g., Stryer et al., 1978, Ann. Rev. Biochem., 47:819; Selvin, 1995, Methods Enzymol., 246:300; Orpana, 2004 Biomol Eng 21, 45-50; Olivier, 2005 Mutant Res 573, 103-110, each of which is incorporated herein by reference in its entirety).
  • As used herein, the term “FEN-1” in reference to an enzyme refers to a non-polymerase flap endonuclease from a eukaryote or archaeal organism. See, e.g., WO 02/070755, and Kaiser M. W., et al. (1999) J. Biol. Chem., 274:21387, which are incorporated by reference herein in their entireties for all purposes.
  • As used herein, the term “FEN-1 activity” refers to any enzymatic activity of a FEN-1 enzyme, including but not limited to flap endonuclease (FEN), nick exonuclease (EXO), and gap endonuclease (GEN) activities (see, e.g., Shen, et al., BioEssays Volume 27, Issue 7, Pages 717-729, incorporated herein by reference).
  • As used herein, the term “primer annealing” refers to conditions that permit oligonucleotide primers to hybridize to template nucleic acid strands. Conditions for primer annealing vary with the length and sequence of the primer and are generally based upon the Tm that is determined or calculated for the primer. For example, an annealing step in an amplification method that involves thermocycling involves reducing the temperature after a heat denaturation step to a temperature based on the Tm of the primer sequence, for a time sufficient to permit such annealing.
  • As used herein, the term “amplifiable nucleic acid” is used in reference to nucleic acids that may be amplified by any amplification method. It is contemplated that “amplifiable nucleic acid” will usually comprise “sample template.”
  • As used herein, the term “sample template” refers to nucleic acid originating from a sample that is analyzed for the presence of “target.” In contrast, “background template” is used in reference to nucleic acid other than sample template that may or may not be present in a sample. The presence of background template is most often inadvertent. It may be the result of carryover, or it may be due to the presence of nucleic acid contaminants sought to be purified away from the sample. For example, nucleic acids from organisms other than those to be detected may be present as background in a test sample.
  • A sample “suspected of containing” a nucleic acid may contain or not contain the target nucleic acid molecule.
  • The term “real time” as used herein in reference to detection of nucleic acid amplification or signal amplification refers to the detection or measurement of the accumulation of products or signal in the reaction while the reaction is in progress, e.g., during incubation or thermal cycling. Such detection or measurement may occur continuously, or it may occur at a plurality of discrete points during the progress of the amplification reaction, or it may be a combination. For example, in a polymerase chain reaction, detection (e.g., of fluorescence) may occur continuously during all or part of thermal cycling, or it may occur transiently, at one or more points during one or more cycles. In some embodiments, real time detection of PCR is accomplished by determining a level of fluorescence at the same point (e.g., a time point in the cycle, or temperature step in the cycle) in each of a plurality of cycles, or in every cycle. Real time detection of amplification may also be referred to as detection “during” the amplification reaction.
  • As used herein, the terms “reverse transcription” and “reverse transcribe” refer to the use of a template-dependent polymerase to produce a DNA strand complementary to an RNA template. A polymerase capable of producing a DNA strand complementary to an RNA template is generally referred to as a “reverse transcriptase” or as a polymerase that has “reverse transcriptase activity”.
  • As used herein, the term “abundance of nucleic acid” refers to the amount of a particular target nucleic acid sequence present in a sample or aliquot. The amount is generally referred to in terms of mass (e.g., μg), mass per unit of volume (e.g., μg/μ1); copy number (e.g., 1000 copies, 1 attomole), or copy number per unit of volume (e.g., 1000 copies per ml, 1 attomole per μ1). Abundance of a nucleic acid can also be expressed as an amount relative to the amount of a standard of known concentration or copy number. Measurement of abundance of a nucleic acid may be on any basis understood by those of skill in the art as being a suitable quantitative representation of nucleic acid abundance, including physical density or the sample, optical density, refractive property, staining properties, or on the basis of the intensity of a detectable label, e.g. a fluorescent label.
  • The term “amplicon” or “amplified product” refers to a segment of nucleic acid, generally DNA, generated by an amplification process such as the PCR process. The terms are also used in reference to RNA segments produced by amplification methods that employ RNA polymerases, such as NASBA, TMA, etc.
  • The term “amplification plot” as used in reference to a thermal cycling amplification reaction refers to the plot of signal that is indicative of amplification, e.g., fluorescence signal, versus cycle number. When used in reference to a non-thermal cycling amplification method, an amplification plot generally refers to a plot of the accumulation of signal as a function of time.
  • The term “baseline” as used in reference to an amplification plot refers to the detected signal coming from assembled amplification reactions at prior to incubation or, in the case of PCR, in the initial cycles, in which there is little change in signal.
  • The term “no template control” and “no target control” (or “NTC”) as used herein in reference to a control reaction refers to a reaction or sample that does not contain template or target nucleic acid. It is used to verify amplification quality.
  • As used herein, the term “quantitative amplification data set” refers to the data obtained during quantitative amplification of the target sample, e.g., target DNA. In the case of quantitative PCR or QuARTS assays, the quantitative amplification data set is a collection of fluorescence values obtained at during amplification, e.g., during a plurality of, or all of the thermal cycles. Data for quantitative amplification is not limited to data collected at any particular point in a reaction, and fluorescence may be measured at a discrete point in each cycle or continuously throughout each cycle.
  • The abbreviations “Ct” and “Cp” as used herein in reference to real-time detection during an amplification reaction that is thermal cycled refers to the cycle at which signal (e.g., fluorescent signal) crosses a predetermined threshold value indicative of positive signal. Various methods have been used to calculate the threshold that is used as a determinant of signal verses concentration, and the value is generally expressed as either the “crossing threshold” (Ct) or the “crossing point” (Cp). Either Cp values or Ct values may be used in embodiments of the methods presented herein for analysis of real-time signal for the determination of the amount of RNA marker(s) or reference markers in an assay or sample.
  • As used herein, the term “kit” refers to any delivery system for delivering materials. In the context of reaction assays, such delivery systems include systems that allow for the storage, transport, or delivery of reaction reagents (e.g., oligonucleotides, enzymes, etc. in the appropriate containers) and/or supporting materials (e.g., buffers, written instructions for performing the assay etc.) from one location to another. For example, kits include one or more enclosures (e.g., boxes) containing the relevant reaction reagents and/or supporting materials. As used herein, the term “fragmented kit” refers to delivery systems comprising two or more separate containers that each contain a subportion of the total kit components. The containers may be delivered to the intended recipient together or separately. For example, a first container may contain an enzyme for use in an assay, while a second container contains oligonucleotides.
  • The term “system” as used herein refers to a collection of articles for use for a particular purpose. In some embodiments, the articles comprise instructions for use, as information supplied on e.g., an article, on paper, or on recordable media (e.g., DVD, CD, flash drive, etc.). In some embodiments, instructions direct a user to an online location, e.g., a website.
  • As used herein, the term “information” refers to any collection of facts or data. In reference to information stored or processed using a computer system(s), including but not limited to internets, the term refers to any data stored in any format (e.g., analog, digital, optical, etc.). As used herein, the term “information related to a subject” refers to facts or data pertaining to a subject (e.g., a human, plant, or animal). The term “genomic information” refers to information pertaining to a genome including, but not limited to, nucleic acid sequences, genes, percentage methylation, allele frequencies, RNA expression levels, protein expression, phenotypes correlating to genotypes, etc. “Allele frequency information” refers to facts or data pertaining to allele frequencies, including, but not limited to, allele identities, statistical correlations between the presence of an allele and a characteristic of a subject (e.g., a human subject), the presence or absence of an allele in an individual or population, the percentage likelihood of an allele being present in an individual having one or more particular characteristics, etc.
  • DESCRIPTION OF THE DRAWINGS
  • FIG. 1 shows a schematic diagram of a combined reverse transcription-QuARTS flap endonuclease detection assay for real-time detection of RNA. Use of multiple different probe flap/FRET cassette dye combinations allows multiple different target nucleic acids to be detected together in multiplex reactions.
  • FIG. 2 compares the effects of different amounts of reverse transcriptase and different reverse transcription conditions on the detection of known amounts of target RNA in RT-QuARTS assays.
  • FIG. 3 shows graphs showing standard curves measuring marker LRG1 RNA. Panel A describes the dilution series, the average Cp value at each dilution, and the calculated strands/reaction calculated from the amplification plots shown in panel B. Panel C shows a graph comparing the Cp compared to the log of the amount of RNA present in the sample.
  • FIG. 4 compares the signals measured for markers FAM83A, XAGE1D, CYP24A1, GAGE12D, LRG1, and MAGEA4 in cancer and normal tissue samples, as described below.
  • FIG. 5 shows graphs comparing the sensitivity and specificity when samples are analyzed using the combinations of four or six expression markers, as listed above each panel.
  • DETAILED DESCRIPTION OF THE INVENTION
  • Provided herein is technology relating to RNA expression markers for use in assays for detection and quantification of RNA. In particular, the technology relates to use of RNA-based gene expression assays to detect lung cancer.
  • In this detailed description of the various embodiments, for purposes of explanation, numerous specific details are set forth to provide a thorough understanding of the embodiments disclosed. One skilled in the art will appreciate, however, that these various embodiments may be practiced with or without these specific details. In other instances, structures and devices are shown in block diagram form. Furthermore, one skilled in the art can readily appreciate that the specific sequences in which methods are presented and performed are illustrative and it is contemplated that the sequences can be varied and still remain within the spirit and scope of the various embodiments disclosed herein.
  • The methods and compositions provided herein relate to characterizing the expression from marker genes by characterizing RNA molecules (“RNA markers”) in a sample, wherein the RNA presence, absence, or status (e.g., with respect to post-transcription modifications or processing) is indicative of neoplasia. Accordingly, provided here are compositions and method directed toward analysis of RNA markers that correlate with lung neoplasia. In preferred embodiments the technology provides assays wherein RNA markers are reverse transcribed, amplified, and detected in real time in a single reaction mixture, and in a single vessel.
  • Also provided herein are compositions and kits for practicing the methods. For example, in some embodiments, reagents (e.g., primers, probes) specific for one or more RNA expression markers are provided alone or in sets (e.g., sets of primers pairs for amplifying a plurality of RNA markers). Additional reagents for conducting a detection assay may also be provided (e.g., enzymes, buffers, positive and negative controls for conducting QuARTS assays, RT-QuARTS assays, PCR, sequencing, or other assays). In some embodiments, the kits containing one or more reagent necessary, sufficient, or useful for conducting a method are provided. Also provided are reactions mixtures containing the reagents. Further provided are master mix reagent sets containing a plurality of reagents that may be added to each other and/or to a test sample to complete a reaction mixture.
  • The technology relates to the analysis of any sample associated with lung cancer. For example, in some embodiments the sample comprises a tissue and/or biological fluid obtained from a patient. In some embodiments, the sample comprises a secretion. In some embodiments, the sample comprises sputum, blood, serum, plasma, lung tissue samples, or lung cells. In some embodiments, the subject is human. Such samples can be obtained by any number of means known in the art, such as will be apparent to the skilled person.
  • I. RNA Detection Assays to Detect Lung Cancer
  • Eight candidate mRNA markers (GAGE12D, FAM83A, LRG1, XAGE-1 d, MAGEA4, SFTPB, AKAP4, and CYP24A1) were selected based on discrimination reported in the literature. As described below, samples from 246 patients (119 controls, 127 lung cancer cases) were tested. The lung cancer cases comprised adena (65), squamous (34), large cell (13), small cell (4) and others carcinomas (11). The controls were from patients having benign lung nodules (37), normal lung (60), chronic obstructive pulmonary disorder (COPD) (10), and normal lung adjacent to tumor (12). Cases and controls included smokers and non-smokers.
  • Messenger RNA expression levels were assayed in a single-tube reverse transcription QuARTS (Quantitative Allele-Specific Real-time Target and Signal amplification) as described herein below, a reaction configuration that simultaneously measures copy numbers of two mRNA markers and a housekeeping reference mRNA (CASC3). To account for sample-to-sample variability, relative gene expression values of each mRNA marker were calculated by dividing the copy numbers obtained for each of the mRNAs by the CASC3 mRNA copy number.
  • Receiver operator characteristic (ROC) curve analyses resulted in an area under the curve (AUC) of 0.976. At 100% specificity, the mRNA panel of 6 markers achieved a sensitivity of 92.1% for all cancers (117/127) and 93.9% for adenocarcinoma and squamous carcinoma combined (93/99).
  • II. RNA Detection Assays and Kits
  • The markers described herein find use in a variety of RNA expression assays, e.g., qRT-PCR, digital PCR, gene expression arrays, etc. In some embodiments, a modified version of a quantitative real-time target and signal amplification (QuARTS) assay is used to evaluate gene expression. In DNA detection, three reactions occur during each QuARTS assay, including amplification (reaction 1) and target probe cleavage (reaction 2) in the primary reaction; and FRET cleavage and fluorescent signal generation (reaction 3) in the secondary reaction. After the first few cycles generate initial amounts of cleaved probe, these reactions occur essentially concurrently. As modified herein, a reverse transcription step is included to produce cDNA for QuARTS flap assay detection.
  • When target nucleic acid is amplified with specific primers, a specific detection probe with a flap sequence loosely binds to the amplicon. The presence of the specific invasive oligonucleotide at the target binding site causes a 5′ nuclease, e.g., a FEN-1 endonuclease, to release the flap sequence by cutting between the detection probe and the flap sequence. The flap sequence is complementary to a non-hairpin portion of a corresponding FRET cassette. Accordingly, the flap sequence functions as an invasive oligonucleotide on the FRET cassette and effects a cleavage between the FRET cassette fluorophore and a quencher, which produces a fluorescent signal. The cleavage reaction can cut multiple probes per target and thus release multiple fluorophore per flap, providing exponential signal amplification. A QuARTS flap endonuclease assay can detect multiple targets in a single reaction vessel, e.g., by using FRET cassettes with different dyes.
  • Methods of isolating RNA from samples are known in the art. For example, RNA isolation methods may comprise one or more of organic extraction, ultrafiltration, hybrid capture, etc. In some embodiments, cells or lysed samples containing RNA may be added directly to assay reactions without purification.
  • In some embodiments, the sample comprises blood, serum, plasma, or saliva. In some embodiments, the subject is human. Such samples can be obtained by any number of means known in the art, such as will be apparent to the skilled person. Cell free or substantially cell free samples can be obtained by subjecting the sample to various techniques known to those of skill in the art which include, but are not limited to, centrifugation and filtration. Although it is generally preferred that no invasive techniques are used to obtain the sample, it still may be preferable to obtain samples such as tissue homogenates, tissue sections, and biopsy specimens. The technology is not limited in the methods used to prepare the samples and provide a nucleic acid for testing. For example, in some embodiments, a RNA is isolated from blood or from a plasma sample using a hybrid capture method, e.g., using target-specific binding materials (e.g., oligonucleotides) on solid supports.
  • The analysis of markers can be carried out separately or simultaneously with additional markers within one test sample. For example, several markers can be combined into one test for efficient processing of multiple samples and for potentially providing greater diagnostic and/or prognostic accuracy. In addition, one skilled in the art would recognize the value of testing multiple samples (for example, at successive time points) from the same subject. Such testing of serial samples can allow the identification of changes in marker expression over time. Changes in expression, as well as the absence of change in expression, can provide useful information about the disease status that includes, but is not limited to, identifying the approximate time from onset of the event, the presence and amount of salvageable tissue, the appropriateness of drug therapies, the effectiveness of various therapies, and identification of the subject's outcome, including risk of future events.
  • The analysis of biomarkers can be carried out in a variety of physical formats. For example, the use of microtiter plates or automation can be used to facilitate the processing of large numbers of test samples. Alternatively, single sample formats could be developed to facilitate immediate treatment and diagnosis in a timely fashion, for example, in ambulatory transport or emergency room settings.
  • It is contemplated that embodiments of the technology are provided in the form of a kit. The kits comprise embodiments of the compositions, devices, apparatuses, etc. described herein, and instructions for use of the kit. Such instructions describe appropriate methods for preparing an analyte from a sample, e.g., for collecting a sample and preparing a nucleic acid from the sample. Individual components of the kit are packaged in appropriate containers and packaging (e.g., vials, boxes, blister packs, ampules, jars, bottles, tubes, and the like) and the components are packaged together in an appropriate container (e.g., a box or boxes) for convenient storage, shipping, and/or use by the user of the kit. It is understood that liquid components (e.g., a buffer) may be provided in a lyophilized form to be reconstituted by the user. Kits may include a control or reference for assessing, validating, and/or assuring the performance of the kit. For example, a kit for assaying the amount of a nucleic acid present in a sample may include a control comprising a known concentration of the same or another nucleic acid for comparison and, in some embodiments, a detection reagent (e.g., a primer) specific for the control nucleic acid. The kits are appropriate for use in a clinical setting and, in some embodiments, for use in a user's home. The components of a kit, in some embodiments, provide the functionalities of a system for preparing a nucleic acid solution from a sample. In some embodiments, certain components of the system are provided by the user.
  • III. Applications
  • In some embodiments, diagnostic assays identify the presence of a disease or condition in an individual. In some embodiments, the disease is cancer (e.g., lung cancer). In preferred embodiments, markers whose aberrant expression is associated with a lung cancer (e.g., one or more markers selected GAGE12D, FAM83A, LRG1, XAGE-1 d, MAGEA4, SFTPB, AKAP4, and CYP24A1) are used. In some embodiments, an assay further comprises detection of a reference nucleic acid (e.g., CASC3 or β-actin mRNAs; U1 and U6 snRNAs, etc.).
  • In some embodiments, the technology finds application in treating a patient (e.g., a patient with lung cancer, with early stage lung cancer, or who may develop lung cancer), the method comprising determining the expression levels of one or more markers as provided herein and administering a treatment to the patient based on the results of determining the expression levels. The treatment may be administration of a pharmaceutical compound, a vaccine, performing a surgery, imaging the patient, performing another test. Preferably, said use is in a method of clinical screening, a method of prognosis assessment, a method of monitoring the results of therapy, a method to identify patients most likely to respond to a particular therapeutic treatment, a method of imaging a patient or subject, and a method for drug screening and development.
  • In some embodiments, the technology finds application in methods for diagnosing lung cancer in a subject is provided. The terms “diagnosing” and “diagnosis” as used herein refer to methods by which the skilled artisan can estimate and even determine whether or not a subject is suffering from a given disease or condition or may develop a given disease or condition in the future. The skilled artisan often makes a diagnosis on the basis of one or more diagnostic indicators, such as for example a biomarker, the expression of which is indicative of the presence, severity, or absence of the condition.
  • Along with diagnosis, clinical cancer prognosis relates to determining the aggressiveness of the cancer and the likelihood of tumor recurrence to plan the most effective therapy. If a more accurate prognosis can be made or even a potential risk for developing the cancer can be assessed, appropriate therapy, and in some instances less severe therapy for the patient can be chosen. Assessment (e.g., analyzing expression) of cancer biomarkers is useful to separate subjects with good prognosis and/or low risk of developing cancer who will need no therapy or limited therapy from those more likely to develop cancer or suffer a recurrence of cancer who might benefit from more intensive treatments.
  • As such, “making a diagnosis” or “diagnosing”, as used herein, is further inclusive of making determining a risk of developing cancer or determining a prognosis, which can provide for predicting a clinical outcome (with or without medical treatment), selecting an appropriate treatment (or whether treatment would be effective), or monitoring a current treatment and potentially changing the treatment, based on the measure of the diagnostic biomarkers disclosed herein.
  • Further, in some embodiments of the technology, multiple determinations of the biomarkers over time can be made to facilitate diagnosis and/or prognosis. A temporal change in the biomarker can be used to predict a clinical outcome, monitor the progression of lung cancer, and/or monitor the efficacy of appropriate therapies directed against the cancer. In such an embodiment for example, one might expect to see a change in the expression of one or more biomarkers disclosed herein (and potentially one or more additional biomarker(s), if monitored) in a biological sample over time during the course of an effective therapy.
  • The technology further finds application in methods for determining whether to initiate or continue prophylaxis or treatment of a cancer in a subject. In some embodiments, the method comprises providing a series of biological samples over a time period from the subject; analyzing the series of biological samples to determine expression of at least one biomarker disclosed herein in each of the biological samples; and comparing any measurable change in the expression of one or more of the biomarkers in each of the biological samples. Any changes in the expression of biomarkers over the time period can be used to predict risk of developing cancer, predict clinical outcome, determine whether to initiate or continue the prophylaxis or therapy of the cancer, and whether a current therapy is effectively treating the cancer. For example, a first time point can be selected prior to initiation of a treatment and a second time point can be selected at some time after initiation of the treatment. Expression can be measured in each of the samples taken from different time points and qualitative and/or quantitative differences noted. A change in the expression of the biomarkers from the different samples can be correlated with risk for developing lung, prognosis, determining treatment efficacy, and/or progression of the cancer in the subject.
  • In preferred embodiments, the methods and compositions of the invention are for treatment or diagnosis of disease at an early stage, for example, before symptoms of the disease appear. In some embodiments, the methods and compositions of the invention are for treatment or diagnosis of disease at a clinical stage.
  • As noted above, in some embodiments multiple determinations of one or more diagnostic or prognostic biomarkers can be made, and a temporal change in the marker can be used to determine a diagnosis or prognosis. For example, a diagnostic marker can be determined at an initial time, and again at a second time. In such embodiments, an increase in the marker from the initial time to the second time can be diagnostic of a particular type or severity of cancer, or a given prognosis. Likewise, a decrease in the marker from the initial time to the second time can be indicative of a particular type or severity of cancer, or a given prognosis. Furthermore, the degree of change of one or more markers can be related to the severity of the cancer and future adverse events. The skilled artisan will understand that, while in certain embodiments comparative measurements can be made of the same biomarker at multiple time points, one can also measure a given biomarker at one time point, and a second biomarker at a second time point, and a comparison of these markers can provide diagnostic information.
  • As used herein, the phrase “determining the prognosis” refers to methods by which the skilled artisan can predict the course or outcome of a condition in a subject. The term “prognosis” does not refer to the ability to predict the course or outcome of a condition with 100% accuracy, or even that a given course or outcome is predictably more or less likely to occur based on the expression of a biomarker. Instead, the skilled artisan will understand that the term “prognosis” refers to an increased probability that a certain course or outcome will occur; that is, that a course or outcome is more likely to occur in a subject exhibiting a given condition, when compared to those individuals not exhibiting the condition. For example, in individuals not exhibiting the condition, the chance of a given outcome (e.g., suffering from lung cancer) may be very low.
  • In some embodiments, a statistical analysis associates a prognostic indicator with a predisposition to an adverse outcome. For example, in some embodiments, an expression level different from that in a normal control sample obtained from a patient who does not have a cancer can signal that a subject is more likely to suffer from a cancer than subjects with a level that is more similar to the expression level in the control sample, as determined by a level of statistical significance. Additionally, a change in expression level from a baseline (e.g., “normal”) level can be reflective of subject prognosis, and the degree of change in expression can be related to the severity of adverse events. Statistical significance is often determined by comparing two or more populations and determining a confidence interval and/or a p value. See, e.g., Dowdy and Wearden, Statistics for Research, John Wiley & Sons, New York, 1983, incorporated herein by reference in its entirety. Exemplary confidence intervals of the present subject matter are 90%, 95%, 97.5%, 98%, 99%, 99.5%, 99.9% and 99.99%, while exemplary p values are 0.1, 0.05, 0.025, 0.02, 0.01, 0.005, 0.001, and 0.0001.
  • In other embodiments, a threshold degree of change in expression of a prognostic or diagnostic biomarker disclosed herein can be established, and the degree of change in the expression of the biomarker in a biological sample is simply compared to the threshold degree of change in the expression. A preferred threshold change in the expression level for biomarkers provided herein is about 5%, about 10%, about 15%, about 20%, about 25%, about 30%, about 50%, about 75%, about 100%, and about 150%. In yet other embodiments, a “nomogram” can be established, by which expression of a prognostic or diagnostic indicator (biomarker or combination of biomarkers) is directly related to an associated disposition towards a given outcome. The skilled artisan is acquainted with the use of such nomograms to relate two numeric values with the understanding that the uncertainty in this measurement is the same as the uncertainty in the marker concentration because individual sample measurements are referenced, not population averages.
  • In some embodiments, a control sample is analyzed concurrently with the biological sample, such that the results obtained from the biological sample can be compared to the results obtained from the control sample. Additionally, it is contemplated that standard curves can be provided, with which assay results for the biological sample may be compared. Such standard curves present expression levels of a biomarker as a function of assay units, e.g., fluorescent signal intensity, if a fluorescent label is used. Using samples taken from multiple donors, standard curves can be provided for control expression of the one or more biomarkers in normal tissue, as well as for “at-risk” levels of the one or more biomarkers in tissue taken from donors with lung cancer.
  • The analysis of markers can be carried out separately or simultaneously with additional markers within one test sample. For example, several markers can be combined into one test for efficient processing of a multiple of samples and for potentially providing greater diagnostic and/or prognostic accuracy. In addition, one skilled in the art would recognize the value of testing multiple samples (for example, at successive time points) from the same subject. Such testing of serial samples can allow the identification of changes in marker expression over time. Changes in expression, as well as the absence of change in expression, can provide useful information about the disease status that includes, but is not limited to, identifying the approximate time from onset of the event, the presence and amount of salvageable tissue, the appropriateness of drug therapies, the effectiveness of various therapies, and identification of the subject's outcome, including risk of future events.
  • The analysis of biomarkers can be carried out in a variety of physical formats. For example, the use of microtiter plates or automation can be used to facilitate the processing of large numbers of test samples. Alternatively, single sample formats could be developed to facilitate immediate treatment and diagnosis in a timely fashion, for example, in ambulatory transport or emergency room settings.
  • In some embodiments, the subject is diagnosed as having lung cancer if, when compared to a control expression, there is a measurable difference in the expression of at least one biomarker in the sample. Conversely, when no change in expression is identified in the biological sample, the subject can be identified as not having lung cancer, not being at risk for the cancer, or as having a low risk of the cancer. In this regard, subjects having lung cancer or risk thereof can be differentiated from subjects having low to substantially no cancer or risk thereof. Those subjects having a risk of developing lung cancer can be placed on a more intensive and/or regular screening schedule. On the other hand, those subjects having low to substantially no risk may avoid being subjected to screening procedures, until such time as a future screening, for example, a screening conducted in accordance with the present technology, indicates that a risk of lung cancer has appeared in those subjects.
  • As mentioned above, depending on the embodiment of the method of the present technology, detecting a change in expression of the one or more biomarkers can be a qualitative determination or it can be a quantitative determination. As such, the step of diagnosing a subject as having, or at risk of developing, lung cancer indicates that certain threshold measurements are made, e.g., the expression of the one or more biomarkers in the biological sample varies from a predetermined control expression. In some embodiments of the method, the control expression is any detectable expression of the biomarker. In other embodiments of the method where a control sample is tested concurrently with the biological sample, the predetermined expression is the expression in the control sample. In other embodiments of the method, the predetermined expression is based upon and/or identified by a standard curve. In other embodiments of the method, the predetermined expression is a specifically state or range of state. As such, the predetermined expression can be chosen, within acceptable limits that will be apparent to those skilled in the art, based in part on the embodiment of the method being practiced and the desired specificity, etc.
  • Over recent years, it has become apparent that circulating epithelial cells, representing metastatic tumor cells, can be detected in the blood of many patients with cancer. Molecular profiling of rare cells is important in biological and clinical studies. Applications range from characterization of circulating epithelial cells (CEpCs) in the peripheral blood of cancer patients for disease prognosis and personalized treatment (See e.g., Cristofanilli M, et al. (2004) N Engl J Med 351:781-791; Hayes D F, et al. (2006) Clin Cancer Res 12:4218-4224; Budd G T, et al., (2006) Clin Cancer Res 12:6403-6409; Moreno J G, et al. (2005) Urology 65:713-718; Pantel et al., (2008) Nat Rev 8:329-340; and Cohen S J, et al. (2008) J Clin Oncol 26:3213-3221). Accordingly, embodiments of the present disclosure provide compositions and methods for detecting the presence of metastatic cancer in a subject by identifying the presence of expression markers in plasma or whole blood.
  • EXPERIMENTAL EXAMPLES Tissue Extraction.
  • Tissue samples were obtained from various commercial and non-commercial sources (Asuragen, BioServe, ConversantBio, Cureline, Mayo Clinic, M D Anderson, and PrecisionMed). Tissue sections were examined by a pathologist, who circled histologically distinct lesions to direct careful micro-dissection. Total nucleic acid extraction was performed using the Promega Maxwell RSC system. FFPE slides were scraped and extracted using the Maxwell® RSC DNA FFPE Kit (#AS1450) using the manufacturer's procedure but skipping the RNase digestion step. The same procedure was used for FFPE bulk curls. For frozen punch biopsy samples, a modified procedure using the lysis buffer from the RSC DNA FFPE kit with the Maxwell® RSC Blood DNA kit (#AS1400) was utilized omitting the RNase step. Prior to testing, samples were diluted 1:5 in 20 ng/μL tRNA in 10 mM TrisHCl, pH 8.0, 0.1 mM EDTA.
  • Gene Expression Markers
  • Gene expression markers tested comprised AKAP4GAGE12D, FAM83A, SFTPB (Pro-Surfactant B), XAGE-1D, CYP24A1, LRG1, and MAGEA4, and one reference gene expression were tested on lung cancer tissue samples. Expression of CASC3 was used as a reference marker.
  • Lung Tissue Samples
  • 127 cancer tissue samples and 119 normal lung tissue samples were tested. The tissue types tested are summarized in the following tables:
  • Cancer Tissue Subtypes N
    Adenocarcinoma 65
    Bronchioloalveolar 6
    Large cell carcinoma 13
    Neuroendocrine 2
    Small cell carcinoma 4
    Squamous cell carcinoma 34
    Unknown 3
  • Normal Tissue N
    Benign lung nodules 37
    Adjacent normal tissue 72
    COPD tissue 10
  • RT-QuARTS.
  • A QuARTS flap endonuclease assay reaction was modified to add a reverse transcription step. The assay probes were designed to span exon junctions so that the RT-QuARTS assay would specifically detect mRNA targets rather than the corresponding genomic loci. Briefly, the technique combines a reverse transcription step to convert the RNA target into a cDNA strand, a polymerase based target amplification and a simultaneous invasive cleavage signal amplification reaction (FIG. 1). The format results in a real-time accumulation of fluorescent signal in proportion to the amount of target mRNA. It produces a similar output to quantitative RT-PCR, but with the added sensitivity and specificity resulting from the addition of the invasive cleavage reaction. RT-QuARTS reactions comprising different amounts of Molony Murine Leukemia Virus (MMLV) reverse transcriptase and different dilutions of RNA were conducted using a reverse transcription reactions for 10 to 45 minutes. FIG. 2 provides a table comparing the results of the different reaction conditions.
  • Each triplex RT-QuARTS assay as describe below consists of one mRNA target reporting to FAM, one to HEX, and the reference mRNA to Quasar 670. Standard curves for each assay were generated by serially diluting known quantities of in vitro-produced transcripts for each marker. Standard curves were created by plotting Cp value by Log input strands. The resulting slope and intercept values were used to convert the Cp values of the unknown samples to mRNA strand values. Oligonucleotide sequences for the assays are shown in Table 1.
  • In vitro transcripts for each target were made from templates containing the DNA sequence amplified in the QuARTS reaction with additional flanking 5′ and 3′ sequences coupled to a T7 promoter. In vitro transcription was done using the T7 Ribomax system (Promega) and the resulting transcripts were quantitated with the Quant-iT RNA assay kit (Thermo Fisher Scientific).
  • Without reverse transcription, an exemplary QuARTS reaction typically comprises approximately 400-600 nmol/1 (e.g., 500 nmol/1) of each primer and detection probe, approximately 100 nmol/1 of the invasive oligonucleotide, approximately 600-700 nmol/1 of each FRET cassette (FAM, e.g., as supplied commercially by Hologic, Inc.; HEX, e.g., as supplied commercially by BioSearch Technologies; and Quasar 670, e.g., as supplied commercially by BioSearch Technologies), 6.675 ng/μl FEN-1 endonuclease (e.g., Cleavase® 2.0, Hologic, Inc.), 1 unit Taq DNA polymerase in a 30 μl reaction volume (e.g., GoTaq® DNA polymerase, Promega Corp., Madison, Wis.), 10 mmol/l 3-(n-morpholino) propanesulfonic acid (MOPS), 7.5 mmol/l MgCl2, and 250 μmol/l of each dNTP. Exemplary QuARTS cycling conditions are as shown in the table below. In some applications, analysis of the quantification cycle (Cg) provides a measure of the initial number of target DNA strands (e.g., copy number) in the sample.
  • RT-QuARTS reactions contained 20 U of MMLV reverse transcriptase (MMLV-RT), 219 ng of Cleavase® 2.0, 1.5 U of GoTaq® DNA Polymerase, 200 nM of each primer, 500 nM each of probe and FRET oligonucleotides, 10 mM MOPS buffer, pH7.5, 7.5 mM MgCl2, and 250 μM each dNTP. Reactions were run on a Roche LightCycler 480 system under the following conditions: 42° C. for 30 minutes (RT reaction), 95° C. for 3 min, 10 cycles of 95° C. for 20 seconds, 63° C. for 30 sec, 70° C. for 30 sec, followed by 35 cycles of 95° C. for 20 sec, 53° C. for 1 min, 70° C. for 30 sec, and hold at 40° C. for 30 sec.
  • RT-QuARTS with Multiplex Preamplification
  • In some embodiments, RT-QuARTS assays may comprise a step of multiplex pre-amplification, e.g., to pre-amplify 10, 12, or more targets in as sample. Multiplex pre-amplification for QuARTS assays is described, e.g., in U.S. Patent Appln. Ser. No. 62/249,097, filed Oct. 30, 2015, and 62/332,295, filed May 5, 2016, each of which is incorporated herein by reference.
  • An RT-pre-amplification is conducted in a reaction mixture containing, e.g., 20 U of MMLV reverse transcriptase, 1.5 U of GoTaq® DNA Polymerase, 10 mM MOPS buffer, pH7.5, 7.5 mM MgCl2, 250 μM each dNTP, and oligonucleotide primers, (e.g., for 12 targets, 12 primer pairs/24 primers, in equimolar amounts (e.g., 200 nM each primer), or with individual primer concentrations adjusted to balance amplification efficiencies of the different targets). Thermal cycling times and temperatures are selected to be appropriate for the volume of the reaction and the amplification vessel. For example, the reactions may be cycled as follows:
  • Stage Temp/Time # of Cycles
    RT 42° C./30′ 1
    95° C./3′ 1
    Amplification 1 95° C./20″ 10
    63° C./30″
    70° C./30″
    Cooling  4° C./Hold 1
  • After thermal cycling, aliquots of the pre-amplification reaction (e.g., 10 μL) are diluted to 500 μL in 10 mM Tris, 0.1 mM EDTA, with or without fish DNA. Aliquots of the diluted pre-amplified DNA (e.g., 10 μL) are used in QuARTS PCR-flap assay, e.g., as described above.
  • In some embodiments, DNA targets, e.g., methylated DNA marker genes, genes corresponding to the RNA marker, etc., may be amplified and detected along with the reverse-transcribed cDNAs in a QuARTS assay reaction. In some embodiments, DNA and cDNA are co-amplified and detected in a single-tube reaction, i.e., without the need to open the reaction vessel at any point between combining the reagents and collecting the output data. In other embodiments, marker DNA from the same sample or from a different sample, may be separately isolated, with or without a bisulfite conversion step, and may be combined with sample RNA in an RT-QuARTS assay. In yet other embodiments, RNA and/or DNA samples may be pre-amplified as described above.
  • The amplification primers and probes used for reverse transcription, amplification, and the flap endonuclease reactions that occur in the RT-QuARTS assay as described herein are shown in Table 1, below:
  • TABLE 1
    AKAP4 Forward 5′-GGACACTGAGAAGAAAGACCAGTC
    Primer (SEQ ID NO: 1)
    Reverse 5′-GGGAGCTTGTTTGAAAAGGCA
    Primer (SEQ ID NO: 2)
    Probe 5′-CCACGGACGCTAAGACAGAGG/3C6/
    (SEQ ID NO: 3)
    CASC3 Forward 5′-CTGCAACCACGGGAACTT
    Primer (SEQ ID NO: 4)
    Reverse 5′-GAGGTGGAGGTCCTGCTC
    Primer (SEQ ID NO: 5)
    Probe 5′-GACGCGGAGTCGAGGTATGCC/3C6/
    (SEQ ID NO: 6)
    CYP24A1 Forward 5′-CTTCAACTGCATTTGGCTCTTTG
    Primer (SEQ ID NO: 7)
    Reverse 5′-TGTGGCCTGGATGTCGT
    Primer (SEQ ID NO: 8)
    Probe 5′-CCACGGACGGTTGGATTGTCC/3C6/
    (SEQ ID NO: 9)
    FA483A Forward 5′-TGGAGATTTGTCCTGTCTGGATC
    Primer (SEQ ID NO: 10)
    Reverse 5′-CTTGGAGAGGATGTTCCGGT
    Primer (SEQ ID NO: 11)
    Probe 5′-CCACGGACGCTTACAGCTTCA/3C6/
    (SEQ ID NO: 12)
    GAGE12D Forward 5′-AGGGAGCATCTGCAGGTC
    Primer (SEQ ID NO: 13)
    Reverse 5′-CCTGTTCCTGGCTATGAGCTTC
    Primer (SEQ ID NO: 14)
    Probe 5′-CGCCGAGGCAAGGGCCGAAG/3C6/
    (SEQ ID NO: 15)
    LRG1 Forward 5′-GAGCAGACAGCGACCAAA
    Primer (SEQ ID NO: 16)
    Reverse 5′-CAGGAACAGAGTTCTAGAAACATGG
    Primer (SEQ ID NO: 17)
    Probe 5′-CCACGGACGAAAGCCCAGGGG/3C6/
    (SEQ ID NO: 18)
    MAGEA4 Forward 5′-AGAGGAGCACCAAGGAGAAGA
    Primer (SEQ ID NO: 19)
    Reverse 5′-GGCAAAAGCTGGGCAATGG
    Primer (SEQ ID NO: 20)
    Probe 5′-CGCCGAGGATCTGCCTGTGG/3C6/
    (SEQ ID NO: 21)
    SFTPB Forward 5′-GTCATCGACTACTTCCAGAACC
    Primer (SEQ ID NO: 22)
    Reverse 5′-AGGTGCATACAGATGCCG
    Primer (SEQ ID NO: 23)
    Probe 5′-CGCCGAGGCAGACTGACTCA/3C6/
    (SEQ ID NO: 24)
    XAGE1D Forward 5′-CCCAGGTGCTGGGAAGG
    Primer (SEQ ID NO: 25)
    Reverse 5′-ACTGATGCAGCTCTTGCAGA
    Primer (SEQ ID NO: 26)
    Probe 5′-CCACGGACGGGAAATGCGCGA/3C6/
    (SEQ ID NO: 27)
  • FIG. 3 shows exemplary standard curves for LRG-1 RNA at dilutions A-E, i.e., 10 to 100,000 copies per reaction of input RNA, in the RT-QuARTS assay as described above. The average number of RNA strands present as calculated from the fluorescence signal during amplification are shown under “Calc. Strands/Rxn” on the right half of panel A. The graph in panel C shows the fluorescence signal accumulation by cycle number for the reactions having the different indicated amounts of input RNA.
  • RT-QuARTs Quantitative Data Analysis for Marker Detection
  • Strand values for individual markers from the samples were determined by using the standard curves for each marker, as discussed above for the LRG-1 RNA. The strand numbers were divided by the CASC3 reference marker strand numbers determined in the same assay well to normalize for varying input RNA amounts. The resulting ratio was multiplied by 100 to generate the “% MARKER” value for each mRNA as shown in FIG. 4.
  • Receiver operating characteristic (ROC) curves were generated for different groupings of markers using JMP 11.0 software (SAS). The positive percent agreement (diagnostic sensitivity) was calculated by dividing the detected positives by the known lung cancer samples and multiplying by 100, and the negative percent agreement (diagnostic specificity) by dividing the detected negatives by the known normal controls and multiplying by 100.
  • FIG. 4 shows the signal measured from individual marker RNAs from cancer and normal samples. FIG. 5 shows the aggregate sensitivity and specificity for samples analyzed using the indicated combinations of mRNA markers.
  • Target mRNA sequences (showing T nucleotides in place of U nucleotides) are as follows:
  • AKAP4
    (SEQ ID NO: 28)
    >gi|21493038|ref|NM_139289.1| Homo sapiens A kinase (PRKA)
    anchor protein 4 (AKAP4), transcript variant 2, mRNA
    CAGGGGTGGCAGCCAACTGCAGGTGCCCAAGAACTTGGCACTTCTCAGTTCCATCTAAAGGGGC
    ACATCTCCCTTCTGGGTGTCACGTTTTCAGCCAAACATCTAAAAGAACTTCATCATCAAGATGT
    CTGATGATATTGACTGGTTACGCAGCCACAGGGGTGTGTGCAAGGTAGATCTCTACAACCCAGA
    AGGACAGCAAGATCAGGACCGGAAAGTGATATGCTTTGTCGATGTGTCCACCCTGAATGTAGAA
    GATAAAGATTACAAGGATGCTGCTAGTTCCAGCTCAGAAGGCAACTTAAACCTGGGAAGTCTGG
    AAGAAAAAGAGATTATCGTGATCAAGGACACTGAGAAGAAAGACCAGTCTAAGACAGAGGGATC
    TGTATGCCTTTTCAAACAAGCTCCCTCTGATCCTGTAAGTGTCCTCAACTGGCTTCTCAGTGAT
    CTCCAGAAGTATGCCTTGGGTTTCCAACATGCACTGAGCCCCTCAACCTCTACCTGTAAACATA
    AAGTAGGAGACACAGAGGGCGAATATCACAGAGCATCCTCTGAGAACTGCTACAGTGTCTATGC
    CGATCAAGTGAACATAGATTATTTGATGAACAGACCTCAAAACCTACGTCTAGAAATGACAGCA
    GCTAAAAACACCAACAATAATCAAAGTCCTTCAGCTCCTCCAGCCAAACCTCCTAGCACTCAGA
    GAGCAGTCATTTCCCCTGATGGAGAATGTTCTATAGATGACCTTTCCTTCTACGTCAACCGACT
    ATCTTCTCTGGTAATCCAGATGGCCCATAAGGAAATCAAGGAGAAGTTGGAAGGTAAAAGCAAA
    TGCCTTCATCATTCAATCTGTCCATCCCCTGGGAACAAAGAGAGAATCAGTCCCCGAACTCCTG
    CGAGCAAGATTGCTTCTGAAATGGCCTATGAAGCTGTGGAACTGACAGCTGCAGAAATGCGTGG
    CACTGGAGAGGAGTCCAGGGAAGGTGGCCAGAAAAGCTTTCTATATAGCGAATTATCCAACAAG
    AGCAAAAGTGGAGACAAACAGATGTCCCAGAGAGAGAGCAAAGAATTTGCAGATTCCATCAGCA
    AGGGGCTCATGGTTTATGCAAATCAGGTGGCATCTGACATGATGGTCTCTCTCATGAAGACCTT
    GAAAGTGCACAGCTCTGGGAAGCCAATTCCAGCATCTGTGGTCCTGAAGAGGGTGTTGCTAAGG
    CACACCAAGGAGATTGTGTCCGATTTGATTGATTCTTGCATGAAGAACCTGCATAATATTACTG
    GGGTCCTGATGACTGACTCAGACTTTGTCTCAGCTGTCAAGAGAAATCTGTTCAACCAGTGGAA
    ACAAAATGCTACAGACATCATGGAGGCCATGCTGAAGCGCTTGGTCAGTGCCCTTATAGGTGAG
    GAGAAGGAGACTAAGTCTCAGAGTCTGTCATATGCATCTTTAAAAGCTGGGTCCCATGATCCCA
    AATGCAGGAATCAGAGTCTTGAATTCTCCACCATGAAAGCTGAAATGAAAGAGAGGGACAAAGG
    CAAAATGAAATCAGACCCATGCAAGTCACTGACTAGTGCTGAGAAAGTCGGTGAACACATTCTC
    AAAGAGGGCCTAACCATCTGGAACCAAAAGCAAGGAAACTCATGCAAGGTGGCTACCAAAGCAT
    GCAGCAATAAAGATGAGAAAGGAGAAAAGATCAATGCTTCCACAGATTCACTGGCCAAGGACCT
    GATTGTCTCTGCCCTTAAGCTGATCCAGTACCATCTGACCCAGCAGACTAAGGGCAAAGATACA
    TGTGAAGAAGACTGTCCTGGTTCCACCATGGGCTATATGGCTCAGAGTACTCAATATGAAAAGT
    GTGGAGGTGGCCAAAGTGCCAAAGCACTTTCAGTGAAACAACTAGAATCTCACAGAGCCCCTGG
    ACCATCCACCTGTCAAAAGGAGAACCAACACCTGGACTCCCAGAAAATGGATATGTCAAACATC
    GTTCTAATGCTGATTCAGAAACTGCTTAATGAGAACCCCTTCAAATGTGAGGATCCATGCGAAG
    GTGAGAACAAGTGTTCTGAGCCCAGGGCAAGCAAAGCAGCTTCCATGTCCAACAGATCTGACAA
    AGCGGAAGAACAATGCCAGGAGCATCAAGAACTTGACTGTACCAGTGGGATGAAGCAAGCGAAC
    GGGCAATTTATAGATAAACTAGTAGAATCTGTGATGAAGCTCTGCCTTATCATGGCTAAGTATA
    GCAACGATGGGGCAGCCCTTGCTGAGTTGGAAGAACAAGCAGCCTCGGCAAATAAGCCCAATTT
    CAGGGGCACCAGATGCATTCACAGTGGTGCAATGCCACAGAACTATCAAGACTCTCTTGGACAT
    GAAGTAATTGTCAATAATCAGTGCTCTACAAATAGCTTGCAGAAGCAGCTCCAGGCTGTCCTGC
    AGTGGATTGCAGCCTCCCAGTTTAACGTGCCCATGCTCTACTTCATGGGAGATAAGGATGGACA
    ACTGGAAAAGCTTCCTCAGGTTTCAGCTAAAGCAGCAGAGAAGGGGTACAGTGTAGGAGGTCTT
    CTTCAAGAGGTCATGAAGTTTGCCAAGGAACGGCAACCAGATGAAGCTGTGGGAAAGGTGGCCA
    GGAAACAGTTGCTGGACTGGCTGCTCGCTAACCTGTGAGCTGATCCTTGACTCCTCTTCATCTT
    AGCCCCCCTAGCAGCATTCCATCCCAGCCAGAGCACCCCCACCATCAGGCCAGTCAACTGCACA
    ATACACAACTGTATTTCCCAATACACTTGAGCAGTTGCCTGTGAATGTAAGAGGTGTCAACAAA
    CTGGGAAATAAAATAAAAAAAAATAATAAAAAAAAAAAAAAAAAAAAAAAA
    CASC3
    (SEQ ID NO: 29)
    >gi|102468569|ref|NM_007359.4| Homo sapiens cancer
    susceptibility candidate 3 (CASC3), mRNA
    AATCCGGGTCGGCCGCAAACGTGCCGCAGGCCTAGGCCCCGCCCAGTGCCCCGCCCCTCCCCCA
    ACACACACACACACACACACACACACACACACCCCAACACACACACACACACCCCAACACACAC
    ACACACACACACACACACACACACACACACACACACACACACACACACAGCGGGATGGCCGAGC
    GCCGCACGCGTAGCACGCCGGGACTAGCTATCCAGCCTCCCAGCAGCCTCTGCGACGGGCGCGG
    TGCGTAAGTACCTCGCCGGTGGTGGCCGTTCTCCGTAAGATGGCGGACCGGCGGCGGCAGCGCG
    CTTCGCAAGACACCGAGGACGAGGAATCTGGTGCTTCGGGCTCCGACAGCGGCGGCTCCCCGTT
    GCGGGGAGGCGGGAGCTGCAGCGGTAGCGCCGGAGGCGGCGGCAGCGGCTCTCTGCCTTCACAG
    CGCGGAGGCCGAACCGGGGCCCTTCATCTGCGGCGGGTGGAGAGCGGGGGCGCCAAGAGTGCTG
    AGGAGTCGGAGTGTGAGAGTGAAGATGGCATTGAAGGTGATGCTGTTCTCTCGGATTATGAAAG
    TGCAGAAGACTCGGAAGGTGAAGAAGGTGAATACAGTGAAGAGGAAAACTCCAAAGTGGAGCTG
    AAATCAGAAGCTAATGATGCTGTTAATTCTTCAACAAAAGAAGAGAAGGGAGAAGAAAAGCCTG
    ACACCAAAAGCACTGTGACTGGAGAGAGGCAAAGTGGGGACGGACAGGAGAGCACAGAGCCTGT
    GGAGAACAAAGTGGGTAAAAAGGGCCCTAAGCATTTGGATGATGATGAAGATCGGAAGAATCCA
    GCATACATACCTCGGAAAGGGCTCTTCTTTGAGCATGATCTTCGAGGGCAAACTCAGGAGGAGG
    AAGTCAGACCCAAGGGGCGTCAGCGAAAGCTATGGAAGGATGAGGGTCGCTGGGAGCATGACAA
    GTTCCGGGAAGATGAGCAGGCCCCAAAGTCCCGACAGGAGCTCATTGCTCTTTATGGTTATGAC
    ATTCGCTCAGCTCATAATCCTGATGACATCAAACCTCGAAGAATCCGGAAACCCCGATATGGGA
    GTCCTCCACAAAGAGATCCAAACTGGAACGGTGAGCGGCTAAACAAGTCTCATCGCCACCAGGG
    TCTTGGGGGCACCCTACCACCAAGGACATTTATTAACAGGAATGCTGCAGGTACCGGCCGTATG
    TCTGCACCCAGGAATTATTCTCGATCTGGGGGCTTCAAGGAAGGTCGTGCTGGTTTTAGGCCTG
    TGGAAGCTGGTGGGCAGCATGGTGGCCGGTCTGGTGAGACTGTTAAGCATGAGATTAGTTACCG
    GTCACGGCGCCTAGAGCAGACTTCTGTGAGGGATCCATCTCCAGAAGCAGATGCTCCAGTGCTT
    GGCAGTCCTGAGAAGGAAGAGGCAGCCTCAGAGCCACCAGCTGCTGCTCCTGATGCTGCACCAC
    CACCCCCTGATAGGCCCATTGAGAAGAAATCCTATTCCCGGGCAAGAAGAACTCGAACCAAAGT
    TGGAGATGCAGTCAAGCTTGCAGAGGAGGTGCCCCCTCCTCCTGAAGGACTGATTCCAGCACCT
    CCAGTCCCAGAAACCACCCCAACTCCACCTACTAAGACTGGGACCTGGGAAGCTCCGGTGGATT
    CTAGTACAAGTGGACTTGAGCAAGATGTGGCACAACTAAATATAGCAGAACAGAATTGGAGTCC
    GGGGCAGCCTTCTTTCCTGCAACCACGGGAACTTCGAGGTATGCCCAACCATATACACATGGGA
    GCAGGACCTCCACCTCAGTTTAACCGGATGGAAGAAATGGGTGTCCAGGGTGGTCGAGCCAAAC
    GCTATTCATCCCAGCGGCAAAGACCTGTGCCAGAGCCCCCCGCCCCTCCAGTGCATATCAGTAT
    CATGGAGGGACATTACTATGATCCACTGCAGTTCCAGGGACCAATCTATACCCATGGTGACAGC
    CCTGCCCCGCTGCCTCCACAGGGCATGCTTGTGCAGCCAGGAATGAACCTTCCCCACCCAGGTT
    TACATCCCCACCAGACACCAGCTCCTCTGCCCAATCCAGGCCTCTATCCCCCACCAGTGTCCAT
    GTCTCCAGGACAGCCACCACCTCAGCAGTTGCTTGCTCCTACTTACTTTTCTGCTCCAGGCGTC
    ATGAACTTTGGTAATCCCAGTTACCCTTATGCTCCAGGGGCACTGCCTCCCCCACCACCGCCTC
    ATCTGTATCCTAATACACAGGCCCCATCACAGGTATATGGAGGAGTGACCTACTATAACCCCGC
    CCAGCAGCAGGTGCAGCCAAAGCCCTCCCCACCCCGGAGGACTCCCCAGCCAGTCACCATCAAG
    CCCCCTCCACCTGAGGTTGTAAGCAGGGGTTCCAGTTAATACAAGTTTCTGAATATTTTAAATC
    TTAACATCATATAAAAAGCAGCAGAGGTGAGAACTCAGAAGAGAAATACAGCTGGCTATCTACT
    ACCAGAAGGGCTTCAAAGATATAGGGTGTGGCTCCTACCAGCAAACAGCTGAAAGAGGAGGACC
    CCTGCCTTCCTCTGAGGACAGGCTCTAGAGAGAGGGAGAAACAAGTGGACCTCGTCCCATCTTC
    ACTCTTCACTTGAGTTGGCTGTGTTCGGGGGAGCAGAGAGAGCCAGACAGCCCCAAGCTTCTGA
    GTCTAGATACAGAAGCCCATGTCTTCTGCTGTTCTTCACTTCTGGGAAATTGAAGTGTCTTCTG
    TTCCCAAGGAAGCTCCTTCCTGTTTGTTTTGTTTTCTAAGATGTTCATTTTTAAAGCCTGGCTT
    CTTATCCTTAATATTATTTTAATTTTTTCTCTTTGTTTCTGTTTCTTGCTCTCTCTCCCTGCCT
    TTAAATGAAACAAGTCTAGTCTTCTGGTTTTCTAGCCCCTCTGGATTCCCTTTTGACTCTTCCG
    TGCATCCCAGATAATGGAGAATGTATCAGCCAGCCTTCCCCACCAAGTCTAAAAAGACCTGGCC
    TTTCACTTTTAGTTGGCATTTGTTATCCTCTTGTATACTTGTATTCCCTTAACTCTAACCCTGT
    GGAAGCATGGCTGTCTGCACAGAGGGTCCCATTGTGCAGAAAAGCTCAGAGTAGGTGGGTAGGA
    GCCCTTCTCTTTGACTTAGGTTTTTAGGAGTCTGAGCATCCATCAATACCTGTACTATGATGGG
    CTTCTGTTCTCTGCTGAGGGCCAATACCCTACTGTGGGGAGAGATGGCACACCAGATGCTTTTG
    TGAGAAAGGGATGGTGGAGTGAGAGCCTTTGCCTTTAGGGGTGTGTATTCACATAGTCCTCAGG
    GCTCAGTCTTTTGAGGTAAGTGGAATTAGAGGGCCTTGCTTCTCTTCTTTCCATTCTTCTTGCT
    ACACCCCTTTTCCAGTTGCTGTGGACCAATGCATCTCTTTAAAGGCAAATATTATCCAGCAAGC
    AGTCTACCCTGTCCTTTGCAATTGCTCTTCTCCACGTCTTTCCTGCTACAAGTGTTTTAGATGT
    TACTACCTTATTTTCCCCGAATTCTATTTTTGTCCTTGCAGACAGAATATAAAAACTCCTGGGC
    TTAAGGCCTAAGGAAGCCAGTCACCTTCTGGGCAAGGGCTCCTATCTTTCCTCCCTATCCATGG
    CACTAAACCACTTCTCTGCTGCCTCTGTGGAAGAGATTCCTATTACTGCAGTACATACGTCTGC
    CAGGGGTAACCTGGCCACTGTCCCTGTCCTTCTACAGAACCTGAGGGCAAAGATGGTGGCTGTG
    TCTCTCCCCGGTAATGTCACTGTTTTTATTCCTTCCATCTAGCAGCTGGCCTAATCACTCTGAG
    TCACAGGTGTGGGATGGAGAGTGGGGAGAGGCACTTAATCTGTAACCCCCAAGGAGGAAATAAC
    TAAGAGATTCTTCTAGGGGTAGCTGGTGGTTGTGCCTTTTGTAGGCTGTTCCCTTTGCCTTAAA
    CCTGAAGATGTCTCCTCAAGCCTGTGGGCAGCATGCCCAGATTCCCAGACCTTAAGACACTGTG
    AGAGTTGTCTCTGTTGGTCCACTGTGTTTAGTTGCAAGGATTTTTCCATGTGTGGTGGTGTTTT
    TTGTTACTGTTTTAAAGGGTGCCCATTTGTGATCAGCATTGTGACTTGGAGATAATAAAATTTA
    GACTATAACTTGGCTCCCTAAAAAAAAAAAAAAAAAAA
    CYP24A1
    (SEQ ID NO: 30)
    >gi|193083115|ref|NM_000782.4| Homo sapiens cytochrome P450,
    family 24, subfamily A, polypeptide 1 (CYP24A1), transcript
    variant 1, mRNA
    GACAGGAGGAAACGCAGCGCCAGCAGCATCTCATCTACCCTCCTTGACACCTCCCCGTGGCTCC
    AGCCAGACCCTAGAGGTCAGCCTTGCGGACCAACAGGAGGACTCCCAGCTTTCCCTTTTCAAGA
    GGTCCCCAGACACCGGCCACCCTCTTCCAGCCCCTGCGGCCAGTGCAAGGAGGCACCAATGCTC
    TGAGGCTGTCGCGTGGTGCAGCGTCGAGCATCCTCGCCGAGGTCCTTTCTGCTGCCTGTCCCGC
    CTCACCCCGCTCCATCACACCAGCTGGCCCTCTTTGCTTCCTTTTCCCAGAATCGTTAAGCCCC
    GACTCCCACTAGCACCTCGTACCAACCTCGCCCCACCCCATCCTCCTGCCTTCCCGCGCTCCGG
    TGTCCCCCGCTGCCATGAGCTCCCCCATCAGCAAGAGCCGCTCGCTTGCCGCCTTCCTGCAGCA
    GCTGCGCAGTCCGAGGCAGCCCCCGAGACTGGTGACATCTACGGCGTACACGTCCCCTCAGCCG
    CGAGAGGTGCCAGTCTGCCCGCTGACAGCTGGTGGCGAGACTCAGAACGCGGCCGCCCTGCCGG
    GCCCCACCAGCTGGCCACTGCTGGGCAGCCTGCTGCAGATTCTCTGGAAAGGGGGTCTCAAGAA
    ACAGCACGACACCCTGGTGGAGTACCACAAGAAGTATGGCAAGATTTTCCGCATGAAGTTGGGT
    TCCTTTGAGTCGGTGCACCTGGGCTCGCCATGCCTGCTGGAAGCGCTGTACCGCACCGAGAGCG
    CGTACCCGCAGCGGCTGGAGATCAAACCGTGGAAGGCCTATCGCGACTACCGCAAAGAAGGCTA
    CGGGCTGCTGATCCTGGAAGGGGAAGACTGGCAGCGGGTCCGGAGTGCCTTTCAAAAGAAACTA
    ATGAAACCAGGGGAAGTGATGAAGCTGGACAACAAAATCAATGAGGTCTTGGCCGATTTTATGG
    GCAGAATAGATGAGCTCTGTGATGAAAGAGGCCACGTTGAAGACTTGTACAGCGAACTGAACAA
    ATGGTCGTTTGAAAGTATCTGCCTCGTGTTGTATGAGAAGAGATTTGGGCTTCTCCAGAAGAAT
    GCAGGGGATGAAGCTGTGAACTTCATCATGGCCATCAAAACAATGATGAGCACGTTTGGGAGGA
    TGATGGTCACTCCAGTCGAGCTGCACAAGAGCCTCAACACCAAGGTCTGGCAGGACCACACTCT
    GGCCTGGGACACCATTTTCAAATCAGTCAAAGCTTGTATCGACAACCGGTTAGAGAAGTATTCT
    CAGCAGCCTAGTGCAGATTTCCTTTGTGACATTTATCACCAGAATCGGCTTTCAAAGAAAGAAT
    TGTATGCTGCTGTCACAGAGCTCCAGCTGGCTGCGGTGGAAACGACAGCAAACAGTCTAATGTG
    GATTCTCTACAATTTATCCCGTAATCCCCAAGTGCAACAAAAGCTTCTTAAGGAAATTCAAAGT
    GTATTACCTGAGAATCAGGTGCCACGGGCAGAAGATTTGAGGAATATGCCGTATTTAAAAGCCT
    GTCTGAAAGAATCTATGAGGCTTACGCCGAGTGTACCATTTACAACTCGGACTCTTGACAAGGC
    AACAGTTCTGGGTGAATATGCTTTACCCAAAGGAACAGTGCTCATGCTAAATACCCAGGTGTTG
    GGATCCAGTGAAGACAATTTTGAAGATTCAAGTCAGTTTAGACCTGAACGTTGGCTTCAGGAGA
    AGGAAAAAATTAATCCTTTTGCGCATCTTCCATTTGGCGTTGGAAAAAGAATGTGCATTGGTCG
    CCGATTAGCAGAGCTTCAACTGCATTTGGCTCTTTGTTGGATTGTCCGCAAATACGACATCCAG
    GCCACAGACAATGAGCCTGTTGAGATGCTACACTCAGGCACCCTGGTGCCCAGCCGGGAACTCC
    CCATCGCGTTTTGCCAGCGATAATACGCCTCAGATGGTGGTATTTGCTAACATCATATCCAACT
    CAGGGAAGCGGACTGAGTGCTGGGATCCAAGGCATTCTACAGGGTTCACTGCTGGTTTACACTT
    CACCTGTGTCAGCACCATCTTCAGGTGCTTAGAATGGCCTGGGAGCCTGTTCTGTCTTGCATCT
    TCCATGACATGAAAGGGAGGCTGGCACTTGTCAGTCAGGTAGAGGTTACAAACCGTTTCAGGCC
    CTGCCTACCACATTCACTGTTTGAATCTTTAATTCCCAAGAATAAGTTTACATTTCACAATGAA
    TGACCTACAACAGCTAAATTTTCTGGGGCTGGGAGTAATACTGACAATCCATTTACTGTAGCTC
    TGCTTAATGTACTACTTAGGAAAATGTCCCTGCTTAATAATGTAAGCCAAGCTAAATGATGGTT
    AAAGTTATCAGGCCTCCCATGAAATTGCGTTCTTCCTGCATTGAAATAAAAACATTATTGGGAA
    ACTAGAGAACACCTCTATTTTTAAAAGGACTTTAACGAAGTCAAACAACTTATAAGACTAGTGA
    TTCACTGGGGCATTATTTTGTTAGAGGACCTTAAAATTGTTTATTTTTTAAATGTGATTCCTTT
    ATGGCATTAGGGTAAAGATGAAGCAATAATTTTTAAATTGTGTATGTGCATATGAAGCACAGAC
    ATGCATGTGTGTGTGTGTCTGTGTGTGTGTGTCCGTGTATGTGTGTGTGGGTTCTAATGGTAAT
    TTGCCTCAGTCATTTTTTTAATATTTGCAGTACTTGATTTAGGATCTGTGGTGCAGGGCAATGT
    TTCAAAGTTTAGTCACAGCTTAAAAACATTCAGTGTGACTTTAATATTATAAAATGATTTCCCA
    TGCCATAATTTTTCTGTCTATTAAATGGGACAAGTGTAAAGCATGCAAAAGTTAGAGATCTGTT
    ATATAACATTTGTTTTGTGATTTGAACTCCTAGGAAAAATATGATTTCATAAATGTAAAATGCA
    CAGAAATGCATGCAATACTTATAAGACTTAAAAATTGTGTTTACAGATGGTTTATTTGTGCATA
    TTTTTACTACTGCTTTTCCTAAATGCATACTGTATATAATTCTGTGTATTTGATAAATATTTCT
    TCCTACATTATATTTTTAGAATATTTCAGAAATATACATTTATGTCTTTATATTGTAATAAATA
    TGTACATATCTAGGTATATGCTTTCTCTCTGCTGTGAAATTATTTTTAGAATTATAAATTCACG
    TCTTGTCAGATTTCATCTGTATACCTTCAAATTCTCTGAAAGTAAAAATAAAAGTTTTTAAATA
    TTAAAAAAAAAAAAAAAAAAAAA
    FAM83A
    (SEQ ID NO: 31)
    >gi|767953716|ref|XM_005251087.2| PREDICTED: Homo sapiens family
    with sequence similarity 83, member A (FAM83A), transcript
    variant X1, mRNA
    AGGAAATATCCCATGGCTGACTGTGCCAAGGAGGTGTCTGAGCCAGCCCTCCCGGCCCGAGGGC
    AGGGCAGGTGGCCCTGAGAGATAAGCCAATCCCGCAGCTGCAGATGAGGAGTTCTGAGAAGCAT
    TGCTCAGGACAGCGGTAAATCACTTCTTGGAGGTGCCCTGCACGCCGGTCCTGGGAGCAGGCGG
    CCTCCCGGGGGTGCGGGAGCCCCACTCCTCCGTGGTGTGTTCCATTTGCTTCCCACATCTGGAG
    GAGCTGACGTGCCAGCCTCCCCCAGCACCACCCAGGGACGGGAGGCATGAGCCGGTCAAGGCAC
    CTGGGCAAAATCCGGAAGCGTCTGGAAGATGTCAAGAGCCAGTGGGTCCGGCCAGCCAGGGCTG
    ACTTTAGTGACAACGAGAGTGCCCGGCTGGCCACGGACGCCCTCTTGGATGGGGGTTCTGAAGC
    CTACTGGCGGGTGCTCAGCCAGGAAGGCGAGGTGGACTTCTTGTCCTCGGTGGAGGCCCAGTAC
    ATCCAGGCCCAGGCCAGGGAGCCCCCGTGTCCCCCAGACACCCTGGGAGGGGCGGAAGCAGGCC
    CTAAGGGACTGGACTCCAGCTCCCTACAGTCCGGCACCTACTTCCCTGTGGCCTCAGAGGGCAG
    CGAGCCGGCCCTACTGCACAGCTGGGCCTCAGCTGAGAAGCCCTACCTGAAGGAAAAATCCAGC
    GCCACTGTGTACTTCCAGACCGTCAAGCACAACAACATCAGAGACCTCGTCCGCCGCTGCATCA
    CCCGGACTAGCCAGAACATTTCCATCCGGAGTGTGGAAGGAGAGATATACTGTGCCAAGTCAGG
    CAGGAAATTCGCTGGCCAAATCCGGGAGAAGTTCATCATCTCGGACTGGAGATTTGTCCTGTCT
    GGATCTTACAGCTTCACCTGGCTCTGCGGACACGTGCACCGGAACATCCTCTCCAAGTTCACAG
    GCCAGGCGGTGGAGCTGTTTGACGAGGAGTTCCGCCACCTCTACGCCTCCTCCAAGCCTGTGAT
    GGGCCTGAAGTCCCCGCGGCTGGTCGCCCCCGTCCCGCCCGGAGCAGCCCCGGCCAATGGCCGC
    CTTAGCAGCAGCAGTGGCTCCGCCAGTGACCGCACGTCCTCCAACCCCTTCAGCGGCCGCTCGG
    CAGGCAGCCACCCCGGTACCCGAAGTGTGTCCGCGTCTTCAGGGCCCTGTAGCCCCGCGGCCCC
    ACACCCGCCTCCACCGCCCCGGTTCCAGCCCCACCAAGGCCCTTGGGGAGCCCCGAGTCCCCAG
    GCCCACCTCTCCCCGCGGCCCCACGACGGCCCGCCCGCCGCTGTCTACAGCAACCTGGGGGCCT
    ACAGGCCCACGCGGCTGCAGCTGGAGCAGCTGGGCCTGGTGCCGAGGCTGACTCCAACCTGGAG
    GCCCTTCCTGCAGGCCTCCCCTCACTTCTGAAGGTCCCATCCCCTGCTGCCCTCCGCAGGCCCA
    GGGCTGGGCACTCCCTGAGACCCAAAGACCCACCTCAACGACGAGTGGCGTTGAGCCACTTCCC
    TTTGAAAAGACACTCAAAATCACTGCCATGGTTCAATGTTCCCAGGCCCCAGGCCATCCACTTG
    CCGGCCCCCACCAGTTCTTGGGTTCCCCGCTCTAGTTTGACCTGTGCAGCACATTCCAGAAGGT
    TCCAGGGAGGTTGTGGGGCAGCTAGAGGACAAAATCATGAAAACAGAGTCCCTGTCTTCCAGAG
    ATCATCCGGGGCTTTAATATTAATGGCCCCCAAAACTCCGTAAGAAGCAGGAAATGCAGCCCAA
    GTTTTACAAATGGGTAAACAGAGGCACTGAGAGATAGATGGTAGTTTGGTACTTCTGGTTCCCA
    GTGCCCAGGAATGGTCCACTCCCAAGAAATTCAGGAAAGAAAGACTGAGGAGAAGGTGTGGGAA
    CATTCTGGATGTTTCGGGAGAGTTGGGGAAACTCCTCCTCTTAGGAAAGGCTAATACTAGGGTA
    TCCTTGGGCCCAATGAATTAGGGGTGAGGCCCCAGAACCCGTTATCTATGAGTTGTATGGGGGA
    GCCATCTGAAGCTGTAGCCACCAGGGATGCAGCTAGCTGAGGAGTTTGGGGTGTTGGGTTGGAC
    AAGGCAGGTTAGTAGACTCAGATTCTTGCTTCAAAGAGCCTTGGGCTGGCCTGGAGGTCCCTGG
    AGTCTAGACTGGACCTAGGAGCTTGAGTTGTCAGGGGCCAGGACTGGCCCCACTGCAGTGCCCA
    GGCCAGTCTTGAGCAGCAGGGAGGGCTCAGCTGTCCCCAGATCCAGGTGCCTCTGACCAGCCTG
    GTCACCTCCTGAGGAATAAATGCTGAACCTCACAAGCCCCATCATTCATTTCTTCTCAATTCAC
    AGTGCCCCTCTTTGTTTCTGGGGTGGAACTAGGTCCTGAGGGCACAGCCTAGCTGAGTGCAAAG
    AAATATAGGATGCTTAGAAAGCATACAGGAGGGGCCAGGCGTGGTGGCTCATGCCTGTAATCCC
    AGAACTTTGGGATGCCAAGGTGGTTGGATTACCTGAGATCAGGTGGATTACCTGGTCTCGAGAC
    CAGCCTGACCAATATGGTGAAACCCCGTCTCTACTAAAAATACAAAAATTAGGCTGAGACAGGA
    GAATTGCTTGAACCCAGGAAGCAGAGGTTGCAATGAGCTGAGATTGCATCACTGCACTCCAGCA
    TGGGCAACAAAGCAAGACTCCGTCACAGAAAAAAAAAAAAAAAAGAGAGAGAGCATAGAGGAGG
    GTTGGCCAGCCCTGTGGTGGGTGGGATGTCAGAGACACTTCCCAGATAAAGTAAGAGTTAACCC
    TGCACCTCAGGTGTGATAGTGGGGTCAGTGGTATGTGATCCAGGCTGGGGAGCCAGAGGGGAGC
    AGGTGCCAACTCCACATCCTTCTCCTGTTTCTAGGCCCTCTCCTCCCTTGTCGGTTTTTGGCGG
    GGAAGCTCAGCCTTCGCTGTGGAGGGACGAGAGCACAGAGCTCTTCCTCCTGGTGGCCTCTGAC
    CCCTGACGGCCTGTGGCATCCTCCCTAGTCCCCTCTGCCCATCCATCCCTCTGTTCCAATTCTC
    CACTGCTCCCAGCATGATCTGGGGCATCTTGGCTTCTGGTTTCTTTTATTATTATTATTATTAT
    TAATTATTGTATTCCTGTCCTTCACTTTTTTCCTCCTTAGTTCCTGAAAGTAAACAAAACAAAA
    CAAAAACAAAAAAACAAACAACACTTTGGTTCCTGATGGCTTTCTGAACCCAGCCCTGACCTTG
    TTGTTTCACAGCTGACGGCTGAGATGAGGTTAGAATGACTGGGCCCGGCTGAACATTCCAAATT
    GGATTTCACCATCTGCTGAGAAAGTTTAAGGAAGGCAAAGCTTGCCAGGTCACAGAAGCTCCCA
    AGCCCAGCTTTCCAAAGGCCTCAGCCTGTGCCTGTGTCGAGCTCAGTCCTGGGAGATAGGGGAG
    AACCTGCAGGCAGGAACAAGCCCCCCTACTCCTGACCACCCTCCATCAGCAGTCTCCCCTCCGT
    GGTCGTCTTTGTTGACAAAGGTGCAGTTTCTCCTCTCCTGGGCACCTGTAACATGTGATGCGCT
    GCCTGCTGGGAGGTTAGGTCGGGGCTGCCCCGGCGAGTGGAGCATGAGCAGAACCGCCGAGGGT
    CACTTCTGGGCAGAAGCTTTGAGAGCCTGGGTCCAGGTTGCCACATAGAAGCAGCTCTCCAGTT
    GAAACCCTCCTCTGCCAGCCTGGGGTCCTAAGCGATGAGCAGAATCCCCCACTCCCACCCCACC
    AACCCACAATGGATATGTAGTGAGCAAGAAATAAACCTTTGTTGTTTAAGCCA
    GAGE12D
    (SEQ ID NO: 32)
    gi|187608822|ref|NM_001127199.1| Homo sapiens G antigen 12D
    (GAGE12D), mRNA
    GTTCACTGGGCGTCTTCTGCCCGGCCCCTTCGCCCACGTGAAGAACGCCAGGGAGCTGTGAGGC
    AGTGCTGTGTGGTTCCTGCCGTCCGGACTCTTTTTCCTCTACTGAGATTCATCTGTGTGAAATA
    TGAGTTGGCGAGGAAGATCGACCTATTATTGGCCTAGACCAAGGCGCTATGTACAGCCTCCTGA
    AATGATTGGGCCTATGCGGCCCGAGCAGTTCAGTGATGAAGTGGAACCAGCAACACCTGAAGAA
    GGGGAACCAGCAACTCAATGTCAGGATCCTGCAGCTGCTCAGGAGGGAGAGGATGAGGGAGCAT
    CTGCAGGTCAAGGGCCGAAGCCTGAAGCTCATAGCCAGGAACAGGGTCACCCACAGACTGGGTG
    TGAGTGTGAAGATGGTCCTGATGGGCAGGAGATGGACCCGCCAAATCCAGAGGAGGTGAAAACG
    CCTGAAGAAGGTGAAAAGCAATCACAGTGTTAAAAGAAGACACGTTGAAATGATGCAGGCTGCT
    CCTATGTTGGAAATTTGTTCATTAAAATTCTCCCAATAAAGCTTTACAGCCTTCTGCAAAGAAG
    TCTTGCGCA
    LRG1
    (SEQ ID NO: 33)
    gi|49574519|ref|NM_052972.2| Homo sapiens leucine-rich alpha-2-
    glycoprotein 1 (LRG1), mRNA
    GCAGAGCTACCATGTCCTCTTGGAGCAGACAGCGACCAAAAAGCCCAGGGGGCATTCAACCCCA
    TGTTTCTAGAACTCTGTTCCTGCTGCTGCTGTTGGCAGCCTCAGCCTGGGGGGTCACCCTGAGC
    CCCAAAGACTGCCAGGTGTTCCGCTCAGACCATGGCAGCTCCATCTCCTGTCAACCACCTGCCG
    AAATCCCCGGCTACCTGCCAGCCGACACCGTGCACCTGGCCGTGGAATTCTTCAACCTGACCCA
    CCTGCCAGCCAACCTCCTCCAGGGCGCCTCTAAGCTCCAAGAATTGCACCTCTCCAGCAATGGG
    CTGGAAAGCCTCTCGCCCGAATTCCTGCGGCCAGTGCCGCAGCTGAGGGTGCTGGATCTAACCC
    GAAACGCCCTGACCGGGCTGCCCCCGGGCCTCTTCCAGGCCTCAGCCACCCTGGACACCCTGGT
    ATTGAAAGAAAACCAGCTGGAGGTCCTGGAGGTCTCGTGGCTACACGGCCTGAAAGCTCTGGGG
    CATCTGGACCTGTCTGGGAACCGCCTCCGGAAACTGCCCCCCGGGCTGCTGGCCAACTTCACCC
    TCCTGCGCACCCTTGACCTTGGGGAGAACCAGTTGGAGACCTTGCCACCTGACCTCCTGAGGGG
    TCCGCTGCAATTAGAACGGCTACATCTAGAAGGCAACAAATTGCAAGTACTGGGAAAAGATCTC
    CTCTTGCCGCAGCCGGACCTGCGCTACCTCTTCCTGAACGGCAACAAGCTGGCCAGGGTGGCAG
    CCGGTGCCTTCCAGGGCCTGCGGCAGCTGGACATGCTGGACCTCTCCAATAACTCACTGGCCAG
    CGTGCCCGAGGGGCTCTGGGCATCCCTAGGGCAGCCAAACTGGGACATGCGGGATGGCTTCGAC
    ATCTCCGGCAACCCCTGGATCTGTGACCAGAACCTGAGCGACCTCTATCGTTGGCTTCAGGCCC
    AAAAAGACAAGATGTTTTCCCAGAATGACACGCGCTGTGCTGGGCCTGAAGCCGTGAAGGGCCA
    GACGCTCCTGGCAGTGGCCAAGTCCCAGTGAGACCAGGGGCTTGGGTTGAGGGTGGGGGGTCTG
    GTAGAACACTGCAACCCGCTTAACAAATAATCCTGCCTTTGGCCGGGTGCGGGGGCTCACGCCT
    GTAATCCCAGCACTTTGGGAGGCCCAGGTGGGCGGATCACGAGGTCAGGAGATCGAGACCATCT
    TGGCTAACATGGTGAAACCCTGTCTCTACTAAAAATATAAAAAATTAGCCAGGCGTGGTGGTGG
    GCACCTGTAGTCCCAGCAACTCGGGAGGCTGAGGCAGGAGAATGGCGTGAACTTGGGAGGCGGA
    GCTTGCGGTGAGCCAAGATCGTGCCACTGCACTCTAGCCTGGGCGACAGAGCAAGACTGTCTCA
    AAAAAATTAAAATTAAAATTAAAAACAAATAATCCTGCCTTTTACAGGTGAAACTCGGGGCTGT
    CCATAGCGGCTGGGACCCCGTTTCATCCATCCATGCTTCCTAGAACACACGATGGGCTTTCCTT
    ACCCATGCCCAAGGTGTGCCCTCCGTCTGGAATGCCGTTCCCTGTTTCCCAGATCTCTTGAACT
    CTGGGTTCTCCCAGCCCCTTGTCCTTCCTTCCAGCTGAGCCCTGGCCACACTGGGGCTGCCTTT
    CTCTGACTCTGTCTTCCCCAAGTCAGGGGGCTCTCTGAGTGCAGGGTCTGATGCTGAGTCCCAC
    TTAGCTTGGGGTCAGAACCAAGGGGTTTAATAAATAACCCTTGAAAACTGGA
    MAGEA4
    (SEQ ID NO: 34)
    >gi|58530866|ref|NM_001011548.1| Homo sapiens MAGE family member
    A4 (MAGEA4), transcript variant 1, mRNA
    AGAGACAAGCGAGCTTCTGCGTCTGACTCGCAGCTTGAGACTGGCGGAGGGAAGCCCGCCCAGG
    CTCTATAAGGAGACAAGGTTCTGAGCAGACAGGCCAACCGGAGGACAGGATTCCCTGGAGGCCA
    CAGAGGAGCACCAAGGAGAAGATCTGCCTGTGGGTCCCCATTGCCCAGCTTTTGCCTGCACTCT
    TGCCTGCTGCCCTGACCAGAGTCATCATGTCTTCTGAGCAGAAGAGTCAGCACTGCAAGCCTGA
    GGAAGGCGTTGAGGCCCAAGAAGAGGCCCTGGGCCTGGTGGGTGCACAGGCTCCTACTACTGAG
    GAGCAGGAGGCTGCTGTCTCCTCCTCCTCTCCTCTGGTCCCTGGCACCCTGGAGGAAGTGCCTG
    CTGCTGAGTCAGCAGGTCCTCCCCAGAGTCCTCAGGGAGCCTCTGCCTTACCCACTACCATCAG
    CTTCACTTGCTGGAGGCAACCCAATGAGGGTTCCAGCAGCCAAGAAGAGGAGGGGCCAAGCACC
    TCGCCTGACGCAGAGTCCTTGTTCCGAGAAGCACTCAGTAACAAGGTGGATGAGTTGGCTCATT
    TTCTGCTCCGCAAGTATCGAGCCAAGGAGCTGGTCACAAAGGCAGAAATGCTGGAGAGAGTCAT
    CAAAAATTACAAGCGCTGCTTTCCTGTGATCTTCGGCAAAGCCTCCGAGTCCCTGAAGATGATC
    TTTGGCATTGACGTGAAGGAAGTGGACCCCGCCAGCAACACCTACACCCTTGTCACCTGCCTGG
    GCCTTTCCTATGATGGCCTGCTGGGTAATAATCAGATCTTTCCCAAGACAGGCCTTCTGATAAT
    CGTCCTGGGCACAATTGCAATGGAGGGCGACAGCGCCTCTGAGGAGGAAATCTGGGAGGAGCTG
    GGTGTGATGGGGGTGTATGATGGGAGGGAGCACACTGTCTATGGGGAGCCCAGGAAACTGCTCA
    CCCAAGATTGGGTGCAGGAAAACTACCTGGAGTACCGGCAGGTACCCGGCAGTAATCCTGCGCG
    CTATGAGTTCCTGTGGGGTCCAAGGGCTCTGGCTGAAACCAGCTATGTGAAAGTCCTGGAGCAT
    GTGGTCAGGGTCAATGCAAGAGTTCGCATTGCCTACCCATCCCTGCGTGAAGCAGCTTTGTTAG
    AGGAGGAAGAGGGAGTCTGAGCATGAGTTGCAGCCAGGGCTGTGGGGAAGGGGCAGGGCTGGGC
    CAGTGCATCTAACAGCCCTGTGCAGCAGCTTCCCTTGCCTCGTGTAACATGAGGCCCATTCTTC
    ACTCTGTTTGAAGAAAATAGTCAGTGTTCTTAGTAGTGGGTTTCTATTTTGTTGGATGACTTGG
    AGATTTATCTCTGTTTCCTTTTACAATTGTTGAAATGTTCCTTTTAATGGATGGTTGAATTAAC
    TTCAGCATCCAAGTTTATGAATCGTAGTTAACGTATATTGCTGTTAATATAGTTTAGGAGTAAG
    AGTCTTGTTTTTTATTCAGATTGGGAAATCCGTTCTATTTTGTGAATTTGGGACATAATAACAG
    CAGTGGAGTAAGTATTTAGAAGTGTGAATTCACCGTGAAATAGGTGAGATAAATTAAAAGATAC
    TTAATTCCCGCCTTATGCCTCAGTCTATTCTGTAAAATTTAAAAAATATATATGCATACCTGGA
    TTTCCTTGGCTTCGTGAATGTAAGAGAAATTAAATCTGAATAAATAATTCTTTCTGTTAA
    SFTPB
    (SEQ ID NO: 35)
    >gi|288856298|ref|NM_000542.3| Homo sapiens surfactant protein B
    (SFTPB), transcript variant 1, mRNA
    TGTAAATGCTCTTCTGACTAATGCAAACCATGTGTCCATAGAACCAGAAGATTTTTCCAGGGGA
    AAAGAGCCCCCACGCCCCGCCCAGCTATAAGGGGCCATGCACCAAGCAGGGTACCCAGGCTGCA
    GAGGTGCCATGGCTGAGTCACACCTGCTGCAGTGGCTGCTGCTGCTGCTGCCCACGCTCTGTGG
    CCCAGGCACTGCTGCCTGGACCACCTCATCCTTGGCCTGTGCCCAGGGCCCTGAGTTCTGGTGC
    CAAAGCCTGGAGCAAGCATTGCAGTGCAGAGCCCTAGGGCATTGCCTACAGGAAGTCTGGGGAC
    ATGTGGGAGCCGATGACCTATGCCAAGAGTGTGAGGACATCGTCCACATCCTTAACAAGATGGC
    CAAGGAGGCCATTTTCCAGGACACGATGAGGAAGTTCCTGGAGCAGGAGTGCAACGTCCTCCCC
    TTGAAGCTGCTCATGCCCCAGTGCAACCAAGTGCTTGACGACTACTTCCCCCTGGTCATCGACT
    ACTTCCAGAACCAGACTGACTCAAACGGCATCTGTATGCACCTGGGCCTGTGCAAATCCCGGCA
    GCCAGAGCCAGAGCAGGAGCCAGGGATGTCAGACCCCCTGCCCAAACCTCTGCGGGACCCTCTG
    CCAGACCCTCTGCTGGACAAGCTCGTCCTCCCTGTGCTGCCCGGGGCCCTCCAGGCGAGGCCTG
    GGCCTCACACACAGGATCTCTCCGAGCAGCAATTCCCCATTCCTCTCCCCTATTGCTGGCTCTG
    CAGGGCTCTGATCAAGCGGATCCAAGCCATGATTCCCAAGGGTGCGCTAGCTGTGGCAGTGGCC
    CAGGTGTGCCGCGTGGTACCTCTGGTGGCGGGCGGCATCTGCCAGTGCCTGGCTGAGCGCTACT
    CCGTCATCCTGCTCGACACGCTGCTGGGCCGCATGCTGCCCCAGCTGGTCTGCCGCCTCGTCCT
    CCGGTGCTCCATGGATGACAGCGCTGGCCCAAGGTCGCCGACAGGAGAATGGCTGCCGCGAGAC
    TCTGAGTGCCACCTCTGCATGTCCGTGACCACCCAGGCCGGGAACAGCAGCGAGCAGGCCATAC
    CACAGGCAATGCTCCAGGCCTGTGTTGGCTCCTGGCTGGACAGGGAAAAGTGCAAGCAATTTGT
    GGAGCAGCACACGCCCCAGCTGCTGACCCTGGTGCCCAGGGGCTGGGATGCCCACACCACCTGC
    CAGGCCCTCGGGGTGTGTGGGACCATGTCCAGCCCTCTCCAGTGTATCCACAGCCCCGACCTTT
    GATGAGAACTCAGCTGTCCAGCTGCAAAGGAAAAGCCAAGTGAGACGGGCTCTGGGACCATGGT
    GACCAGGCTCTTCCCCTGCTCCCTGGCCCTCGCCAGCTGCCAGGCTGAAAAGAAGCCTCAGCTC
    CCACACCGCCCTCCTCACCGCCCTTCCTCGGCAGTCACTTCCACTGGTGGACCACGGGCCCCCA
    GCCCTGTGTCGGCCTTGTCTGTCTCAGCTCAACCACAGTCTGACACCAGAGCCCACTTCCATCC
    TCTCTGGTGTGAGGCACAGCGAGGGCAGCATCTGGAGGAGCTCTGCAGCCTCCACACCTACCAC
    GACCTCCCAGGGCTGGGCTCAGGAAAAACCAGCCACTGCTTTACAGGACAGGGGGTTGAAGCTG
    AGCCCCGCCTCACACCCACCCCCATGCACTCAAAGATTGGATTTTACAGCTACTTGCAATTCAA
    AATTCAGAAGAATAAAAAATGGGAACATACAGAACTCTAAAAGATAGACATCAGAAATTGTTAA
    GTTAAGCTTTTTCAAAAAATCAGCAATTCCCCAGCGTAGTCAAGGGTGGACACTGCACGCTCTG
    GCATGATGGGATGGCGACCGGGCAAGCTTTCTTCCTCGAGATGCTCTGCTGCTTGAGAGCTATT
    GCTTTGTTAAGATATAAAAAGGGGTTTCTTTTTGTCTTTCTGTAAGGTGGACTTCCAGCTTTTG
    ATTGAAAGTCCTAGGGTGATTCTATTTCTGCTGTGATTTATCTGCTGAAAGCTCAGCTGGGGTT
    GTGCAAGCTAGGGACCCATTCCTGTGTAATACAATGTCTGCACCAATGCTAATAAAGTCCTATT
    CTCTTTTATGAGAAAGAAAAAGACACCGTCCTTTAAAGTGCTGCAGTATGGCCAGACGTGGTGG
    CTCACACCTGCAATCCCAGCACCTTAGGAGGCCGAGGCAGGAGGATCCTTGAGGTCAGGAGTTC
    GAGACCAGCCTCGCCAACATGGTGAAACCCCATTTCTACTAAAAATACAAAAAATTAGCCAAGT
    GTGGTGGCATATGCCTGTAATCCCAACTACTCAGAAGGCCGAGGCAGGAGAATTACTTGAACGC
    AGGAGAATCACTGCAGCCCAGGAGGCAGAGGTTGCAGTGAGCCGAGATTGCACCACTGCACTCC
    AGCCTGGGTGACAGAGCAAGACTCCATCTCAGTAAATAAATAAATAAATAAAAAGCGCTGCAGT
    AGCTGTGGCCTCACCCTGAAGTCAGCGGGCCCAGGCCTACCTCACTCTCTCCCTTGGCAGAGAA
    GCAGACGTCCATAGCTCCTCTCCCTCACAAGCGCTCCCAGCCTGCCCTCCAGCTGCTGCTCTCC
    CCTCCCAGTCTCTACTCACTGGGATGAGGTTAGGTCATGAGGACACCAAAAACCTAAAAATAAA
    CAAAAAGCCAAACAAGCCTTAGCTTTTCTTAAAGACTGAAATGCCTGGAAGTGTCCCTTTATTT
    ATAAAATAACTTTTGTCATATTTCTTATACATGTTTCTTGTAAGAAATTCAGAAACTACAGACA
    AAGAGAGTGGAAATTACCCACTGTCAGGCCTCTGAGCCCAAGCTAAGCCATCATATCCCCTGTG
    CCCTGCACGTATACACCCAGATGGCCTGAAGCAACTGAAGATCCACAAAAGAAGTGAAAATAGC
    CAGTTCCTGCCTTAACTGATGACATTCCACCATTGTGATTTGTTCCTGCCCCACCCTAACTGAT
    CAATTGACCTTGTGACAATACACCTTCCCCACCCTTGAGAAGGTGCTTTGTAATATTCTCCCCA
    CCCACCCCACGCCCGCACCCCCGCACCCTTAAGAAGGTATTTTGTAATATTCTCTCCGCCATTG
    AGAATGTGCTTTGTAAGATCCACCCCCTGCCCACAAAAAATTGCTCCTAACTCCACCGCCTATC
    CCAAACCTACAAGAACTAATGATAATCCCACCACCCTTTGCTGACTCTTTTTGGACTCAGCCCA
    CCTGCACCCAGGTGATTAAAAAGCTTTATTGTTCACACAAAGCCTGTTTGGTAGTCTCTTCACA
    GGGAAGCATGTGACACCCACAATCCCACCTAGCCCAGGAGAGAGCTACGGCAGGGTGTGTGTTT
    TGACACTGAGCTTGGGGCTTTTTCCATCTTCTCCCCACAGCCTCTGGCTCCACACCTCCACCGT
    TCAAGCGCCAGAAAGAGCTGTCTATGCAGCCTGCTCTTGGGCCTGGGGATGAGACACACAATTC
    ATTGGCTCCTGGATTTTAAGTAGACATTTGTAAATCTATAGCTAACTACTGTCCTTAAAGCCAT
    TGTTTCCATTACAAAATCCAACTCTCTGAGAGAAAAGGGTGTTTTAAATTTAAAAAAATAAAAA
    CAAAAAAGTTTGATTGAGAAAAAAAAAAAAAAA
    XAGE-1d
    (SEQ ID NO: 36)
    >gi|18157207|emb|AJ318879.1| Homo sapiens mRNA for XAGE-1d
    protein
    GGGAACGCGGCGGAGCTGTGAGCCGGCGACTCGGGTCCCTGAGGTCTGGATTCTTTCTCCGCTA
    CTGAGACACGGCGGACACACACAAACACAGAACCACACAGCCAGTCCCAGGAGCCCAGTAATGG
    AGAGCCCCAAAAAGAAGAACCAGCAGCTGAAAGTCGGGATCCTACACCTGGGCAGCAGACAGAA
    GAAGATCAGGATACAGCTGAGATCCCAGGTGCTGGGAAGGGAAATGCGCGACATGGAAGGTGAT
    CTGCAAGAGCTGCATCAGTCAAACACCGGGGATAAATCTGGATTTGGGTTCCGGCGTCAAGGTG
    AAGATAATACCTAAAGAGGAACACTGTAAAATGCCAGAAGCAGGTGAAGAGCAACCACAAGTTT
    AAATGAAGACAAGCTGAAACAACGCAAGCTGGTTTTATATTAGATATTTGACTTAAACTATCTC
    AATAAAGTTTTGCAGCTTTCACCAAAAAAAAAA
  • All literature and similar materials cited in this application, including but not limited to, patents, patent applications, articles, books, treatises, and internet web pages are expressly incorporated by reference in their entirety for any purpose. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as is commonly understood by one of ordinary skill in the art to which the various embodiments described herein belongs. When definitions of terms in incorporated references appear to differ from the definitions provided in the present teachings, the definition provided in the present teachings shall control.
  • Various modifications and variations of the described compositions, methods, and uses of the technology will be apparent to those skilled in the art without departing from the scope and spirit of the technology as described. Although the technology has been described in connection with specific exemplary embodiments, it should be understood that the invention as claimed should not be unduly limited to such specific embodiments. Indeed, various modifications of the described modes for carrying out the invention that are obvious to those skilled in pharmacology, biochemistry, medical science, or related fields are intended to be within the scope of the following claims.

Claims (27)

1. A method of screening for a lung neoplasm in a sample obtained from a subject, the method comprising:
a) assaying a sample from a subject for an amount of at least one RNA marker selected from the group consisting of GAGE12D, FAM83A, LRG1, XAGE-1 d, MAGEA4, SFTPB, AKAP4, and CYP24A1 in a sample obtained from a subject;
b) assaying said sample for an amount of reference marker in said sample;
c) comparing the amount of said at least one RNA marker to the amount of reference marker in said sample to determine a level of expression for said at least one marker gene in said sample; and
d) generating a record reporting the expression for said at least one marker gene in said sample.
2. The method of claim 1, wherein said assaying comprises obtaining a sample comprising RNA from a subject and treating the RNA with a reverse transcriptase to form a cDNA copy of at least a portion of said RNA.
3. The method of claim 1, wherein said at least one RNA marker is at least two of said markers.
4. The method of claim 1, wherein said at least one RNA marker comprises the group consisting of GAGE, FAM83A, LRG1 and MAGEA4.
5. The method of claim 1, wherein said at least one RNA marker comprises the group consisting of GAGE, FAM83A, LRG1, CYP24A1, XAGE1D and MAGEA4.
6. The method of claim 1, wherein said reference marker is an RNA selected from the group consisting of CASC3 mRNA, β-actin mRNA, U1 snRNA and U6 snRNA.
7. The method of claim 1, wherein the assaying comprises using polymerase chain reaction, nucleic acid sequencing, mass spectrometry, mass-based separation, a flap endonuclease assay, and/or target capture.
8-9. (canceled)
10. The method of claim 1, wherein assaying the expression of the RNA marker comprises detecting an increased or decreased expression of the RNA marker relative to a normal expression of the marker.
11. The method of claim 1 wherein the sample is a tissue sample, a blood sample, a serum sample, or a sputum sample.
12-13. (canceled)
14. A kit, comprising:
a) at least one oligonucleotide, wherein at least a portion of said oligonucleotide specifically hybridizes to a marker selected from the group consisting of GAGE12D, FAM83A, LRG1, XAGE-1 d, MAGEA4, SFTPB, AKAP4, and CYP24A1, and
b) at least one additional oligonucleotide, wherein at least a portion of said additional oligonucleotide specifically hybridizes to a reference nucleic acid.
15. The kit of claim 14, wherein said kit comprises at least two additional oligonucleotides.
16. The kit of claim 14, wherein said kit further comprises one or more components selected from the group consisting of reverse transcriptase, flap endonuclease, DNA polymerase, and a FRET cassette.
17-18. (canceled)
19. The kit of claim 16, wherein said kit comprises at least 4 oligonucleotides, wherein each of the markers in the group consisting of GAGE, FAM83A, LRG1, and MAGEA4 specifically hybridizes to at least one of said 4 oligonucleotides.
20. (canceled)
21. The kit of claim 16, wherein said oligonucleotide is selected from one or more of a capture oligonucleotide, a pair of nucleic acid primers, a nucleic acid probe, and an invasive oligonucleotide.
22. The kit of claim 14, wherein said reference marker is an RNA selected from the group consisting of CASC3, β-actin, U1 RNA and U6 RNA.
23. A composition comprising a reaction mixture comprising a complex of at least one RNA marker selected from the group consisting of GAGE12D, FAM83A, LRG1, XAGE-1 d, MAGEA4, SFTPB, AKAP4, and CYP24A1 and an oligonucleotide that specifically hybridizes to said RNA marker.
24. The composition of claim 23, further comprising a complex of at least one reference marker and an oligonucleotide that specifically hybridizes to said reference marker.
25-26. (canceled)
27. The composition of claim 24, wherein said reference marker is an RNA selected from the group consisting of CASC3, β-actin, U1 RNA and U6 RNA.
28. The composition of claim 23, wherein said oligonucleotide is selected from one or more of a capture oligonucleotide, a pair of nucleic acid primers, a nucleic acid probe, and an invasive oligonucleotide.
29. The composition of claim 23, further comprising one or more components selected from the group consisting of reverse transcriptase, flap endonuclease, thermostable DNA polymerase, and a FRET cassette.
30. The composition of claim 23, wherein said composition comprises a nucleic acid probe oligonucleotide comprising a reporter molecule.
31-33. (canceled)
US15/587,806 2016-05-05 2017-05-05 Detection of lung neoplasia by amplification of rna sequences Abandoned US20170321286A1 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
US15/587,806 US20170321286A1 (en) 2016-05-05 2017-05-05 Detection of lung neoplasia by amplification of rna sequences
US17/078,924 US11479823B2 (en) 2016-05-05 2020-10-23 Detection of lung neoplasia by amplification of RNA sequences
US17/932,574 US20230121671A1 (en) 2016-05-05 2022-09-15 Detection of lung neoplasia by amplification of rna sequences

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201662332419P 2016-05-05 2016-05-05
US15/587,806 US20170321286A1 (en) 2016-05-05 2017-05-05 Detection of lung neoplasia by amplification of rna sequences

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US17/078,924 Continuation US11479823B2 (en) 2016-05-05 2020-10-23 Detection of lung neoplasia by amplification of RNA sequences

Publications (1)

Publication Number Publication Date
US20170321286A1 true US20170321286A1 (en) 2017-11-09

Family

ID=60243468

Family Applications (3)

Application Number Title Priority Date Filing Date
US15/587,806 Abandoned US20170321286A1 (en) 2016-05-05 2017-05-05 Detection of lung neoplasia by amplification of rna sequences
US17/078,924 Active US11479823B2 (en) 2016-05-05 2020-10-23 Detection of lung neoplasia by amplification of RNA sequences
US17/932,574 Pending US20230121671A1 (en) 2016-05-05 2022-09-15 Detection of lung neoplasia by amplification of rna sequences

Family Applications After (2)

Application Number Title Priority Date Filing Date
US17/078,924 Active US11479823B2 (en) 2016-05-05 2020-10-23 Detection of lung neoplasia by amplification of RNA sequences
US17/932,574 Pending US20230121671A1 (en) 2016-05-05 2022-09-15 Detection of lung neoplasia by amplification of rna sequences

Country Status (1)

Country Link
US (3) US20170321286A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11674168B2 (en) 2015-10-30 2023-06-13 Exact Sciences Corporation Isolation and detection of DNA from plasma

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110052488A1 (en) * 2009-09-03 2011-03-03 Genentech, Inc. Methods For Treating, Diagnosing, and Monitoring Rheumatoid Arthritis
US20140087382A1 (en) * 2012-09-25 2014-03-27 Exact Sciences Corporation Normalization of polymerase activity

Family Cites Families (50)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5288609A (en) 1984-04-27 1994-02-22 Enzo Diagnostics, Inc. Capture sandwich hybridization method and composition
US4965188A (en) 1986-08-22 1990-10-23 Cetus Corporation Process for amplifying, detecting, and/or cloning nucleic acid sequences using a thermostable enzyme
US4683195A (en) 1986-01-30 1987-07-28 Cetus Corporation Process for amplifying, detecting, and/or-cloning nucleic acid sequences
US4683202A (en) 1985-03-28 1987-07-28 Cetus Corporation Process for amplifying nucleic acid sequences
US5011769A (en) 1985-12-05 1991-04-30 Meiogenics U.S. Limited Partnership Methods for detecting nucleic acid sequences
US5359100A (en) 1987-10-15 1994-10-25 Chiron Corporation Bifunctional blocked phosphoramidites useful in making nucleic acid mutimers
US5124246A (en) 1987-10-15 1992-06-23 Chiron Corporation Nucleic acid multimers and amplified nucleic acid hybridization assays using same
US5403711A (en) 1987-11-30 1995-04-04 University Of Iowa Research Foundation Nucleic acid hybridization and amplification method for detection of specific sequences in which a complementary labeled nucleic acid probe is cleaved
CA1340807C (en) 1988-02-24 1999-11-02 Lawrence T. Malek Nucleic acid amplification process
US5639611A (en) 1988-12-12 1997-06-17 City Of Hope Allele specific polymerase chain reaction
CA2036946C (en) 1990-04-06 2001-10-16 Kenneth V. Deugau Indexing linkers
US5494810A (en) 1990-05-03 1996-02-27 Cornell Research Foundation, Inc. Thermostable ligase-mediated DNA amplifications system for the detection of genetic disease
US5849481A (en) 1990-07-27 1998-12-15 Chiron Corporation Nucleic acid hybridization assays employing large comb-type branched polynucleotides
US6872816B1 (en) 1996-01-24 2005-03-29 Third Wave Technologies, Inc. Nucleic acid detection kits
US5846717A (en) 1996-01-24 1998-12-08 Third Wave Technologies, Inc. Detection of nucleic acid sequences by invader-directed cleavage
US5994069A (en) 1996-01-24 1999-11-30 Third Wave Technologies, Inc. Detection of nucleic acids by multiple sequential invasive cleavages
US5338671A (en) 1992-10-07 1994-08-16 Eastman Kodak Company DNA amplification with thermostable DNA polymerase and polymerase inhibiting antibody
WO1995014106A2 (en) 1993-11-17 1995-05-26 Id Biomedical Corporation Cycling probe cleavage detection of nucleic acid sequences
CA2188660C (en) 1994-04-25 2005-01-18 Richard G. H. Cotton Detection of mutation by resolvase cleavage
US5851770A (en) 1994-04-25 1998-12-22 Variagenics, Inc. Detection of mismatches by resolvase cleavage using a magnetic bead support
EP0842294B1 (en) 1994-12-23 2002-10-23 Dade Behring Inc. Detection of nucleic acids by nuclease-catalyzed product formation
US5882867A (en) 1995-06-07 1999-03-16 Dade Behring Marburg Gmbh Detection of nucleic acids by formation of template-dependent product
US5773258A (en) 1995-08-25 1998-06-30 Roche Molecular Systems, Inc. Nucleic acid amplification using a reversibly inactivated thermostable enzyme
US5854033A (en) 1995-11-21 1998-12-29 Yale University Rolling circle replication reporter systems
US5965408A (en) 1996-07-09 1999-10-12 Diversa Corporation Method of DNA reassembly by interrupting synthesis
JP2001517925A (en) 1995-12-22 2001-10-09 ベーリングウエルケ、アクティエンゲゼルシャフト Homogeneous amplification and detection of nucleic acids
US5985557A (en) 1996-01-24 1999-11-16 Third Wave Technologies, Inc. Invasive cleavage of nucleic acids
EP0892808B1 (en) 1996-04-12 2008-05-14 PHRI Properties, Inc. Detection probes, kits and assays
US6096273A (en) 1996-11-05 2000-08-01 Clinical Micro Sensors Electrodes linked via conductive oligomers to nucleic acids
US6013459A (en) 1997-06-12 2000-01-11 Clinical Micro Sensors, Inc. Detection of analytes using reorganization energy
US6063573A (en) 1998-01-27 2000-05-16 Clinical Micro Sensors, Inc. Cycling probe technology using electron transfer detection
EP1053352B1 (en) 1998-02-04 2002-09-18 Variagenics, Inc. Mismatch detection techniques
US6235502B1 (en) 1998-09-18 2001-05-22 Molecular Staging Inc. Methods for selectively isolating DNA using rolling circle amplification
US6630333B1 (en) 1999-03-23 2003-10-07 Invitrogen Corporation Substantially pure reverse transriptases and methods of production thereof
US6528254B1 (en) 1999-10-29 2003-03-04 Stratagene Methods for detection of a target nucleic acid sequence
EP1341929A2 (en) 2000-11-15 2003-09-10 Third Wave Technologies, Inc. Fen endonucleases
US7662594B2 (en) 2002-09-20 2010-02-16 New England Biolabs, Inc. Helicase-dependent amplification of RNA
US8206904B2 (en) 2002-12-18 2012-06-26 Third Wave Technologies, Inc. Detection of nucleic acids
EP1664077B1 (en) 2003-09-05 2016-04-13 Trustees of Boston University Method for non-invasive prenatal diagnosis
WO2006050499A2 (en) 2004-11-03 2006-05-11 Third Wave Technologies, Inc. Single step detection assay
US7888017B2 (en) 2006-02-02 2011-02-15 The Board Of Trustees Of The Leland Stanford Junior University Non-invasive fetal genetic screening by digital analysis
KR101419980B1 (en) 2008-03-15 2014-07-15 홀로직, 인크. Compositions and methods for analysis of nucleic acid molecules during amplification reactions
WO2010120554A1 (en) * 2009-04-01 2010-10-21 The Regents Of The University Of California Detecting and treating breast cancer resistance to egfr inhibitors
US8715937B2 (en) 2010-11-15 2014-05-06 Exact Sciences Corporation Mutation detection assay
US8916344B2 (en) 2010-11-15 2014-12-23 Exact Sciences Corporation Methylation assay
US8361720B2 (en) 2010-11-15 2013-01-29 Exact Sciences Corporation Real time cleavage assay
US20140363471A1 (en) * 2013-06-10 2014-12-11 National Tsing Hua University Method of inducing autophagy and activating toll-like receptor
US10822638B2 (en) 2015-10-30 2020-11-03 Exact Sciences Development Company, Llc Isolation and detection of DNA from plasma
US10385406B2 (en) 2016-05-05 2019-08-20 Exact Sciences Development Company, Llc Detection of lung neoplasia by analysis of methylated DNA
US11118228B2 (en) 2017-01-27 2021-09-14 Exact Sciences Development Company, Llc Detection of colon neoplasia by analysis of methylated DNA

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110052488A1 (en) * 2009-09-03 2011-03-03 Genentech, Inc. Methods For Treating, Diagnosing, and Monitoring Rheumatoid Arthritis
US20140087382A1 (en) * 2012-09-25 2014-03-27 Exact Sciences Corporation Normalization of polymerase activity

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11674168B2 (en) 2015-10-30 2023-06-13 Exact Sciences Corporation Isolation and detection of DNA from plasma

Also Published As

Publication number Publication date
US20210155993A1 (en) 2021-05-27
US11479823B2 (en) 2022-10-25
US20230121671A1 (en) 2023-04-20

Similar Documents

Publication Publication Date Title
US20220056526A1 (en) Detection of colon neoplasia by analysis of methylated dna
US11028447B2 (en) Detection of neoplasia by analysis of methylated dna
US20220136058A1 (en) Characterizing methylated dna, rna, and proteins in the detection of lung neoplasia
US20230366034A1 (en) Compositions and methods for diagnosing lung cancers using gene expression profiles
US20150072866A1 (en) Marker panel for detecting cancer
JP2019162102A (en) System and method of detecting rnas altered by cancer in peripheral blood
US20220403471A1 (en) Characterizing methylated dna, rna, and proteins in subjects suspected of having lung neoplasia
US20230121671A1 (en) Detection of lung neoplasia by amplification of rna sequences
US20210079479A1 (en) Compostions and methods for diagnosing lung cancers using gene expression profiles
WO2023212628A1 (en) Low-bias sequential multiplex amplification assays

Legal Events

Date Code Title Description
AS Assignment

Owner name: EXACT SCIENCES CORPORATION, WISCONSIN

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:ALLAWI, HATIM;LIDGARD, GRAHAM P.;KRUEGER, CHATEEN;AND OTHERS;REEL/FRAME:043463/0587

Effective date: 20160607

AS Assignment

Owner name: EXACT SCIENCES DEVELOPMENT COMPANY, LLC, WISCONSIN

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:EXACT SCIENCES CORPORATION;REEL/FRAME:044119/0001

Effective date: 20171001

STPP Information on status: patent application and granting procedure in general

Free format text: NON FINAL ACTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER

STPP Information on status: patent application and granting procedure in general

Free format text: FINAL REJECTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION

STPP Information on status: patent application and granting procedure in general

Free format text: NON FINAL ACTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER

STPP Information on status: patent application and granting procedure in general

Free format text: FINAL REJECTION MAILED

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION

AS Assignment

Owner name: EXACT SCIENCES CORPORATION, WISCONSIN

Free format text: MERGER;ASSIGNOR:EXACT SCIENCES DEVELOPMENT COMPANY, LLC;REEL/FRAME:058738/0465

Effective date: 20220101