CN114277092B - RNA virus macro transcriptome database building method based on nanopore sequencing platform and application - Google Patents

RNA virus macro transcriptome database building method based on nanopore sequencing platform and application Download PDF

Info

Publication number
CN114277092B
CN114277092B CN202111473132.2A CN202111473132A CN114277092B CN 114277092 B CN114277092 B CN 114277092B CN 202111473132 A CN202111473132 A CN 202111473132A CN 114277092 B CN114277092 B CN 114277092B
Authority
CN
China
Prior art keywords
rna
rna virus
macro
virus
library
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202111473132.2A
Other languages
Chinese (zh)
Other versions
CN114277092A (en
Inventor
周水莲
潘吾思
戴岩
李振
童桢开
李诗濛
任用
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Jiangsu Xiansheng Diagnostic Medical Instrument Co ltd
Jiangsu Xiansheng Diagnostic Technology Co ltd
Jiangsu Xiansheng Medical Devices Co ltd
Original Assignee
Jiangsu Xiansheng Diagnostic Medical Instrument Co ltd
Jiangsu Xiansheng Diagnostic Technology Co ltd
Jiangsu Xiansheng Medical Devices Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jiangsu Xiansheng Diagnostic Medical Instrument Co ltd, Jiangsu Xiansheng Diagnostic Technology Co ltd, Jiangsu Xiansheng Medical Devices Co ltd filed Critical Jiangsu Xiansheng Diagnostic Medical Instrument Co ltd
Priority to CN202111473132.2A priority Critical patent/CN114277092B/en
Publication of CN114277092A publication Critical patent/CN114277092A/en
Application granted granted Critical
Publication of CN114277092B publication Critical patent/CN114277092B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention belongs to the field of gene sequencing, and particularly relates to a RNA virus macro-transcriptome library building method of a nanopore sequencing platform. Aiming at the problems that a nanopore sequencing platform is low in flux, a clinical sample is high in host content, and total RNA contains a large amount of ribosomal RNA, incomplete host cell DNA removal and ribosomal RNA removal are carried out on the clinical sample through reverse enrichment and the like; aiming at the problems of low viral load of clinical samples and high diversity of RNA viral genomes, cDNA is positively enriched by adopting single primer isothermal amplification and the like. According to the invention, through forward and reverse enrichment steps, a nanopore sequencing platform RNA virus macro-transcription set-up library detection process is successfully established, rapid clinical identification of unknown RNA virus pathogens is satisfied, and the method is suitable for popularization and application.

Description

RNA virus macro transcriptome database building method based on nanopore sequencing platform and application
Technical Field
The invention belongs to the field of gene sequencing, and particularly relates to a RNA virus macro-transcriptome database building method of a nanopore sequencing platform and application thereof.
Background
Infectious diseases are a major cause of human illness and death and have been of great clinical interest. In most cases of severe infection, RNA viruses are the major pathogens, such as the global pandemic of seasonal influenza and the worldwide pandemic of the novel coronavirus (SARS-CoV-2), posing serious challenges to the world's economy and human health, making accurate and rapid identification of the pathogen a focus of attention.
The main methods for identifying viruses in the traditional clinical microbiological laboratory are virus separation culture, immunofluorescence detection, PCR molecular detection and other technologies, but the methods have limitations. The virus isolation culture and immunofluorescence detection are low in sensitivity and time-consuming; PCR molecular detection is based only on differential diagnosis by clinicians for specific pathogen types, and the detection capability of rare and novel viral pathogens is limited. And the RNA virus in a sample can be comprehensively detected without bias by the macrotranscriptome sequencing of the RNA virus, rare and novel pathogens which cannot be detected by the traditional means can be found, and more comprehensive and accurate reference is provided for clinical decision. In addition, the macro-transcriptome sequencing of the RNA virus can further research the immune response of the virus and a host, provides possibility for searching disease markers and promotes the development of accurate diagnosis and treatment of infectious diseases.
Compared with the traditional etiology identification method, the metagenome sequencing identification method has the advantages of short identification period, low requirement on the technical level of operation and identification personnel and the like. And the metagenomic sequencing overcomes the defects of the traditional etiological diagnosis, namely a series of differential diagnosis made by a clinician according to the clinical manifestations of patients, and is increasingly applied to the identification of microorganisms, particularly pathogenic microorganisms of unknown causes. However, the existing second-generation sequencing-based technology has long sequencing analysis period, long reading length and large instrument investment, which limits the feasibility of the application of the technology in clinical rapid detection. The third-generation sequencing technology PacBio is greatly improved in the aspect of sequencing reading, but the library building process is complex, the defect of long sequencing period exists like the second-generation sequencing technology, the data downloading is completed within dozens of hours, and the subsequent analysis time is added, so that the clinical rapid identification is difficult to meet.
The nanopore sequencing technology developed in recent years can make up for the disadvantages of other sequencing platforms, has the characteristics of reading of a sequencing super-long sequence, real-time data generation and biological information analysis, and small and portable equipment, has the characteristics of improving the accuracy of pathogenic microorganism detection and shortening the report period, is suitable for hospital and field detection, conforms to the requirement scene of clinical infection diagnosis, and is expected to become the infection diagnosis technology of the next generation.
In the research field of RNA virus macro transcriptome, the amount of host nucleic acid in clinical samples is extremely high, the total RNA contains a large amount of ribosomal RNA (rRNA for short), the viral load in the samples is extremely low, RNA virus Reads cannot be detected or the detected amount is very small, and the establishment of a library of RNA virus macro transcriptome becomes a challenge. In practice, an efficient and convenient macro transcriptome database building method for RNA viruses is urgently needed for accurate diagnosis and treatment of infectious diseases.
In view of the above, the present invention is particularly proposed.
Disclosure of Invention
Aiming at the problem that the RNA virus of a clinical sample is difficult to detect, the core problem or purpose to be solved by the invention is to set up a set of detection flow of the RNA virus macro-transcription component library of the nanopore sequencing platform so as to improve the sensitivity of RNA virus detection of the nanopore sequencing platform, meet the clinical requirement for rapidly detecting the sample suspected of RNA virus infection, and be suitable for popularization and application.
According to the invention, through the operation steps of reverse enrichment and forward enrichment, the detection process of establishing the RNA virus macro-transcriptome library of the nanopore sequencing platform is successfully established, the sensitivity of RNA virus macro-transcriptome detection is improved, the rapid clinical identification of unknown pathogens is met, and the method is suitable for popularization and application.
Specifically, the technical scheme adopted by the invention is as follows:
the invention firstly provides a RNA virus macro transcriptome database building method based on a nanopore sequencing platform, which comprises the following steps:
1) Removing host DNA from the sample;
2) A nucleic acid extraction step;
3) A cDNA synthesis step;
4) Enrichment of cDNA;
5) Eliminating human rRNA;
further, the method can also comprise the following steps:
6) And (5) library construction.
Further, the step of 1) removing the host DNA from the sample is a step of incompletely removing the host DNA.
Furthermore, in the step 1), DNase is adopted to carry out incomplete removal on the host DNA of the sample;
preferably, the adding amount of the DNase is 26-260U, so that incomplete host removal corresponding to clinical samples can be realized effectively.
Further, the 2) nucleic acid extraction step can adopt RNA virus extraction by adopting Zymo _ D7021 or Qiagen _ 57704; preferably, zymo-D7021 is used.
Further, in the step 4) of cDNA enrichment, a single primer isothermal amplification method is adopted to actively enrich cDNA;
furthermore, the single-primer isothermal amplification method is an SPIA isothermal amplification method for actively enriching cDNA.
Preferably, the SPIA Isothermal Amplification uses a NuGEN Trio RNA-Seq (Single Primer Isothermal Amplification) kit.
Further, 5) removing the human rRNA by a probe method.
Further, the probe method comprises a hybridization capture step and a specific nuclease treatment step.
Preferably, the hybrid capture can use NuGEN Trio RNA-Seq (transcript deletion with AnyDeplete) kit.
Further, the library constructing step of 6) adopts a PCR-free library building computer kit; more preferably, the SQK-LSK109 kit is used.
Further, the samples include, but are not limited to: a body fluid sample, an alveolar lavage fluid sample, a sputum sample, a cerebrospinal fluid sample.
Further, the RNA virus macro-transcriptome is a nanopore sequencing platform based RNA virus macro-transcriptome.
The invention also provides the application of any one of the methods in RNA virus macro-transcriptome sequencing.
The invention has the beneficial technical effects that:
(1) The invention has high host content and a large amount of ribosomal RNA (rRNA) in total RNA of a clinical sample, and incomplete host cell DNA removal and rRNA removal are carried out on the clinical sample through reverse enrichment.
(2) The invention actively enriches cDNA by adopting a single primer isothermal amplification method aiming at low viral load of clinical samples and high diversity of RNA viral genomes.
(3) The invention effectively improves the sensitivity of RNA virus macro-transcriptome detection through reverse enrichment and forward enrichment, meets the clinical identification of unknown pathogens, and is suitable for popularization and application.
(4) The invention carries out detailed parameter optimization on the whole process to obtain a set of method process which is suitable for a nanopore sequencing platform and used for RNA virus macro-transcription library building and sequencing analysis.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and other drawings can be obtained by those skilled in the art without creative efforts.
FIG. 1 is a schematic diagram of an experimental procedure according to the present invention;
FIG. 2 is a graph comparing the results of each thousand Reads for different viruses of the library;
FIG. 3 is a comparison graph of the detection results of the average coverage depth of sequencing of different on-machine library kits;
FIG. 4 is a graph of detailed results of virus identification and coverage of virus identification for sample 187;
FIG. 5 sample 725 details of virus identification and a virus identification coverage map;
FIG. 6 sample A1 virus identification details and virus identification coverage map;
FIG. 7 sample B1 virus identification details and a virus identification coverage map;
FIG. 8 sample C1 virus identification details and virus identification coverage map;
FIG. 9 shows detailed results of virus identification and coverage map of virus identification for cell quality control product D1;
FIG. 10 shows detailed results of virus identification and coverage map of virus identification for the cell quality control product E1;
FIG. 11 shows the detailed results of virus identification and the coverage map of virus identification for the cell quality control product F1.
Detailed Description
The technical solutions of the present invention will be described clearly and completely with reference to the accompanying drawings, and it should be understood that the described embodiments are some, but not all embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
The following terms or definitions are provided only to aid in understanding the present invention. These definitions should not be construed to have a scope less than understood by those skilled in the art.
Unless defined otherwise below, all technical and scientific terms used in the detailed description of the invention are intended to have the same meaning as commonly understood by one of ordinary skill in the art. While the following terms are believed to be well understood by those skilled in the art, the following definitions are set forth to better explain the present invention.
As used herein, the terms "comprising," "including," "having," "containing," or "involving" are inclusive or open-ended and do not exclude additional unrecited elements or method steps. The term "consisting of 8230A" is considered to be a preferred embodiment of the term "comprising". If in the following a certain group is defined to comprise at least a certain number of embodiments, this should also be understood as disclosing a group which preferably only consists of these embodiments.
Where an indefinite or definite article is used when referring to a singular noun e.g. "a" or "an", "the", this includes a plural of that noun.
The terms "about" and "substantially" in the present invention denote the interval of accuracy that a person skilled in the art can understand while still guaranteeing the technical effect of the feature in question. The term generally denotes a deviation of ± 10%, preferably ± 5%, from the indicated value.
Furthermore, the terms first, second, third, (a), (b), (c), and the like in the description and in the claims, are used for distinguishing between similar elements and not necessarily for describing a sequential or chronological order. It is to be understood that the terms so used are interchangeable under appropriate circumstances and that the embodiments of the invention described herein are capable of operation in other sequences than described or illustrated herein.
The invention relates to a RNA virus macro-transcriptome database building method based on a nanopore sequencing platform, which mainly comprises the following steps:
1) Removing host DNA from the sample; 2) A nucleic acid extraction step; 3) A cDNA synthesis step; 4) Enrichment of cDNA; 5) Eliminating human rRNA; 6) And (5) library construction.
In some embodiments, in step 1) removing the host DNA from the sample, the sample contains a large amount of host DNA and the RNA virus is in low abundance and only a very small fraction. On one hand, only a very small part of obtained sequencing data can be used for species identification due to the interference of host DNA, on the other hand, the extracted nucleic acid is high, the input volume of subsequent DNase digestion is also influenced, the input amount of total RNA pathogens is further influenced, the detection sensitivity of macro-transcription component library construction of low-abundance RNA viruses is directly influenced, and the host DNA is required to be incompletely removed through analysis. Incomplete removal of host DNA is carried out, for example, using a combination (1) of reagent DH + reagent SS + reagent XX (host DNA liberation and degradation), or a combination (2) of reagent SS + reagent XX (host DNA degradation), preferably in the case of viruses (2) of reagent SS + reagent XX (host DNA degradation); preferably, the combination (2) is used.
In some embodiments, clinical RNA samples are limited in access and subsequently optimized library-building procedures are performed using cell-cultured virus quality controls simultaneously.
In some embodiments, 2) the nucleic acid extraction step, with low clinical viral load, qPCR evaluates kit efficacy and demonstrates that Zymo-D7021 is superior to Qiagen-57704, and thus Zymo-D7021 may be preferred for RNA virus extraction.
In some embodiments, in the step 4) of cDNA enrichment, cDNA is actively enriched by using a single-primer isothermal amplification method, qPCR and on-machine detection result shows that SPIA > SISIPA, and preferably, cDNA is actively enriched by using an SPIA isothermal amplification method.
In some embodiments, 5) in the step of human rRNA elimination, the throughput of the nanopore sequencing platform is low, the total RNA contains a large amount of conserved ribosomal RNA (rRNA), and the rRNA is removed by a probe method, so that the Reads of RNA viruses is improved.
In some embodiments, 6) library construction, the computer kit is selected, and the PCR FREE kit has better effect in terms of sequencing data amount, unmapped, virus Reads and sequencing average coverage depth detection, such as SQK-LSK109> SQK-PBK004, so the SQK-LSK109 kit can be preferably used for library computer.
Further, the method for identifying the macro transcriptome can be applied to the fields including but not limited to clinical research, scientific research and the like.
It will be appreciated that the method system of the invention is not limited to the type of sample, but is applicable wherever the field relates to macrotranscriptome rna viruses, for example samples may include, but are not limited to: a body fluid sample, an alveolar lavage fluid sample, a sputum sample, or a cerebrospinal fluid sample, among others.
Embodiments of the present invention will be described in detail below with reference to examples, but it will be understood by those skilled in the art that the following examples are only illustrative of the present invention and should not be construed as limiting the scope of the present invention. The examples, in which specific conditions are not specified, were conducted under conventional conditions or conditions recommended by the manufacturer. The reagents or instruments used are not indicated by manufacturers, and are all conventional products available on the market.
The following examples and experimental examples relate to an apparatus comprising: biological safety cabinet, vibration metal bath, pipettor, centrifuge, breaking instrument, super clean bench, PCR instrument, magnetic frame, gridION, qubit 4.0, refrigerator etc..
The related reagent comprises: host removal and microbiome DNA isolation kit, nucleic acid extraction kit, SPIA amplification kit,RNA-Seq transcript depletion with AnyDeplete Kit(Human rRNA)、 Universal End preparation ModuLe、Universal Adapter Ligation ModuLe、
Figure BDA0003387850860000061
GXL DNA Polymerase, ONT library construction kit, AMPure XP purified magnetic bead and Qubit TM A detection kit, an ONT sequencing chip and the like.
Example 1RNA Virus derhosting procedure optimization
Considering that the sample contains a large amount of host DNA, the abundance of RNA virus is low and only a very small part is occupied. On one hand, only a very small part of obtained sequencing data can be used for species identification due to the interference of host DNA, on the other hand, the extracted nucleic acid is high, the input volume of subsequent DNase digestion is also influenced, the input amount of total RNA pathogens is further influenced, the detection sensitivity of macro-transcription component library construction of low-abundance RNA viruses is directly influenced, and the incomplete removal of the host DNA is considered.
In this embodiment, the host DNA is incompletely removed from clinical Influenza B virus samples (IVB) and Rhinovirus samples (RhV) so as to screen a combination that can not only incompletely remove the host DNA but also has a small influence on viruses.
The invention sets two combinations as follows: combining (1) a saponin and DNase combined mode, and carrying out host DNA dissociation and degradation for experimental purposes; combination (2) only performs the DNase degradation host mode.
The two combined operation flows set by the invention are as follows:
combination (1) procedure: taking 500 mu L of sample, centrifuging for 10min at 4 ℃ of 18000 rcf; carefully transferring the supernatant to a new 2mL centrifuge tube; 10 μ L of 5% DH; adding 56 mu L SS, adding 1-10 mu L solution XX, mixing uniformly immediately, incubating at 37 ℃ and 1000rpm for 10min.
The combination (2) procedure: taking 500 mu L of sample, centrifuging for 10min at 4 ℃ of 18000 rcf; carefully transferring the supernatant to a new 2mL centrifuge tube; adding 56 mu L SS, adding 1-10 mu L solution XX, mixing uniformly immediately, incubating at 37 ℃ and 1000rpm for 10min.
The experimental design is shown in table 1:
table 1: optimized different combination forms of RNA virus decoating process
Combination (1) Reagent DH + reagent SS + reagent XX Host DNA liberation and degradation
Combination (2) Reagent SS + reagent XX Host DNA degradation
Remarking: the component of reagent DH is saponin, reagent SS is DNase Buffer, and reagent XX is DNase.
The results of the experiment are shown in table 1:
table 2: qPCR result after host removal of different combinations of clinical RNA virus samples
Figure BDA0003387850860000071
qPCR validated combined test results: in terms of RhV clinical sample Human and virus target quantification, whether the effect of reagent DH removal is negligible; in the aspect of IVB clinical sample Human quantification, the influence of reagent DH removal is negligible, but the difference of the virus target is 4.48CT, the difference is large, and the risk of reagent DH on virus removal hosts is prompted. In summary, the present invention preferably combines the reagents (2) SS + XX (DNase Buffer + DNase) with the reagents (2) for viruses, i.e., incomplete removal of host DNA is carried out only by DNA degradation without involving DH-based DNA isolation step, and the preferred amount of DNase added is 26U to 260U, which enables correspondingly effective incomplete removal of host.
Example 2 virus off-host procedure the necessity for building a nanopore sequencing platform RNA virus macro-transcriptome library
In the process of sample extraction, besides a very small amount of RNA viruses are extracted, the sample also contains a large amount of human genome nucleic acids, DNA of other pathogens and ribosomal RNA (rRNA), and if the parts are not removed, a large amount of sequencing space is occupied, so that the RNA detection is influenced; however, in the process of host removal, part of RNA viruses are also removed, which affects the load and detection of RNA viruses, so how to select host removal and the removal degree in practice affects the final result.
The purpose of this embodiment is: according to the optimized virus off-host process reagent combination (2) in example 1, whether the off-host process is necessary for constructing the ONT platform RNA virus macro-transcriptome library is tested for clinical samples.
Table 3: effect of non-decovered and decovered clinical samples on viral load
Figure BDA0003387850860000072
Table 4: clinical sample uncarved and uncarved nanopore sequencing platform library building and computer result
Figure BDA0003387850860000073
Figure BDA0003387850860000081
The result shows that clinical IVB samples are respectively tested for non-host removal and host removal process library building, and from the sequencing off-line result in 1 hour, the target pathogen is not detected in the non-host removal, the IVB can be well detected in the host removal, and the sequence number is 442.
In summary, in the RNA virus macro-transcript building library based on the nanopore sequencing platform, although the host removing process reduces the virus load, from the point of the computer result, the host removing process is very necessary for building the RNA virus macro-transcript building library of the nanopore sequencing platform.
Example 3 viral nucleic acid extraction reagent screening
The metagenome clinical sample has low virus load, and in practice, the existing RNA virus nucleic acid extraction kit is not suitable for RNA virus nucleic acid extraction of the metagenome, and a high-quality RNA nucleic acid template is required for obtaining a high-quality sequencing result. The RNA virus nucleic acid extraction kit of different manufacturers has great difference in extraction effect, extraction time and the like, so that the effective detection of pathogens in the subsequent RNA virus sequencing stage is influenced. The difference of the effect of extracting the RNA virus nucleic acid pathogeny by tens of times to thousands of times not only affects the sensitivity of the RNA virus macro-transcriptome process, but also can cause the omission of the RNA pathogeny to the social economy and the inestimable and life-threatening loss of patients. Therefore, the present example explores and tests two RNA virus nucleic acid extraction kits, and adopts qPCR to evaluate the extraction effect of the kit.
The experimental scheme is as follows: alveolar lavage fluid sample Spike in RNA virus
1mL of alveolar lavage fluid sample is put into a 1.5mL centrifuge tube, 100 μ L of Enterovirus 71 (Enterovirus 71, EV 71) and human Parainfluenza 3 (Parailnza virus type 3, PIV 3) quality control products (copy number is 1 × 10^5 copies/mL) are added, and nucleic acid extraction of the core Zymo _ D7021 and Qiagen _57704 are carried out by averagely dividing into two parts, wherein the kits are all commercially available.
Table 5: qPCR result for extracting nucleic acid from different kit viruses
Figure BDA0003387850860000082
As shown in Table 5, zymo-D7021 is obviously superior to Qiagen-57704 in the quantitative result of the virus target, and the difference of the extraction effect of the virus is significant, and the difference of the quantitative result evaluation is 3-10 CTs, and the difference of the extraction efficiency of the RNA virus is about 10-1000 times.
In the extraction time, the nucleic acid extraction time of Zymo-D7021 is about 0.5 hour, the nucleic acid extraction time of Qiagen _57704 is about 1.5 hours, and Zymo-D7021 can better meet the requirement of quick detection of clinical infection samples.
In summary, zymo-D7021 is preferred for RNA virus extraction in the present invention, because of the combination of the extraction effect and the extraction time.
Example 4 cDNA enrichment necessity for building a nanopore sequencing platform RNA Virus Macro-transcriptome library
The cDNA has less virus content, and the direct library construction and sequencing may not be effective, and this example contrasts and tests the influence of direct library construction and machine installation of cDNA and machine installation after cDNA enrichment on the virus sequencing result.
Table 6: influence of cDNA enrichment on RNA virus macrotranscriptome sequencing result of construction of nanopore sequencing platform
Figure BDA0003387850860000091
The experimental results are shown in table 6, and compared with the three treatment conditions, the number of sequences detected by the SPIA-enriched Rhinovirus a in the method 3cDNA and the relative abundance are obviously better than those detected by the methods 1 and 2 without the cDNA enrichment operation process. Thus, in RNA virus macrotranscriptome sequencing, cDNA enrichment has a significant gain effect on RNA virus enrichment.
Example 5 rRNA knock-out necessities for building a nanopore sequencing platform RNA viral transcriptome library
The transcriptome comprises a collection of different types of RNA molecules, coding RNA and non-coding RNA. Ribosomal RNA (rRNA) accounts for about 80% of the total RNA amount, and is the most abundant RNA, but usually, such RNA has a large molecular weight and is not metabolically active, and if the RNA is not removed, a large amount of sequencing space may be occupied, which may affect the detection of RNA.
Table 7 influence of rrna knock-out effect on RNA virus macro-transcriptome sequencing results building a nanopore sequencing platform
Figure BDA0003387850860000092
Figure BDA0003387850860000101
The experimental results show in the table, the detected sequence number and relative abundance of Rhinovirus A after rRNA elimination of cDNA in the method 2 are obviously superior to those of the method 1 without rRNA elimination operation flow under two treatment conditions; in addition, it is clear from the above example 4 that the effect of rRNA elimination alone is not ideal, but the "rRNA elimination + SPIA enrichment" effect is optimal, and the principle is not clear at all. In conclusion, after SPIA enrichment, rRNA elimination is performed on cDNA, so that the effective data volume is remarkably improved, and the rRNA elimination is very key for building a nanopore sequencing platform RNA virus macro-transcription component library.
Example 6 on-Board library reagent selection
Considering that the Nanopore kit is divided into PCR and PCR-free, and whether performing PCR again in the selection of the kit in the machine will further amplify the sequencing result, this example compares the effect of constructing library kit on QK-LSK109 (PCR-free) and SQK-PBK004 (LP with PCR).
As can be seen from FIGS. 2 and 3, in the on-machine kit selection, the SQK-LSK109 (PCR-free) > SQK-PBK004 (LP with PCR) is selected from the aspects of sequencing data quantity, detection of virus Reads and sequencing average coverage depth, so that the SQK-LSK109 kit is preferably used for library on-machine in the invention.
Example 7 establishment of the basic Process of the invention
The basic procedure of the present invention was finally determined by the search of examples 1-6 (the following is only an optimal solution and is not intended to limit the scope of the patent claims).
The basic flow of the experiment is shown in FIG. 1, and the concrete sample macro-transcription library construction and detection method is as follows:
1. sample pretreatment (removal of host DNA):
1. taking 500 mu L of body fluid sample, and centrifuging for 10min at 4 ℃ of 18000 rcf;
2. carefully transferring the supernatant to a new 2mL centrifuge tube;
3. adding 56 mu L SS, adding 1-10 mu L solution XX, mixing uniformly immediately, incubating at 37 ℃ and 1000rpm for 10min.
2. And (3) extracting nucleic acid, namely extracting RNA virus by adopting Zymo-D7021 according to the kit instruction.
3. 1/2 strand cDNA Synthesis: the conventional cDNA synthesis process is adopted.
4. cDNA enrichment (SPIA Pre-amplification)
1. In a 0.2mL PCR tube, the following system was prepared: specifically, trio RNA-Seq (Single Primer Isothermal Amplification kit, purchased from NuGEN) was used.
Figure BDA0003387850860000102
/>
Figure BDA0003387850860000111
2. The PCR tube was placed on a PCR instrument for the following reactions:
temperature (. Degree.C.) Time (min)
4 1
47 90
80 20
4 Hold
Purification after SPIA Pre-amplification
(1) Adding 33 mu L of nucleic-free water into the PCR tube after the previous step, adding 60 mu L of AMPure XP beads suspension for purification, uniformly mixing, incubating at room temperature for 5-10 minutes, centrifuging for a short time, placing on a magnetic frame for standing, and removing supernatant;
(2) Adding 200 μ L of freshly prepared 80% ethanol, and repeating for 2 times;
(3) After the short-time centrifugation, resetting the centrifugal tube on a magnetic frame, removing residual liquid by using a 10-mu-L pipettor, and airing for 2min at room temperature;
(4) Adding 12 mu L of DNA Resuspension Buffer, resuspending the magnetic beads, incubating at room temperature for 2min, standing on a magnetic frame, sucking the supernatant, putting into the next step for reaction, and taking the remaining 1 mu L in the previous step for the concentration determination of the Qubit.
5. Human rRNA knock-out treatment
1. And (3) probe binding: specifically, trio RNA-Seq (transcript deletion with AnyDesplex) kit was used and purchased from NuGEN.
(1) In a 0.2mL PCR tube, the following system was prepared:
components Volume (μ L)
Anydeplete Buffer Mix 5
Anydeplete Probe Mix 9
Anydeplete Enzyme Mix 1
DNA Resuspension Sample 10
Total 25
(2) The PCR tube was placed on a PCR instrument for the following reactions:
Figure BDA0003387850860000112
Figure BDA0003387850860000121
/>
2. targeted segment removal (specific nuclease treatment removal)
(1) The following system was added to the PCR tube at the end of the previous step:
components Volume (μ L)
Anydeplete Buffer Mix 5
Anydeplete Enzyme Mix II 4
DNA Resuspension Buffer 16
Samples from the end of the previous step 25
Total 50
(2) The PCR tube was placed on a PCR instrument for the following reactions:
temperature (. Degree.C.) Time (min)
60 30
95 5
4 Hold
3. And (5) purifying.
6. And (4) constructing a library by adopting an SQK-LSK109 kit according to the kit instruction.
7. Sequencing analysis, computer sequencing and letter generation analysis according to a standard nanopore sequencing computer flow.
Example 8 clinical sample testing
Based on the established RNA virus macro-transcription component library building processing flow, 5 clinical RNA virus samples and 3 cell quality control products which are verified by qPCR are subjected to machine test according to the whole RNA virus macro-transcription component library building flow based on the nanopore sequencing platform, and the test results are shown in the following table and figures 4-11.
Figure BDA0003387850860000122
/>
Figure BDA0003387850860000131
According to the invention, 5 parts of clinical samples verified by qPCR and 3 parts of cell culture quality control products are subjected to full-process machine test according to RNA virus macro-transcription component library based on a nanopore sequencing platform, RNA virus infection pathogeny (detected RNA viruses comprise respiratory syncytial virus, rhinovirus, human parainfluenza virus, influenza virus, enterovirus, flavivirus and the like) can be rapidly detected, and the detection process is matched with the qPCR verification result, the RNA virus macro-transcription component library detection process of the nanopore sequencing platform is successfully established, 1 hour of sequencing data shows that the sequence number, relative abundance and coverage of the viruses are good, the clinical requirement on rapid detection of suspected RNA virus infection samples is met, and the method is suitable for popularization and application.
The foregoing descriptions of specific exemplary embodiments of the present invention have been presented for purposes of illustration and description. It is not intended to limit the invention to the precise form disclosed, and obviously many modifications and variations are possible in light of the above teaching. The exemplary embodiments were chosen and described in order to explain certain principles of the invention and its practical application to enable one skilled in the art to make and use various exemplary embodiments of the invention and various alternatives and modifications as are suited to the particular use contemplated. It is intended that the scope of the invention be defined by the claims and their equivalents.

Claims (2)

1. A method for constructing a library of RNA virus macro-transcriptomes, said method comprising the steps of:
1) Removing host DNA from the sample;
2) A nucleic acid extraction step;
3) A cDNA synthesis step;
4) Enrichment of cDNA;
5) Eliminating human rRNA;
6) Constructing a library;
the 1) host DNA removing step of the sample is an incomplete host DNA removing step: adopting DNase to carry out incomplete removal of host DNA on the sample; the adding amount of the DNase is 26U-260U;
in the step 2) of nucleic acid extraction, zymo _ D7021 is adopted for RNA virus extraction;
the step 4) of cDNA enrichment actively enriches cDNA by adopting a single primer isothermal amplification method;
the single primer isothermal amplification method is an SPIA isothermal amplification method;
5) removing human rRNA by a probe method; the probe method comprises a hybridization capture step and a specific nuclease treatment step;
the samples include, but are not limited to: alveolar lavage fluid samples, sputum samples, and cerebrospinal fluid samples;
the RNA virus macro-transcriptome is a nanopore sequencing platform based RNA virus macro-transcriptome;
and 6) in the library construction step, a PCR-free library construction computer kit is adopted.
2. Use of the library construction method of claim 1 in RNA virus macrotranscriptome sequencing, said use being a non-disease diagnostic use.
CN202111473132.2A 2021-12-02 2021-12-02 RNA virus macro transcriptome database building method based on nanopore sequencing platform and application Active CN114277092B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111473132.2A CN114277092B (en) 2021-12-02 2021-12-02 RNA virus macro transcriptome database building method based on nanopore sequencing platform and application

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111473132.2A CN114277092B (en) 2021-12-02 2021-12-02 RNA virus macro transcriptome database building method based on nanopore sequencing platform and application

Publications (2)

Publication Number Publication Date
CN114277092A CN114277092A (en) 2022-04-05
CN114277092B true CN114277092B (en) 2023-03-24

Family

ID=80870996

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111473132.2A Active CN114277092B (en) 2021-12-02 2021-12-02 RNA virus macro transcriptome database building method based on nanopore sequencing platform and application

Country Status (1)

Country Link
CN (1) CN114277092B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117126920B (en) * 2023-07-31 2024-04-12 江苏先声医学诊断有限公司 Central nervous infection pathogen library building and detecting method based on nanopore sequencing platform

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113265452A (en) * 2021-05-14 2021-08-17 北京大学人民医院 Bioinformatics pathogen detection method based on Nanopore metagenome RNA-seq

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106319639B (en) * 2015-06-17 2018-09-04 深圳华大智造科技有限公司 Build the method and apparatus of sequencing library
CA3067251C (en) * 2018-03-19 2024-02-27 Illumina, Inc. Methods and compositions for selective cleavage of nucleic acids with recombinant nucleases
CN110964795B (en) * 2018-12-13 2020-09-18 北京先声医学检验实验室有限公司 Nanopore sequencing platform based alveolar lavage fluid sample library building method, identification method and kit
CN113249436A (en) * 2020-02-12 2021-08-13 广州微远基因科技有限公司 Method and application for reducing host nucleic acid in biological sample
CN112853001A (en) * 2021-02-06 2021-05-28 浙江树人学院(浙江树人大学) Quality control product for detecting RNA virus by metagenome sequencing and application thereof

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113265452A (en) * 2021-05-14 2021-08-17 北京大学人民医院 Bioinformatics pathogen detection method based on Nanopore metagenome RNA-seq

Also Published As

Publication number Publication date
CN114277092A (en) 2022-04-05

Similar Documents

Publication Publication Date Title
CN110349630B (en) Analysis method and device for blood metagenome sequencing data and application thereof
CN111235316B (en) Primer probe for identifying novel coronavirus and application of primer probe in triple fluorescence RPA
CN111254190B (en) Nanopore third-generation sequencing detection method for plasma virology
CN105420403B (en) A kind of real time fluorescence quantifying PCR method for carrying out magnetic bead in a pipe and extracting nucleic acid and amplification
CN111500781B (en) Amplification primer group for detecting SARS-CoV-2 by mNGS and application thereof
CN109486923B (en) Primer system for sequencing multiple amplicons, application thereof and method for constructing sequencing library
CN111440896A (en) Novel β coronavirus variation detection method, probe and kit
CN105861700A (en) High throughput test method for neuromuscular disorders
CN105567681B (en) A kind of method and label connector based on the noninvasive biopsy virus of high-throughput gene sequencing
CN106191311B (en) A kind of multiple liquid phase genetic chip method and reagent of quick detection cavy LCMV, SV, PVM, Reo-3 virus
CN108138244A (en) Virus group capture microarray dataset, design and construction method and application method
CN113337639A (en) Method for detecting COVID-19 based on mNGS and application thereof
CN114277092B (en) RNA virus macro transcriptome database building method based on nanopore sequencing platform and application
CN111662959A (en) High-throughput rapid detection method for fungi
CN103998625B (en) For the method and system of Viral diagnosis
CN113265452A (en) Bioinformatics pathogen detection method based on Nanopore metagenome RNA-seq
CN113215235A (en) Method for rapidly detecting pathogenic microorganisms in high flux
CN102952895B (en) Method for detecting unknown viruses through utilizing sequencing technology
CN111549109A (en) High-throughput pathogen microorganism gene detection screening method
CN115948607B (en) Method and kit for simultaneously detecting multiple pathogen genes
CN109486922B (en) Method for detecting microbial target sequence based on single primer probe capture
CN114351261A (en) Method for detecting respiratory tract sample difficultly-detected pathogenic microorganisms based on nanopore sequencing platform
CN115491431A (en) Detection kit for detecting Japanese encephalitis B virus nucleic acid based on gene editing technology and application
CN113186348A (en) Primer combination and method for detecting common respiratory viruses
CN114107454A (en) Respiratory tract infection pathogen detection method based on macrogene/macrotranscriptome sequencing

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant