CN107944225A - Gene high-flux sequence data mutation detection methods - Google Patents

Gene high-flux sequence data mutation detection methods Download PDF

Info

Publication number
CN107944225A
CN107944225A CN201711214506.2A CN201711214506A CN107944225A CN 107944225 A CN107944225 A CN 107944225A CN 201711214506 A CN201711214506 A CN 201711214506A CN 107944225 A CN107944225 A CN 107944225A
Authority
CN
China
Prior art keywords
gene
sequence
mutation
sam
format file
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201711214506.2A
Other languages
Chinese (zh)
Other versions
CN107944225B (en
Inventor
李超
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huisuan Gene Technology Shanghai Co ltd
Original Assignee
Hui - Ying Medical Technology (shanghai) Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hui - Ying Medical Technology (shanghai) Co Ltd filed Critical Hui - Ying Medical Technology (shanghai) Co Ltd
Priority to CN201711214506.2A priority Critical patent/CN107944225B/en
Publication of CN107944225A publication Critical patent/CN107944225A/en
Application granted granted Critical
Publication of CN107944225B publication Critical patent/CN107944225B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B25/00ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression

Landscapes

  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biophysics (AREA)
  • Genetics & Genomics (AREA)
  • Molecular Biology (AREA)
  • Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Biotechnology (AREA)
  • Evolutionary Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Theoretical Computer Science (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

The present invention provides a kind of gene high-flux sequence data mutation detection methods, including step:S1:Obtain the high-flux sequence data of a cdna sample;S2:Generate the position information tag of each gene order of the high-flux sequence data of the cdna sample;S3:Each gene order is grouped according to the position information tag and calculates the mutation total amount of acquisition one;S4:The mutation total amount is substituted into a background model output mutation testing result.A kind of gene high-flux sequence data mutation detection methods of the present invention, the method combined using virtual molecular label with background database reduces noise, improve the specificity and sensitiveness of detection, can be in the random error during experiment can be effectively reduced on the premise of not increasing experimental cost, with reference to correction of the background database to systematic error, it can achieve the purpose that the low abundance mutation of precise Identification.

Description

Gene high-throughput sequencing data mutation detection method
Technical Field
The invention relates to the technical field of gene detection, in particular to a gene high-throughput sequencing data mutation detection method.
Background
In the past clinical and scientific research application of tumor gene mutation detection, only the condition of high-abundance gene mutation in tumor tissues is generally concerned. Due to low content of mutant nucleic acid, the low-abundance mutation is easy to miss detection or false positive under the condition of low sequencing coverage. However, in some application scenarios, for example, detection of low-abundance tumor mutant nucleic acid in blood by means of liquid biopsy, accurate detection of low-abundance mutation is required. By combining high-throughput sequencing targeted capture or amplification technology with high-depth sequencing, the sequencing coverage of important tumor mutation sites can be improved, and the detection sensitivity can be improved. However, due to the noise naturally existing in high-throughput sequencing, it is still difficult to distinguish the true mutation from the noise point only from the experimental point of view, and the above problems must be solved by establishing a model of noise reduction and mutation detection through an algorithm.
In the existing scheme, sequencing data of healthy people are used as background values, and a background noise threshold value of each site is determined through normal distribution fitting, so that true positive points and noise are distinguished. However, this solution has several problems: 1. batch effect exists in high-throughput sequencing experiments and data generation, a background model established by sequencing data of healthy people can remove system errors existing in a sequencing system, but experimental errors randomly generated in each experiment cannot be effectively removed; 2. the background data of healthy people needs to be established by measuring data of a large sample size of a large number of sites, the required cost is high, and the noise reduction effect cannot be realized on the sites which are not covered temporarily in a background database.
Disclosure of Invention
Aiming at the defects in the prior art, the invention provides a gene high-throughput sequencing data mutation detection method, which adopts a method of combining a virtual molecular tag and a background database to reduce noise, improves the specificity and sensitivity of detection, can effectively reduce random errors in an experiment on the premise of not increasing the experiment cost, and can achieve the aim of accurately identifying low-abundance mutation by combining the correction of the background database on system errors.
In order to achieve the above object, the present invention provides a gene high-throughput sequencing data mutation detection method, comprising the steps of:
s1: obtaining high-throughput sequencing data of a gene sample;
s2: generating a position information label of each gene sequence of the high-throughput sequencing data of the gene sample;
s3: grouping each gene sequence according to the position information label and calculating to obtain a total mutation amount;
s4: and substituting the total mutation amount into a background model to output a mutation detection result.
Preferably, the S2 step further comprises the steps of:
s21: comparing each gene sequence to a reference genome through a sequence comparison algorithm to form comparison information of each gene sequence;
s22: storing the comparison information in a SAM/BAM format file;
s23: judging a template chain Ti of a sequence source of each gene sequence according to the SAM/BAM format file, wherein i is more than or equal to 1 and less than or equal to n, and n is the number of the gene sequences;
and S24, generating a position information label of each gene sequence according to the template strand Ti of the sequence source and the SAM/BAM format file.
Preferably, the S23 step further comprises the steps of:
extracting a first comparison starting position Pi of each gene sequence, a second comparison starting position Qi of a same fragment comparison sequence, positive and negative chain information Si and a sequence number Ri of the gene sequence from the SAM/BAM format file;
the template strand of sequence origin Ti is positive when the sequence number Ri of the gene sequence is equal to the value of the read1 position of the SAM/BAM format file and the positive-negative chain information Si is equal to the value of the foward position of the SAM/BAM format file, or the sequence number Ri of the gene sequence is not equal to the value of the read1 position of the SAM/BAM format file and the positive-negative chain information Si is not equal to the value of the foward position of the SAM/BAM format file;
the template strand Ti from which the sequence originates is negative when the sequence number Ri of the gene sequence is equal to the value of the read1 position of the SAM/BAM format file and the positive-negative chain information Si is not equal to the value of the forward 1 position of the SAM/BAM format file, or the sequence number Ri of the gene sequence is not equal to the value of the read1 position of the SAM/BAM format file and the positive-negative chain information Si is equal to the value of the forward position of the SAM/BAM format file.
Preferably, the position information tag is represented as (Pi, qi, ti).
Preferably, the S3 step further comprises the steps of:
s31: dividing the gene sequences with the consistent position information tags into the same genome;
s32: counting each gene sequence in the genome and a target gene position g of the reference genome i Corresponding to a current genotype location of mutant genotype and base quality q&gt 30 mutation number v of the gene sequence j J is a natural number greater than or equal to 1;
such as v j &gt 0, recording the base quality q of the current gene position&gt 30 number n of the gene sequence j
Such as v j <f*n j Then v is j =0, wherein f is a preset lowest base identity ratio value;
s33: repeating the step S32 to obtain the mutation number v of each target gene position j And calculating a total number of mutations based on the number of mutationsWherein
When in useAt the time, reserveAnd continuing the subsequent steps;
when in useWhen in use, willAnd resetting the numerical value of the step (b) and continuing the subsequent steps.
Preferably, the S4 step further comprises the steps of:
s41: establishing a background model, wherein the formula of the background model is as follows:
wherein, P gi In order to accumulate the distribution frequency, gamma is a first fitting parameter, delta is a second fitting parameter, epsilon is a third fitting parameter, and lambda is a fourth fitting parameter;
obtaining the first fitting parameter, the second fitting parameter, the third fitting parameter and the fourth fitting parameter according to fitting of a plurality of sample data;
s42: substituting the total mutation amount into the background model, and calculating the cumulative distribution frequency;
s43: and when the cumulative distribution frequency value is greater than 0.95, judging that a gene locus corresponding to the current position information label is a positive locus.
Preferably, the number of the sample data is greater than or equal to 1000.
Due to the adoption of the technical scheme, the invention has the following beneficial effects:
1. the random noise in high throughput sequencing is removed without adding experimental steps and cost.
2. A calculation model for distinguishing the positive mutation sites is established by modeling sequencing data of healthy people after random noise is removed.
Finally, the sensitivity and specificity of low-abundance variation detection can be obviously improved on the premise of not changing the existing experimental system.
Drawings
FIG. 1 is a flow chart of a gene high throughput sequencing data mutation detection method according to an embodiment of the present invention.
Detailed Description
The following description of the preferred embodiment of the present invention, in accordance with the accompanying drawings of which 1 is presented to enable a better understanding of the invention as to its functions and features.
Referring to fig. 1, a method for detecting mutation in gene high-throughput sequencing data according to an embodiment of the present invention includes the steps of:
s1: high throughput sequencing data of a gene sample is obtained.
S2: generating a positional information tag for each gene sequence of the high-throughput sequencing data of the gene sample.
Wherein, the step of S2 further comprises the steps of:
s21: comparing each gene sequence to a reference genome through a sequence comparison algorithm to form comparison information of each gene sequence; the sequence comparison algorithm can adopt any existing sequence comparison algorithm, and is not particularly limited; the comparison information comprises first comparison starting position information, second comparison starting position information, base quality information, positive and negative chain information, sequence number information of a gene sequence and the like;
s22: storing the comparison information in a SAM/BAM format file;
s23: judging template chains Ti of sequence sources of all gene sequences according to the SAM/BAM format file, wherein i is more than or equal to 1 and less than or equal to n, and n is the number of the gene sequences;
and S24, generating a position information label of each gene sequence according to the template strand Ti of the sequence source and the SAM/BAM format file.
Wherein, the step of S23 further comprises the steps of:
extracting a first comparison initial position Pi of each gene sequence, a second comparison initial position Qi of a same fragment comparison sequence, positive and negative chain information Si and a sequence number Ri of the gene sequence from the SAM/BAM format file; the logical relationship of the template strand Ti from which the sequence originates can be expressed as:
when the sequence number Ri of the gene sequence is equal to the value of the read1 position of the SAM/BAM format file and the positive and negative chain information Si is equal to the value of the foward position of the SAM/BAM format file, or the sequence number Ri of the gene sequence is not equal to the value of the read1 position of the SAM/BAM format file and the positive and negative chain information Si is not equal to the value of the foward position of the SAM/BAM format file, the template chain Ti of the sequence source is positive;
when the sequence number Ri of the gene sequence is equal to the value of the read1 position of the SAM/BAM format file and the positive-negative chain information Si is not equal to the value of the foward position of the SAM/BAM format file, or the sequence number Ri of the gene sequence is not equal to the value of the read1 position of the SAM/BAM format file and the positive-negative chain information Si is equal to the value of the foward position of the SAM/BAM format file, the template chain Ti from which the sequence originates is negative.
In the present embodiment, the positional information tag is represented by (Pi, qi, ti). The triplet is capable of uniquely identifying all sequences from a uniform template nucleic acid and is capable of distinguishing between the sense and anti-sense strands of the template.
S3: and grouping the gene sequences according to the position information labels and calculating to obtain a total mutation amount.
Wherein, the step S3 further comprises the steps of:
s31: dividing the gene sequences with consistent position information labels into the same genome;
s32: counting the gene sequence in the genome and the target gene position g of the reference genome i Corresponding to a current genotype location of mutant genotype and base quality q&gt 30 number of mutations of the Gene sequence v j J is a natural number greater than or equal to 1;
such as v j &gt 0, recording base quality q of current gene position&Number n of gene sequences of gt, 30 j
Such as v j <f*n j Then v is j =0, wherein f is a preset lowest base identity ratio value;
s33: repeating the step S32 to obtain the mutation number v of each target gene position j And calculating a total number of mutations based on the number of mutationsWherein
When in useAt the time, reserveAnd continuing the subsequent steps;
when in useWhen in use, willAnd resetting the numerical value of the step (b) and continuing the subsequent steps.
S4: and substituting the total mutation amount into a background model to output a mutation detection result.
Wherein, the step S4 further comprises the steps of:
s41: establishing a background model, wherein the formula of the background model is as follows:
wherein, P gi In order to accumulate the distribution frequency, gamma is a first fitting parameter, delta is a second fitting parameter, epsilon is a third fitting parameter, and lambda is a fourth fitting parameter;
obtaining a first fitting parameter, a second fitting parameter, a third fitting parameter and a fourth fitting parameter according to fitting of more than 1000 sample data;
s42: substituting the total mutation amount into a background model, and calculating cumulative distribution frequency;
s43: and when the cumulative distribution frequency value is greater than 0.95, judging that a gene locus corresponding to the current position information label is a positive locus.
The gene high-throughput sequencing data mutation detection method provided by the embodiment of the invention has the following beneficial effects:
1. the random noise in high-throughput sequencing is removed without increasing the experimental steps and cost.
2. A calculation model for distinguishing the positive mutation sites is established by modeling sequencing data of healthy people without random noise.
Finally, the sensitivity and specificity of low-abundance variation detection can be obviously improved on the premise of not changing the existing experimental system.
While the present invention has been described in detail and with reference to the embodiments thereof as illustrated in the accompanying drawings, it will be apparent to one skilled in the art that various changes and modifications can be made therein. Therefore, certain details of the embodiments are not to be interpreted as limiting, and the scope of the invention is to be determined by the appended claims.

Claims (7)

1. A gene high-throughput sequencing data mutation detection method comprises the following steps:
s1: obtaining high-throughput sequencing data of a gene sample;
s2: generating a position information label of each gene sequence of the high-throughput sequencing data of the gene sample;
s3: grouping each gene sequence according to the position information label and calculating to obtain a total mutation amount;
s4: and substituting the total mutation amount into a background model to output a mutation detection result.
2. The method for detecting mutation in gene high-throughput sequencing data according to claim 1, wherein the step S2 further comprises the steps of:
s21: comparing each gene sequence to a reference genome through a sequence comparison algorithm to form comparison information of each gene sequence;
s22: storing the comparison information in a SAM/BAM format file;
s23: judging a template chain Ti of a sequence source of each gene sequence according to the SAM/BAM format file, wherein i is more than or equal to 1 and less than or equal to n, and n is the number of the gene sequences;
and S24, generating a position information label of each gene sequence according to the template strand Ti of the sequence source and the SAM/BAM format file.
3. The method for detecting mutation in gene high-throughput sequencing data according to claim 2, wherein said step of S23 further comprises the steps of:
extracting a first comparison starting position Pi of each gene sequence, a second comparison starting position Qi of a same fragment comparison sequence, positive and negative chain information Si and a sequence number Ri of the gene sequence from the SAM/BAM format file;
the template strand of sequence origin Ti is positive when the sequence number Ri of the gene sequence is equal to the value of the read1 position of the SAM/BAM format file and the positive-negative chain information Si is equal to the value of the foward position of the SAM/BAM format file, or the sequence number Ri of the gene sequence is not equal to the value of the read1 position of the SAM/BAM format file and the positive-negative chain information Si is not equal to the value of the foward position of the SAM/BAM format file;
the template strand Ti from which the sequence originates is negative when the sequence number Ri of the gene sequence is equal to the value of the read1 position of the SAM/BAM format file and the positive-negative chain information Si is not equal to the value of the forward 1 position of the SAM/BAM format file, or the sequence number Ri of the gene sequence is not equal to the value of the read1 position of the SAM/BAM format file and the positive-negative chain information Si is equal to the value of the forward position of the SAM/BAM format file.
4. The method for detecting mutation in gene high throughput sequencing data according to claim 3, wherein said positional information tag is represented by (Pi, qi, ti).
5. The method for detecting mutation in gene high throughput sequencing data according to any one of claims 1 to 4, wherein said S3 step further comprises the steps of:
s31: dividing the gene sequences with the same position information label into the same genome;
s32: counting each gene sequence in the genome and a target gene position g of the reference genome i Corresponding to a current gene position of mutant genotype and base quality q&gt 30 mutation number v of the gene sequence j J is a natural number greater than or equal to 1;
such as v j &gt 0, recording the base quality q of the current gene position&gt 30, the number n of the gene sequences j
Such as v j <f*n j Then v is j =0, wherein f is a preset lowest base identity ratio value;
s33: repeating the step S32 to obtain the mutation number v of each target gene position j And calculating a total number of mutations based on the number of mutationsWherein
When the temperature is higher than the set temperatureAt the time, reserveAnd continuing the subsequent steps;
when the temperature is higher than the set temperatureWhen in use, willAnd resetting the value of (4) and continuing the subsequent steps.
6. The method for detecting mutation in gene high-throughput sequencing data according to claim 5, wherein the step S4 further comprises the steps of:
s41: establishing a background model, wherein the formula of the background model is as follows:
wherein, P gi The cumulative distribution frequency is shown, gamma is a first fitting parameter, delta is a second fitting parameter, epsilon is a third fitting parameter, and lambda is a fourth fitting parameter;
obtaining the first fitting parameter, the second fitting parameter, the third fitting parameter and the fourth fitting parameter according to fitting of a plurality of sample data;
s42: substituting the total mutation amount into the background model, and calculating the cumulative distribution frequency;
s43: and when the cumulative distribution frequency value is greater than 0.95, judging that a gene locus corresponding to the current position information label is a positive locus.
7. The method of detecting mutations in gene high-throughput sequencing data according to claim 6, wherein the number of sample data is 1000 or more.
CN201711214506.2A 2017-11-28 2017-11-28 Gene high-throughput sequencing data mutation detection method Active CN107944225B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711214506.2A CN107944225B (en) 2017-11-28 2017-11-28 Gene high-throughput sequencing data mutation detection method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711214506.2A CN107944225B (en) 2017-11-28 2017-11-28 Gene high-throughput sequencing data mutation detection method

Publications (2)

Publication Number Publication Date
CN107944225A true CN107944225A (en) 2018-04-20
CN107944225B CN107944225B (en) 2020-04-24

Family

ID=61950342

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711214506.2A Active CN107944225B (en) 2017-11-28 2017-11-28 Gene high-throughput sequencing data mutation detection method

Country Status (1)

Country Link
CN (1) CN107944225B (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108681661A (en) * 2018-05-16 2018-10-19 杭州迈迪科生物科技有限公司 A method of generating sample ID with experiment
CN109920480A (en) * 2019-03-14 2019-06-21 深圳市海普洛斯生物科技有限公司 A kind of method and apparatus correcting high-flux sequence data
CN111161801A (en) * 2019-12-31 2020-05-15 杨琦 Method for automatically identifying heterozygous mutation in first-generation gene sequencing
CN113628683A (en) * 2021-08-24 2021-11-09 慧算医疗科技(上海)有限公司 High-throughput sequencing mutation detection method, equipment, device and readable storage medium
CN115424664A (en) * 2022-11-07 2022-12-02 北京雅康博生物科技有限公司 Method and device for evaluating degree of artificial mutation

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170058332A1 (en) * 2015-09-02 2017-03-02 Guardant Health, Inc. Identification of somatic mutations versus germline variants for cell-free dna variant calling applications
CN106599616A (en) * 2017-01-03 2017-04-26 上海派森诺医学检验所有限公司 duplex-seq-based ultralow-frequency mutation site detection analysis method
CN106755454A (en) * 2017-01-06 2017-05-31 杭州杰毅麦特医疗器械有限公司 A kind of molecular label nucleic acid detection method
CN106845150A (en) * 2016-12-29 2017-06-13 安诺优达基因科技(北京)有限公司 A kind of device for detecting Circulating tumor DNA sample Gene Fusion

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170058332A1 (en) * 2015-09-02 2017-03-02 Guardant Health, Inc. Identification of somatic mutations versus germline variants for cell-free dna variant calling applications
CN106845150A (en) * 2016-12-29 2017-06-13 安诺优达基因科技(北京)有限公司 A kind of device for detecting Circulating tumor DNA sample Gene Fusion
CN106599616A (en) * 2017-01-03 2017-04-26 上海派森诺医学检验所有限公司 duplex-seq-based ultralow-frequency mutation site detection analysis method
CN106755454A (en) * 2017-01-06 2017-05-31 杭州杰毅麦特医疗器械有限公司 A kind of molecular label nucleic acid detection method

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108681661A (en) * 2018-05-16 2018-10-19 杭州迈迪科生物科技有限公司 A method of generating sample ID with experiment
CN109920480A (en) * 2019-03-14 2019-06-21 深圳市海普洛斯生物科技有限公司 A kind of method and apparatus correcting high-flux sequence data
CN111161801A (en) * 2019-12-31 2020-05-15 杨琦 Method for automatically identifying heterozygous mutation in first-generation gene sequencing
CN111161801B (en) * 2019-12-31 2023-06-06 杨琦 Method for automatically identifying heterozygous mutation in first generation gene sequencing
CN113628683A (en) * 2021-08-24 2021-11-09 慧算医疗科技(上海)有限公司 High-throughput sequencing mutation detection method, equipment, device and readable storage medium
CN113628683B (en) * 2021-08-24 2024-04-09 慧算医疗科技(上海)有限公司 High-throughput sequencing mutation detection method, device and apparatus and readable storage medium
CN115424664A (en) * 2022-11-07 2022-12-02 北京雅康博生物科技有限公司 Method and device for evaluating degree of artificial mutation

Also Published As

Publication number Publication date
CN107944225B (en) 2020-04-24

Similar Documents

Publication Publication Date Title
CN107944225B (en) Gene high-throughput sequencing data mutation detection method
CN107423578B (en) Device for detecting somatic cell mutation
CN103993069B (en) Virus integration site capture sequencing analysis method
WO2018068600A1 (en) Image processing method and system
Wang et al. Accurate recovery of ribosome positions reveals slow translation of wobble-pairing codons in yeast
JP2016539630A5 (en)
WO2007142044A1 (en) Image processing device and image processing program
CN109887546B (en) Single-gene or multi-gene copy number detection system and method based on next-generation sequencing
WO2011145955A1 (en) Method and system for sequence correlation
CN116486910B (en) Deep learning training set establishment method for nanopore sequencing base recognition and application thereof
CN106529171A (en) Detection analysis method for breast cancer susceptibility gene heritable variation point
CN110016497B (en) Method for detecting copy number variation of tumor single cell genome
CN103946396A (en) Method for sequence recombination and apparatus for ngs
CN115083521B (en) Method and system for identifying tumor cell group in single cell transcriptome sequencing data
CN108595915A (en) A kind of three generations&#39;s data correcting method based on DNA variation detections
CN108154007B (en) Copy number variation and deletion type detection method based on single tumor sample
CN109920480B (en) Method and device for correcting high-throughput sequencing data
Alkhateeb et al. Zseq: an approach for preprocessing next-generation sequencing data
CN114530199A (en) Method and device for detecting low-frequency mutation based on double sequencing data and storage medium
CN107967411B (en) Method and device for detecting off-target site and terminal equipment
CN109886151B (en) False identity attribute detection method
CN113628683B (en) High-throughput sequencing mutation detection method, device and apparatus and readable storage medium
CN105320850A (en) High-throughput sequencing data matching method
US20160026756A1 (en) Method and apparatus for separating quality levels in sequence data and sequencing longer reads
CN108268753A (en) A kind of microorganism group recognition methods and device, equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20211208

Address after: 201209 floor 3, building 1, No. 400, Fangchun Road, pilot Free Trade Zone, Pudong New Area, Shanghai

Patentee after: Huisuan gene technology (Shanghai) Co.,Ltd.

Address before: 201209 floor 3, building 1, No. 400, Fangchun Road, pilot Free Trade Zone, Pudong New Area, Shanghai

Patentee before: SMARTQUERIER BIOMEDICINE (SHANGHAI) CO.,LTD.

TR01 Transfer of patent right