CN108875306A - A kind of method and system for searching Sex Determination sequence - Google Patents
A kind of method and system for searching Sex Determination sequence Download PDFInfo
- Publication number
- CN108875306A CN108875306A CN201810551156.7A CN201810551156A CN108875306A CN 108875306 A CN108875306 A CN 108875306A CN 201810551156 A CN201810551156 A CN 201810551156A CN 108875306 A CN108875306 A CN 108875306A
- Authority
- CN
- China
- Prior art keywords
- sample
- window
- sequence
- male
- genome
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Abstract
The invention discloses a kind of method and system for searching Sex Determination sequence, the method for the lookup Sex Determination sequence includes:The sample of multiple females or male is sequenced, data is obtained, and genome will be referred on comparing, obtains comparison result, wherein include with reference to genome:The sequence of sex determinative chromosome Y or W;Comparison result is handled, the sequencing depth of each base is obtained;Genome on comparing is divided at least one window, and multiple females or male are calculated with the base coverage of each window at least one window;To each window, the average base coverage of multiple males or female sample is counted;According to average base coverage, the Sex Determination sequence of the sample of multiple females or male is determined.
Description
Technical field
The present invention relates to technical field of biological information, and in particular to it is a kind of based on search Sex Determination sequence method and be
System.
Background technique
Currently, the method for looking for Sex Determination sequence is CQ (chromosome quotient method) [1], it will be female
Sample and male sample compared on the genome of Sex Determination respectively, find the big region of coverage difference, then it is assumed that
It is Sex Determination sequence.For example the gender of the mankind is that chromosome is determined by Y, male XY, women XX.In order to find gender
Determining sequence (Y), the sequencing data of male and the sequencing data of women are compared upper male gene group by us, count every dyeing
The base coverage of body, it finds that there is item chromosome, in the comparison result of women, base coverage is very low (0.3),
And in the comparison result of male, base coverage is very high (0.98), then this chromosome is male sex chromosome Y.
But the species (such as diamondback moth) of big, the high heterozygosis of individual variation small for sex chromosome's difference, this method
It is just not applicable.
Summary of the invention
The purpose of the present invention is to provide a kind of method and system for searching Sex Determination sequence, to solve the prior art
The problem of.
To achieve the above object, the technical scheme is that a kind of method for searching Sex Determination sequence, feature exist
In, the method for the lookup Sex Determination sequence, including:The sample of multiple females or male is sequenced, data are obtained, and
Genome will be referred on comparing, obtain comparison result, wherein include with reference to genome:Sex determinative chromosome Y's or W
Sequence;Comparison result is handled, the sequencing depth of each base is obtained;Genome on comparing is divided at least one window
Mouthful, and multiple females or male are calculated with the base coverage of each window at least one window;To each window, system
Count the average base coverage of multiple males or female sample;According to average base coverage, determine multiple female or male
The Sex Determination sequence of sample.
Optionally, the sample of multiple females or male is sequenced, obtains data, and base will be referred on comparing
Because of group, comparison result is obtained, including:The sample of multiple females or male is sequenced by Illumina platform, obtains number
According to, and data BWA software is compared and refers to genome, obtain comparison result.
Optionally, comparison result is handled, the sequencing depth of each base is obtained, including:It is handled using SAMtools software
Comparison result obtains the sequencing depth of each base.
Optionally, the base coverage of each window at least one window is calculated, including:It will be sequenced in each window
Base number of the depth greater than 0 is counted, then divided by the length of each window.
Optionally, according to average base coverage, the Sex Determination sequence of the sample of multiple females or male, packet are determined
It includes:When reference genome includes:The sequence of sex determinative chromosome Y, and determine the average base covering of multiple female samples
Degree is greater than 70%, and when the average base coverage of multiple female samples is less than 10%, is determined as female sex's determining sequence;Or
Person, when reference genome includes:The sequence of sex determinative chromosome W, and determine the average base covering of multiple female samples
Degree is greater than 70%, and when the average base coverage of multiple male samples is less than 10%, is determined as female sex's determining sequence.
To achieve the above object, the technical scheme is that a kind of system for searching Sex Determination sequence, the lookup
The system of other determining sequence, including:Unit, comparing unit, processing unit, computing unit, statistic unit and determination unit is sequenced;
Wherein, unit is sequenced, for the sample of multiple females or male to be sequenced, obtains data;Comparing unit is used for data
Genome is referred in comparison, obtains comparison result, wherein include with reference to genome:The sequence of sex determinative chromosome Y or W;Place
It manages unit and obtains the sequencing depth of each base for handling comparison result;Computing unit, for by the base on comparing
Because group is divided at least one window, and the base for calculating each window at least one window to multiple females or male is covered
Cover degree;Statistic unit, for counting the average base coverage of multiple males or female sample to each window;Determination unit,
For determining the Sex Determination sequence of the sample of multiple females or male according to average base coverage.
Optionally, unit is sequenced, is specifically used for:The sample of multiple females or male is surveyed by Illumina platform
Sequence obtains data;Comparing unit is specifically used for:Data BWA software is compared and refers to genome, obtains comparison result.
Optionally, processing unit is specifically used for:Comparison result is handled using SAMtools software, obtains each base
Depth is sequenced.
Optionally, computing unit is specifically used for:Base number of the depth greater than 0 will be sequenced in each window to count,
Again divided by the length of each window.
Optionally, determination unit is specifically used for:Include with reference to genome when described:The sequence of sex determinative chromosome Y,
And determine that the average base coverage of the multiple female sample is greater than 70%, and the average base of the multiple female sample
When coverage is less than 10%, it is determined as female sex's determining sequence;Alternatively, including with reference to genome when described:Sex Determination dye
The sequence of colour solid W, and determine that the average base coverage of the multiple female sample is greater than 70%, and the multiple male sample
When the average base coverage of product is less than 10%, it is determined as female sex's determining sequence.
The invention has the advantages that:
Whole sequence is cut into small window to analyze, compared with the prior art in chromosome difference hour, it is right
Whole sequence is analyzed, and the Sex Determination sequence in whole sequence can be more accurately found.
Detailed description of the invention
Fig. 1 is that the embodiment of the invention provides a kind of flow charts of method for searching Sex Determination sequence.
Fig. 2 is that the embodiment of the invention provides a kind of structural schematic diagrams of system for searching Sex Determination sequence.
Specific embodiment
The following examples are used to illustrate the present invention, but are not intended to limit the scope of the present invention..
Embodiment 1
Fig. 1 is that the embodiment of the invention provides a kind of flow charts of method for searching Sex Determination sequence.As shown in Figure 1,
The method of the lookup Sex Determination sequence includes the following steps:
Step S101:The sample of multiple females or male is sequenced, obtains data, and will refer on comparing
Genome obtains comparison result, wherein include with reference to genome:The sequence of sex determinative chromosome Y or W;
Specifically, the sample of multiple females or male is sequenced by Illumina platform, obtains data, and will
Data BWA software, which compares, refers to genome, obtains comparison result.
It should be noted that sequencing approach includes but is not limited to Illumina platform, by data and genome alignment is referred to
On method include but is not limited to BWA.
In embodiments of the present invention, use female diamondback moth as with reference to genome (ZW, female determine gender), survey respectively
It is (each to have found 10,453 windows by searching for the method for Sex Determination sequence for 50 males and 50 female individuals
Window 30bp).
Step S102:Comparison result is handled, the sequencing depth of each base is obtained;
Specifically, comparison result is handled using SAMtools software, obtains the sequencing depth of each base.
Step S103:Genome on comparing is divided at least one window, and multiple females or male are counted
Calculate the base coverage of each window at least one window;
Specifically, base number of the depth greater than 0 will be sequenced in each window to count, then divided by the length of each window
Degree.
For example, each window can be dimensioned to 50bp, each window can also be sized to difference
's.
Step S104:To each window, the average base coverage of multiple males or female sample is counted;
Step S105:According to average base coverage, the Sex Determination sequence of the sample of multiple females or male is determined.
Specifically, when reference genome includes:The sequence of sex determinative chromosome Y, and determine multiple female samples
Average base coverage is greater than 70%, and when the average base coverage of multiple female samples is less than 10%, is determined as female
Other determining sequence;Alternatively, when including with reference to genome:The sequence of sex determinative chromosome W, and determine multiple female samples
Average base coverage is greater than 70%, and when the average base coverage of multiple male samples is less than 10%, is determined as female
Other determining sequence.
Embodiment 2
Fig. 2 is that the embodiment of the invention provides a kind of systems for searching Sex Determination sequence.As shown in Fig. 2, the lookup
The system of other determining sequence, including:Unit 21, comparing unit 22, processing unit 23, computing unit 24, statistic unit 25 is sequenced
With determination unit 26;Wherein,
Unit 21 is sequenced, for the sample of multiple females or male to be sequenced, obtains data;Comparing unit 22 is used
In genome will be referred on comparing, comparison result is obtained, wherein include with reference to genome:Sex determinative chromosome Y or W
Sequence;Processing unit 23 obtains the sequencing depth of each base for handling comparison result;Computing unit 24, for that will count
It is divided at least one window according to the genome in comparison, and multiple females or male is calculated each at least one window
The base coverage of window;Statistic unit 25, for each window, the average base for counting multiple males or female sample to cover
Cover degree;Determination unit 26, for determining the Sex Determination sequence of the sample of multiple females or male according to average base coverage
Column.
Optionally, unit 21 is sequenced, is specifically used for:The sample of multiple females or male is carried out by Illumina platform
Sequencing obtains data.
Optionally, comparing unit 22 are specifically used for:Data BWA software is compared and refers to genome, obtains and compares knot
Fruit.
Optionally, processing unit 23 are specifically used for:Comparison result is handled using SAMtools software, obtains each base
Sequencing depth.
Optionally, computing unit 24 are specifically used for:
Base number of the depth greater than 0 will be sequenced in each window to count, then divided by the length of each window.
Optionally, determination unit 26 are specifically used for:When reference genome includes:The sequence of sex determinative chromosome Y, with
And determine that the average base coverage of multiple female samples is greater than 70%, and the average base coverage of multiple female samples is less than
When 10%, it is determined as female sex's determining sequence;Alternatively, when including with reference to genome:The sequence of sex determinative chromosome W, with
And determine that the average base coverage of multiple female samples is greater than 70%, and the average base coverage of multiple male samples is less than
When 10%, it is determined as female sex's determining sequence.The description of 26 part of determination unit can be detailed in the description of the part step S105.
The embodiment of the present invention is analyzed by the way that whole sequence is cut into small window, compared with the prior art in
Chromosome difference hour, whole sequence is analyzed, can more accurately find the Sex Determination sequence in whole sequence.
Although above having used general explanation and specific embodiment, the present invention is described in detail, at this
On the basis of invention, it can be made some modifications or improvements, this will be apparent to those skilled in the art.Therefore,
These modifications or improvements without departing from theon the basis of the spirit of the present invention are fallen within the scope of the claimed invention.
Claims (10)
1. a kind of method for searching Sex Determination sequence, which is characterized in that the method, including:
The sample of multiple females or male is sequenced, data is obtained, and genome will be referred on the comparing, obtains
Obtain comparison result, wherein described to include with reference to genome:The sequence of sex determinative chromosome Y or W;
The comparison result is handled, the sequencing depth of each base is obtained;
Genome on the comparing is divided at least one window, and to described in the multiple female or male calculating
The base coverage of each window at least one window;
To each window, the average base coverage of the multiple male or female sample is counted;
According to the average base coverage, the Sex Determination sequence of the sample of the multiple female or male is determined.
2. the method according to claim 1, wherein it is described by it is multiple female or male sample be sequenced,
Data are obtained, and genome will be referred on the comparing, obtain comparison result, including:
The sample of multiple females or male is sequenced by Illumina platform, obtains data, and the data are used
BWA software compares and refers to genome, obtains comparison result.
3. the method according to claim 1, wherein the processing comparison result, obtains each base
Depth is sequenced, including:
The comparison result is handled using SAMtools software, obtains the sequencing depth of each base.
4. the method according to claim 1, wherein each window at least one window described in the calculating
Base coverage, including:
Base number of the depth greater than 0 will be sequenced in each window to count, then divided by the length of each window.
5. the method according to claim 1, wherein described according to the average base coverage, determine described in
The Sex Determination sequence of the sample of multiple females or male, including:
Include with reference to genome when described:The sequence of sex determinative chromosome Y, and determine being averaged for the multiple female sample
Base coverage is greater than 70%, and when the average base coverage of the multiple female sample is less than 10%, is determined as female
Other determining sequence;Alternatively,
Include with reference to genome when described:The sequence of sex determinative chromosome W, and determine being averaged for the multiple female sample
Base coverage is greater than 70%, and when the average base coverage of the multiple male sample is less than 10%, is determined as female
Other determining sequence.
6. a kind of system for searching Sex Determination sequence, which is characterized in that the system, including:Be sequenced unit, comparing unit,
Processing unit, computing unit, statistic unit and determination unit;Wherein,
The sequencing unit obtains data for the sample of multiple females or male to be sequenced;
The comparing unit obtains comparison result for will refer to genome on the comparing, wherein described to refer to base
Because group includes:The sequence of sex determinative chromosome Y or W;
The processing unit obtains the sequencing depth of each base for handling the comparison result;
The computing unit, for the genome on the comparing to be divided at least one window, and to the multiple
Female or male calculate the base coverage of each window at least one described window;
The statistic unit, for counting the average base covering of the multiple male or female sample to each window
Degree;
The determination unit, for determining the property of the sample of the multiple female or male according to the average base coverage
Other determining sequence.
7. system according to claim 6, which is characterized in that the sequencing unit is specifically used for:By multiple females or hero
Property sample be sequenced by Illumina platform, obtain data;
The comparing unit, is specifically used for:Data BWA software is compared and refers to genome, obtains comparison result.
8. system according to claim 6, which is characterized in that the processing unit is specifically used for:
The comparison result is handled using SAMtools software, obtains the sequencing depth of each base.
9. system according to claim 6, which is characterized in that the computing unit is specifically used for:
Base number of the depth greater than 0 will be sequenced in each window to count, then divided by the length of each window.
10. the method according to claim 1, wherein the determination unit, is specifically used for:
Include with reference to genome when described:The sequence of sex determinative chromosome Y, and determine being averaged for the multiple female sample
Base coverage is greater than 70%, and when the average base coverage of the multiple female sample is less than 10%, is determined as female
Other determining sequence;Alternatively,
Include with reference to genome when described:The sequence of sex determinative chromosome W, and determine being averaged for the multiple female sample
Base coverage is greater than 70%, and when the average base coverage of the multiple male sample is less than 10%, is determined as female
Other determining sequence.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810551156.7A CN108875306A (en) | 2018-05-31 | 2018-05-31 | A kind of method and system for searching Sex Determination sequence |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810551156.7A CN108875306A (en) | 2018-05-31 | 2018-05-31 | A kind of method and system for searching Sex Determination sequence |
Publications (1)
Publication Number | Publication Date |
---|---|
CN108875306A true CN108875306A (en) | 2018-11-23 |
Family
ID=64336195
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810551156.7A Pending CN108875306A (en) | 2018-05-31 | 2018-05-31 | A kind of method and system for searching Sex Determination sequence |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108875306A (en) |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090081677A1 (en) * | 2007-09-26 | 2009-03-26 | Kaohsiung Medical University | Method for gender identification of eagles with probe-based real-time pcr and the sequences used for gender identification of eagles |
CN103384887A (en) * | 2010-05-25 | 2013-11-06 | 加利福尼亚大学董事会 | BAMBAM: parallel comparative analysis of high-throughput sequencing data |
CN103805689A (en) * | 2012-11-15 | 2014-05-21 | 深圳华大基因科技服务有限公司 | Characteristic kmer based metatypic chromosomal sequence assembly method and application thereof |
CN103955630A (en) * | 2014-03-26 | 2014-07-30 | 田埂 | Method for preparing reference database and performing target area sequence alignment on to-be-tested free nucleic acid samples |
CN105574361A (en) * | 2015-11-05 | 2016-05-11 | 上海序康医疗科技有限公司 | Method for detecting variation of copy numbers of genomes |
-
2018
- 2018-05-31 CN CN201810551156.7A patent/CN108875306A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090081677A1 (en) * | 2007-09-26 | 2009-03-26 | Kaohsiung Medical University | Method for gender identification of eagles with probe-based real-time pcr and the sequences used for gender identification of eagles |
CN103384887A (en) * | 2010-05-25 | 2013-11-06 | 加利福尼亚大学董事会 | BAMBAM: parallel comparative analysis of high-throughput sequencing data |
CN103805689A (en) * | 2012-11-15 | 2014-05-21 | 深圳华大基因科技服务有限公司 | Characteristic kmer based metatypic chromosomal sequence assembly method and application thereof |
CN103955630A (en) * | 2014-03-26 | 2014-07-30 | 田埂 | Method for preparing reference database and performing target area sequence alignment on to-be-tested free nucleic acid samples |
CN105574361A (en) * | 2015-11-05 | 2016-05-11 | 上海序康医疗科技有限公司 | Method for detecting variation of copy numbers of genomes |
Non-Patent Citations (2)
Title |
---|
HALL, ANDREW BRANTLEY 等: "A male-determining factor in the mosquito Aedes aegypti", 《SCIENCE》 * |
杨莹莹: "性别决定相关因子的研究进展", 《现代生物医学进展》 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106096330B (en) | A kind of noninvasive antenatal biological information determination method | |
Chen et al. | Automated flow cytometric analysis across large numbers of samples and cell types | |
CN114242162B (en) | Method for establishing drug synergy prediction model, prediction method and corresponding device | |
CN108268752B (en) | A kind of chromosome abnormality detection device | |
WO2019213811A1 (en) | Method, apparatus, and system for detecting chromosomal aneuploidy | |
Taylor et al. | Identity-by-descent with uncertainty characterises connectivity of Plasmodium falciparum populations on the Colombian-Pacific coast | |
CN111226281B (en) | Method and device for determining chromosome aneuploidy and constructing classification model | |
EP2602734A1 (en) | Robust variant identification and validation | |
CN108595912A (en) | Detect the method, apparatus and system of chromosomal aneuploidy | |
CN102618630A (en) | Application of Y-STR (Y chromosome-short tandem repeat) | |
US20150356238A1 (en) | Scoring the Deviation of an Individual with High Dimensionality from a First Population | |
WO2018060365A1 (en) | Genomic variant ranking system for clinical trial matching | |
Jofre et al. | An Indian lineage of Histoplasma with strong signatures of differentiation and selection | |
CN108875306A (en) | A kind of method and system for searching Sex Determination sequence | |
Arenas et al. | Identifying extreme observations, outliers and noise in clinical and genetic data | |
Munot et al. | Automated karyotyping of metaphase cells with touching chromosomes | |
CN111724911A (en) | Target drug sensitivity prediction method and device, terminal device and storage medium | |
CN108715891A (en) | A kind of the expression quantitative approach and system of transcript profile data | |
WO2019213810A1 (en) | Method, apparatus, and system for detecting chromosome aneuploidy | |
US7480645B2 (en) | Method for estimating the relevance of a document with respect to a concept | |
CN110970089B (en) | Pretreatment method and pretreatment device for fetal concentration calculation and application of pretreatment device | |
US11535896B2 (en) | Method for analysing cell-free nucleic acids | |
CN113658338A (en) | Point cloud tree monomer segmentation method and device, electronic equipment and storage medium | |
JP4307807B2 (en) | Data processing apparatus, method, and computer program | |
Dilber et al. | Robust detection of natural selection using a probabilistic model of tree imbalance |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20181123 |