CN114574595A - Application of human chromosome InDel gene locus, primer group, product thereof and individual identification method of test material - Google Patents

Application of human chromosome InDel gene locus, primer group, product thereof and individual identification method of test material Download PDF

Info

Publication number
CN114574595A
CN114574595A CN202210219948.0A CN202210219948A CN114574595A CN 114574595 A CN114574595 A CN 114574595A CN 202210219948 A CN202210219948 A CN 202210219948A CN 114574595 A CN114574595 A CN 114574595A
Authority
CN
China
Prior art keywords
dna
artificial sequence
indel
loci
individual identification
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202210219948.0A
Other languages
Chinese (zh)
Other versions
CN114574595B (en
Inventor
王升启
张庆珍
周喆
王雪倩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Academy of Military Medical Sciences AMMS of PLA
Original Assignee
Academy of Military Medical Sciences AMMS of PLA
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Academy of Military Medical Sciences AMMS of PLA filed Critical Academy of Military Medical Sciences AMMS of PLA
Priority to CN202210219948.0A priority Critical patent/CN114574595B/en
Publication of CN114574595A publication Critical patent/CN114574595A/en
Application granted granted Critical
Publication of CN114574595B publication Critical patent/CN114574595B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6888Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/156Polymorphic or mutational markers

Abstract

The invention provides an application of a human chromosome InDel gene locus, a primer group, a product thereof and an individual identification method of a test material, and relates to the technical field of biology. The invention provides application of 66InDel gene loci of human chromosomes in individual identification or paternity identification. The inventor independently screens 66InDel loci from a dbSNP database, evaluates population genetics parameters of 66-plex InDels (66 InDel loci) in 251 Chinese Han population, and analyzes the forensic application value of the parameters. The results show that: the 66InDel gene loci have good genetic polymorphism in Chinese Han population, and can be independently applied to forensic individual identification and paternity identification research.

Description

Application of human chromosome InDel gene locus, primer group, product thereof and individual identification method of test material
Technical Field
The invention relates to the technical field of biology, in particular to an application of a human chromosome InDel gene locus, a primer group, a product thereof and an individual identification method of a test material.
Background
Short Tandem Repeat (STR) has been widely used in the fields of individual identification and paternity testing due to its high sensitivity and strong identification ability. But STR mutation rate phaseFor higher, about 10-3There may be misjudgment in the paternity test; and the fragment is longer (100-400bp), and the detection success rate in the degraded test material is lower. Insertion/deletion polymorphisms (InDel) are a novel genetic marker, widely distributed in the human genome; the mutation rate is significantly lower than STR, about 10-8(ii) a Amplicons are shorter (<200bp) is more suitable for detecting highly degraded samples; the two-state structure is easier to type and detect.
At present, some researches design an InDel genetic marker multiplex amplification system aiming at different countries and different groups, and provide basic data for individual identification and paternity identification by evaluating group genetics related parameters of a gene locus. The InDel commercial typing kit widely applied at home and abroad mainly comprises
Figure BDA0003536579570000011
DIPplex kit (QIAGEN, germany) and AGCU InDel 50kit (german biotechnology limited, wunst).
Figure BDA0003536579570000012
The DIPplex kit takes the gene frequency of the German population as a main screening basis, and has lower identification efficiency on the Chinese population. The 30 InDel accumulated individual recognition rates (TDP) in the Beijing Han nationality population were 0.999999999985, the TDP in the Guizhou Gelao and Miao populations were 0.99999999934 and 0.99999999686, respectively, but the TDP in the Spain population was as high as 0.999999999999. The AGCU InDel 50kit is a commercialized typing kit developed in China, and comprises 47 autosomal InDel gene loci, 2Y chromosome InDel gene loci and 1 individual identification gene locus (Amelogenin). The cumulative individual recognition rate (TDP) and cumulative non-paternity exclusion probability (CPE) for 47 indels in the chinese population were 0.999999999999999 and 0.9997, respectively. Although 47 InDel identified more potent than TDP (0.99999999998) of conventional STR system, able to meet individual identification requirements, CPEtrio<0.9999, the paternity test was less effective and was in need of further improvement.
In view of the above, the present invention is particularly proposed.
Disclosure of Invention
A first object of the present invention is to provide an application of the human chromosomal InDel locus to solve at least one of the above problems.
The second objective of the invention is to provide a primer set for amplifying the InDel locus.
It is a third object of the present invention to provide a reagent.
The fourth object of the present invention is to provide a kit.
The fifth purpose of the invention is to provide the application of the primer group, the reagent or the kit in individual identification or paternity test.
A sixth object of the present invention is to provide a method for identifying an individual of a material to be tested.
In a first aspect, the invention provides the use of a human chromosomal InDel locus in any of the following a) -b):
a) individual identification;
b) performing paternity test;
the InDel locus comprises: rs56113354, rs66493016, rs34947838, rs10562732, rs10555133, rs3028599, rs34105738, rs10541877, rs3030108, rs10535391, rs142380122, rs5849187, rs59715790, rs139940865, rs35052276, rs10562881, rs59027185, rs35908330, rs70954609, rs147779273, rs35829610, rs60570384, rs 146109109109101, rs151142037, rs59258861, rs4053254, rs145041883, rs 12156094, rs 7430767, rs 36363676043, rs34812419, rs144650377, rs 338383838328, rs 914009127, rs 2737701, rs 444593, rs56110100, rs 6656375635799756974, rs 34785635563579975633794, rs 343556355633799756335634798, rs 335634563456345794, rs 3356345794, rs 335634563456345794, rs 33563456345633563356335633563356335633563356339753, rs 34563456345634563456345634975634975634975634975634975634569, rs 335634563457401, rs.
As a further technical solution, the InDel locus comprises: rs56113354, rs66493016, rs34947838, rs10562732, rs10555133, rs3028599, rs34105738, rs10541877, rs3030108, rs10535391, rs142380122, rs5849187, rs59715790, rs139940865, rs35052276, rs10562881, rs59027185, rs35908330, rs70954609, rs147779273, rs35829610, rs60570384, rs 146109109109101, rs151142037, rs59258861, rs4053254, rs145041883, rs 12156094, rs 7430767, rs 363636043, rs34812419, rs144650377, rs 33838383839, rs 9140027, rs 27701, rs 444593, rs56110100, rs 6656357997567997567841401, rs 34799756335633794, rs 3456355635563379975634794, rs 3456355635563356335633563356349794, rs 3456345634563456345634563456345794, rs 335634563456345633563356335633563356349794, rs 34563456345634563456345634563456345634563456345634563456345634563456345633563497569, rs 3356345634563456345634563456345634563456345634563456345634563456345634563456345634563456300, rs 335634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456300, rs 3356345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456300, rs 33563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456300, rs 3356345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456300, rs 335634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456.
In a second aspect, the invention provides a primer set for amplifying the InDel locus, wherein the sequence of the primer set is shown as SEQ ID No. 1-SEQ ID No. 144.
In a third aspect, the present invention provides a reagent comprising the above primer set.
In a fourth aspect, the present invention provides a kit comprising the primer set or the reagent.
As a further technical scheme, the kit also comprises a reaction buffer solution and a hot start Taq enzyme.
In a fifth aspect, the present invention provides a primer set, reagent or kit for use in individual identification or paternity testing.
In a sixth aspect, the present invention provides a method for identifying individuals of a material to be tested, comprising the steps of:
a. extracting DNA of a material to be detected;
b. obtaining the typing result of the DNA InDel gene locus;
c. performing individual identification according to the genotype of the locus;
the InDel locus comprises: rs56113354, rs66493016, rs34947838, rs10562732, rs10555133, rs3028599, rs34105738, rs10541877, rs3030108, rs10535391, rs142380122, rs5849187, rs59715790, rs139940865, rs35052276, rs10562881, rs59027185, rs35908330, rs70954609, rs147779273, rs35829610, rs60570384, rs 146109109109101, rs151142037, rs59258861, rs4053254, rs145041883, rs 12156094, rs 7430767, rs 363636043, rs34812419, rs144650377, rs 33838383839, rs 9140027, rs 27701, rs 444593, rs56110100, rs 6656357997567997567841401, rs 34799756335633794, rs 3456355635563379975634794, rs 3456355635563356335633563356349794, rs 3456345634563456345634563456345794, rs 335634563456345633563356335633563356349794, rs 34563456345634563456345634563456345634563456345634563456345634563456345633563497569, rs 3356345634563456345634563456345634563456345634563456345634563456345634563456345634563456300, rs 335634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456300, rs 3356345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456300, rs 33563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456300, rs 3356345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456300, rs 335634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456.
As a further technical scheme, the step b further comprises the step of amplifying the InDel locus by adopting a primer group.
As a further technical scheme, the sequence of the primer group is shown as SEQ ID NO. 1-SEQ ID NO. 144.
Compared with the prior art, the invention has the following beneficial effects:
the inventor independently screens 66InDel loci from a dbSNP database, evaluates population genetics parameters of 66-plex InDels (66 InDel loci) in 251 Chinese Han population, and analyzes the forensic application value of the parameters. The research result shows that: through the correction of the Bonferroni method, 66InDel loci all accord with Hardy-Weinberg equilibrium law (P) in Chinese Han population>0.000758), the loci are in linkage equilibrium. The average observed heterozygosity (Ho) of each locus was 0.482, the average expected heterozygosity (He) was 0.483, the average individual Discrimination (DP) was 0.612, the average Polymorphism Information Content (PIC) was 0.365, and the cumulative individual discrimination (TDP) was 0.99999999999999999999999999942818. Diad cumulative non-paternal exclusion probabilities (CPEs) for 66InDel lociduo) 0.999739, triplets accumulate non-parent exclusion probabilities (CPEs)trio) Is 0.999999999417. KnotThe results show that: the 66InDel gene loci have good genetic polymorphism in Chinese Han population, and can be independently applied to forensic individual identification and paternity identification research.
The primer group provided by the invention has strong specificity and high sensitivity, and can realize amplification of the 66InDel loci.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and other drawings can be obtained by those skilled in the art without creative efforts.
Fig. 1 is a 66InDel locus allele frequency distribution.
Detailed Description
Embodiments of the present invention will be described in detail below with reference to embodiments and examples, but those skilled in the art will understand that the following embodiments and examples are only illustrative of the present invention and should not be construed as limiting the scope of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention. Those who do not specify the conditions are performed according to the conventional conditions or the conditions recommended by the manufacturer. The reagents or instruments used are not indicated by the manufacturer, and are all conventional products available commercially.
In a first aspect, the invention provides the use of a human chromosomal InDel locus in any of the following a) -b):
a) individual identification;
b) performing paternity test;
the InDel locus comprises: rs56113354, rs66493016, rs34947838, rs10562732, rs10555133, rs3028599, rs34105738, rs10541877, rs3030108, rs10535391, rs142380122, rs5849187, rs59715790, rs139940865, rs35052276, rs10562881, rs59027185, rs35908330, rs70954609, rs147779273, rs35829610, rs60570384, rs 146109109109101, rs151142037, rs59258861, rs4053254, rs145041883, rs 12156094, rs 7430767, rs 36363676043, rs34812419, rs144650377, rs 338383838328, rs 914009127, rs 2737701, rs 444593, rs56110100, rs 6656375635799756974, rs 34785635563579975633794, rs 343556355633799756335634798, rs 335634563456345794, rs 3356345794, rs 335634563456345794, rs 33563456345633563356335633563356335633563356339753, rs 34563456345634563456345634975634975634975634975634975634569, rs 335634563457401, rs.
In some preferred embodiments, the InDel locus comprises: rs56113354, rs66493016, rs34947838, rs10562732, rs10555133, rs3028599, rs34105738, rs10541877, rs3030108, rs10535391, rs142380122, rs5849187, rs59715790, rs139940865, rs35052276, rs10562881 881, rs59027185, rs35908330, rs70954609, rs147779273, rs35829610, rs60570384, rs146109101, rs151142037, rs59258861, rs 405353535353535353254, rs 145883, rs 12156094, rs 7430767, rs 36101043, rs34812419, rs144650377, rs 3392838326, rs 9127, rs 152701, rs 444593, rs 1067967110100, rs 6656615661563277567781974, rs 7856341815635563256325632563256327753, rs 33778197563256325632563277401, rs 3377401, rs 335632563256325632563277401, rs 3377401, rs 33563256325632563256325632563277401, rs 3332563256325632563256325632563297401, rs 3332563256325632563256325632563256325632563277401, rs 3332563256325632563256325632563256325632848, rs 3332563256325632563256325632563256325632563256325632563256325632563256325632563284and 34975.
In a second aspect, the invention provides a primer set for amplifying the InDel locus, wherein the sequence of the primer set is shown as SEQ ID No. 1-SEQ ID No. 144.
Primer pairs for amplification of the InDel locus described above are shown in the following table:
Figure BDA0003536579570000071
Figure BDA0003536579570000081
Figure BDA0003536579570000091
it should be noted that two primer pairs are designed for better amplification at the above-mentioned partial loci.
The inventor autonomously screens 66 autosomal InDel loci from a dbSNP database and autonomously constructs a 66InDel multiplex amplification system based on a high-throughput sequencing system. Through the DNA typing detection results of 251 Chinese Han irrelevant individual samples, group genetic parameters of 66InDel in Chinese Han population are analyzed, and basic data are provided for individual identification and parental right identification.
In a third aspect, the present invention provides a reagent comprising the above primer set.
Because the reagent comprises the primer group, the kit can amplify the 66InDel loci.
In a fourth aspect, the present invention provides a kit comprising the above primer set or the above reagent.
Because the kit comprises the primer group, the kit can realize amplification of the 66InDel loci.
In some preferred embodiments, the kit further comprises a reaction buffer and a hot start Taq enzyme.
The reaction buffer and the hot start Taq enzyme are not particularly limited in the present invention, and reagents which are conventional in the art may be used.
In a fifth aspect, the present invention provides a primer set, reagent or kit for use in individual identification or paternity testing.
In a sixth aspect, the present invention provides a method for identifying individuals of a material to be tested, comprising the steps of:
a. extracting DNA of a material to be detected;
b. obtaining the typing result of the DNA InDel gene locus;
c. performing individual identification according to the genotype of the locus;
the InDel locus comprises: rs56113354, rs66493016, rs34947838, rs10562732, rs10555133, rs3028599, rs34105738, rs10541877, rs3030108, rs10535391, rs142380122, rs5849187, rs59715790, rs139940865, rs35052276, rs10562881, rs59027185, rs35908330, rs70954609, rs147779273, rs35829610, rs60570384, rs 146109109109101, rs151142037, rs59258861, rs4053254, rs145041883, rs 12156094, rs 7430767, rs 363636043, rs34812419, rs144650377, rs 33838383839, rs 9140027, rs 27701, rs 444593, rs56110100, rs 6656357997567997567841401, rs 34799756335633794, rs 3456355635563379975634794, rs 3456355635563356335633563356349794, rs 3456345634563456345634563456345794, rs 335634563456345633563356335633563356349794, rs 34563456345634563456345634563456345634563456345634563456345634563456345633563497569, rs 3356345634563456345634563456345634563456345634563456345634563456345634563456345634563456300, rs 335634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456300, rs 3356345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456300, rs 33563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456300, rs 3356345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456300, rs 335634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456.
The method is simple, and can effectively realize the inference of unknown material sources at the gene level.
In some preferred embodiments, the b step further comprises the step of amplifying the InDel locus using a primer set.
In some preferred embodiments, the sequences of the primer sets are shown in SEQ ID NO.1 to SEQ ID NO. 132.
The present invention is further illustrated by the following specific examples and comparative examples, but it should be understood that these examples are for illustrative purposes only and are not to be construed as limiting the invention in any way.
Example 1
1 materials and methods
1.1InDel screening procedure
The study was based on the dbSNP database (https:// www.ncbi.nlm.nih.gov/snp /) dataset with human GRCh37.hg19 as the reference genome, and the InDel loci with higher identification potency among Chinese populations were screened using VCFtools and plink software. Specific screening criteria are as follows: (1) a binary polymorphic insertion/deletion sequence located in an autosomal intron region with a Global Minor Allele Frequency (GMAF) between 0.4 and 0.5, yielding 17374 loci; (2) satisfies Hardy-Weinberg equilibrium (HWE) law, PHWE>0.05, heterozygosity>0.4, resulting in 1263 loci; (3) coincidence with chain equilibrium test (LE), r2<0.01, yielding 987 loci; (4) inter-population (east Asia, south Asia, Europe, America, Africa) population differentiation index (Fst)<0.06, 549 loci are obtained; (5) segment length of InDel locus>1bp, obtaining 172 gene loci; (6) on the same chromosome, the physical distance between InDel loci is more than or equal to 10MB, and 93 loci are obtained. And (3) carrying out multiplex Primer design on 93 loci obtained by the screening standard by adopting Primer Premier 5.0 and MFEprimer software, considering factors such as Primer specificity, Primer length, annealing temperature, Primer structure design and the like to ensure that each pair of primers can be amplified uniformly, and finally determining 66InDel loci for composite amplification. There are no reports in the literature relating to medically sensitive information for the 66InDel loci.
1.2 sample Collection and DNA extraction
According to the principle of informed consent, FTA blood cards prepared by collecting peripheral blood samples of 100 Chinese Han independent individuals and peripheral blood of 151 Chinese Han independent individuals comprise 68 cases of Beijing Han nationality, 50 cases of Henan Han nationality, 41 cases of Shandong Han nationality, 21 cases of inner Mongolian Han nationality, 18 cases of Shanxi Han nationality, 16 cases of Shenyang Han nationality, 16 cases of Lanzhou Han nationality, 11 cases of Jiangxi Han nationality and 10 cases of Guangdong Han nationality. Peripheral blood samples were all obtained
Figure BDA0003536579570000121
DNA was extracted from the DNA Investigator Kit (Qiagen, Germany). DNA quantification was performed using a Qubit 3.0 fluorescence quantifier (Thermo Fisher, USA). Storing at-20 deg.C for use. 151 independent of each otherAnd (4) preparing the peripheral blood into an FTA blood card, and storing the FTA blood card in a ventilated and dry place for later use.
1.3PCR amplification and library preparation
By using
Figure BDA0003536579570000122
Custom Panel (IGMU258V1) (GJIETAIKANG Biotech Co., Ltd.) (it should be noted that the kit is designed for 66InDel loci of the present invention, and includes the Primer set provided by the present invention) to perform PCR multiplex amplification, the total amplification system is 30 μ L, wherein Enhancer buffer NB (1N)3.5 μ L, Enhance buffer M2.5 μ L, Primer (including 144 primers provided by the present invention, sequence shown in SEQ ID NO. 1-SEQ ID NO. 132) 5 μ L, IGT-EM808 polymerase mix 10 μ L, template DNA 5 μ L (1ng/μ L) or FTA blood card sample DNA with diameter of 1mm, and deionized water to 30 μ L. In the experiment process, 9947A and 9948 are used as positive controls, and deionized water is used as a negative control. Using ProFlexTMAmplification was performed using a PCR amplification apparatus (Applied Biosystems, USA) under the following conditions: 3min at 95 ℃ for 30 s; 20s at 98 ℃, 4min at 60 ℃ and 18 cycles; finally, the mixture is stored at 72 ℃ for 10min and 4 ℃.
The PCR product was purified by AMPure XP magnetic beads (Beckman Coulter, USA), and 13.5. mu.L of the PCR product, Index-i 51. mu.L, Index-i 71. mu.L, Enhance buffer M2.5. mu.L, and deionized water 2. mu.L were added. By ProFlexTMAmplification was performed using a PCR amplification apparatus (Applied Biosystems, USA) under the following conditions: 3min at 95 ℃ for 30 s; 20s at 98 ℃, 1min at 58 ℃ and 30s at 72 ℃ for 9 cycles; finally, the mixture is stored at 72 ℃ for 5min and 4 ℃.
1.4 high throughput sequencing and InDel typing
The sample library was quality controlled and quantified using an Agilent 2100 bioanalyzer (Agilent Inc., USA) and 7500 real-time fluorescent quantitative PCR (Thermo Fisher, USA), followed by high-throughput sequencing of the 10pM DNA mixed library on a MiSeq high-throughput sequencer (Illinia, USA) using the MiSeq v2 kit (2X 150 cycles) (Illinia, USA). Typing analysis was performed on 66InDel loci and Amel sex loci using CLC Genomics Workbench 20.0 (Qiagen, Germany). Firstly, performing quality control on the fastq sequence through QC for sequencing Reads and Trim Reads functions, including quality evaluation and data trimming of original data (removing low-quality Reads and linker sequences with a quality score of less than 30 in original sequencing Reads). Importing a Structural Variant Call module according to the bed file configured in the amplification area; configuring a vcf file according to the InDel position and the variation information, and importing an Identify Known details from maps module. According to the standard GA/T1693-. Typing results were obtained for each locus.
1.5 statistical analysis
Allele Frequencies (AF) of 66InDel loci, observed heterozygosity (Ho), expected heterozygosity (He) were calculated using GenAIEx 6.5 software, Hardy-Weinberg equilibrium and linkage equilibrium tests between loci were performed on each locus, and a Typical Paternity Index (TPI) was calculated. The identification power (DP), the cumulative identification power (TDP), and the Polymorphism Information Content (PIC) were calculated using the Modifie-Powerstates software package. Calculating cumulative diad non-parent exclusion probability (CPE) by using Cervus 3.0 software (http:// www.fieldgenetics.com/pages/download. jsp)duo) And cumulative triplet non-parent exclusion probability (CPE)trio)
2 results
2.166-plex InDels gene locus genetic information and standard typing results
The IGMU kit can effectively amplify 251 samples of Chinese Han nationality irrelevant individuals, the mass fraction Q30 of all the samples is more than 95 percent through the analysis of CLC Genomics Workbench 20.0 software, the average sequencing depth of an amplicon is 1550X, the main distribution range of the multiple PCR product fragments is 150 plus one 180bp, and the Agilent 2100 bioanalyzer has no non-specific amplification under the quality control. Specific InDel locus genetic information and positive standard (9947A, 9948) typing are shown in Table 1.
TABLE 166 InDel locus genetic information and positive standard typing
Figure BDA0003536579570000141
Figure BDA0003536579570000151
Figure BDA0003536579570000161
Note: 1) rs # is the ID number of the InDel locus in the dbSNP database; 2) physical location of the InDel locus on the human genome database (grch37.p19) chromosome, 3) reference allel (Ref) was named 1in the subject line and alternative allel (Alt) was named 2 in the subject line.
2.266-plex InDels Hardy-Weinberg equilibrium test and linkage equilibrium test in Chinese Han population
Hardy-Weinberg equilibrium test is carried out on 66 loci, and the genotype distribution of other 63 loci conforms to Hardy-Weinberg equilibrium (P) except three loci of rs146109101, rs3061475 and rs2307807>0.05), the specific P values are shown in Table 2. Corrected by the Bonferroni method, the P value is 0.000758 (0.05/66), and the P values of 66InDel loci are all more than 0.000758, which accords with Hardy-Weinberg balance. Combining 66 loci in pairs for linkage equilibrium test to yield 2145 (2145 [. sub.N × (N-1))]N is the number of loci studied) to obtain the linkage equilibrium P between loci. No pairwise linkage was found at any of the 66 loci, corrected by Bonferroni, at 0.0000233 (0.05/2145). Therefore, TDP and CPE can be performedduo、CPEtrioThe calculation of (2).
2.366-Plex InDels population genetics parameter investigation in Chinese Han population
According to the statistical analysis of GenAIEx 6.5 software, except that the Minor Allele Frequency (MAF) of the rs79444593 locus is lower than 0.1, the minor allele frequency distribution of the rest 65 InDel loci is between 0.28 and 0.5, the average value is 0.4366, the allele frequency distribution is relatively balanced in Han people in China, and the allele frequency distribution of 66InDel loci is shown in figure 1. Statistics show that the minimum values of Ho, He, DP, PIC, PE and TPI occur at the locus rs 79444593. Except rs79444593, the heterozygosity (Ho) of the other 65 loci is 0.390-0.562, the expected heterozygosity (He) is 0.409-0.500, and the difference between the heterozygosity (Ho) and the expected heterozygosity (He) is small; the individual recognition rate (DP) is 0.562-0.653, and the higher the DP is, the stronger the efficacy of the genetic marker in recognizing different individuals is; the Polymorphism Information Content (PIC) is 0.325-0.375, and the higher the PIC is, the more the information content of the genetic marker is; the cumulative paternity index (TPI) is 0.820-1.141. The cumulative individual identification probability (TDP) is 0.99999999999999999999999999942818. Specific locus parameters are shown in table 2. Except the rs79444593 locus, the polymorphism of other loci in Chinese Han population is high, and 66-plex InDels is suitable for the individual identification research of Chinese Han population.
Table 266 InDel loci forensic genetics parameters (n 251)
Figure BDA0003536579570000171
Figure BDA0003536579570000181
Figure BDA0003536579570000191
Application of 2.466-plex InDels in individual recognition research
Figure BDA0003536579570000192
DIPPlex kit and AGCU InDel 50kit were used as common commercial InDel typing systems in individualsThe application of the field of identification and paternity test is wider. Mixing 66-plex InDels with
Figure BDA0003536579570000193
The efficacy of DIPLEX, AGCU InDel 50 and a commonly used STR commercial kit (DNA type 21. TM.) was compared.
Figure BDA0003536579570000194
DIPplex、AGCU InDel 50、 DNA Typer 21TMThe cumulative individual recognition efficiencies (TDPs) for the Chinese Han population were 0.99999997369, 0.999999999999999999635, and 0.99999999999999999999999885, respectively. The accumulated individual identification rate of the AGCU InDel 50 system is higher than that of a common STR system TDP (0.99999999998) (see: Wang Yali, Song Xiang, Kui Jie, Bai Hui Ru, round, Zhang Jia Yi, Yang Yue, Shi is unique, and 30 genetic polymorphisms of autosomal InDel [ J ] of Mongolian population in the red peak area of inner Mongolia]Life science study, 2019,23(05): 359-. The cumulative individual recognition rate (TDP) for 66-plex InDels of this study was 0.99999999999999999999999999942818, which is much higher than that for
Figure BDA0003536579570000201
DIPplex, AGCU InDel 50 and DNA type 21TM kits, indicating that 66-plex InDels are more potent at recognizing different individuals.
Application of 2.566-plex InDels in paternity test case
Sample information: FTA blood cards prepared by collecting peripheral blood samples of 100 Chinese Han independent individuals and peripheral blood of 151 Chinese Han independent individuals comprise 68 Beijing Han nationality, 50 Henan Han nationality, 41 Shandong Han nationality, 21 inner Mongolian Han nationality, 18 Shanxi Han nationality, 16 Shenyang Han nationality, 16 Lanzhou Han nationality, 11 Jiangxi Han nationality and 10 Guangdong Han nationality.
The experimental steps are as follows: refer to steps 1.2-1.5.
For case analysis of paternity test, use is made of
Figure BDA0003536579570000202
DIPplex system, dual non-father exclusion probability (CPE) of cantonese han nationality of chinaduo) 0.942342, triple non-parent exclusion probability (CPE)trio) Is 0.994748. AGCU InDel 50kit triplet non-parent exclusion probability (CPE) in Guangzhou Han nationalitytrio) Is 0.999732. Triple non-paternal exclusion probability (CPE) for two commercial kitstrio) All are less than 0.9999, the efficiency for identifying the diad case is lower, and the method cannot be independently applied to paternity test cases. As shown in Table 3, 66-plex InDels bigeminal non-parent exclusion Probability (PE)duo) 0.0155-0.125, triple non-father exclusion Probability (PE)trio) 0.1415-0.2812, the higher the probability of excluding non-parents of a locus, the higher the efficacy of excluding non-parents when performing paternity testing analysis. CPE in 66-plex InDels for 251 unrelated individual samples analyzed by Cervus softwareduoIs 0.999739, CPEtrioIs 0.999999999417. CPE for 66-plex InDelstrioMore than 20 STR loci (CPE)trio0.999999996 (see: Yang Xiu, mu Rui, Cao Rad Wei, Wang Wu, Li Chun Xiang, Cai Jun Cheng, Liu Hua Jie, Mawenhua, Xinjiang Stone river region Kazak nationality, Hui nationality, Han 20 STR loci genetic polymorphism research [ J]Laser biology newspaper, 2021,30(04):339-347.)) has stronger identification efficacy, which indicates that 66-plex InDels can be independently applied to the analysis of triplet paternity test cases, but other commercialized kits are needed for the combined use when the system efficacy of the dyad case does not meet the requirement of forensic paternity test. The four systems are compared in the identifying efficacy of Chinese Han population, as shown in Table 4.
Table 366 InDel loci bivalent non-paternal exclusion probabilities and tripartite non-paternal exclusion probabilities (n 251)
Figure BDA0003536579570000211
TABLE 4 comparison of the identifying Performance of different systems
Figure BDA0003536579570000212
Discussion of the preferred embodiments
STR is the most common polymorphic genetic marker in the field of forensic physical evidence identity identification at present, and the detection length of a composite amplification system is generally distributed between 100-500bp, so that the STR is not suitable for detecting highly degraded and low-copy samples. Indels are used as the genetic markers of the two alleles, and the amplified segments of the genetic markers are all smaller than 200bp, so that the method is suitable for the typing of degradation test materials. The study autonomously constructed an Illumina sequencing platform-based multiplex amplification system (66-plex InDels) comprising 66 autosomal Indel loci and one Amel sex locus.
The genetic parameters of 66-plex InDels were evaluated by genetic polymorphism investigation of 251 unrelated individuals in Han nationality of China. The results show that 66InDel loci are distributed on 18 chromosomes, and the average allele frequency of the InDel loci in Chinese Han population is 0.5242; the average observed heterozygosity is 0.482; the average expected heterozygosity is 0.483; the average match probability is 0.388; the average individual recognition rate was 0.612; the average polymorphism information content is 0.365; the average cumulative paternity index was 0.974. All loci, after Bonferroni correction, obey Hardy-Weinberg equilibrium law (P > 0.000758) and no linkage disequilibrium (P > 0.0000233) between each two loci. The rs79444593 locus has an MAF of 0.4385 in the east Asian population of the International thousand-people genome project, but the MAF of the locus is 0.09761 in the Chinese Han population in the research, which shows that the locus has low genetic polymorphism and poor identification efficiency in the Chinese Han population. In subsequent experiments, random sampling errors may be eliminated as much as possible by increasing the sample size, considering that random sampling may also cause errors.
Previous studies have shown that the individual identification efficacy of about 60 InDel loci can be equivalent to the commercial STR kit currently used in forensic identification (e.g., the Sinofiler kit contains 15 STR loci). DP is used as an important index for forensic individual identification, DP is the probability that two random individual phenotypes are different, and the larger the DP is, the stronger the individual identification efficiency is. A calculation was made for 66-plex InDels with a cumulative individual identification probability (TDP) of 0.99999999999999999999999999942818 and a cumulative non-parent exclusion probability (CPE) of 0.999999999417. TDP higher than DNA type 21TMThe identification efficiency of 20 autosomal STRs in the kit is high, and the CPE is high>0.9999, therefore, 66-plex InDels can be independently applied to individual identification and paternity identification, and can also be used as a supplementary gene locus of an existing commercial kit to carry out complex genetic relationship identification.
The research has the defects that the genetic polymorphism of an individual gene locus (rs79444593) in Chinese Han population is poor, the site can be replaced by optimizing a multiple multiplex amplification system, and more InDel gene loci can be added to improve the system efficiency of 66-plex InDels. Meanwhile, the subsequent research can analyze the genetic polymorphism of other ethnic groups of Chinese population, evaluate the forensic application value of 66-plex InDels in different populations in China, carry out the balance research of genetic parameters among the populations and explore the universality of 66-plex InDels in Chinese population.
The research obtains 66 autosomal InDel gene locus multiplex amplification systems, and provides genetic background data for individual identification and paternity identification research of the InDel gene locus in Chinese population through forensic population genetic analysis of 66-plex InDels in Chinese population.
Finally, it should be noted that: the above embodiments are only used to illustrate the technical solution of the present invention, and not to limit the same; while the invention has been described in detail and with reference to the foregoing embodiments, it will be understood by those skilled in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some or all of the technical features may be equivalently replaced; and the modifications or the substitutions do not make the essence of the corresponding technical solutions depart from the scope of the technical solutions of the embodiments of the present invention.
SEQUENCE LISTING
<110> military medical research institute of military science institute of people's liberation force of China
Application of <120> human chromosome InDel gene locus, primer group and product thereof, and individual identification method of test material
<160> 144
<170> PatentIn version 3.5
<210> 1
<211> 25
<212> DNA
<213> Artificial sequence
<400> 1
acaagagtag gataggtgcc acttt 25
<210> 2
<211> 23
<212> DNA
<213> Artificial sequence
<400> 2
ttgcttgtgg tatccatggt gac 23
<210> 3
<211> 23
<212> DNA
<213> Artificial sequence
<400> 3
ggccccattt gttgcttatt acc 23
<210> 4
<211> 24
<212> DNA
<213> Artificial sequence
<400> 4
tggagaaatt ctcagcacac ctac 24
<210> 5
<211> 23
<212> DNA
<213> Artificial sequence
<400> 5
tggaagcggg gaaaacaaga agt 23
<210> 6
<211> 22
<212> DNA
<213> Artificial sequence
<400> 6
gagcaacttg gtgcttcttg ga 22
<210> 7
<211> 23
<212> DNA
<213> Artificial sequence
<400> 7
aaccttcaca acaaccctgt gaa 23
<210> 8
<211> 24
<212> DNA
<213> Artificial sequence
<400> 8
actccagagc ctacatgttt aacc 24
<210> 9
<211> 26
<212> DNA
<213> Artificial sequence
<400> 9
gtgagacagg atgaacagtc ttcccc 26
<210> 10
<211> 26
<212> DNA
<213> Artificial sequence
<400> 10
acacgtgtct ggagagtcta gagaag 26
<210> 11
<211> 22
<212> DNA
<213> Artificial sequence
<400> 11
ttctaaccat tgccctggtg ag 22
<210> 12
<211> 23
<212> DNA
<213> Artificial sequence
<400> 12
ttccatgcag gtcacacaag atc 23
<210> 13
<211> 22
<212> DNA
<213> Artificial sequence
<400> 13
ttacccagtc ttgcccttca tg 22
<210> 14
<211> 27
<212> DNA
<213> Artificial sequence
<400> 14
atctctaaga tgaactccat ttggctt 27
<210> 15
<211> 25
<212> DNA
<213> Artificial sequence
<400> 15
cagccatctt tatcaacttc ccatg 25
<210> 16
<211> 28
<212> DNA
<213> Artificial sequence
<400> 16
tagatattgt tgctcttaca agaggcaa 28
<210> 17
<211> 23
<212> DNA
<213> Artificial sequence
<400> 17
tttatgctct gtcccctcgt gaa 23
<210> 18
<211> 24
<212> DNA
<213> Artificial sequence
<400> 18
ttgcttcagt tctcagcaag ttcc 24
<210> 19
<211> 25
<212> DNA
<213> Artificial sequence
<400> 19
tgaggtactg tgtcctaaac cctat 25
<210> 20
<211> 24
<212> DNA
<213> Artificial sequence
<400> 20
ttcagaagca agagtctacc aacc 24
<210> 21
<211> 23
<212> DNA
<213> Artificial sequence
<400> 21
ccgtggtgcc atattttgag gta 23
<210> 22
<211> 23
<212> DNA
<213> Artificial sequence
<400> 22
tcagaagcaa gagtctacca acc 23
<210> 23
<211> 23
<212> DNA
<213> Artificial sequence
<400> 23
tcaactgcat cacagcttac ctg 23
<210> 24
<211> 25
<212> DNA
<213> Artificial sequence
<400> 24
aggtcttcct aggatagata gcttg 25
<210> 25
<211> 25
<212> DNA
<213> Artificial sequence
<400> 25
cttcatttca actgcatcac agctt 25
<210> 26
<211> 27
<212> DNA
<213> Artificial sequence
<400> 26
aaaggtcttc ctaggataga tagcttg 27
<210> 27
<211> 24
<212> DNA
<213> Artificial sequence
<400> 27
ctgcctctct ttcccctaat tgac 24
<210> 28
<211> 26
<212> DNA
<213> Artificial sequence
<400> 28
aacctccact ttcctaccag aagttc 26
<210> 29
<211> 25
<212> DNA
<213> Artificial sequence
<400> 29
ttttccaaca aatgatgctg ggaaa 25
<210> 30
<211> 25
<212> DNA
<213> Artificial sequence
<400> 30
actggattta accatggttt cttgg 25
<210> 31
<211> 24
<212> DNA
<213> Artificial sequence
<400> 31
ttcatacccc agtgtacctg gaat 24
<210> 32
<211> 24
<212> DNA
<213> Artificial sequence
<400> 32
aattctcata ccagcacagc agtc 24
<210> 33
<211> 28
<212> DNA
<213> Artificial sequence
<400> 33
agtagatcta cagtttgatc cagcaatc 28
<210> 34
<211> 26
<212> DNA
<213> Artificial sequence
<400> 34
gaaacaacct aagtgcctat tgacca 26
<210> 35
<211> 32
<212> DNA
<213> Artificial sequence
<400> 35
aacatcacgt tgtatacttt aaatatacac aa 32
<210> 36
<211> 27
<212> DNA
<213> Artificial sequence
<400> 36
gccattactt cctcattctt tgcttcc 27
<210> 37
<211> 24
<212> DNA
<213> Artificial sequence
<400> 37
gcaagcaagt cctgaactgg tata 24
<210> 38
<211> 25
<212> DNA
<213> Artificial sequence
<400> 38
ggccctttag gccactttaa gacct 25
<210> 39
<211> 21
<212> DNA
<213> Artificial sequence
<400> 39
cacagcaagc aagtcctgaa c 21
<210> 40
<211> 23
<212> DNA
<213> Artificial sequence
<400> 40
ggccctttag gccactttaa gac 23
<210> 41
<211> 23
<212> DNA
<213> Artificial sequence
<400> 41
gcgagactcc atctcaaaaa cac 23
<210> 42
<211> 26
<212> DNA
<213> Artificial sequence
<400> 42
agtagaaaga gcatgaaacc tccatg 26
<210> 43
<211> 21
<212> DNA
<213> Artificial sequence
<400> 43
aacacaggct tgaggccaac c 21
<210> 44
<211> 22
<212> DNA
<213> Artificial sequence
<400> 44
gggactgcta atgcaaacag tg 22
<210> 45
<211> 23
<212> DNA
<213> Artificial sequence
<400> 45
aggtggcttc tacatggtgt tgg 23
<210> 46
<211> 23
<212> DNA
<213> Artificial sequence
<400> 46
ctctgttaga gcagtgcaga agg 23
<210> 47
<211> 24
<212> DNA
<213> Artificial sequence
<400> 47
attggcacat gggtggtatc tgaa 24
<210> 48
<211> 22
<212> DNA
<213> Artificial sequence
<400> 48
cacaggaatg gctaacagca gt 22
<210> 49
<211> 22
<212> DNA
<213> Artificial sequence
<400> 49
gagaaaaggc tgacttcccc tt 22
<210> 50
<211> 24
<212> DNA
<213> Artificial sequence
<400> 50
ctgcctgtta cacatgtgct agga 24
<210> 51
<211> 21
<212> DNA
<213> Artificial sequence
<400> 51
accatgccta gcctcagcat a 21
<210> 52
<211> 24
<212> DNA
<213> Artificial sequence
<400> 52
taacattggc catatgggtc tctg 24
<210> 53
<211> 23
<212> DNA
<213> Artificial sequence
<400> 53
tttctctctc aactggggct tga 23
<210> 54
<211> 24
<212> DNA
<213> Artificial sequence
<400> 54
tcagctgttc tccgtctact ctga 24
<210> 55
<211> 24
<212> DNA
<213> Artificial sequence
<400> 55
tattccaagc tgtggtggct acag 24
<210> 56
<211> 24
<212> DNA
<213> Artificial sequence
<400> 56
gttccaagcc ttggctattg gatg 24
<210> 57
<211> 25
<212> DNA
<213> Artificial sequence
<400> 57
tgagggaagg tatctgaagg tcaga 25
<210> 58
<211> 36
<212> DNA
<213> Artificial sequence
<400> 58
tacacatata tacagcatat atacacacat aaactg 36
<210> 59
<211> 24
<212> DNA
<213> Artificial sequence
<400> 59
catcccattt ctcaggtccc aaat 24
<210> 60
<211> 25
<212> DNA
<213> Artificial sequence
<400> 60
tcactgttct gccctctgta ttctg 25
<210> 61
<211> 26
<212> DNA
<213> Artificial sequence
<400> 61
gtttatttaa tccctggctc ccatgc 26
<210> 62
<211> 26
<212> DNA
<213> Artificial sequence
<400> 62
tctcactcca tctaaacaac gaaacc 26
<210> 63
<211> 22
<212> DNA
<213> Artificial sequence
<400> 63
ttgtgaggtc agtcctgagg aa 22
<210> 64
<211> 22
<212> DNA
<213> Artificial sequence
<400> 64
aatagggaac acctcctggg tt 22
<210> 65
<211> 22
<212> DNA
<213> Artificial sequence
<400> 65
aggatggaga gtctgtgctg tg 22
<210> 66
<211> 26
<212> DNA
<213> Artificial sequence
<400> 66
accctttcta ggtgtacagt tctgtg 26
<210> 67
<211> 25
<212> DNA
<213> Artificial sequence
<400> 67
ctcacaagcc ctttttatgg tgaca 25
<210> 68
<211> 24
<212> DNA
<213> Artificial sequence
<400> 68
cagtgttcag ggaacaggat atgc 24
<210> 69
<211> 23
<212> DNA
<213> Artificial sequence
<400> 69
gggagaaatt caagcccaat cca 23
<210> 70
<211> 27
<212> DNA
<213> Artificial sequence
<400> 70
ggcctagaag gaaaaatgat tttgttg 27
<210> 71
<211> 22
<212> DNA
<213> Artificial sequence
<400> 71
actcctctct catgcacagt ca 22
<210> 72
<211> 26
<212> DNA
<213> Artificial sequence
<400> 72
ttcctgtctt acttttccct ccacac 26
<210> 73
<211> 30
<212> DNA
<213> Artificial sequence
<400> 73
ccacacctgg ctaatttttc tattagtaga 30
<210> 74
<211> 25
<212> DNA
<213> Artificial sequence
<400> 74
ggcccatgag agcctttaat aatgg 25
<210> 75
<211> 23
<212> DNA
<213> Artificial sequence
<400> 75
accctcagct gcattctagc tac 23
<210> 76
<211> 22
<212> DNA
<213> Artificial sequence
<400> 76
actgggcccg tattactact ca 22
<210> 77
<211> 26
<212> DNA
<213> Artificial sequence
<400> 77
ctctgccttt tcaagaacaa ttctgt 26
<210> 78
<211> 29
<212> DNA
<213> Artificial sequence
<400> 78
actgtgattt atcaagttaa catcactct 29
<210> 79
<211> 23
<212> DNA
<213> Artificial sequence
<400> 79
tagcctggtc tggaaatgtg gtt 23
<210> 80
<211> 22
<212> DNA
<213> Artificial sequence
<400> 80
catgtgccat tacctgtgtg gt 22
<210> 81
<211> 32
<212> DNA
<213> Artificial sequence
<400> 81
ccttaggtag atgtatgtat gtactgagtg cc 32
<210> 82
<211> 32
<212> DNA
<213> Artificial sequence
<400> 82
cagtcattta catcaatgtg tggattaaat cc 32
<210> 83
<211> 27
<212> DNA
<213> Artificial sequence
<400> 83
actgtggtta tgttagagga atgcatg 27
<210> 84
<211> 26
<212> DNA
<213> Artificial sequence
<400> 84
aacaatcgtt gaagataggt gatgga 26
<210> 85
<211> 26
<212> DNA
<213> Artificial sequence
<400> 85
ggctttgatt ctttctacag cgattt 26
<210> 86
<211> 25
<212> DNA
<213> Artificial sequence
<400> 86
tgaaaaattc tctccactgg gagga 25
<210> 87
<211> 27
<212> DNA
<213> Artificial sequence
<400> 87
cagtgatgtg aatagtttca atgggtt 27
<210> 88
<211> 29
<212> DNA
<213> Artificial sequence
<400> 88
gtatgtgtga gcataaatta gtgtcctgt 29
<210> 89
<211> 34
<212> DNA
<213> Artificial sequence
<400> 89
ctaattttgt atttttagta gagatggggt ctca 34
<210> 90
<211> 35
<212> DNA
<213> Artificial sequence
<400> 90
acgttcttaa aacaacaata ttggtttata ggagt 35
<210> 91
<211> 22
<212> DNA
<213> Artificial sequence
<400> 91
ggggtgttat gtgtggtgtg tg 22
<210> 92
<211> 23
<212> DNA
<213> Artificial sequence
<400> 92
tgtgtggggg tatggatgtg gta 23
<210> 93
<211> 23
<212> DNA
<213> Artificial sequence
<400> 93
gaaaagcttg ggtggagtct cat 23
<210> 94
<211> 25
<212> DNA
<213> Artificial sequence
<400> 94
tggttgagca ttacaagcag tgtac 25
<210> 95
<211> 22
<212> DNA
<213> Artificial sequence
<400> 95
aaatctcatc cccctgctgt ac 22
<210> 96
<211> 22
<212> DNA
<213> Artificial sequence
<400> 96
tttccctgct atatccccag ca 22
<210> 97
<211> 22
<212> DNA
<213> Artificial sequence
<400> 97
aaatacagca gccaggggag at 22
<210> 98
<211> 23
<212> DNA
<213> Artificial sequence
<400> 98
aaggtcactg gaactcacac ttc 23
<210> 99
<211> 20
<212> DNA
<213> Artificial sequence
<400> 99
ggaccacagg tgtgcactgc 20
<210> 100
<211> 21
<212> DNA
<213> Artificial sequence
<400> 100
gagccattgt gtctggcctc a 21
<210> 101
<211> 24
<212> DNA
<213> Artificial sequence
<400> 101
ggctgctatg ctgagtagat gtag 24
<210> 102
<211> 25
<212> DNA
<213> Artificial sequence
<400> 102
ttgggaggaa gaactaagga cctag 25
<210> 103
<211> 26
<212> DNA
<213> Artificial sequence
<400> 103
ctggttaaga tgctaagacc cagtct 26
<210> 104
<211> 31
<212> DNA
<213> Artificial sequence
<400> 104
tctgatgatg caataaagtg ttattttcac a 31
<210> 105
<211> 23
<212> DNA
<213> Artificial sequence
<400> 105
taccatgcat caagtggcct tct 23
<210> 106
<211> 23
<212> DNA
<213> Artificial sequence
<400> 106
cagggccatg cagtaagatt tcc 23
<210> 107
<211> 21
<212> DNA
<213> Artificial sequence
<400> 107
caaagtaacg ctgggaccac t 21
<210> 108
<211> 23
<212> DNA
<213> Artificial sequence
<400> 108
caatcctcgc tctacaaccc cat 23
<210> 109
<211> 23
<212> DNA
<213> Artificial sequence
<400> 109
ctggtgaatt tccgtccaga caa 23
<210> 110
<211> 22
<212> DNA
<213> Artificial sequence
<400> 110
atgtcccctg gcttatgtcc ta 22
<210> 111
<211> 25
<212> DNA
<213> Artificial sequence
<400> 111
ccccccattt cttagtgtag atggc 25
<210> 112
<211> 24
<212> DNA
<213> Artificial sequence
<400> 112
gacagtcaca gagcatagag cact 24
<210> 113
<211> 22
<212> DNA
<213> Artificial sequence
<400> 113
accttgccct gtctatgtct tc 22
<210> 114
<211> 21
<212> DNA
<213> Artificial sequence
<400> 114
gcagagtcta agcatgccct t 21
<210> 115
<211> 26
<212> DNA
<213> Artificial sequence
<400> 115
gtcctgaatg gtattgccta ggtttt 26
<210> 116
<211> 25
<212> DNA
<213> Artificial sequence
<400> 116
ccgcttttca agagtgatgt ggtac 25
<210> 117
<211> 24
<212> DNA
<213> Artificial sequence
<400> 117
tggcttaaga gaaagcacac cttt 24
<210> 118
<211> 24
<212> DNA
<213> Artificial sequence
<400> 118
atctgttcct cattctgcac ctca 24
<210> 119
<211> 27
<212> DNA
<213> Artificial sequence
<400> 119
gtcctttgcc atttttctat tgacatg 27
<210> 120
<211> 30
<212> DNA
<213> Artificial sequence
<400> 120
tggcaataac gtaaaaatgt ttggtactag 30
<210> 121
<211> 25
<212> DNA
<213> Artificial sequence
<400> 121
cctcttcctc ataaatctgg ctgtc 25
<210> 122
<211> 25
<212> DNA
<213> Artificial sequence
<400> 122
aggtagcctc tttcttcagg aatcg 25
<210> 123
<211> 23
<212> DNA
<213> Artificial sequence
<400> 123
ttactgagga tggaggcagg acg 23
<210> 124
<211> 21
<212> DNA
<213> Artificial sequence
<400> 124
ctaccagtgt gccatgtcag t 21
<210> 125
<211> 21
<212> DNA
<213> Artificial sequence
<400> 125
agcggcagaa aatgagtgac a 21
<210> 126
<211> 22
<212> DNA
<213> Artificial sequence
<400> 126
tggagaattc ccagcatgga ac 22
<210> 127
<211> 21
<212> DNA
<213> Artificial sequence
<400> 127
aaaagtgcct gctgtgggaa t 21
<210> 128
<211> 22
<212> DNA
<213> Artificial sequence
<400> 128
tcatgctgat agcccatcac ac 22
<210> 129
<211> 23
<212> DNA
<213> Artificial sequence
<400> 129
gaaatgttga cgctagggac tga 23
<210> 130
<211> 24
<212> DNA
<213> Artificial sequence
<400> 130
gagatccaaa agacagaggc atgg 24
<210> 131
<211> 26
<212> DNA
<213> Artificial sequence
<400> 131
ccaacggtgt gtaagtattc ctcttt 26
<210> 132
<211> 25
<212> DNA
<213> Artificial sequence
<400> 132
gacctgacca aactctatga actga 25
<210> 133
<211> 26
<212> DNA
<213> Artificial sequence
<400> 133
gtaataggag cagcagtacg gatttt 26
<210> 134
<211> 21
<212> DNA
<213> Artificial sequence
<400> 134
gggattacag gtgctgccac a 21
<210> 135
<211> 23
<212> DNA
<213> Artificial sequence
<400> 135
caatgcctgt cctctctgca tct 23
<210> 136
<211> 22
<212> DNA
<213> Artificial sequence
<400> 136
tcaacgacgg gaaatacgtc ct 22
<210> 137
<211> 23
<212> DNA
<213> Artificial sequence
<400> 137
tgtccaatgc ctgtcctctc tgc 23
<210> 138
<211> 23
<212> DNA
<213> Artificial sequence
<400> 138
tcactcaacg acgggaaata cgt 23
<210> 139
<211> 24
<212> DNA
<213> Artificial sequence
<400> 139
ggagggaggg atagcattag gaaa 24
<210> 140
<211> 34
<212> DNA
<213> Artificial sequence
<400> 140
aaaacatctg ataattgaat taaaagatga gcaa 34
<210> 141
<211> 23
<212> DNA
<213> Artificial sequence
<400> 141
tcatatggtg cagctgctat gga 23
<210> 142
<211> 23
<212> DNA
<213> Artificial sequence
<400> 142
ccaaatggtg aaaccccgtc tct 23
<210> 143
<211> 23
<212> DNA
<213> Artificial sequence
<400> 143
ctgcctagca aaaggtcaat gct 23
<210> 144
<211> 24
<212> DNA
<213> Artificial sequence
<400> 144
gtgggtttga cagttggatg cttg 24

Claims (10)

1. Use of a human chromosomal InDel locus in any one of the following a) -b):
a) individual identification;
b) performing paternity test;
the InDel locus comprises: rs56113354, rs66493016, rs34947838, rs10562732, rs10555133, rs3028599, rs34105738, rs10541877, rs3030108, rs10535391, rs142380122, rs5849187, rs59715790, rs139940865, rs35052276, rs10562881, rs59027185, rs35908330, rs70954609, rs147779273, rs35829610, rs60570384, rs 146109109109101, rs151142037, rs59258861, rs4053254, rs145041883, rs 12156094, rs 7430767, rs 36363676043, rs34812419, rs144650377, rs 338383838328, rs 914009127, rs 2737701, rs 444593, rs56110100, rs 6656375635799756974, rs 34785635563579975633794, rs 343556355633799756335634798, rs 335634563456345794, rs 3356345794, rs 335634563456345794, rs 33563456345633563356335633563356335633563356339753, rs 34563456345634563456345634975634975634975634975634975634569, rs 335634563457401, rs.
2. The use of claim 1, wherein the InDel locus comprises: rs56113354, rs66493016, rs34947838, rs10562732, rs10555133, rs3028599, rs34105738, rs10541877, rs3030108, rs10535391, rs142380122, rs5849187, rs59715790, rs139940865, rs35052276, rs10562881, rs59027185, rs35908330, rs70954609, rs147779273, rs35829610, rs60570384, rs 146109109109101, rs151142037, rs59258861, rs4053254, rs145041883, rs 12156094, rs 7430767, rs 363636043, rs34812419, rs144650377, rs 33838383839, rs 9140027, rs 27701, rs 444593, rs56110100, rs 6656357997567997567841401, rs 34799756335633794, rs 3456355635563379975634794, rs 3456355635563356335633563356349794, rs 3456345634563456345634563456345794, rs 335634563456345633563356335633563356349794, rs 34563456345634563456345634563456345634563456345634563456345634563456345633563497569, rs 3356345634563456345634563456345634563456345634563456345634563456345634563456345634563456300, rs 335634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456300, rs 3356345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456300, rs 33563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456300, rs 3356345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456300, rs 335634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456.
3. A primer set for amplifying the InDel locus according to claim 1, wherein the sequence of the primer set is shown as SEQ ID No.1 to SEQ ID No. 144.
4. A reagent comprising the primer set according to claim 3.
5. A kit comprising the primer set according to claim 3 or the reagent according to claim 4.
6. The kit of claim 5, further comprising a reaction buffer and a hot start Taq enzyme.
7. Use of the primer set of claim 3, the reagent of claim 4, or the kit of claim 5 or 6 for individual identification or paternity testing.
8. An individual identification method for a material to be tested is characterized by comprising the following steps:
a. extracting DNA of a material to be detected;
b. obtaining the typing result of the DNA InDel gene locus;
c. performing individual identification according to the genotype of the locus;
the InDel locus comprises: rs56113354, rs66493016, rs34947838, rs10562732, rs10555133, rs3028599, rs34105738, rs10541877, rs3030108, rs10535391, rs142380122, rs5849187, rs59715790, rs139940865, rs35052276, rs10562881, rs59027185, rs35908330, rs70954609, rs147779273, rs35829610, rs60570384, rs 146109109109101, rs151142037, rs59258861, rs4053254, rs145041883, rs 12156094, rs 7430767, rs 363636043, rs34812419, rs144650377, rs 33838383839, rs 9140027, rs 27701, rs 444593, rs56110100, rs 6656357997567997567841401, rs 34799756335633794, rs 3456355635563379975634794, rs 3456355635563356335633563356349794, rs 3456345634563456345634563456345794, rs 335634563456345633563356335633563356349794, rs 34563456345634563456345634563456345634563456345634563456345634563456345633563497569, rs 3356345634563456345634563456345634563456345634563456345634563456345634563456345634563456300, rs 335634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456300, rs 3356345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456300, rs 33563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456300, rs 3356345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456300, rs 335634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456345634563456.
9. The individual identification method according to claim 8, wherein the b-step further comprises the step of amplifying the InDel locus using a primer set.
10. The individual identification method according to claim 9, wherein the sequences of the primer sets are shown in SEQ ID No.1 to SEQ ID No. 144.
CN202210219948.0A 2022-03-08 2022-03-08 Application of human chromosome InDel gene locus, primer group, product thereof and individual identification method of test material Active CN114574595B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210219948.0A CN114574595B (en) 2022-03-08 2022-03-08 Application of human chromosome InDel gene locus, primer group, product thereof and individual identification method of test material

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210219948.0A CN114574595B (en) 2022-03-08 2022-03-08 Application of human chromosome InDel gene locus, primer group, product thereof and individual identification method of test material

Publications (2)

Publication Number Publication Date
CN114574595A true CN114574595A (en) 2022-06-03
CN114574595B CN114574595B (en) 2022-12-02

Family

ID=81773753

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210219948.0A Active CN114574595B (en) 2022-03-08 2022-03-08 Application of human chromosome InDel gene locus, primer group, product thereof and individual identification method of test material

Country Status (1)

Country Link
CN (1) CN114574595B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115851973A (en) * 2022-09-30 2023-03-28 中国人民解放军军事科学院军事医学研究院 Method and kit for rapidly detecting human InDel genetic polymorphism by real-time fluorescent PCR (polymerase chain reaction) and application

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060228721A1 (en) * 2005-04-12 2006-10-12 Leamon John H Methods for determining sequence variants using ultra-deep sequencing
WO2017185758A1 (en) * 2016-04-27 2017-11-02 上海荻硕贝肯生物科技有限公司 Primer, probe, kit, and method for microchimerism assay and individual recognition
CN108517363A (en) * 2018-03-08 2018-09-11 深圳华大法医科技有限公司 A kind of individual identification system, kit and application thereof based on the sequencing of two generations
CN109762909A (en) * 2018-12-09 2019-05-17 朱波峰 A kind of 44 site InDels composite amplification detection kits for sample medical jurisprudence individual appreciation of degrading
CN111748637A (en) * 2020-07-23 2020-10-09 中国人民解放军军事科学院军事医学研究院 SNP molecular marker combination, multiplex composite amplification primer set, kit and method for genetic relationship analysis and identification
CN113584179A (en) * 2021-04-14 2021-11-02 南方医科大学 Six-color fluorescence labeling detection system for degrading 64 loci on human autosome and sex chromosome of test material typing
CN113881760A (en) * 2021-09-17 2022-01-04 朱波峰 Composite amplification system for detecting four types of 129 molecular genetic markers based on NGS technology
CN113981048A (en) * 2021-08-30 2022-01-28 司法鉴定科学研究院 Primer composition, kit and method for detecting micro-haplotype locus based on next-generation sequencing technology and application thereof

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060228721A1 (en) * 2005-04-12 2006-10-12 Leamon John H Methods for determining sequence variants using ultra-deep sequencing
WO2017185758A1 (en) * 2016-04-27 2017-11-02 上海荻硕贝肯生物科技有限公司 Primer, probe, kit, and method for microchimerism assay and individual recognition
CN108517363A (en) * 2018-03-08 2018-09-11 深圳华大法医科技有限公司 A kind of individual identification system, kit and application thereof based on the sequencing of two generations
CN109762909A (en) * 2018-12-09 2019-05-17 朱波峰 A kind of 44 site InDels composite amplification detection kits for sample medical jurisprudence individual appreciation of degrading
CN111748637A (en) * 2020-07-23 2020-10-09 中国人民解放军军事科学院军事医学研究院 SNP molecular marker combination, multiplex composite amplification primer set, kit and method for genetic relationship analysis and identification
CN113584179A (en) * 2021-04-14 2021-11-02 南方医科大学 Six-color fluorescence labeling detection system for degrading 64 loci on human autosome and sex chromosome of test material typing
CN113981048A (en) * 2021-08-30 2022-01-28 司法鉴定科学研究院 Primer composition, kit and method for detecting micro-haplotype locus based on next-generation sequencing technology and application thereof
CN113881760A (en) * 2021-09-17 2022-01-04 朱波峰 Composite amplification system for detecting four types of 129 molecular genetic markers based on NGS technology

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
MENGGEWANG等: "Molecular genetic survey and forensic characterization of Chinese Mongolians via the 47 autosomal insertion/deletion marker", 《GENOMICS》 *
张星茹等: "中国柯尔克孜族35个InDel位点的遗传多态性分析和群体遗传学研究", 《生物医学转化》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115851973A (en) * 2022-09-30 2023-03-28 中国人民解放军军事科学院军事医学研究院 Method and kit for rapidly detecting human InDel genetic polymorphism by real-time fluorescent PCR (polymerase chain reaction) and application
CN115851973B (en) * 2022-09-30 2023-10-20 中国人民解放军军事科学院军事医学研究院 Method, kit and application for rapidly detecting human InDel genetic polymorphism by real-time fluorescence PCR

Also Published As

Publication number Publication date
CN114574595B (en) 2022-12-02

Similar Documents

Publication Publication Date Title
RU2752700C2 (en) Methods and compositions for dna profiling
KR100943525B1 (en) System for Biomarker Screening
WO2011091046A1 (en) Identification of polymorphic sequences in mixtures of genomic dna by whole genome sequencing
US20210262034A1 (en) Methods for identifying and using small rna predictors
KR101667526B1 (en) Method for Extended Autosomal STR Analysing Human Subject of Analytes using a Next Generation Sequencing Technology
KR101157526B1 (en) Snp for diagnosing adhd, microarray and kit comprising the same, and method of diagnosing adhd using thereof
CN109609648B (en) Liver cancer-related lncRNA marker and detection primer and application thereof
CN105936940B (en) Nucleic acid sequence for detecting deafness gene and application thereof
EP3314026A1 (en) Single nucleotide polymorphism inhla-b*15:02
CN114574595B (en) Application of human chromosome InDel gene locus, primer group, product thereof and individual identification method of test material
CA2957267C (en) Method and device for comparative analysis of mirna expression level
EP2721171A1 (en) Discrimination of blood type variants
CN112592981B (en) Primer group, kit and method for DNA archive construction
KR101761801B1 (en) Composition for determining nose phenotype
CN110734982A (en) High-throughput sequencing technology-based linkage autosomal STR typing system and kit
EP3315613B1 (en) Methods and kits for diagnosing or assessing the risk of cervical cancer
EP3225689B1 (en) Method and device for correcting level of expression of small rna
US20100297622A1 (en) Method for high-throughput gene expression profile analysis
CN108315435B (en) SNP molecular marker related to sheep lambing number trait and application thereof
CN110938681A (en) Allele nucleic acid enrichment and detection method
WO2019147764A1 (en) Small rna predictor guided therapeutics
CN114085895B (en) Detection primer for rapidly detecting MSI and kit thereof
CN115725745B (en) SNP molecular marker related to sheep multi-thoracic vertebrae and amplification primer set and application
CN108251531B (en) Application of ENSG00000267549 in judging osteosarcoma metastasis
CN117448450A (en) Marker for colorectal cancer screening, probe composition and application thereof

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant