CN108491691B - Genetic relationship identification method and terminal equipment - Google Patents

Genetic relationship identification method and terminal equipment Download PDF

Info

Publication number
CN108491691B
CN108491691B CN201810247761.5A CN201810247761A CN108491691B CN 108491691 B CN108491691 B CN 108491691B CN 201810247761 A CN201810247761 A CN 201810247761A CN 108491691 B CN108491691 B CN 108491691B
Authority
CN
China
Prior art keywords
allele
genetic relationship
ancestor
controlled
frequency
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810247761.5A
Other languages
Chinese (zh)
Other versions
CN108491691A (en
Inventor
李淑瑾
马冠车
丛斌
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hebei Medical University
Original Assignee
Hebei Medical University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hebei Medical University filed Critical Hebei Medical University
Priority to CN201810247761.5A priority Critical patent/CN108491691B/en
Publication of CN108491691A publication Critical patent/CN108491691A/en
Application granted granted Critical
Publication of CN108491691B publication Critical patent/CN108491691B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding

Landscapes

  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Medical Informatics (AREA)
  • Biophysics (AREA)
  • Software Systems (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Epidemiology (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Public Health (AREA)
  • Bioethics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Biotechnology (AREA)
  • Evolutionary Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Theoretical Computer Science (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention is suitable for the technical field of forensic physical evidence, and provides a genetic relationship identification method and terminal equipment, wherein the method comprises the following steps: acquiring a first allele and a second allele of a person to be identified, and determining a first gene frequency of the first allele and a second gene frequency of the second allele; acquiring a first occurrence number of the first allele in a controlled ancestor and a second occurrence number of the second allele in the controlled ancestor; acquiring the genetic relationship hierarchy of the person to be authenticated and the controlled ancestor; and determining the first genetic relationship between the person to be identified and the controlled ancestor according to the genetic relationship hierarchy, the first gene frequency, the second gene frequency, the first occurrence number and the second occurrence number. The invention can simplify the identification process.

Description

Genetic relationship identification method and terminal equipment
Technical Field
The invention belongs to the technical field of forensic physical evidence, and particularly relates to a genetic relationship identification method and terminal equipment.
Background
Genetic relationship identification (kinship testing) refers to identification of whether a certain specific genetic relationship exists between two individuals according to genetic rules by detecting and typing various genetic markers, and is one of the main tasks of forensic physical evidence.
According to the identification purpose, the genetic relationship identification can be divided into two types, namely direct genetic relationship identification and collateral genetic relationship identification, wherein the former type comprises relationship identification such as father, grandfather, grandgrandgrandfather and the like, and the most common type in the work is paternity identification, especially father-son relationship identification; the latter is related to appraisal such as tertiary nephew, sibling/half-sibling siblings, cousin and the like. At present, genetic relationship identification needs to consider the combination condition of identifying individual genotypes, and the identification process is complex.
Disclosure of Invention
In view of this, embodiments of the present invention provide a genetic relationship identification method and a terminal device, so as to solve the technical problem in the prior art that a genetic relationship identification process is complex.
The embodiment of the invention provides a genetic relationship identification method, which comprises the following steps:
acquiring a first allele and a second allele of a person to be identified, and determining a first gene frequency of the first allele and a second gene frequency of the second allele;
obtaining a first number of occurrences of the first allele in the genotype of the controlled ancestor and a second number of occurrences of the second allele in the genotype of the controlled ancestor;
acquiring the genetic relationship hierarchy of the person to be authenticated and the controlled ancestor;
determining a first genetic relationship between the person to be identified and the controlled ancestor according to the genetic relationship hierarchy, the first gene frequency, the second gene frequency, the first occurrence number and the second occurrence number; or
Obtaining a third number of occurrences of the first allele in the genotype of the reference parent and a fourth number of occurrences of the second allele in the genotype of the reference parent; and judging the second genetic relationship between the person to be identified and the controlled ancestor according to the genetic relationship hierarchy, the first gene frequency, the second gene frequency, the first occurrence number, the second occurrence number, the third occurrence number and the fourth occurrence number.
Optionally, the determining the first genetic relationship between the person to be authenticated and the controlled ancestor according to the genetic relationship hierarchy, the first gene frequency, the second gene frequency, the first occurrence number, and the second occurrence number includes:
according to the expression
Figure BDA0001607022820000021
Determining a first genetic relationship index of the person to be authenticated and the controlled ancestor, and judging the first genetic relationship of the person to be authenticated and the controlled ancestor according to the first genetic relationship index;
wherein KIDIs a first genetic relationship index, piIs the first allele frequency, pjIs the second allele frequency, diFor the first number of occurrences, djAnd G is the genetic relationship hierarchy.
Optionally, the determining a second genetic relationship between the person to be identified and the controlled ancestor according to the genetic relationship hierarchy, the first genetic frequency, the second genetic frequency, the first occurrence number, the second occurrence number, the third occurrence number, and the fourth occurrence number includes:
according to the expression
Figure BDA0001607022820000022
Judging a second genetic relationship index of the person to be authenticated and the controlled ancestor, and judging a second genetic relationship of the person to be authenticated and the controlled ancestor according to the second genetic relationship index;
wherein KITIs a second genetic relationship index, piIs the first allele frequency, pjIs the second allele frequency, diFor the first number of occurrences, djIs the second occurrence number, d'iIs the third occurrence number, d'jAnd G is the genetic relationship hierarchy.
Optionally, the obtaining a first allele and a second allele of a person to be identified, and determining a first gene frequency of the first allele and a second gene frequency of the second allele, includes:
testing the genotype of the preset locus of the person to be identified by using a preset kit to obtain a first genotype;
determining a first gene frequency of the first allele and a second gene frequency of the second allele from the first genotype.
Optionally, the obtaining a first number of occurrences of the first allele in a controlled ancestor and a second number of occurrences of the second allele in the controlled ancestor comprises:
testing the genotype of the preset locus of the controlled ancestor by using a preset kit to obtain a second genotype;
determining a first number of occurrences of the first allele and a second number of occurrences of the second allele based on the second genotype.
Optionally, the obtaining a third number of occurrences of the first allele on the reference parent and a fourth number of occurrences of the second allele on the reference parent comprises:
testing the genotype of the preset site of the reference parent by using a preset kit to obtain a third genotype;
determining a third number of occurrences of the first allele and a fourth number of occurrences of the second allele from the third genotype.
A second aspect of an embodiment of the present invention provides an affinity identification apparatus, including:
the first acquisition unit is used for acquiring a first allele and a second allele of a person to be identified and determining a first gene frequency of the first allele and a second gene frequency of the second allele;
a second obtaining unit for obtaining a first number of occurrences of the first allele in the genotype of the controlled ancestor and a second number of occurrences of the second allele in the genotype of the controlled ancestor;
a third obtaining unit, configured to obtain a genetic relationship hierarchy between the person to be authenticated and the controlled ancestor;
a determining unit, configured to determine a first genetic relationship between the person to be authenticated and the controlled ancestor according to the genetic relationship hierarchy, the first genetic frequency, the second genetic frequency, the first occurrence number, and the second occurrence number; or
Further comprising:
a fourth acquisition unit configured to acquire a third number of occurrences of the first allele in the genotype of the reference parent and a fourth number of occurrences of the second allele in the genotype of the reference parent;
the determining unit is configured to acquire a third number of occurrences of the first allele in the genotype of the reference parent and a fourth number of occurrences of the second allele in the genotype of the reference parent.
Optionally, the determining unit is specifically configured to perform the following operations according to an expression
Figure BDA0001607022820000041
Determining a first genetic relationship index of the person to be authenticated and the controlled ancestor, and judging the first genetic relationship of the person to be authenticated and the controlled ancestor according to the first genetic relationship index;
wherein KIDIs a first genetic relationship index, piIs the first allele frequency, pjIs the second allele frequency, diFor the first number of occurrences, djThe second occurrence number, G is the level of the genetic relationship; or
According to the expression
Figure BDA0001607022820000042
Determining a second genetic relationship index of the person to be authenticated and the controlled ancestor, and judging the second genetic relationship of the person to be authenticated and the controlled ancestor according to the second genetic relationship index;
wherein KITIs a second genetic relationship index, piIs the first allele frequency, pjIs the second allele frequency, diFor the first number of occurrences, djIs the second occurrence number, d'iIs the third occurrence number, d'jAnd G is the genetic relationship hierarchy.
A third aspect of the embodiments of the present invention provides a genetic relationship identification terminal device, including a memory, a processor, and a computer program stored in the memory and executable on the processor, where the processor executes the computer program to implement the steps of the method according to the first aspect of the embodiments of the present invention.
A fourth aspect of embodiments of the present invention provides a computer-readable storage medium, in which a computer program is stored, which, when executed by a processor, performs the steps of the method according to the first aspect of embodiments of the present invention.
Compared with the prior art, the embodiment of the invention has the following beneficial effects: according to the embodiment of the invention, the first genetic frequency of the first allele of the person to be identified, the second genetic frequency of the second allele, the first occurrence frequency of the first allele in the genotype of the controlled ancestor, the second occurrence frequency of the second allele in the genotype of the controlled ancestor and the genetic relationship hierarchy of the person to be identified and the controlled ancestor are obtained, and the first genetic relationship between the person to be identified and the controlled ancestor is judged according to the genetic relationship hierarchy, the first genetic frequency, the second genetic frequency, the first occurrence frequency and the second occurrence frequency.
The embodiment of the invention obtains the first gene frequency of the first allele of the person to be identified, the second gene frequency of the second allele, the first appearance frequency of the first allele in the genotype of the controlled ancestor, the second appearance frequency of the second allele in the genotype of the controlled ancestor, the genetic relationship hierarchy of the person to be identified and the controlled ancestor, the third appearance frequency of the first allele in the genotype of the reference parent and the fourth appearance frequency of the second allele in the genotype of the reference parent; and judging the second genetic relationship between the person to be identified and the controlled ancestor according to the genetic relationship hierarchy, the first gene frequency, the second gene frequency, the first occurrence frequency, the second occurrence frequency, the third occurrence frequency and the fourth occurrence frequency.
The embodiment of the invention does not need to consider the combination condition of detecting the individual genotypes, thereby simplifying the identification process.
Drawings
In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings needed to be used in the embodiments or the prior art descriptions will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without inventive exercise.
Fig. 1 is a schematic flow chart illustrating an implementation of a genetic relationship identification method according to an embodiment of the present invention;
FIG. 2 is a schematic diagram of a database according to an embodiment of the present invention;
fig. 3 is a schematic flow chart illustrating an implementation of the genetic relationship identification method according to the second embodiment of the present invention;
FIG. 4 is a schematic diagram of an apparatus for genetic relationship identification according to a third embodiment of the present invention;
FIG. 5 is a schematic diagram of an apparatus for genetic relationship identification according to a third embodiment of the present invention;
fig. 6 is a schematic diagram of an affinity identification terminal device according to a fourth embodiment of the present invention.
Detailed Description
In the following description, for purposes of explanation and not limitation, specific details are set forth, such as particular system structures, techniques, etc. in order to provide a thorough understanding of the embodiments of the invention. It will be apparent, however, to one skilled in the art that the present invention may be practiced in other embodiments that depart from these specific details. In other instances, detailed descriptions of well-known systems, devices, circuits, and methods are omitted so as not to obscure the description of the present invention with unnecessary detail.
In order to explain the technical means of the present invention, the following description will be given by way of specific examples.
In the embodiment of the invention, the genetic relationship between two individuals assumed in the genetic relationship identification is called controlled genetic relationship; of the two individuals whose relationship needs to be determined, the individual at the descendant position of the genetic pedigree under the controlled relationship is called the person to be identified, and the other individual is called the controlled ancestor; if a biological parent (father or mother) of said person to be authenticated participates in the authentication, this parent is called the reference parent. When two individuals needing to determine the relationship are in the same ancestor of the genetic pedigree under the controlled genetic relationship, when a reference parent participates in identification, the filial generation of the reference parent is called as a person to be identified, and the other individual is called as a controlled ancestor; when the parent is not referred to for identification, one individual is randomly determined to be the person to be identified and the other individual is the controlled ancestor.
Example one
Referring to fig. 1, an embodiment of the present invention provides a genetic relationship identification method, which is a genetic relationship identification method involving no reference parent, including the following steps,
step S101, a first allele and a second allele of a person to be identified are obtained, and a first gene frequency of the first allele and a second gene frequency of the second allele are determined.
Optionally, the specific implementation manner of step S101 is:
testing the genotype of the preset locus of the person to be identified by using a preset kit to obtain a first genotype;
determining a first gene frequency of the first allele and a second gene frequency of the second allele from the first genotype.
In the embodiment of the invention, the preset kit is used for testing the genotype of the preset locus of the person to be identified to obtain the first genotype, the first genotype comprises the first allele and the second allele, and then the first gene frequency and the second gene frequency are searched in the database according to the name of the first allele and the name of the second allele. The data in the database includes the type of kit, the locus corresponding to the type of kit, the allele corresponding to the locus, and the gene frequency of each allele. For example, the genotype of the human to be identified at site D19S433 was tested using the kit numbered 20A, and the first genotype measured was (i, j), where i is the first allele and j is the second allele, and then the first gene frequency and the second gene frequency were obtained by searching the database for the kit numbered 20A, site D19S433, the gene frequency of the first allele i and the gene frequency of the second allele j. Preferably, the data in the database is stored in an EXCELL table, and the first gene frequency and the second gene frequency are searched by an EXCELL function. For example, the data in the database is as shown in FIG. 2, it being understood that FIG. 2 lists only a portion of the data of the database.
Step S102, acquiring a first occurrence number of the first allele in the genotype of the controlled ancestor and a second occurrence number of the second allele in the genotype of the controlled ancestor.
Optionally, the specific implementation manner of step S102 is:
testing the genotype of the preset locus of the controlled ancestor by using a preset kit to obtain a second genotype;
determining a first number of occurrences of the first allele and a second number of occurrences of the second allele based on the second genotype.
In the embodiment of the invention, the kit and the test site for testing the controlled ancestor are the same as those for testing the person to be identified, for example, the kit with the number of 20A is also used for testing the genotype of the controlled ancestor site D19S433, the second genotype is obtained, and the first appearance frequency of the first allele and the second appearance frequency of the second allele are obtained from the second genotype. For example, when the first genotype is (i, j) and the second genotype is (i, k), where i, j, and k are different from each other, the first frequency of occurrence is 1 and the second frequency of occurrence is 0.
And step S103, acquiring the genetic relationship hierarchy of the person to be authenticated and the controlled ancestor.
In the embodiment of the present invention, the controlled relationships include direct relationships or collateral relationships satisfying the following conditions: the first is that the nearest common ancestor of the person to be identified and the controlled ancestor is a person or a couple, and there is no relationship between the couples, and the second is that the common ancestor of the person to be identified and the controlled ancestor is in the same ancestor of the person to be identified.
In the genetic relationship identification satisfying the above conditions, the method of calculating the hierarchy of the genetic relationship is as follows:
the genetic algebra from the nearest common ancestor (the controlled ancestor in the direct relationship) to the controlled ancestor is GFirst of all(in direct relationship, GFirst of all0) to the person to be identified is GSecond stepIf the number of common ancestors is a (1 or 2), the relationship hierarchy G is G ═ GFirst of all+GSecond step+1-a。
For example, in an ancestral relationship, the nearest common ancestor of two volumes is the controlled ancestor, GFirst of all=0,G Second step2, a 1, GFirst of all+GSecond step+1-a ═ 2. In the relationship of tertiary nephew, the nearest common ancestor of the two bodies is the grandparent of the person to be authenticated, i.e. the controlled ancestorParents of the ancestor GFirst of all=1,G Second step2, a 2, G ═ GFirst of all+GSecond step+1-a ═ 2. In the half-sibling relationship, the nearest common ancestor of the two bodies is the parent common to both, GFirst of all=1,GSecond step=1,a=1,G=GFirst of all+GSecond step+1-a=2。
And step S104, judging the first genetic relationship between the person to be identified and the controlled ancestor according to the genetic relationship hierarchy, the first gene frequency, the second gene frequency, the first occurrence frequency and the second occurrence frequency.
In the embodiment of the invention, a first genetic relationship index of the person to be authenticated and the controlled ancestor is determined according to the genetic relationship hierarchy, the first gene frequency, the second gene frequency, the first occurrence frequency and the second occurrence frequency, wherein the first genetic relationship index is a genetic relationship index in the genetic relationship authentication without participation of the reference parent, and the reference parent is a biological father or a biological mother of the person to be authenticated. The calculation process is as follows: firstly, determining the sum of the ratio of the first occurrence frequency to the first allele frequency and the ratio of the second occurrence frequency to the second allele frequency, and then determining a first genetic relationship index according to the genetic relationship hierarchy.
Specifically, a first genetic relationship index of the person to be authenticated and the controlled ancestor is determined according to a formula (1), and the first genetic relationship of the person to be authenticated and the controlled ancestor is determined according to the first genetic relationship index. The formula (1) is:
Figure BDA0001607022820000081
wherein KIDIs a first genetic relationship index, piIs the first allele frequency, pjIs the second allele frequency, diFor the first number of occurrences, djAnd G is the genetic relationship hierarchy.
And after the first genetic relationship index is determined, carrying out genetic relationship judgment according to the relevant technical specification.
In the identification of actual relationshipsGenerally, a plurality of kits are used to test a plurality of different sites of individual inheritance, thereby determining a plurality of first genetic relationship indexes, and a first total genetic relationship index CKI is obtained by calculating the product of the plurality of genetic relationship indexes1According to CKI1The genetic relationship determination is performed in conjunction with relevant technical specifications and standards, such as, for example, paternity testing technical specifications and standards.
The method comprises the steps of obtaining a first gene frequency of a first allele of a person to be identified, a second gene frequency of a second allele, a first occurrence frequency of the first allele in a controlled ancestor, a second occurrence frequency of the second allele in the controlled ancestor and an affinity hierarchy of the person to be identified and the controlled ancestor, and judging the first affinity of the person to be identified and the controlled ancestor according to the affinity hierarchy, the first gene frequency, the second gene frequency, the first occurrence frequency and the second occurrence frequency without considering the combination condition of detected individual genotypes, so that the identification process is simplified.
Example two
Referring to fig. 2, a second embodiment of the present invention provides a genetic relationship identification method in which a reference parent participates, the method including the steps of:
step S201, a first allele and a second allele of a person to be identified are obtained, and a first gene frequency of the first allele and a second gene frequency of the second allele are determined.
Step S202, obtaining the first occurrence number of the first allele in the genotype of the controlled ancestor and the second occurrence number of the second allele in the genotype of the controlled ancestor.
And step S203, acquiring the genetic relationship hierarchy of the person to be authenticated and the controlled ancestor.
Step S204, acquiring the third occurrence number of the first allele in the genotype of the reference parent and the fourth occurrence number of the second allele in the genotype of the reference parent.
Step S205, determining the second genetic relationship between the person to be identified and the controlled ancestor according to the genetic relationship hierarchy, the first gene frequency, the second gene frequency, the first occurrence number, the second occurrence number, the third occurrence number and the fourth occurrence number.
In the embodiment of the present invention, the implementation method of step S201 to step S203 is the same as the implementation method of step S101 to step S103 described in the first embodiment of the present invention, and is not described herein again.
In the embodiment of the invention, a second genetic relationship index of the person to be identified and the controlled ancestor is determined according to the genetic relationship hierarchy, the first gene frequency, the second gene frequency, the first occurrence number, the second occurrence number, the third occurrence number and the fourth occurrence number, wherein the second genetic relationship index is a genetic relationship index in the genetic relationship identification with the reference parent. The specific calculation process is as follows: firstly, determining the product of the fourth occurrence frequency and the first occurrence frequency as a first component, determining the product of the third occurrence frequency and the second occurrence frequency as a second component, determining the product of the fourth occurrence frequency and the first gene frequency as a third component, determining the product of the third occurrence frequency and the second gene frequency as a fourth component, then determining the sum of the first component and the second component to be divided by the sum of the third component and the fourth component, and determining a second genetic relationship according to the legacy passages.
Further, the specific implementation method of step S205 is as follows: and (3) determining a second genetic relationship index of the person to be authenticated and the controlled ancestor according to a formula (2), and judging the second genetic relationship of the person to be authenticated and the controlled ancestor according to the second genetic relationship index. The formula (2) is:
Figure BDA0001607022820000101
wherein KITIs a second genetic relationship index, piIs the first allele frequency, pjIs the second allele frequency, diFor the first number of occurrences, djIs the second occurrence number,d'iIs the third occurrence number, d'jAnd G is the genetic relationship hierarchy.
Further, the specific implementation method of step S204 is as follows:
testing the genotype of the preset site of the reference parent by using a preset kit to obtain a third genotype;
determining a third number of occurrences of the first allele and a fourth number of occurrences of the second allele from the third genotype.
In the embodiments of the present invention, the test reference uses the same kit and test site as those used for testing the person to be identified and the controlled ancestor. The method for obtaining the third occurrence number and the fourth occurrence number is the same as the method for obtaining the first occurrence number and the second occurrence number, and is not described herein again.
And after the second genetic relationship index is determined, performing genetic relationship judgment according to the related technical specification.
In the actual genetic relationship identification, a plurality of kits are generally used to test a plurality of different sites of individual inheritance, so as to determine a plurality of second genetic relationship indexes, and a second total genetic relationship index CKI is obtained by calculating the product of the plurality of second genetic relationship indexes2According to CKI2The genetic relationship determination is performed in conjunction with related specifications, such as, for example, paternity testing specifications and standards.
The embodiment of the invention obtains the third appearance frequency of the first allele in the genotype of the reference parent and the fourth appearance frequency of the second allele in the genotype of the reference parent by obtaining the first gene frequency of the first allele of the person to be identified, the second gene frequency of the second allele, the first appearance frequency of the first allele in the controlled ancestor, the second appearance frequency of the second allele in the controlled ancestor and the level of the genetic relationship between the person to be identified and the controlled ancestor; and judging the second genetic relationship between the person to be identified and the controlled ancestor according to the genetic relationship hierarchy, the first gene frequency, the second gene frequency, the first occurrence frequency, the second occurrence frequency, the third occurrence frequency and the fourth occurrence frequency.
In the embodiment of the invention, when the controlled ancestor is a suspicious father or a suspicious mother of the person to be identified, the genetic relationship is the paternity relationship, and the genetic relationship index is the paternity index. Determining a first paternity of the person to be identified and the controlled ancestor according to the first gene frequency, the second gene frequency, the first occurrence number and the second occurrence number, and determining a second paternity of the person to be identified and the controlled ancestor according to the first gene frequency, the second gene frequency, the first occurrence number, the second occurrence number, the third occurrence number and the fourth occurrence number.
Specifically, a first paternity index of the person to be authenticated and the controlled ancestor is determined according to a formula (3), wherein the formula (3) is as follows:
Figure BDA0001607022820000111
wherein p isiIs the first allele frequency, pjIs the second allele frequency, diFor the first number of occurrences, djThe second number of occurrences.
Determining a second affinity index of the person to be authenticated with the controlled ancestor according to formula (4), wherein formula (4) is:
Figure BDA0001607022820000112
wherein p isiIs the first allele frequency, pjIs the second allele frequency, diFor the first number of occurrences, djIs the second occurrence number, d'iIs the third occurrence number, d'jThe fourth number of occurrences.
In embodiments of the invention, the first paternity index is that in a diad paternity test, i.e. no reference parent participates, and the second paternity index is that in a tripled paternity test, i.e. with a reference parent participates.
In the embodiment of the invention, the genetic relationship hierarchy, the first gene frequency, the second gene frequency, the first occurrence frequency, the second occurrence frequency, the third occurrence frequency and the fourth occurrence frequency are all stored in an EXCELL table, and the first genetic relationship index, the second genetic relationship index, the first paternity index and the second paternity index are calculated through an EXCELL function, so that the rapid calculation can be realized, and the method is simple and convenient.
The methods described in examples one and two are only applicable to the exponential identification of relationships that meet the following conditions:
(1) the common ancestor of the two individuals being identified is a person or a couple;
(2) the common ancestor cannot exist in both the paternal and maternal lines of two individuals at the same time, i.e., the Full Sibling Index (FSI) cannot be calculated using the present method.
In the embodiment of the present invention, the derivation process of equations (1) to (4) is as follows:
when the controlled genetic relationship is a direct genetic relationship, the derivation process is as follows:
the nomenclature of the individuals involved in the identification is as follows: individuals downstream of the genetic lineage are called children, i.e. persons to be identified, and are labeled c (child); individuals upstream of the genetic lineage are designated as controlled ancestors and are labeled aa (allegedalstor), the reference parent can be the biological mother or biological father of the child, and the reference parent is labeled M. In the autosomal genetic marker, genetic information of both the paternal line and the maternal line follows the same regular inheritance, so that the calculation method of the corresponding genetic relationship index is the same in the direct genetic relationship identification, regardless of whether AA is in the paternal line or the maternal line of C. For convenience, AA is considered in the paternal line of C in the calculations herein, while M is the biological mother of C.
Suppose that a certain kit is used for detecting n alleles at a certain locus, and the frequency of each allele is p1、p2、…、pnI and j respectively denote two alleles, i and j may be the same or different, and in particular, as described in detail below, piThe gene frequency, p, of the allele ijGene frequency, d, of the respective allele jiAnd djDenotes the number of occurrences of allele i and allele j in both alleles of AA, d'iAnd d'jIndicates the number of occurrences of allele i and allele j in both alleles of M, respectively. For example, if i ≠ j, then d is the two allelic genes of AA are (i, k), and k ≠ iiIs 1, djIs 0.
Let f (Fi) be the probability that the parent of C provides allele i to it when AA is determined to be the parent of G generation of C; f (F' i) is the probability that the parent of C provides allele i to the parent when the parents are all random individuals; f (Mi) is the probability that C's maternal ancestor provides it with allele i. Where allele j is involved, the nomenclature is as above.
All Kinship Indexes (KI) are Likelihood Ratios (LR), i.e. the ratio of two hypothesis probabilities, wherein the hypothesis at the molecular position is called the provenance hypothesis (Hp), i.e. there is a certain genetic relationship between two individuals; the hypothesis at the position of the denominator is called the defendant hypothesis (Hd), i.e. there is no specific relationship between the two individuals, KI is expressed by equation (5):
Figure BDA0001607022820000131
wherein E is an event { C, AA } or { C, AA, M } for detecting the occurrence of a specific genotype combination among individuals, Hp is the G generation ancestor of C, Hd is AA and C are unrelated individuals.
Pr(E|Hp) Is shown in HpAssuming that when set, the probability of occurrence of E, i.e., the probability that the parent of C and the parent given by the condition provide one of its two alleles, respectively, under the condition that AA is determined to be the G generation parent of C. When C is homozygote (i, i), the probability is f (fi) xf (mi); when C is a heterozygote (i, j), the probability is f (Fi) x f (Mj) + f (Fj) x f (Mi).
Pr(E|Hd) Is shown in HdAssuming that it is set, the probability of occurrence of E, i.e. the probability that a completely random ancestor of the parent and the conditional giving ancestor of the parent each provide one of its two alleles, respectively.When C is homozygote (i, i), the probability is F (F' i) x F (Mi); when C is a heterozygote (i, j), the probability is F (F 'i) × F (Mj) + F (F' j) × F (Mi).
Therefore, the calculation of KI can be summarized according to whether the child is homozygous for the genotype as follows:
c is homozygote, taking (i, i) as an example, KI is calculated as follows:
Figure BDA0001607022820000141
c is heterozygote, taking (i, j) and i ≠ j as an example, the calculation formula for KI is as follows:
Figure BDA0001607022820000142
as can be seen from equations (6) and (7), assuming that C has a genotype of (i, j), i and j may or may not be the same, for all cases:
Figure BDA0001607022820000143
in a classical pedigree, of the two alleles of a child, one is from the father and one is from the mother. In the same line, as the genetic relationship hierarchy (G) increases, the number of its ancestors of the G generation in the parent or mother line also increases exponentially, i.e., the number of ancestors of the G generation in the parent or mother line is 2G-1 Total allele 2GAnd (4) respectively. This 2 can be inferred from Mendelian's Law of inheritanceGAll alleles have the same probability of being passed to children.
Thus, the probability that a child's paternal or maternal allele is from a G generation ancestor of that line is
Figure BDA0001607022820000144
The probability from other ancestors of the same ancestor of the family is
Figure BDA0001607022820000145
And a certain body is inherited to the offspringThe probability of allele i is
Figure BDA0001607022820000146
Thus, the probability of an AA inheriting allele i to C is:
Figure BDA0001607022820000147
since all alleles of ancestors of the parent are given equal opportunity to be inherited by children, f (fi) is the sum of the probability of providing allele i to AA and the probability of providing allele i to other ancestors of the parent. When the genotype of an individual is unknown, the probability of inheritance of allele i to the next generation is considered to be pi. Therefore, the method comprises the following steps:
Figure BDA0001607022820000148
in the target relationship, the individual AA is the G generation ancestor of C, and the other individuals in the family are unknown and considered to be random individuals, without reference to the parent. It is understood that, in this case, F (F' i) ═ F (mi) ═ piAssuming that C has a genotype of (i, j), i and j may be the same or different, for all cases:
Figure BDA0001607022820000151
Y=Pr(E|Hd)=f(F'i)×f(Mj)+f(F'j)×f(Mi)=2pipj(11)
then the non-reference parent participation affinity index KI is recorded as the first affinity index KIDThe calculation formula is as follows:
Figure BDA0001607022820000152
the formula (1) is obtained by the above analysis.
And (3) identifying the direct relationship without reference parent participation when the diad parent is identified as G ═ 1, substituting G ═ 1 into the formula (1), and obtaining the paternity index as the first paternity index, namely the formula (3).
In the case of reference parents, where AA is the G ancestor of C, the biological mother or father M of C is established and its genotype is known, and no other individuals in the family are known, except AA, M, C, and considered to be random individuals, in the target relationship. It can be seen that F (F' i) ═ p at this timei
Figure BDA0001607022820000153
Assuming that C has a genotype of (i, j), i and j may be the same or different, for all cases:
Figure BDA0001607022820000154
Figure BDA0001607022820000155
then there is said reference parent participation affinity index as the second affinity index KITThe calculation formula is as follows:
Figure BDA0001607022820000156
the formula (2) is obtained by the above analysis.
And (3) when the triplet parent is identified as G ═ 1, identifying the direct relationship with reference to the parent, and substituting G ═ 1 into the formula (2) to obtain the paternity index as a second paternity index, namely the formula (4).
By observing the KI for more distant direct relationships, the following relationships exist between the G generation direct relationship index and the corresponding affinity index, regardless of whether the parent data is referenced in the identification:
Figure BDA0001607022820000161
wherein, G represents the relationship hierarchy of the assumed ancestor to child under the specific relationship, and different paternity index formulas are respectively applied according to the diads or the triplets.
When the controlled relationship is a collateral relationship, the derivation process is as follows:
it can be seen that, when the sites of linkage equilibrium are used for identification, the genetic relationship indexes of the genetic relationships in the same hierarchy are equal to each other according to the hierarchical distribution of the genetic relationships, regardless of the genotype combination of the individuals involved in the identification.
Therefore, for any direct genetic relationship and collateral genetic relationship meeting the conditions in step S103, the formula (1) can be applied to the calculation of the first genetic relationship index, and the formula (2) can be applied to the calculation of the second genetic relationship index, wherein G in both the formulas represents the genetic relationship hierarchy of the controlled genetic relationship.
It should be understood that, the sequence numbers of the steps in the foregoing embodiments do not imply an execution sequence, and the execution sequence of each process should be determined by its function and inherent logic, and should not constitute any limitation to the implementation process of the embodiments of the present invention.
EXAMPLE III
Referring to fig. 4, in a third embodiment of the present invention, a genetic relationship identification apparatus includes: a first acquisition unit 301, a second acquisition unit 302, a third acquisition unit 303, and a determination unit 304. Wherein,
a first obtaining unit 301, configured to obtain a first allele and a second allele of a person to be identified, and determine a first gene frequency of the first allele and a second gene frequency of the second allele;
a second obtaining unit 302, configured to obtain a first number of occurrences of the first allele in the genotype of the controlled ancestor and a second number of occurrences of the second allele in the genotype of the controlled ancestor;
a third obtaining unit 303, configured to obtain a level of an affinity relationship between the person to be authenticated and the controlled ancestor;
a determining unit 304, configured to determine a first genetic relationship between the person to be authenticated and the controlled ancestor according to the genetic relationship hierarchy, the first genetic frequency, the second genetic frequency, the first occurrence number, and the second occurrence number.
According to the embodiment of the invention, the first gene frequency of the first allele of the person to be identified and the second gene frequency of the second allele are obtained through the first obtaining unit 301, the first appearance frequency of the first allele in the controlled ancestor and the second appearance frequency of the second allele in the controlled ancestor are obtained through the second obtaining unit 302, the genetic relationship hierarchy of the person to be identified and the controlled ancestor is obtained through the third obtaining unit 303, the first genetic relationship of the person to be identified and the controlled ancestor is determined through the determining unit 304 according to the genetic relationship hierarchy, the first gene frequency, the second gene frequency, the first appearance frequency and the second appearance frequency, and the combination condition of detecting the individual genotypes does not need to be considered, so that the identification process is simplified.
Referring to fig. 5, in another implementation manner, the genetic relationship identification apparatus includes: a first acquisition unit 301, a second acquisition unit 302, a third acquisition unit 303, a determination unit 304, and a fourth acquisition unit 305. Wherein,
a first obtaining unit 301, configured to obtain a first allele and a second allele of a person to be identified, and determine a first gene frequency of the first allele and a second gene frequency of the second allele;
a second obtaining unit 302, configured to obtain a first number of occurrences of the first allele in the genotype of the controlled ancestor and a second number of occurrences of the second allele in the genotype of the controlled ancestor;
a third obtaining unit 303, configured to obtain a level of an affinity relationship between the person to be authenticated and the controlled ancestor;
a fourth acquisition unit 305 for acquiring a third number of occurrences of the first allele in the genotype of the reference parent and a fourth number of occurrences of the second allele in the genotype of the reference parent;
the determining unit 304 is configured to determine a second relationship between the person to be identified and the controlled ancestor according to the relationship hierarchy, the first gene frequency, the second gene frequency, the first occurrence number, the second occurrence number, the third occurrence number, and the fourth occurrence number.
The embodiment of the invention obtains a first gene frequency of a first allele of a person to be identified and a second gene frequency of a second allele by a first obtaining unit 301, obtains a first occurrence number of the first allele in a controlled ancestor and a second occurrence number of the second allele in the controlled ancestor by a second obtaining unit 302, obtains an affinity hierarchy of the person to be identified and the controlled ancestor by a third obtaining unit 303, obtains a third occurrence number of the first allele in a genotype of a reference parent and a fourth occurrence number of the second allele in the genotype of the reference parent by a fourth obtaining unit 305, and determines a second affinity of the person to be identified and the controlled ancestor according to the affinity hierarchy, the first gene frequency, the second gene frequency, the first occurrence number, the second occurrence number, the third occurrence number and the fourth occurrence number by a determining unit 304, without the need to consider combinations that detect individual genotypes, thereby simplifying the identification process.
Optionally, the determining unit 304 is specifically configured to:
the determining unit is specifically configured to determine the value of the parameter according to an expression
Figure BDA0001607022820000181
Determining a first genetic relationship index of the person to be authenticated and the controlled ancestor, and judging the first genetic relationship of the person to be authenticated and the controlled ancestor according to the first genetic relationship index;
wherein KIDIs a first genetic relationship index, piIs the first allele frequency, pjIs the second allele frequency, diFor the first number of occurrences, djAnd G is the genetic relationship hierarchy.
Optionally, the determining unit 304 is specifically configured to:
according to the expression
Figure BDA0001607022820000182
Determining a second genetic relationship index of the person to be authenticated and the controlled ancestor, and judging the second genetic relationship of the person to be authenticated and the controlled ancestor according to the second genetic relationship index;
wherein KITIs a second genetic relationship index, piIs the first allele frequency, pjIs the second allele frequency, diFor the first number of occurrences, djIs the second occurrence number, d'iIs the third occurrence number, d'jAnd G is the genetic relationship hierarchy.
Optionally, the first obtaining unit 301 is specifically configured to use a preset kit to test the genotype of the preset locus of the person to be identified, so as to obtain a first genotype;
determining a first gene frequency of the first allele and a second gene frequency of the second allele from the first genotype.
Optionally, the second obtaining unit 302 is specifically configured to use a preset kit to test the genotype of the preset locus of the controlled ancestor, and obtain a second genotype;
determining a first number of occurrences of the first allele and a second number of occurrences of the second allele based on the second genotype.
Optionally, the fourth obtaining unit 305 is specifically configured to enable a preset kit to test the genotype of the preset site of the reference parent, and obtain a third genotype;
determining a third number of occurrences of the first allele and a fourth number of occurrences of the second allele from the third genotype.
Example four
Fig. 6 is a schematic diagram of an affinity identification terminal device according to a fourth embodiment of the present invention. As shown in fig. 6, the genetic relationship authentication terminal device 4 of this embodiment includes: a processor 401, a memory 402 and a computer program 403 stored in said memory 402 and executable on said processor 401. The processor 401, when executing the computer program 403, implements the steps in the embodiment of the affinity identification method described above, such as the steps S101 to S104 shown in fig. 1. Alternatively, the processor 401, when executing the computer program 403, implements the functions of each module/unit in each device embodiment described above, for example, the functions of the units 301 to 304 shown in fig. 4.
Illustratively, the computer program 403 may be partitioned into one or more modules/units that are stored in the memory 402 and executed by the processor 401 to implement the present invention. The one or more modules/units may be a series of computer program instruction segments capable of performing specific functions, which are used to describe the execution process of the computer program 403 in the genetic relationship identification terminal device 4. For example, the computer program 403 may be divided into a first acquisition unit, a second acquisition unit, a third acquisition unit and a determination unit, and each unit has the following specific functions:
the first acquisition unit is used for acquiring a first allele and a second allele of a person to be identified and determining a first gene frequency of the first allele and a second gene frequency of the second allele;
a second obtaining unit for obtaining a first number of occurrences of the first allele in the genotype of the controlled ancestor and a second number of occurrences of the second allele in the genotype of the controlled ancestor;
a third obtaining unit, configured to obtain a genetic relationship hierarchy between the person to be authenticated and the controlled ancestor;
a determining unit, configured to determine a first genetic relationship between the person to be authenticated and the controlled ancestor according to the genetic relationship hierarchy, the first gene frequency, the second gene frequency, the first occurrence number, and the second occurrence number.
The computer program 403 may also be divided into a first acquisition unit, a second acquisition unit, a third acquisition unit, a determination unit and a fourth acquisition unit. Wherein,
the first acquisition unit is used for acquiring a first allele and a second allele of a person to be identified and determining a first gene frequency of the first allele and a second gene frequency of the second allele;
a second obtaining unit for obtaining a first number of occurrences of the first allele in the genotype of the controlled ancestor and a second number of occurrences of the second allele in the genotype of the controlled ancestor;
a third obtaining unit, configured to obtain a genetic relationship hierarchy between the person to be authenticated and the controlled ancestor;
a fourth acquisition unit configured to acquire a third number of occurrences of the first allele in the genotype of the reference parent and a fourth number of occurrences of the second allele in the genotype of the reference parent;
the determining unit is used for determining a second genetic relationship between the person to be identified and the controlled ancestor according to the genetic relationship hierarchy, the first gene frequency, the second gene frequency, the first occurrence number, the second occurrence number, the third occurrence number and the fourth occurrence number.
Optionally, the determining unit 304 is specifically configured to:
according to the expression
Figure BDA0001607022820000201
Determining a first genetic relationship index of the person to be authenticated and the controlled ancestor, and judging the first genetic relationship of the person to be authenticated and the controlled ancestor according to the first genetic relationship index;
wherein KIDIs a first genetic relationship index, piIs the first allele frequency, pjIs the second allele frequency, diFor the first number of occurrences, djAnd G is the genetic relationship hierarchy.
Optionally, the determining unit 304 is specifically configured to:
according to the expression
Figure BDA0001607022820000211
Determining a second genetic relationship index of the person to be authenticated and the controlled ancestor, and judging the second genetic relationship of the person to be authenticated and the controlled ancestor according to the second genetic relationship index;
wherein KITIs a second genetic relationship index, piIs the first allele frequency, pjIs the second allele frequency, diFor the first number of occurrences, djIs the second occurrence number, d'iIs the third occurrence number, d'jAnd G is the genetic relationship hierarchy.
Optionally, the first obtaining unit 301 is specifically configured to use a preset kit to test the genotype of the preset locus of the person to be identified, so as to obtain a first genotype;
determining a first gene frequency of the first allele and a second gene frequency of the second allele from the first genotype.
Optionally, the second obtaining unit 302 is specifically configured to use a preset kit to test the genotype of the preset locus of the controlled ancestor, and obtain a second genotype;
determining a first number of occurrences of the first allele and a second number of occurrences of the second allele based on the second genotype.
Optionally, the fourth obtaining unit 305 is specifically configured to enable a preset kit to test the genotype of the preset site of the reference parent, and obtain a third genotype;
determining a third number of occurrences of the first allele and a fourth number of occurrences of the second allele from the third genotype.
The genetic relationship identification terminal device 4 may be a desktop computer, a notebook, a palm computer, a cloud server, or other computing devices. The genetic relationship identification terminal device may include, but is not limited to, a processor 401 and a memory 402. Those skilled in the art will appreciate that fig. 4 is merely an example of the affinity identification terminal device 4, and does not constitute a limitation of the affinity identification terminal device 4, and may include more or less components than those shown, or some components in combination, or different components, for example, the affinity identification terminal device may further include an input-output device, a network access device, a bus, etc.
The Processor 401 may be a Central Processing Unit (CPU), other general purpose Processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), an off-the-shelf Programmable Gate Array (FPGA) or other Programmable logic device, discrete Gate or transistor logic, discrete hardware components, etc. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like.
The storage 402 may be an internal storage unit of the genetic relationship identification terminal device 4, such as a hard disk or a memory of the genetic relationship identification terminal device 4. The memory 402 may also be an external storage device of the genetic relationship identification terminal device 4, such as a plug-in hard disk, a Smart Memory Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card), or the like, provided on the genetic relationship identification terminal device 4. Further, the memory 402 may include both an internal storage unit and an external storage device of the genetic relationship identification terminal device 4. The memory 402 is used for storing the computer program and other programs and data required for the affinity authentication terminal device. The memory 402 may also be used to temporarily store data that has been output or is to be output.
It will be apparent to those skilled in the art that, for convenience and brevity of description, only the above-mentioned division of the functional units and modules is illustrated, and in practical applications, the above-mentioned function distribution may be performed by different functional units and modules according to needs, that is, the internal structure of the apparatus is divided into different functional units or modules to perform all or part of the above-mentioned functions. Each functional unit and module in the embodiments may be integrated in one processing unit, or each unit may exist alone physically, or two or more units are integrated in one unit, and the integrated unit may be implemented in a form of hardware, or in a form of software functional unit. In addition, specific names of the functional units and modules are only for convenience of distinguishing from each other, and are not used for limiting the protection scope of the present application. The specific working processes of the units and modules in the system may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.
In the above embodiments, the descriptions of the respective embodiments have respective emphasis, and reference may be made to the related descriptions of other embodiments for parts that are not described or illustrated in a certain embodiment.
Those of ordinary skill in the art will appreciate that the various illustrative elements and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware or combinations of computer software and electronic hardware. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the implementation. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present invention.
In the embodiments provided in the present invention, it should be understood that the disclosed apparatus/terminal device and method may be implemented in other ways. For example, the above-described embodiments of the apparatus/terminal device are merely illustrative, and for example, the division of the modules or units is only one logical division, and there may be other divisions when actually implemented, for example, a plurality of units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, devices or units, and may be in an electrical, mechanical or other form.
The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.
In addition, functional units in the embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, and can also be realized in a form of a software functional unit.
The integrated modules/units, if implemented in the form of software functional units and sold or used as separate products, may be stored in a computer readable storage medium. Based on such understanding, all or part of the flow of the method according to the embodiments of the present invention may also be implemented by a computer program, which may be stored in a computer-readable storage medium, and when the computer program is executed by a processor, the steps of the method embodiments may be implemented. Wherein the computer program comprises computer program code, which may be in the form of source code, object code, an executable file or some intermediate form, etc. The computer-readable medium may include: any entity or device capable of carrying the computer program code, recording medium, usb disk, removable hard disk, magnetic disk, optical disk, computer Memory, Read-Only Memory (ROM), Random Access Memory (RAM), electrical carrier wave signals, telecommunications signals, software distribution medium, and the like. It should be noted that the computer readable medium may contain content that is subject to appropriate increase or decrease as required by legislation and patent practice in jurisdictions, for example, in some jurisdictions, computer readable media does not include electrical carrier signals and telecommunications signals as is required by legislation and patent practice.
The above-mentioned embodiments are only used for illustrating the technical solutions of the present invention, and not for limiting the same; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; such modifications and substitutions do not substantially depart from the spirit and scope of the embodiments of the present invention, and are intended to be included within the scope of the present invention.

Claims (8)

1. A method for genetic relationship identification, comprising:
acquiring a first allele and a second allele of a person to be identified, and determining a first gene frequency of the first allele and a second gene frequency of the second allele;
obtaining a first number of occurrences of the first allele in the genotype of the controlled ancestor and a second number of occurrences of the second allele in the genotype of the controlled ancestor;
acquiring the genetic relationship hierarchy of the person to be authenticated and the controlled ancestor;
determining a first genetic relationship between the person to be identified and the controlled ancestor according to the genetic relationship hierarchy, the first gene frequency, the second gene frequency, the first occurrence number and the second occurrence number; or
Obtaining a third number of occurrences of the first allele in the genotype of the reference parent and a fourth number of occurrences of the second allele in the genotype of the reference parent;
determining a second genetic relationship between the person to be authenticated and the controlled ancestor according to the genetic relationship hierarchy, the first gene frequency, the second gene frequency, the first occurrence number, the second occurrence number, the third occurrence number and the fourth occurrence number;
wherein the determining a first genetic relationship between the person to be authenticated and the controlled ancestor according to the genetic relationship hierarchy, the first gene frequency, the second gene frequency, the first number of occurrences, and the second number of occurrences comprises:
according to the expression
Figure FDA0002345621070000011
Determining a first genetic relationship index of the person to be authenticated and the controlled ancestor, and judging the first genetic relationship of the person to be authenticated and the controlled ancestor according to the first genetic relationship index;
wherein KIDIs a first genetic relationship index, piIs the first allele frequency, pjIs the second allele frequency, diFor the first number of occurrences, djAnd G is the genetic relationship hierarchy.
2. The method of claim 1, wherein the determining the second relationship between the person to be authenticated and the controlled ancestor based on the genetic hierarchy, the first gene frequency, the second gene frequency, the first number of occurrences, the second number of occurrences, the third number of occurrences, and the fourth number of occurrences comprises:
according to the expression
Figure FDA0002345621070000021
Determining a second genetic relationship index of the person to be authenticated and the controlled ancestor, and judging the second genetic relationship of the person to be authenticated and the controlled ancestor according to the second genetic relationship index;
wherein KITIs a second genetic relationship index, piIs the first allele frequency, pjIs the second allele frequency, diFor the first number of occurrences, djIs the second occurrence number, d'iIs the third occurrence number, d'jAnd G is the genetic relationship hierarchy.
3. The genetic relationship identification method of claim 1, wherein the obtaining a first allele and a second allele of a person to be identified and determining a first gene frequency of the first allele and a second gene frequency of the second allele comprises:
testing the genotype of the preset locus of the person to be identified by using a preset kit to obtain a first genotype;
determining a first gene frequency of the first allele and a second gene frequency of the second allele from the first genotype.
4. The method of claim 1, wherein the obtaining a first number of occurrences of the first allele in a controlled ancestor and a second number of occurrences of the second allele in the controlled ancestor comprises:
testing the genotype of the preset locus of the controlled ancestor by using a preset kit to obtain a second genotype;
determining a first number of occurrences of the first allele and a second number of occurrences of the second allele based on the second genotype.
5. The method of claim 1, wherein obtaining a third number of occurrences of the first allele at a reference parent and a fourth number of occurrences of the second allele at the reference parent comprises:
testing the genotype of the preset site of the reference parent by using a preset kit to obtain a third genotype;
determining a third number of occurrences of the first allele and a fourth number of occurrences of the second allele from the third genotype.
6. An apparatus for genetic relationship identification, comprising:
the first acquisition unit is used for acquiring a first allele and a second allele of a person to be identified and determining a first gene frequency of the first allele and a second gene frequency of the second allele;
a second obtaining unit for obtaining a first number of occurrences of the first allele in the genotype of the controlled ancestor and a second number of occurrences of the second allele in the genotype of the controlled ancestor;
a third obtaining unit, configured to obtain a genetic relationship hierarchy between the person to be authenticated and the controlled ancestor;
a determining unit, configured to determine a first genetic relationship between the person to be authenticated and the controlled ancestor according to the genetic relationship hierarchy, the first genetic frequency, the second genetic frequency, the first occurrence number, and the second occurrence number;
wherein the determining unit is specifically configured to determine the target position according to an expression
Figure FDA0002345621070000031
Determining a first genetic relationship index of the person to be authenticated and the controlled ancestor, and judging the first genetic relationship of the person to be authenticated and the controlled ancestor according to the first genetic relationship index;
wherein KIDIs a first genetic relationship index, piIs the first allele frequency, pjIs the second allele frequency, diFor the first number of occurrences, djThe second occurrence number, G is the level of the genetic relationship; or
According to the expression
Figure FDA0002345621070000032
Determining a second genetic relationship index of the person to be authenticated and the controlled ancestor, and judging the second genetic relationship of the person to be authenticated and the controlled ancestor according to the second genetic relationship index;
wherein KITIs a second genetic relationship index, piIs the first allele frequency, pjIs the second allele frequency, diFor the first number of occurrences, djIs the second occurrence number, d'iIs the third occurrence number, d'jThe fourth occurrence number, G, the level of the genetic relationship;
or
Further comprising:
a fourth acquisition unit configured to acquire a third number of occurrences of the first allele in the genotype of the reference parent and a fourth number of occurrences of the second allele in the genotype of the reference parent;
the determining unit is configured to determine a second genetic relationship between the person to be identified and the controlled ancestor according to the genetic relationship hierarchy, the first genetic frequency, the second genetic frequency, the first occurrence number, the second occurrence number, the third occurrence number, and the fourth occurrence number.
7. An affinity assessment terminal device comprising a memory, a processor and a computer program stored in said memory and executable on said processor, characterized in that said processor implements the steps of the method according to any one of claims 1 to 5 when executing said computer program.
8. A computer-readable storage medium, in which a computer program is stored which, when being executed by a processor, carries out the steps of the method according to any one of claims 1 to 5.
CN201810247761.5A 2018-03-23 2018-03-23 Genetic relationship identification method and terminal equipment Active CN108491691B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810247761.5A CN108491691B (en) 2018-03-23 2018-03-23 Genetic relationship identification method and terminal equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810247761.5A CN108491691B (en) 2018-03-23 2018-03-23 Genetic relationship identification method and terminal equipment

Publications (2)

Publication Number Publication Date
CN108491691A CN108491691A (en) 2018-09-04
CN108491691B true CN108491691B (en) 2020-06-09

Family

ID=63319612

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810247761.5A Active CN108491691B (en) 2018-03-23 2018-03-23 Genetic relationship identification method and terminal equipment

Country Status (1)

Country Link
CN (1) CN108491691B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111091869A (en) * 2020-01-13 2020-05-01 北京奇云诺德信息科技有限公司 Genetic relationship identification method using SNP as genetic marker
CN116209777A (en) * 2020-10-27 2023-06-02 深圳华大基因股份有限公司 Genetic relationship judging method and device based on noninvasive prenatal gene detection data
CN115346594B (en) * 2022-08-24 2023-09-05 温州医科大学 Ancestor relationship identification method, system, equipment and medium without raw mother participation
CN115206424B (en) * 2022-08-24 2023-04-07 温州医科大学 Method, system, equipment and storage medium for identifying full sibling relationship

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102982222A (en) * 2011-09-02 2013-03-20 司法部司法鉴定科学技术研究所 Simple relatedness index acquiring method on condition of no mutation
CN107609343A (en) * 2017-08-14 2018-01-19 广州市圣鑫生物科技有限公司 Relationship iden- tification method, system, computer equipment and readable storage medium storing program for executing

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102982222A (en) * 2011-09-02 2013-03-20 司法部司法鉴定科学技术研究所 Simple relatedness index acquiring method on condition of no mutation
CN107609343A (en) * 2017-08-14 2018-01-19 广州市圣鑫生物科技有限公司 Relationship iden- tification method, system, computer equipment and readable storage medium storing program for executing

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
ISFG: Recommendations on biostatistics in paternity testing;David W. Gjertson 等;《Forensic Science International: Genetics》;20071231;全文 *
亲权指数计算的新方法;吕德坚 等;《法医学杂志》;20090831;全文 *

Also Published As

Publication number Publication date
CN108491691A (en) 2018-09-04

Similar Documents

Publication Publication Date Title
CN108491691B (en) Genetic relationship identification method and terminal equipment
Kling et al. Investigative genetic genealogy: Current methods, knowledge and practice
Nielsen et al. SNP calling, genotype calling, and sample allele frequency estimation from new-generation sequencing data
Luo et al. A multi-task convolutional deep neural network for variant calling in single molecule sequencing
Vezzi et al. Reevaluating assembly evaluations with feature response curves: GAGE and assemblathons
Chen et al. A uniform survey of allele-specific binding and expression over 1000-Genomes-Project individuals
Sibbesen et al. Accurate genotyping across variant classes and lengths using variant graphs
O'Connell et al. A general approach for haplotype phasing across the full spectrum of relatedness
Shah et al. Integrating copy number polymorphisms into array CGH analysis using a robust HMM
US20190139624A1 (en) Identifying ancestral relationships using a continuous stream of input
Huang et al. PCAP: a whole-genome assembly program
Lee et al. DUDE-Seq: fast, flexible, and robust denoising for targeted amplicon sequencing
Kiezun et al. Deleterious alleles in the human genome are on average younger than neutral alleles of the same frequency
Chen et al. Genotype calling and haplotyping in parent-offspring trios
Huang et al. Differential expression analysis for RNA-Seq: an overview of statistical methods and computational software: supplementary issue: sequencing platform modeling and analysis
Marstrand et al. Identifying and mapping cell-type-specific chromatin programming of gene expression
Kim et al. Distance metrics for ranked evolutionary trees
Hemstrom et al. snpR: User friendly population genomics for SNP data sets with categorical metadata
Andreu-Sánchez et al. A benchmark of genetic variant calling pipelines using metagenomic short-read sequencing
Bayat et al. Fast and accurate exhaustive higher-order epistasis search with BitEpi
Li et al. Efficient identification of identical-by-descent status in pedigrees with many untyped individuals
Srivastava et al. NetSeekR: a network analysis pipeline for RNA-Seq time series data
Kamneva et al. Simulation-based evaluation of hybridization network reconstruction methods in the presence of incomplete lineage sorting
Jiang et al. Recent developments in statistical methods for GWAS and high-throughput sequencing association studies of complex traits
US10747899B2 (en) Techniques for determining whether an individual is included in ensemble genomic data

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CB03 Change of inventor or designer information
CB03 Change of inventor or designer information

Inventor after: Ma Guanche

Inventor after: Li Shujin

Inventor after: Cong Bin

Inventor before: Li Shujin

Inventor before: Ma Guanche

Inventor before: Cong Bin