CN110706739A - Protein conformation space sampling method based on multi-mode internal and external intersection - Google Patents

Protein conformation space sampling method based on multi-mode internal and external intersection Download PDF

Info

Publication number
CN110706739A
CN110706739A CN201910788537.1A CN201910788537A CN110706739A CN 110706739 A CN110706739 A CN 110706739A CN 201910788537 A CN201910788537 A CN 201910788537A CN 110706739 A CN110706739 A CN 110706739A
Authority
CN
China
Prior art keywords
individuals
population
class
individual
different
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910788537.1A
Other languages
Chinese (zh)
Other versions
CN110706739B (en
Inventor
张贵军
赵凯龙
夏瑜豪
刘俊
彭春祥
周晓根
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang University of Technology ZJUT
Original Assignee
Zhejiang University of Technology ZJUT
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang University of Technology ZJUT filed Critical Zhejiang University of Technology ZJUT
Priority to CN201910788537.1A priority Critical patent/CN110706739B/en
Publication of CN110706739A publication Critical patent/CN110706739A/en
Application granted granted Critical
Publication of CN110706739B publication Critical patent/CN110706739B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • G16B30/10Sequence alignment; Homology search
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • G06F18/232Non-hierarchical techniques
    • G06F18/2321Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
    • G06F18/23213Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions with fixed number of clusters, e.g. K-means clustering
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • G16B20/20Allele or variant detection, e.g. single nucleotide polymorphism [SNP] detection
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B5/00ICT specially adapted for modelling or simulations in systems biology, e.g. gene-regulatory networks, protein interaction networks or metabolic networks
    • G16B5/20Probabilistic models

Abstract

A multi-mode internal and external crossing-based protein conformation space sampling method introduces a crowding strategy and a greedy search strategy, wherein the intra-mode crossing and greedy search strategy is beneficial to convergence of individuals to local extrema, and the extra-mode crossing and crowding strategy is beneficial to keeping population diversity. After a population is initialized, through continuous fragment assembly, population individuals are divided into a plurality of modal sets, when the execution modes are crossed internally, a squeezing strategy is applied to sample the conformations, when the execution modes are crossed externally, a greedy search strategy is applied to sample the conformations, and the conflict problems of algorithm quick convergence and diversity retention can be better solved through the crossed use of the two strategies.

Description

Protein conformation space sampling method based on multi-mode internal and external intersection
Technical Field
The invention relates to the fields of bioinformatics and computer application, in particular to a protein conformation space sampling method based on multi-mode internal and external crossing.
Background
Protein structure prediction is an important content of genome function research and is a problem to be solved urgently in the field of bioinformatics at present. Proteins are important elements for life, such as enzyme proteins that catalyze biochemical reactions, carrier proteins that transport oxygen and nutrients, antibody proteins that are responsible for recognition of signals or participate in immune reactions, and the like. The function and structure of protein have close relationship, only the amino acid sequence with certain spatial structure can exert its specific biological function, the research of protein structure prediction tries to decipher the second genetic code and find out the relation between the amino acid sequence and the protein structure, which is the important research content of bioinformatics at present. Besides the biological theory significance, the protein structure prediction also has important practical application significance. It is not enough to study the function of protein and find out the pathogenic mechanism only by amino acid sequence, and it is necessary to know its spatial structure, i.e. the drug design is based on the spatial structure of protein, on the basis of knowing its spatial structure, it utilizes molecular docking algorithm and computer technology to design the inhibitory molecules of disease as candidate drugs, achieving the purpose of inhibiting some enzymes or protein activities.
In protein structure prediction, due to the complexity and inaccuracy of a force field model selected by protein structure prediction, a global stable structure predicted by an algorithm may not be well matched with the structure of an actually measured target point, so that a multi-mode optimization algorithm needs to be designed to provide other high-quality local stable structures of the protein.
Many practical optimization Problems belong to multi-modal function optimization Problems (multi-modal function optimization schemes), and usually a plurality of solutions are required to be solved, including a global optimal solution and a local optimal solution.
Therefore, it is difficult for the current protein structure prediction methods to effectively balance conformational diversity and convergence rate, and improvements are needed.
Disclosure of Invention
In order to overcome the defect that the existing protein structure prediction method cannot take account of the balance conformation diversity and the convergence rate, the invention provides an algorithm based on intra-mode crossing and extra-mode crossing, and introduces a crowd-sourcing strategy and a greedy search strategy. Intra-modal crossover and greedy search strategies facilitate convergence of individuals towards local extrema, while extra-modal crossover and crowding strategies facilitate preservation of population diversity. After a population is initialized, the population is divided into a plurality of modal sets through continuous fragment assembly, when intra-modal intersection is executed, a squeezing strategy is applied to sample the conformation, and when extra-modal intersection is executed, a greedy search strategy is applied to sample the conformation. The conflict problems of rapid convergence and diversity maintenance of the algorithm can be better solved by using the two strategies in a crossed manner.
The technical scheme adopted by the invention for solving the technical problems is as follows:
a method for spatial sampling of protein conformation based on multi-modal internal-external crossing, the method comprising the steps of:
1) inputting sequence information of a predicted protein, and reading the sequence length L;
2) setting parameters: population size N, number of iterations G1、G2Cross probability PcMutation operator F and mode number H;
3) initializing a population: iterating the first and second stages of Rosetta to generate an initial population P ═ P with N individuals1,P2,...,PN};
4) Let g1=1,g1∈{1,2,...,G1};
5) Performing cross, variation and selection operations on the population, wherein the process comprises the following steps:
5.1) mutation operation: for theTarget individual PiRandomly selecting two different from the population and different from PiDifferent individuals, denoted Prand1、Prand2(ii) a Variant individuals P were generated as followsi′:
Pi′=Pi+F(Prand1-Prand2)
5.2) cross operation: generating two random numbers, r and jrand,r∈[0,1],jrandE {1, 2.., L }; the crossed individuals P are generated as followsi″:
Figure BDA0002178840320000021
j belongs to {1,2, … L }, and represents an amino acid sequence number;
5.3) iterating the steps 5.1) and 5.2) until all the target individuals are traversed, and generating a filial generation population P';
5.4) selecting operation: scoring the individuals in the population P and P' by using an energy function, sequencing all the individuals from low to high according to energy, and selecting the first N individuals with low energy to replace the original individuals in the population P;
6)g1=g1+ 1; if g is1≤G1Go to step 5);
7) and generating the mode according to the following process:
7.1) calculating the similarity between every two individuals in the population P according to the following formula:
Figure BDA0002178840320000031
Figure BDA0002178840320000032
and
Figure BDA0002178840320000033
respectively represent an individual PiAnd PjMiddle k number CαThree-dimensional coordinates of atoms, wherein L is the sequence length of the structure, and the smaller the RMSE is, the more similar the two individuals are;
7.2) taking the similarity score between the two individuals as the distance between the two individuals, clustering the seeds into H classes by using a K-center clustering algorithm, and marking the class center point as Ch,h∈{1,2,...,H};
8) Let g2=1,g2∈{1,2,...,G2};
9) If g is2If the number is odd, executing step 9.1) to perform intra-modal crossing; otherwise, executing a step 9.2), performing out-of-mode crossing:
9.1) intra-modal intersection based on the crowd-sourcing strategy, the procedure is as follows:
9.1.1) for target individuals PiRandomly selecting two different from each other and PiIndividuals P from the same classrand1、Prand2In [1, L-2 ]]Internally generating two different random integers r1And r2(ii) a Will PiR of1To r1Residue # 2 and r2To r2Replacement of dihedral angles of residue +2 by Prand1And Prand2Dihedral values of the corresponding residues, resulting in crossed individuals Pi *
9.1.2) calculating Individual Pi *With all the class center points ChTo find a distance to the individual Pi *The class corresponding to the nearest class central point is found out and is corresponding to Pi *The most similar individual, the individual and P were calculated using the Rosetta score3 energy functioni *If P is the energy value ofi *If the energy value is lower, replacing the similar individual and updating the center point of the class;
9.2) greedy search strategy based off-modal intersection as follows:
9.2.1) for target individuals PiRandomly selecting two different from each other and PiIndividuals P from different classesrand1、Prand2In [1, L-2 ]]Internally generating two different random integers r1And r2(ii) a Will PiR of1To r1Residue # 2 and r2To r2+2 residueThe dihedral angles of radicals being respectively substituted by Prand1And Prand2Dihedral values of the corresponding residues, resulting in crossed individuals Pi **
9.2.2) calculating the offspring individuals Pi **With all the class center points ChFinding the similarity with the individual Pi **Finding out the individual with the highest energy in the class to replace the class corresponding to the nearest class central point, and updating the central point of the class;
10)g2=g2+ 1; if g is2≤G2Go to step 9);
11) and (4) performing population division on all individuals again according to the mode of the step 7), clustering into H classes, and outputting the class center point of each class as a final prediction result.
The invention has the beneficial effects that: by using the intra-modal intersection, the extra-modal intersection, the greedy search strategy and the crowd-sourcing strategy in an intersecting manner, the global search capability and the local convergence speed of the algorithm are improved, the algorithm convergence can be accelerated, the diversity of population individuals can be kept, and the prediction precision is improved.
Drawings
FIG. 1 is a three-dimensional structure diagram of protein 1C8C obtained by structure prediction based on a multi-modal internal-external crossing protein conformation space sampling method.
FIG. 2 is a flow chart of a protein conformation space sampling method based on multi-modal internal and external crossing.
Detailed Description
The invention is further described below with reference to the accompanying drawings.
Referring to fig. 1 and 2, a method for spatial sampling of protein conformation based on multi-modal internal and external crossing comprises the following steps:
1) inputting sequence information of a predicted protein, and reading the sequence length L;
2) setting parameters: population size N, number of iterations G1、G2Cross probability PcMutation operator F and mode number H;
3) initializing a population: iterative Rosetta first and second orderSegment, generating an initial population P ═ { P) with N individuals1,P2,...,PN};
4) Let g1=1,g1∈{1,2,...,G1};
5) Performing cross, variation and selection operations on the population, wherein the process comprises the following steps:
5.1) mutation operation: for the target individual PiRandomly selecting two different from the population and different from PiDifferent individuals, denoted Prand1、Prand2(ii) a Variant individuals P were generated as followsi′:
Pi′=Pi+F(Prand1-Prand2)
5.2) cross operation: generating two random numbers, r and jrand,r∈[0,1],jrandE {1, 2.., L }; the crossed individuals P are generated as followsi″:
Figure BDA0002178840320000041
j belongs to {1,2, … L }, and represents an amino acid sequence number;
5.3) iterating the steps 5.1) and 5.2) until all the target individuals are traversed, and generating a filial generation population P';
5.4) selecting operation: scoring the individuals in the population P and P' by using an energy function, sequencing all the individuals from low to high according to energy, and selecting the first N individuals with low energy to replace the original individuals in the population P;
6)g1=g1+ 1; if g is1≤G1Go to step 5);
7) and generating the mode according to the following process:
7.1) calculating the similarity between every two individuals in the population P according to the following formula:
Figure BDA0002178840320000051
Figure BDA0002178840320000052
and
Figure BDA0002178840320000053
respectively represent an individual PiAnd PjMiddle k number CαThree-dimensional coordinates of atoms, wherein L is the sequence length of the structure, and the smaller the RMSE is, the more similar the two individuals are;
7.2) taking the similarity score between the two individuals as the distance between the two individuals, clustering the seeds into H classes by using a K-center clustering algorithm, and marking the class center point as Ch,h∈{1,2,...,H};
8) Let g2=1,g2∈{1,2,...,G2};
9) If g is2If the number is odd, executing step 9.1) to perform intra-modal crossing; otherwise, executing a step 9.2), performing out-of-mode crossing:
9.1) intra-modal intersection based on the crowd-sourcing strategy, the procedure is as follows:
9.1.1) for target individuals PiRandomly selecting two different from each other and PiIndividuals P from the same classrand1、Prand2In [1, L-2 ]]Internally generating two different random integers r1And r2(ii) a Will PiR of1To r1Residue # 2 and r2To r2Replacement of dihedral angles of residue +2 by Prand1And Prand2Dihedral values of the corresponding residues, resulting in crossed individuals Pi *
9.1.2) calculating Individual Pi *With all the class center points ChTo find a distance to the individual Pi *The class corresponding to the nearest class central point is found out and is corresponding to Pi *The most similar individual, the individual and P were calculated using the Rosetta score3 energy functioni *If P is the energy value ofi *If the energy value is lower, replacing the similar individual and updating the center point of the class;
9.2) greedy search strategy based off-modal intersection as follows:
9.2.1) for target individuals PiRandomly selecting two different from each other and PiIndividuals P from different classesrand1、Prand2In [1, L-2 ]]Internally generating two different random integers r1And r2(ii) a Will PiR of1To r1Residue # 2 and r2To r2Replacement of dihedral angles of residue +2 by Prand1And Prand2Dihedral values of the corresponding residues, resulting in crossed individuals Pi **
9.2.2) calculating the offspring individuals Pi **With all the class center points ChFinding the similarity with the individual Pi **Finding out the individual with the highest energy in the class to replace the class corresponding to the nearest class central point, and updating the central point of the class;
10)g2=g2+ 1; if g is2≤G2Go to step 9);
11) and (4) performing population division on all individuals again according to the mode of the step 7), clustering into H classes, and outputting the class center point of each class as a final prediction result.
In this embodiment, taking the protein 1C8C with the sequence length of 64 as an example, a method for spatial sampling of protein conformation based on multi-modal inside-outside crossing comprises the following steps:
1) inputting sequence information of the predicted protein 1C8C, and reading the sequence length L of 64;
2) setting parameters: the number of iterations G is 200 when the population N is equal to1=100,G2300, cross probability Pc0.1, 0.5 for mutation operator F, 6 for mode number H;
3) initializing a population: iterating the first and second stages of Rosetta to generate an initial population P ═ P with N individuals1,P2,...,PN};
4) Let g1=1,g1∈{1,2,...,G1};
5) Performing cross, variation and selection operations on the population, wherein the process comprises the following steps:
5.1) changePerforming different operations: for the target individual PiRandomly selecting two different from the population and different from PiDifferent individuals, denoted Prand1、Prand2(ii) a Variant individuals P were generated as followsi′:
Pi′=Pi+F(Prand1-Prand2)
5.2) cross operation: generating two random numbers, r and jrand,r∈[0,1],jrandE {1, 2.., L }; the crossed individuals P are generated as followsi″:
Figure BDA0002178840320000061
j belongs to {1,2, … L }, and represents an amino acid sequence number;
5.3) iterating the steps 5.1) and 5.2) until all the target individuals are traversed, and generating a filial generation population P';
5.4) selecting operation: scoring the individuals in the population P and P' by using an energy function, sequencing all the individuals from low to high according to energy, and selecting the first N individuals with low energy to replace the original individuals in the population P;
6)g1=g1+ 1; if g is1≤G1Go to step 5);
7) and generating the mode according to the following process:
7.1) calculating the similarity between every two individuals in the population P according to the following formula:
Figure BDA0002178840320000071
Figure BDA0002178840320000072
and
Figure BDA0002178840320000073
respectively represent an individual PiAnd PjMiddle k number CαThree-dimensional coordinates of atoms, L being the sequence length of the structure, smaller RMSE representing twoThe more similar the individuals are;
7.2) taking the similarity score between the two individuals as the distance between the two individuals, clustering the seeds into H classes by using a K-center clustering algorithm, and marking the class center point as Ch,h∈{1,2,...,H};
8) Let g2=1,g2∈{1,2,...,G2};
9) If g is2If the number is odd, executing step 9.1) to perform intra-modal crossing; otherwise, executing a step 9.2), performing out-of-mode crossing:
9.1) intra-modal intersection based on the crowd-sourcing strategy, the procedure is as follows:
9.1.1) for target individuals PiRandomly selecting two different from each other and PiIndividuals P from the same classrand1、Prand2In [1, L-2 ]]Internally generating two different random integers r1And r2(ii) a Will PiR of1To r1Residue # 2 and r2To r2Replacement of dihedral angles of residue +2 by Prand1And Prand2Dihedral values of the corresponding residues, resulting in crossed individuals Pi *
9.1.2) calculating Individual Pi *With all the class center points ChTo find a distance to the individual Pi *The class corresponding to the nearest class central point is found out and is corresponding to Pi *The most similar individual, the individual and P were calculated using the Rosetta score3 energy functioni *If P is the energy value ofi *If the energy value is lower, replacing the similar individual and updating the center point of the class;
9.2) greedy search strategy based off-modal intersection as follows:
9.2.1) for target individuals PiRandomly selecting two different from each other and PiIndividuals P from different classesrand1、Prand2In [1, L-2 ]]Internally generating two different random integers r1And r2(ii) a Will PiR of1To r1Residue # 2 and r2To r2Replacement of dihedral angles of residue +2 by Prand1And Prand2Dihedral values of the corresponding residues, resulting in crossed individuals Pi **
9.2.2) calculating the offspring individuals Pi **With all the class center points ChFinding the similarity with the individual Pi **Finding out the individual with the highest energy in the class to replace the class corresponding to the nearest class central point, and updating the central point of the class;
10)g2=g2+ 1; if g is2≤G2Go to step 9);
11) and (4) performing population division on all individuals again according to the mode of the step 7), clustering into H classes, and outputting the class center point of each class as a final prediction result.
Using the protein 1C8C with an amino acid sequence length of 64 as an example, the method is used to obtain the near-natural state individuals of the protein in six modes, and the predicted root mean square deviation of the protein is respectively
Figure BDA0002178840320000081
Figure BDA0002178840320000082
The prediction structure is shown in fig. 1.
While the foregoing has described the preferred embodiments of the present invention, it will be apparent that the invention is not limited to the embodiments described, but can be practiced with modification without departing from the essential spirit of the invention and without departing from the spirit of the invention.

Claims (1)

1. A protein conformation space sampling method based on multi-modal internal-external crossing is characterized by comprising the following steps:
1) inputting sequence information of a predicted protein, and reading the sequence length L;
2) setting parameters: population size N, number of iterations G1、G2Cross probability PcMutation operator F, modal numberA number H;
3) initializing a population: iterating the first and second stages of Rosetta to generate an initial population P ═ P with N individuals1,P2,...,PN};
4) Let g1=1,g1∈{1,2,...,G1};
5) Performing cross, variation and selection operations on the population, wherein the process comprises the following steps:
5.1) mutation operation: for the target individual PiRandomly selecting two different from the population and different from PiDifferent individuals, denoted Prand1、Prand2(ii) a Variant individuals P were generated as followsi′:
Pi′=Pi+F(Prand1-Prand2)
5.2) cross operation: generating two random numbers, r and jrand,r∈[0,1],jrandE {1, 2.., L }; the crossed individuals P are generated as followsi″:
Figure FDA0002178840310000011
j belongs to {1,2, … L }, and represents an amino acid sequence number;
5.3) iterating the steps 5.1) and 5.2) until all the target individuals are traversed, and generating a filial generation population P';
5.4) selecting operation: scoring the individuals in the population P and P' by using an energy function, sequencing all the individuals from low to high according to energy, and selecting the first N individuals with low energy to replace the original individuals in the population P;
6)g1=g1+ 1; if g is1≤G1Go to step 5);
7) and generating the mode according to the following process:
7.1) calculating the similarity between every two individuals in the population P according to the following formula:
Figure FDA0002178840310000012
Figure FDA0002178840310000021
and
Figure FDA0002178840310000022
respectively represent an individual PiAnd PjMiddle k number CαThree-dimensional coordinates of atoms, wherein L is the sequence length of the structure, and the smaller the RMSE is, the more similar the two individuals are;
7.2) taking the similarity score between the two individuals as the distance between the two individuals, clustering the seeds into H classes by using a K-center clustering algorithm, and marking the class center point as Ch,h∈{1,2,...,H};
8) Let g2=1,g2∈{1,2,...,G2};
9) If g is2If the number is odd, executing step 9.1) to perform intra-modal crossing; otherwise, executing a step 9.2), performing out-of-mode crossing:
9.1) intra-modal intersection based on the crowd-sourcing strategy, the procedure is as follows:
9.1.1) for target individuals PiRandomly selecting two different from each other and PiIndividuals P from the same classrand1、Prand2In [1, L-2 ]]Internally generating two different random integers r1And r2(ii) a Will PiR of1To r1Residue # 2 and r2To r2Replacement of dihedral angles of residue +2 by Prand1And Prand2Dihedral values of the corresponding residues, resulting in crossed individuals Pi *
9.1.2) calculating Individual Pi *With all the class center points ChTo find a distance to the individual Pi *The class corresponding to the nearest class central point is found out and is corresponding to Pi *The most similar individual, the individual and P were calculated using the Rosetta score3 energy functioni *If P is the energy value ofi *If the energy value is lower, replacing the similar individual and updating the center point of the class;
9.2) greedy search strategy based off-modal intersection as follows:
9.2.1) for target individuals PiRandomly selecting two different from each other and PiIndividuals P from different classesrand1、Prand2In [1, L-2 ]]Internally generating two different random integers r1And r2(ii) a Will PiR of1To r1Residue # 2 and r2To r2Replacement of dihedral angles of residue +2 by Prand1And Prand2Dihedral values of the corresponding residues, resulting in crossed individuals Pi **
9.2.2) calculating the offspring individuals Pi **With all the class center points ChFinding the similarity with the individual Pi **Finding out the individual with the highest energy in the class to replace the class corresponding to the nearest class central point, and updating the central point of the class;
10)g2=g2+ 1; if g is2≤G2Go to step 9);
11) and (4) performing population division on all individuals again according to the mode of the step 7), clustering into H classes, and outputting the class center point of each class as a final prediction result.
CN201910788537.1A 2019-08-26 2019-08-26 Protein conformation space sampling method based on multi-mode internal and external intersection Active CN110706739B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910788537.1A CN110706739B (en) 2019-08-26 2019-08-26 Protein conformation space sampling method based on multi-mode internal and external intersection

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910788537.1A CN110706739B (en) 2019-08-26 2019-08-26 Protein conformation space sampling method based on multi-mode internal and external intersection

Publications (2)

Publication Number Publication Date
CN110706739A true CN110706739A (en) 2020-01-17
CN110706739B CN110706739B (en) 2021-10-29

Family

ID=69193366

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910788537.1A Active CN110706739B (en) 2019-08-26 2019-08-26 Protein conformation space sampling method based on multi-mode internal and external intersection

Country Status (1)

Country Link
CN (1) CN110706739B (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016010961A1 (en) * 2014-07-15 2016-01-21 Abbvie Inc. Enzyme occupancy assay
CN108830042A (en) * 2018-06-13 2018-11-16 深圳大学 A kind of feature extraction based on multi-modal protein sequence and coding method and system
WO2018226666A1 (en) * 2017-06-05 2018-12-13 Maumita Mandal Methods, compositions, and devices involving pseudoknot formation
CN109360596A (en) * 2018-08-30 2019-02-19 浙江工业大学 A kind of protein conformation space optimization method based on differential evolution local dip
CN109360601A (en) * 2018-08-29 2019-02-19 浙江工业大学 A kind of multi-modal Advances in protein structure prediction based on exclusion strategy

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016010961A1 (en) * 2014-07-15 2016-01-21 Abbvie Inc. Enzyme occupancy assay
WO2018226666A1 (en) * 2017-06-05 2018-12-13 Maumita Mandal Methods, compositions, and devices involving pseudoknot formation
CN108830042A (en) * 2018-06-13 2018-11-16 深圳大学 A kind of feature extraction based on multi-modal protein sequence and coding method and system
CN109360601A (en) * 2018-08-29 2019-02-19 浙江工业大学 A kind of multi-modal Advances in protein structure prediction based on exclusion strategy
CN109360596A (en) * 2018-08-30 2019-02-19 浙江工业大学 A kind of protein conformation space optimization method based on differential evolution local dip

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
LI C: ""Cross-Modal Ranking with Soft Consistency and Noisy Labels for Robust RGB-T Tracking"", 《COMPUTER VISION ECCV 2018》 *
彭瑶: ""基于超图的多模态特征选择算法及其应用"", 《计算机科学与探索》 *

Also Published As

Publication number Publication date
CN110706739B (en) 2021-10-29

Similar Documents

Publication Publication Date Title
Karimi et al. De novo protein design for novel folds using guided conditional wasserstein generative adversarial networks
Zrimec et al. Controlling gene expression with deep generative design of regulatory DNA
CN107491664B (en) Protein structure de novo prediction method based on information entropy
CN109785901B (en) Protein function prediction method and device
CN105760710A (en) Method for predicting protein structure on basis of two-stage differential evolution algorithm
CN109360596B (en) Protein conformation space optimization method based on differential evolution local disturbance
Huang et al. Simultaneously identify three different attributes of proteins by fusing their three different modes of Chou's pseudo amino acid compositions
JP2008146538A (en) Microrna detector, detection method and program
CN106503486A (en) A kind of differential evolution protein structure ab initio prediction method based on multistage subgroup coevolution strategy
Caetano-Anollés et al. Tracing protein and proteome history with chronologies and networks: folding recapitulates evolution
CN110706739B (en) Protein conformation space sampling method based on multi-mode internal and external intersection
Peshkin et al. Segmentation of yeast DNA using hidden Markov models
CN109033753B (en) Group protein structure prediction method based on secondary structure fragment assembly
Bi A Monte Carlo EM algorithm for de novo motif discovery in biomolecular sequences
Jiahao et al. Artificial intelligence-assisted protein engineering
Thompson Statistics for bioinformatics: methods for multiple sequence alignment
Chen et al. Non-coding RNA identification with pseudo RNA sequences and feature representation learning
Zhang et al. Predicting TF-DNA binding motifs from ChIP-seq datasets using the bag-based classifier combined with a multi-fold learning scheme
Aparna et al. Performance Analysis of Signal Processing Techniques in Bioinformatics for Medical Applications Using Machine Learning Concepts
Manning et al. The importance of physicochemical characteristics and nonlinear classifiers in determining HIV-1 protease specificity
Zhang et al. Protein Language Model Supervised Precise and Efficient Protein Backbone Design Method
Re et al. Ensemble based data fusion for gene function prediction
CN109390035B (en) Protein conformation space optimization method based on local structure comparison
CN109243524B (en) Method for predicting structure of evolved protein through multistage individual screening
ABU-HASHEM et al. INVESTIGATION STUDY: AN INTENSIVE ANALYSIS FOR MSA LEADING METHODS.

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant