CN111429972A - Protein small molecule docking scoring scheme based on deep learning - Google Patents
Protein small molecule docking scoring scheme based on deep learning Download PDFInfo
- Publication number
- CN111429972A CN111429972A CN201910839110.XA CN201910839110A CN111429972A CN 111429972 A CN111429972 A CN 111429972A CN 201910839110 A CN201910839110 A CN 201910839110A CN 111429972 A CN111429972 A CN 111429972A
- Authority
- CN
- China
- Prior art keywords
- compound
- deep learning
- channel
- small molecule
- atom
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 44
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 44
- 238000003032 molecular docking Methods 0.000 title claims abstract description 28
- 150000003384 small molecules Chemical class 0.000 title claims abstract description 25
- 238000013135 deep learning Methods 0.000 title claims abstract description 20
- 150000001875 compounds Chemical class 0.000 claims abstract description 43
- 239000003446 ligand Substances 0.000 claims abstract description 19
- 238000012549 training Methods 0.000 claims abstract description 15
- 238000013136 deep learning model Methods 0.000 claims abstract description 12
- 238000012360 testing method Methods 0.000 claims abstract description 10
- 238000012545 processing Methods 0.000 claims abstract description 7
- 238000012795 verification Methods 0.000 claims abstract description 6
- 229910052739 hydrogen Inorganic materials 0.000 claims description 31
- 239000001257 hydrogen Substances 0.000 claims description 24
- 238000005381 potential energy Methods 0.000 claims description 16
- 238000005411 Van der Waals force Methods 0.000 claims description 15
- 238000010200 validation analysis Methods 0.000 claims description 5
- 238000003672 processing method Methods 0.000 claims description 2
- 239000002131 composite material Substances 0.000 abstract description 9
- 238000000034 method Methods 0.000 abstract description 5
- 125000004429 atom Chemical group 0.000 description 27
- 238000004364 calculation method Methods 0.000 description 8
- 230000027455 binding Effects 0.000 description 5
- 125000004432 carbon atom Chemical group C* 0.000 description 4
- 238000013459 approach Methods 0.000 description 3
- 238000004422 calculation algorithm Methods 0.000 description 3
- 238000009509 drug development Methods 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 230000003993 interaction Effects 0.000 description 3
- 238000012827 research and development Methods 0.000 description 3
- 230000009286 beneficial effect Effects 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 201000010099 disease Diseases 0.000 description 2
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 239000013598 vector Substances 0.000 description 2
- KZMAWJRXKGLWGS-UHFFFAOYSA-N 2-chloro-n-[4-(4-methoxyphenyl)-1,3-thiazol-2-yl]-n-(3-methoxypropyl)acetamide Chemical compound S1C(N(C(=O)CCl)CCCOC)=NC(C=2C=CC(OC)=CC=2)=C1 KZMAWJRXKGLWGS-UHFFFAOYSA-N 0.000 description 1
- 230000001427 coherent effect Effects 0.000 description 1
- 238000005094 computer simulation Methods 0.000 description 1
- 238000013527 convolutional neural network Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 238000009499 grossing Methods 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 150000002611 lead compounds Chemical class 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 239000002547 new drug Substances 0.000 description 1
- 125000004430 oxygen atom Chemical group O* 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 238000003041 virtual screening Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B50/00—ICT programming tools or database systems specially adapted for bioinformatics
- G16B50/30—Data warehousing; Computing architectures
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B15/00—ICT specially adapted for analysing two-dimensional or three-dimensional molecular structures, e.g. structural or functional relations or structure alignment
- G16B15/20—Protein or domain folding
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Physics & Mathematics (AREA)
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Theoretical Computer Science (AREA)
- Spectroscopy & Molecular Physics (AREA)
- General Health & Medical Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Biotechnology (AREA)
- Evolutionary Biology (AREA)
- Biophysics (AREA)
- Medical Informatics (AREA)
- Crystallography & Structural Chemistry (AREA)
- Chemical & Material Sciences (AREA)
- Bioethics (AREA)
- Databases & Information Systems (AREA)
- Investigating Or Analysing Biological Materials (AREA)
Abstract
The invention provides a protein small molecule docking and scoring scheme based on deep learning, which comprises the following steps: step 1, constructing a compound database, and selecting a plurality of protein ligand pairs as a training set, a plurality of protein ligand pairs as a verification set, and a plurality of protein ligand pairs as a test set; step 2, establishing a surrounding grid for the compound to obtain a compound grid, and processing to obtain a multi-channel compound descriptor; step 3, training the multi-channel compound descriptor by using a training set to obtain a deep learning model; step 4, verifying the deep learning model by using a verification set; and 5, obtaining a compound descriptor by using the test set, and inputting the compound descriptor into the deep learning model for prediction to obtain a scoring value of the compound. Therefore, the method has the advantages that the multi-channel grids are adopted, the composite grids are scored through deep learning, and then the docking scores are accurately and quickly predicted.
Description
Technical Field
The invention belongs to the field of protein micromolecules, and particularly relates to a protein micromolecule docking and scoring scheme based on deep learning.
Background
At present, the treatment of diseases is currently an important content of research in various countries, constituting a great risk for the health of humans. The research and development of drugs are the most important steps for disease treatment, and are the key and difficult points of academic, enterprise and even national institutes. The research and development of new drugs face the difficulties of long development period, high cost, uncertainty of side effects and the like, and the development difficulty is huge. With the widespread use of computers, it has become possible to simulate various steps in drug development using computers. At present, the use of computer simulation to assist drug development is indispensable in research and development, and can save time and money greatly.
As drug development, screening of small molecules, i.e. discovery of lead compounds, is a key step. The traditional method is usually carried out in a laboratory, and the docking of the small molecules and target protein is continuously carried out, and finally candidate small molecules are screened out. This approach is time consuming and laborious and those compounds which are not native are not easily found. And the small molecule library is utilized, and the computer is utilized for virtual screening, so that the screening time can be obviously reduced.
The interaction of small molecules and proteins is the essence and theoretical basis of molecular docking, and the interaction force comprises van der waals force, hydrogen bond, hydrophobic interaction, charge force and the like. Currently, most molecular docking software uses an experience-based molecular docking force formula, such as AutoDock, Dock, and the like. The force formulas used by various docking software are different in form, but essentially adopt results based on a priori knowledge. For the calculation of intermolecular acting force, the different algorithms adopt the position parameters including various forms, and most of the algorithms are continuously tested and improved to finally enable the formula to be closer to the actual acting force result.
With the rise of machine learning, scoring of molecular docking by using a deep learning model also becomes a hot point of research. The algorithm is characterized in that molecular descriptor vectors are respectively extracted from proteins and small molecules, and molecular docking activity values are predicted for different protein small molecule vector pairs.
However, this kind of method does not need to know the concept of molecular docking in advance, and uses the respective properties of protein and molecule to predict ligand, because the spatial structure cannot be described, the accuracy is still to be improved. In addition, deep learning models which use the grid values of the compound to predict exist, but the deep learning models often cannot comprehensively consider the compound and still have high limitation. The molecular docking software utilizes an empirical formula to calculate the molecular docking fraction, and although the formula is continuously close to the final result through fitting, the calculation process is too complex and the calculation time is long, so that the docking performance is seriously reduced. Meanwhile, the position parameters in the formula need to be fitted continuously to approach the actual condition, and the final application of the formula usually has a great degree of uncertainty.
Disclosure of Invention
The invention provides a protein small molecule docking scoring scheme based on deep learning, which can adopt a multi-channel grid, fully consider various information of a compound, is more suitable for model input of the deep learning, and accurately and quickly predict a docking score by scoring the multi-channel compound grid.
The technical scheme of the invention is realized as follows: a protein small molecule docking scoring scheme based on deep learning is characterized by comprising the following steps:
step 1, constructing a compound database, and selecting a plurality of protein ligand pairs as a training set, a plurality of protein ligand pairs as a verification set, and a plurality of protein ligand pairs as a test set;
step 2, establishing a surrounding grid for the compound to obtain a compound grid, and processing to obtain a multi-channel compound descriptor;
step 3, training the multichannel compound descriptor in the step 2 by using the training set in the step 1 to obtain a deep learning model;
step 4, verifying the deep learning model in the step 3 by using the verification set in the step 1;
and 5, obtaining a compound descriptor by using the test set in the step 1 and adopting the processing mode in the step 2, inputting the compound descriptor into the deep learning model in the step 3 for prediction, and obtaining a scoring value of the compound.
As a preferred embodiment, the database of complexes in step 1 comprises the PDBb i nd database, wherein 10000 pairs of protein ligand pairs are selected as training set, 2000 pairs of protein ligand pairs are selected as validation set, and the remaining 4151 pairs of protein ligand pairs are selected as test set.
In a preferred embodiment, the processing method in step 2 comprises the steps of performing grid discretization on a composite grid at intervals of 1 angstrom, and simultaneously performing 3 angstrom filling on the whole periphery of the grid.
As a preferred embodiment, the multi-channel complex descriptors in step 2 are divided into four complex descriptors according to the complex atom type, van der waals forces, hydrogen bonding forces, and charge.
As a preferred embodiment, the multi-channel compound descriptor comprises 6 channels divided according to compound atom types, wherein the atom types are C, H, N, O, P and other types which are totally 6, and 6 channels are correspondingly arranged, wherein only the single atom is considered in each channel.
As a preferred embodiment, the multi-channel composite descriptor comprises 1 channel arranged by van der waals forces.
As a preferred embodiment, the multichannel composite descriptor comprises 1 channel arranged according to hydrogen bonding forces.
As a preferred embodiment, the multi-channel complex descriptor comprises 1 channel set by charge.
As a preferred embodiment, the van der waals force of any one grid in a channel is calculated as follows:
where i is all atoms of the compound, is a two-atom potential energy well, σ is the distance between the two-atom potential energy wells at 0, and r is the actual distance of the atom pair.
As a preferred embodiment, the hydrogen bonding forces for any one of the grids in the channel are calculated as follows:
wherein, i is only H, N, O three yards and is two hydrogen bond potential energy traps forming hydrogen bond atom pairs, sigma is the distance of potential energy acting force of 0, and r is the distance of donor and acceptor atoms forming the hydrogen bonds.
After the technical scheme is adopted, the invention has the beneficial effects that:
1. and the compound descriptor of the protein small molecules based on the voxel grid is adopted, and the information such as the space geometric structure of the compound is fully considered.
2. By adopting the multi-channel grid, various information of the compound can be fully compounded, and the method is more suitable for model input of deep learning.
3. And (4) adopting deep learning intelligent scoring to score the multi-channel compound grid, thereby accurately and quickly predicting the docking score.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to these drawings without creative efforts.
FIG. 1 is an overall flow chart of molecular docking scoring;
FIG. 2 is a diagram of a lattice creation for an atom type channel.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
As shown in fig. 1 and 2, the PDBbind database contains detailed attribute information such as binding conformation, binding site, protein conformation, small molecule conformation, and corresponding binding activity of a protein and a ligand. The binding activity it provides can be used as the output of the final model. The latest version of the PDBbind database is version 2018, which contains 16151 proteins bound to ligands. The present invention uses 10000 pairs of protein-ligand pairs as training set, 2000 pairs as validation set, and the rest 4151 pairs as test.
The composite was first subjected to bounding box mesh creation and mesh discretization at 1 angstrom intervals, with 3 angstrom fill around the entire mesh. The resulting composite lattice is processed differently to obtain a multi-channel composite descriptor, including 6 channels divided by the type of composite atoms, 1 channel divided by van der waals forces, and 1 channel divided by hydrogen bonding forces.
For proteins and small molecules, the atom type is the most basic manifestation. The present invention divides all the types of the atoms into C, H, N, O, P types and other 6 types, and correspondingly sets 6 channels.
In each channel, only this single atom is considered. For example, C channel extracts coordinate information on all C in the compound, maps the coordinate information to the bounding box mesh of the compound according to the coordinate position of C, and fills all the meshes within the van der waals radius centered on the C atom to 1. If different C atoms cover the same grid, the grid values are superposed. This means that in this C-atom channel grid, the larger the grid value, the more C-atom van der waals radii it is within. The same treatment is carried out for other atom types, and finally a 6-channel composite grid is obtained.
In order to ensure smooth and coherent transition among different grids, the obtained 6-channel grid is subjected to Gaussian smoothing processing with sigma being 1, so that the obtained grid is smoother.
After the complex bounding box is obtained, the meaning of the median value in each grid needs to be clarified, traditional van der Waals calculations typically use the 6-12L ennard-Jones potential energy formula:
wherein i and j are atoms of molecule 1 and molecule 2 respectively and are potential energy traps of two atoms, sigma is the distance between the potential energy traps of two atoms and is 0, and r is the actual distance of an atom pair.
However, in the quantification of the actual grid values, two host molecules 1 and 2 calculated by van der waals force need to be defined, while for the grid, it is only one grid, which cannot generate acting force on the complex simply, and at the same time, the range span is large due to the calculation of van der waals force, which is not beneficial to the final training and learning. Thus, the present invention modifies the 6-12 formula of van der Waals forces. The van der waals forces for any one grid in a channel are calculated as follows:
where i is all atoms of the compound, is a two-atom potential energy well, σ is the distance between the two-atom potential energy wells at 0, and r is the actual distance of the atom pair. At EvdwThe atomic pair is not considered any more in the calculation of (2), and only the van der waals contribution of a single compound atom to the lattice is considered here. Mapping the sum of the contribution rates of all atoms of the final compound to the grid by adopting a sigmoid function to ensure that the final range of the grid value is positionedBetween 0 and 1. Calculating the van der waals force contribution of the compound atoms to all the grids to obtain the final van der waals force channel grid.
For the calculation of hydrogen bonds, it is necessary to consider only H, or N or O, which may have a strong lone pair, that forms the hydrogen bond, and therefore, only these three atoms are considered in calculating the hydrogen bond.A conventional hydrogen bond calculation formula is the 10-12L ennard-Jones potential energy formula:
wherein, i and j are respectively an acceptor atom and a donor atom of a hydrogen bond, two hydrogen bond potential energy traps forming a hydrogen bond atom pair, sigma is the distance of potential energy acting force of 0, and r is the distance of the donor atom and the acceptor atom forming the hydrogen bond.
For hydrogen bonding channels, we use a similar approach to van der waals force channels:
except that only H, N, O atoms are taken as i, and finally the grid values of all grids are calculated, so that the hydrogen bonding force channel grids are obtained.
The charge also has important contribution to the binding between molecules and is one of the important indexes of scoring. Here, we map the partial charges of all atoms into a grid according to their coordinate positions on the basis of a complex grid, so as to obtain a corresponding charge channel grid.
A3D convolutional neural network similar to VGG is adopted to construct a model, and the specific model is shown in the following table:
the final output of the model is a predicted value of activity, i.e., a score value.
10000 proteins of PDBbind (v2018) were randomly chosen for training, 2000 proteins for validation, the purpose of the validation set was to prevent overfitting and when the model stopped training.
During testing and prediction, the complex grid creation and 9-channel processing are also adopted for a new complex conformation, and finally the 9-channel grid is obtained and input into a model for prediction to obtain a scoring value of the complex.
The present invention is not limited to the above preferred embodiments, and any modifications, equivalent substitutions, improvements, etc. within the spirit and principle of the present invention should be included in the protection scope of the present invention.
Claims (10)
1. A protein small molecule docking scoring scheme based on deep learning is characterized by comprising the following steps:
step 1, constructing a compound database, and selecting a plurality of protein ligand pairs as a training set, a plurality of protein ligand pairs as a verification set, and a plurality of protein ligand pairs as a test set;
step 2, establishing a surrounding grid for the compound to obtain a compound grid, and processing to obtain a multi-channel compound descriptor;
step 3, training the multichannel compound descriptor in the step 2 by using the training set in the step 1 to obtain a deep learning model;
step 4, verifying the deep learning model in the step 3 by using the verification set in the step 1;
and 5, obtaining a compound descriptor by using the test set in the step 1 and adopting the processing mode in the step 2, inputting the compound descriptor into the deep learning model in the step 3 for prediction, and obtaining a scoring value of the compound.
2. The deep learning-based protein small molecule docking scoring protocol according to claim 1, wherein the database of complexes in step 1 comprises PDBbind database, wherein 10000 pairs of protein ligand pairs are selected as training set, 2000 pairs of protein ligand pairs are selected as validation set, and the rest 4151 pairs of protein ligand pairs are selected as test set.
3. The deep learning-based protein small molecule docking scoring protocol as claimed in claim 1, wherein the processing method in step 2 comprises performing grid discretization on a complex grid at intervals of 1 angstrom, and simultaneously performing 3 angstrom filling on the whole grid.
4. The deep learning-based protein small molecule docking scoring scheme according to claim 1, wherein the multichannel compound descriptors in step 2 are classified into four compound descriptors according to compound atom type, van der waals force, hydrogen bonding force and charge.
5. The deep learning-based protein small molecule docking scoring scheme as claimed in claim 4, wherein the multi-channel compound descriptor comprises 6 channels divided according to compound atom types, wherein the atom types are C, H, N, O, P and other 6 types, and the 6 channels are correspondingly arranged, wherein only the single atom is considered in each channel.
6. The deep learning-based protein small molecule docking scoring protocol according to claim 4, wherein said multichannel complex descriptor comprises 1 channel set by van der Waals forces.
7. The deep learning-based protein small molecule docking scoring protocol according to claim 4, wherein the multichannel complex descriptor comprises 1 channel set according to hydrogen bonding force.
8. The deep learning-based protein small molecule docking scoring scheme according to claim 4, wherein the multichannel complex descriptor comprises 1 channel set by charge.
9. The deep learning-based protein small molecule docking scoring protocol according to claim 6, wherein van der Waals forces of any one mesh in the channel are calculated as follows:
where i is all atoms of the compound, is a two-atom potential energy well, σ is the distance between the two-atom potential energy wells at 0, and r is the actual distance of the atom pair.
10. The deep learning-based protein small molecule docking scoring protocol according to claim 7, wherein the hydrogen bonding force of any one mesh in the channel is calculated as follows:
wherein, i is only H, N, O three yards and is two hydrogen bond potential energy traps forming hydrogen bond atom pairs, sigma is the distance of potential energy acting force of 0, and r is the distance of donor and acceptor atoms forming the hydrogen bonds.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910839110.XA CN111429972A (en) | 2019-09-05 | 2019-09-05 | Protein small molecule docking scoring scheme based on deep learning |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910839110.XA CN111429972A (en) | 2019-09-05 | 2019-09-05 | Protein small molecule docking scoring scheme based on deep learning |
Publications (1)
Publication Number | Publication Date |
---|---|
CN111429972A true CN111429972A (en) | 2020-07-17 |
Family
ID=71546793
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910839110.XA Pending CN111429972A (en) | 2019-09-05 | 2019-09-05 | Protein small molecule docking scoring scheme based on deep learning |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111429972A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112185458A (en) * | 2020-10-23 | 2021-01-05 | 深圳晶泰科技有限公司 | Method for predicting protein and ligand molecule binding free energy based on convolutional neural network |
CN115862768A (en) * | 2022-12-11 | 2023-03-28 | 南昌大学 | Optimization method for large-scale virtual drug screening |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102930181A (en) * | 2012-11-07 | 2013-02-13 | 四川大学 | Protein-ligand affinity predicting method based on molecule descriptors |
CN103500293A (en) * | 2013-09-05 | 2014-01-08 | 北京工业大学 | Screening method of non-ribosomal protein-RNA composite near-nature structure |
CN103886226A (en) * | 2012-12-21 | 2014-06-25 | 中国科学院大连化学物理研究所 | Novel protein coarse graining computing method on basis of reconstruction of electrostatic potential in three-dimensional spaces |
CN106575320A (en) * | 2014-05-05 | 2017-04-19 | 艾腾怀斯股份有限公司 | Binding affinity prediction system and method |
CN109036580A (en) * | 2018-07-06 | 2018-12-18 | 华东师范大学 | The affine force prediction method of protein ligand based on interaction energy item and machine learning |
-
2019
- 2019-09-05 CN CN201910839110.XA patent/CN111429972A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102930181A (en) * | 2012-11-07 | 2013-02-13 | 四川大学 | Protein-ligand affinity predicting method based on molecule descriptors |
CN103886226A (en) * | 2012-12-21 | 2014-06-25 | 中国科学院大连化学物理研究所 | Novel protein coarse graining computing method on basis of reconstruction of electrostatic potential in three-dimensional spaces |
CN103500293A (en) * | 2013-09-05 | 2014-01-08 | 北京工业大学 | Screening method of non-ribosomal protein-RNA composite near-nature structure |
CN106575320A (en) * | 2014-05-05 | 2017-04-19 | 艾腾怀斯股份有限公司 | Binding affinity prediction system and method |
CN109036580A (en) * | 2018-07-06 | 2018-12-18 | 华东师范大学 | The affine force prediction method of protein ligand based on interaction energy item and machine learning |
Non-Patent Citations (1)
Title |
---|
魏冬青,戴昊,贾贵华,张永红,徐沁: "超分子建筑 从分子到材料", 科学技术文献出版社, pages: 39 * |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112185458A (en) * | 2020-10-23 | 2021-01-05 | 深圳晶泰科技有限公司 | Method for predicting protein and ligand molecule binding free energy based on convolutional neural network |
CN112185458B (en) * | 2020-10-23 | 2024-04-26 | 深圳晶泰科技有限公司 | Method for predicting binding free energy of protein and ligand molecule based on convolutional neural network |
CN115862768A (en) * | 2022-12-11 | 2023-03-28 | 南昌大学 | Optimization method for large-scale virtual drug screening |
CN115862768B (en) * | 2022-12-11 | 2023-09-01 | 南昌大学 | Optimization method for large-scale drug virtual screening |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Li et al. | DeepDSC: a deep learning method to predict drug sensitivity of cancer cell lines | |
Hochuli et al. | Visualizing convolutional neural network protein-ligand scoring | |
CN109671469B (en) | Method for predicting binding relationship and binding affinity between polypeptide and HLA type I molecule based on circulating neural network | |
KR102100985B1 (en) | Paradigm drug response networks | |
CN111435608B (en) | Protein drug binding site prediction method based on deep learning | |
CN114333986A (en) | Method and device for model training, drug screening and affinity prediction | |
WO2022082739A1 (en) | Method for predicting protein and ligand molecule binding free energy on basis of convolutional neural network | |
KR101764615B1 (en) | Spatial knowledge extractor and extraction method | |
Pruyt et al. | From data-poor to data-rich: system dynamics in the era of big data | |
US20240055071A1 (en) | Artificial intelligence-based compound processing method and apparatus, device, storage medium, and computer program product | |
CN112185458B (en) | Method for predicting binding free energy of protein and ligand molecule based on convolutional neural network | |
CN111429972A (en) | Protein small molecule docking scoring scheme based on deep learning | |
Kamal et al. | Self-organizing mapping based swarm intelligence for secondary and tertiary proteins classification | |
CN116245013A (en) | Geological prediction model construction method, modeling method, equipment and storage medium | |
CN105787265A (en) | Atomic spinning top random error modeling method based on comprehensive integration weighting method | |
Hulot et al. | A unified framework for the integration of multiple hierarchical clusterings or networks from multi-source data | |
CN113053457A (en) | Drug target prediction method based on multi-pass graph convolution neural network | |
KR20210026541A (en) | A system of predicting compound activity for target protein using Homology and artificial neural network | |
US20220351808A1 (en) | Systems and methods for reinforcement learning molecular modeling | |
CN115631805A (en) | Medicine and protein affinity prediction method and system based on graph neural network | |
CN102779288B (en) | A kind of ontological analysis method based on field theory | |
Yatskou et al. | Simulation modelling and machine learning platform for processing fluorescence spectroscopy data | |
JP2022068277A (en) | Compound attribute prediction model training method and device, compound attribute prediction method and device, electronic device, computer readable storage medium and computer program | |
Ninic et al. | Meta models for real-time design assessment within an integrated information and numerical modelling framework | |
CN114360637A (en) | Protein-ligand affinity evaluation method based on graph attention network |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |