CN113160879B

CN113160879B - Method for predicting drug repositioning through side effect based on network learning

Info

Publication number: CN113160879B
Application number: CN202110448039.XA
Authority: CN
Inventors: 韦嘉
Original assignee: Shanghai Jixukang Biotechnology Co ltd
Current assignee: Shanghai Jixukang Biotechnology Co ltd
Priority date: 2021-04-25
Filing date: 2021-04-25
Publication date: 2023-11-28
Anticipated expiration: 2041-04-25
Also published as: CN113160879A

Abstract

The invention discloses a method for predicting drug repositioning through side effects based on network learning, which comprises the following steps: 1) Constructing unique side effect fingerprints for the side effects of each drug in a 0/1 vector mode; 2) Similarity between drugs was calculated using Jaccard Index; 3) Randomly selecting 1 ten thousand times of side effects with the same number as that of the side effects of the medicine A in the side effect data total set, and calculating whether the similarity of the medicine A and the medicine B is better than the random medicine similarity with the same number of side effects selected randomly, wherein only the side effect similarity results with the similarity of the medicine A and the medicine B being obviously better than that of the side effect selected randomly are reserved; 4) Predicting potential indications of neighboring drugs in the network based on their MeSH information 5) using the EASE score to calculate the degree of enrichment of the adverse effect network, rank-ordering the indication locations of drug a. The scheme selects 61 medicines for prediction, can accurately predict the indication of 41 medicines, and shows good prediction effect.

Description

Method for predicting drug repositioning through side effect based on network learning

Technical Field

The invention relates to the technical field of drug research and development, in particular to a method for predicting drug repositioning through side effects based on network learning.

Background

The development efficiency of drugs has been low, compared with the investment of a lot of money and time for the development of novel drugs, the development time can be remarkably shortened for the research of the potential of the known drugs, the toxicity risk of the drugs is reduced, but the past successful examples of drug repositioning often depend on contingencies, recently, researches have been proposed to predict new indications of drugs according to the gene expression patterns of the drugs or by utilizing the structure functions of compounds/proteins so as to reduce the development cost, and these methods are often focused on researching the molecular action mechanism from the aspect of genotype, but the preclinical results based on MOA are not greatly related to the actual curative effect of the drug development process, and it is estimated that the drugs effective in cell analysis are only thirty percent effective in animals, only five percent effective in human bodies, and the difference between MOA and physiological reactions may limit the practicability of the drug repositioning method.

When the drug is combined with the wrong target, normal metabolism and signal paths can be disturbed to generate side effects, namely the side effects of the drug on the human body can be regarded as valuable parameters of the drug on the human body, new thinking and wide prospects are provided for drug repositioning research, only few researches relate to the aspect, a prediction model 'SIDER' covering 996 drugs and 4192 drug effects is researched and manufactured in 2010, a research group establishes a prediction model 'DRoSEF' based on PharmGKB database and covering 145 diseases in 2011, the DRoSEF model is expected to realize more promising performance by expanding the disease coverage, the drug with similar side effects also has certain similar therapeutic properties, new indications of the drug are predicted by researching complete side effect catalogues, and an overall network is constructed by utilizing the side effect similarities of different drugs, so that the application range of the drug can be predicted by adjacent drug function distribution in the network, and the problem can be solved by the method of predicting the drug repositioning based on network learning.

Disclosure of Invention

(one) solving the technical problems

Aiming at the technical problems in the prior art, the invention provides a method for predicting drug repositioning through side effects based on network learning, which predicts new indications of drugs through a complete side effect catalog, builds an integral network by utilizing the side effect similarity of different drugs, solves the problems that the development efficiency of the drugs is always low, a great deal of money and time are required to be input for developing the novel drugs, and the past successful drug repositioning examples often depend on accidents.

(II) technical scheme

The technical scheme for solving the technical problems is as follows: a method for predicting drug repositioning through side effects based on network learning, comprising the steps of:

1) Constructing unique side effect fingerprint for each side effect of each medicine in a 0/1 vector mode, namely 2183 medicines all have a 6495-dimensional vector for representing the side effect of each medicine;

2) Similarity between every two drugs was calculated using Jaccard Index, with the following formula:

wherein a, B is the number of side effects of the medicines A and B, c is the number of side effects shared by the medicines A and B;

3) Randomly selecting 1 ten thousand times of side effects with the same number as that of the side effects of the medicine A in the side effect data total set, and calculating whether the similarity of the medicine A and the medicine B is better than the random medicine similarity with the same number of side effects selected randomly, wherein only the side effect similarity results with the similarity of the medicine A and the medicine B being obviously better than that of the side effect selected randomly are reserved;

4) Z score was used to measure this significant difference, as follows:

wherein the Zcore threshold is set to be equal to or greater than 2.576;

5) Because the MeSH contains disease information distributed according to the hierarchy, the potential indication of the MeSH can be predicted in the network according to the MeSH information of adjacent medicines;

6) Using the EASE score to calculate the degree of enrichment of the side effect network, ranking the indication locations of drug A;

7) And finally, the model is a large network formed by a similarity sub-network between any two medicines, the normalized discount accumulated income is originally used for evaluating a network search engine algorithm in the information retrieval field, and the usefulness degree of the document in a result list is calculated and is used for calculating the rank ranking accuracy of the medicine prediction result.

(III) beneficial effects

Compared with the prior art, the invention provides a method for predicting drug repositioning through side effects based on network learning, which has the following beneficial effects:

according to the network learning-based drug repositioning method through side effects, the similarity of drug side effects is ranked by using a normalized fold cumulative benefit (NDCG) method for ranking the priority of search results in a network search engine algorithm originally used in the information retrieval field for the first time, in the existing method for predicting the drug side effects, the drug side effect data used by us are the most comprehensive and authoritative information of the 6495 kinds of side effects of 2183 drugs in total, the robustness and universality of a model constructed by the method are guaranteed through the comprehensive coverage, in the test results of randomly selecting 98 kinds of drug data in a SIDER model, 84.69% of drugs in the model are selected through threshold screening, 61 kinds of drugs in the model are predicted, and the indication of 41 kinds of drugs in the model can be accurately predicted, and 50.94% of results in the 106 medicine indication prediction results contained in the 41 kinds of drugs are supported by approval or other clinical experiments and scientific literature, so that good prediction effects are displayed, and new drug indications are effectively predicted through the drug side effects.

Drawings

Fig. 1 is a schematic flow chart of a method for predicting drug repositioning through side effects based on network learning.

Detailed Description

The technical solutions of the embodiments of the present invention will be clearly and completely described below in conjunction with the embodiments of the present invention, and it is apparent that the described embodiments are only some embodiments of the present invention, not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.

As shown in fig. 1, a method for predicting drug repositioning through side effects based on network learning includes the following steps:

4) Z score was used to measure this significant difference, as follows:

wherein the Zcore threshold is set to be equal to or greater than 2.576;

The drug side effect dictionary uses a fifteenth edition of merle drug side effect catalogue, and uses 2007-2012 drug side effect report and FDA approved indication data for side effects of drugs after 2006.

Meanwhile, the MedDRA vocabulary is used as standard words and grades, the catalog data from different resources are integrated, and semantic redundancy is avoided.

The FDA approved drug indication information is converted to a MeSH format header, resulting in 6495 clinical side effects of 2183 drug and 994 MeSH fourth-level information.

Experimental cases: in the test results of 98 kinds of medicine data in a SIDER model, 84.69% of medicines are selected in the model through threshold screening, 61 kinds of medicines are selected for prediction, and the indication of 41 kinds of medicines can be accurately predicted, and 50.94% of the 106 top-ranked five-medicine indication prediction results contained in the 41 kinds of medicines are supported by FDA approval or other clinical experiments and scientific literature, so that good prediction effects are shown, and new indication of medicines is effectively predicted through side effects of medicines, as shown in the following table:

Drug-indication pairs	Number	Percentage
			FDA-approved	37	34.91％
Clinical	10	9.43％
			Preclinical	7	6.6％
Unknown	52	49.06％

the beneficial effects of the invention are as follows: according to the network learning-based drug repositioning method through side effects, the similarity of drug side effects is ranked by using a normalized fold cumulative benefit (NDCG) method for ranking the priority of search results in a network search engine algorithm originally used in the information retrieval field for the first time, in the existing method for predicting the drug side effects, the drug side effect data used by us are the most comprehensive and authoritative information of the 6495 kinds of side effects of 2183 drugs in total, the robustness and universality of a model constructed by the method are guaranteed through the comprehensive coverage, in the test results of randomly selecting 98 kinds of drug data in a SIDER model, 84.69% of drugs in the model are selected through threshold screening, 61 kinds of drugs in the model are predicted, and the indication of 41 kinds of drugs in the model can be accurately predicted, and 50.94% of results in the 106 medicine indication prediction results contained in the 41 kinds of drugs are supported by approval or other clinical experiments and scientific literature, so that good prediction effects are displayed, and new drug indications are effectively predicted through the drug side effects.

Although embodiments of the present invention have been shown and described, it will be understood by those skilled in the art that various changes, modifications, substitutions and alterations can be made therein without departing from the principles and spirit of the invention, the scope of which is defined in the appended claims and their equivalents.

Claims

1. The method for predicting drug repositioning through side effects based on network learning is characterized by comprising the following steps:

4) Z score was used to measure this significant difference, as follows:

wherein the Zcore threshold is set to be equal to or greater than 2.576;