CN110020671A - The building of drug relationship disaggregated model and classification method based on binary channels CNN-LSTM network - Google Patents

The building of drug relationship disaggregated model and classification method based on binary channels CNN-LSTM network Download PDF

Info

Publication number
CN110020671A
CN110020671A CN201910174269.4A CN201910174269A CN110020671A CN 110020671 A CN110020671 A CN 110020671A CN 201910174269 A CN201910174269 A CN 201910174269A CN 110020671 A CN110020671 A CN 110020671A
Authority
CN
China
Prior art keywords
text
drug
relationship
layer
disaggregated model
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910174269.4A
Other languages
Chinese (zh)
Other versions
CN110020671B (en
Inventor
孙霞
马龙
张蕾
冯筠
吴楠楠
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Northwest University
Original Assignee
Northwest University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Northwest University filed Critical Northwest University
Priority to CN201910174269.4A priority Critical patent/CN110020671B/en
Publication of CN110020671A publication Critical patent/CN110020671A/en
Application granted granted Critical
Publication of CN110020671B publication Critical patent/CN110020671B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2413Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on distances to training or reference patterns

Landscapes

  • Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Medical Treatment And Welfare Office Work (AREA)

Abstract

The invention discloses a kind of drug relationship disaggregated model construction methods based on binary channels CNN-LSTM network, parent drugs text set is pre-processed, backout is carried out to pretreated drug text each in pretreated drug text set, obtains backward text set;Using pretreated drug text set as positive sequence text set;Training neural network, obtains drug relationship disaggregated model;Neural network includes parallel positive sequence Text character extraction layer and backward Text character extraction layer, Fusion Features layer and classification layer;Positive sequence Text character extraction layer and backward Text character extraction layer include the convolution block set gradually and shot and long term Memory Neural Networks block;The present invention is extracted the local feature of drug text using CNN, is extracted the global characteristics of drug text respectively using LSTM by constructing binary channels CNN-LSTM network, and the drug relationship feature extracted is more abundant, so that classification accuracy rate improves.

Description

The building of drug relationship disaggregated model and classification based on binary channels CNN-LSTM network Method
Technical field
The present invention relates to the building of drug relationship disaggregated model and classification methods, and in particular to one kind is based on binary channels CNN- The drug relationship disaggregated model of LSTM network constructs and classification method.
Background technique
Drug relationship refers to while or taking comprehensive effect caused by two or more drugs whithin a period of time.This Kind effect can be divided into synergistic effect, antagonistic effect and non-interaction.Mutual antagonistic effect between drug can cause patient Serious health risk.Drug relationship extracts the typical relationship that (DDIE) task is natural language processing field and extracts task, It is intended to detect and identify the semantic relation of drug pair, to drug safety accident is reduced, the development of biomedical technology is promoted to have Significance.
In recent years, the expert in terms of biomedical and text mining is made that very big effort in DDIE task, also creates Many methods are made, these methods can be mainly divided into three classes: the method for rule-based mode, the side based on statistical machine learning Method and method based on deep learning.Although the method for rule-based mode can targetedly be gone in identification target text very much Entity relationship, but this method is there are three very serious drawback: (1) needing to expend a large amount of man power and material and remove research mesh Text is marked, the information extraction quality for otherwise making rule cannot be guaranteed;(2) it is needed when laying down a regulation in the field Expert a large amount of priori knowledge is provided, and may to make standard because of the reason of subjective consciousness different by different experts The regular collection of cause;(3) because this method has very strong specific aim to domain knowledge, it is only applicable in the field Information extraction, generalization ability is generally poor, so the method for rule-based mode does not cause researcher widely to pay close attention to. Although these method performances based on statistical machine learning are good, it require that coming by fine and cumbersome Feature Engineering Extract suitable characteristic set.However, the extraction quality of these features depend on existing natural language processing tool, therefore by To the adverse effect of these tool noises and cost, extracts and be characterized in unordered, characteristic mass hardly results in effective guarantor Card causes the accuracy rate of classification not high.
Summary of the invention
The drug relationship disaggregated model building based on binary channels CNN-LSTM network that the purpose of the present invention is to provide a kind of And classification method leads to drug to solve feature randomness that drug relationship classification method in the prior art extracts The not high problem of the accuracy rate of relationship classification.
In order to realize above-mentioned task, the invention adopts the following technical scheme:
A kind of drug relationship disaggregated model construction method based on binary channels CNN-LSTM network, the method according to Lower step executes:
Step 1 obtains parent drugs text set;
Drug relationship in parent drugs text each in parent drugs text set is labeled, drug relationship mark is obtained Label collection;
Step 2 pre-processes the parent drugs text set, obtains pretreated drug text set;
The pretreatment includes Text normalization, text size is fixed and text vector mapping;
Step 3 carries out backward behaviour to pretreated drug text each in the pretreated drug text set Make, obtains backward text set;
Using the pretreated drug text set as positive sequence text set;
Step 4, using the positive sequence text set and backward text set as input, by the drug relationship tally set As output, training neural network obtains drug relationship disaggregated model;
The neural network includes the parallel positive sequence Text character extraction layer and backward text feature set gradually Extract layer, Fusion Features layer and classification layer;
The positive sequence Text character extraction layer and the backward Text character extraction layer include the volume set gradually Block and shot and long term Memory Neural Networks block.
Further, the convolution block is provided with 4.
Further, each convolution block includes the batch regularization sublayer set gradually, convolution sublayer, activation letter Number sublayer, convolution sublayer, activation primitive sublayer and pond beggar layer.
Further, the activation primitive in the activation primitive sublayer is ReLU function.
Further, the Fusion Features layer includes full articulamentum.
Further, the classification layer includes Softmax function layer.
A kind of drug relationship classification method based on binary channels CNN-LSTM network, to drug text to be sorted according to Lower step executes:
Step A, the drug text to be sorted is pre-processed using the method for step 2 in claim 1, is obtained Pretreated drug text;
It step B, will be described in the pretreated drug text input to any one of claim 1-6 claim Drug relationship disaggregated model in, obtain classification results.
The present invention has following technical characterstic compared with prior art:
1, a kind of building of drug relationship disaggregated model and classification based on binary channels CNN-LSTM network provided by the invention Method is extracted the local feature of drug text using CNN, is distinguished using LSTM by constructing binary channels CNN-LSTM network The global characteristics of drug text are extracted, the drug relationship feature extracted is more abundant, so that classification accuracy rate improves;
2, a kind of building of drug relationship disaggregated model and classification based on binary channels CNN-LSTM network provided by the invention Method completes feature extraction by the way that the positive sequence text of drug relationship text and backward text are respectively fed to CNN-LSTM network Journey, compared to single pass LSTM network, the drug relationship feature extracted is more comprehensive, so that classification accuracy rate improves;
3, a kind of building of drug relationship disaggregated model and classification based on binary channels CNN-LSTM network provided by the invention Method simplifies the process of drug characteristic vector extraction, improves drug relationship classification by extracting drug Text eigenvector Accuracy;
4, a kind of building of drug relationship disaggregated model and classification based on binary channels CNN-LSTM network provided by the invention Method is not necessarily to manual intervention and pertinent arts using the parent drugs relational text comprising multiple pharmaceutical entities as input, It does not need manually to extract complicated text feature, generalization ability is strong.
Detailed description of the invention
Fig. 1 is the classification of drug model structure provided in one embodiment of the present of invention;
Fig. 2 is the convolution block internal structure chart provided in one embodiment of the present of invention.
Specific embodiment
It makes explanations first to the term occurred in specific embodiment:
Shot and long term Memory Neural Networks (LSTM): LSTM network is by input gate, forgetting door, out gate and memory unit structure At LSTM passes through the door control mechanism of this complexity, can effectively learn the long-term Dependency Specification of input data, in text data It has a wide range of applications in processing with the serialization informations such as track data.
Convolutional neural networks (CNN): a kind of comprising convolutional calculation and with the feedforward neural network of depth structure.
Embodiment one
As shown in Figure 1, disclosing a kind of drug relationship classification based on binary channels CNN-LSTM network in the present embodiment Model building method, the method execute according to the following steps:
Step 1 obtains parent drugs text set;
Drug relationship in parent drugs text each in parent drugs text set is labeled, drug relationship mark is obtained Label collection;
The biomedical text acquired in the present embodiment can be adopted by modes such as Biomedical literature and papers Collection, the text of acquisition can be for document and paper partly or wholly, but needs to guarantee that text semantic expression is complete.
It is at least needed in the parent drugs text comprising two drug target title words, the two drug target title words are It is related to the drug word of drug relationship classification, remaining is other words, such as parent drugs text in the present embodiment are as follows: “Some quinolones,including ciprofloxacin,have been associated with transient elevations in serum creatinine in patients receiving cyclosporine Concomitantly ", wherein " quinolones ", " ciprofloxacin " and " cyclosporine " is medicine name word, Remaining word is other words.
In the data set used herein, between 0 to 150 words, most text size is distributed in the length of text Between 20 to 60 words, and backward rely on the phenomenon that (such as the grammatical phenomenons such as attributive clause) 46% is accounted in data set.
Drug relationship label includes 5 kinds, is that advice suggests respectively, effect effect, mechanism drug action mechanism, int Positive and irrelevant false.
Step 2 pre-processes the parent drugs text set, obtains pretreated drug text set;
The pretreatment includes Text normalization, text size is fixed and text vector mapping;
In the present embodiment, patent is utilized " based on multilayer convolutional Neural to the pretreatment mode of parent drugs text set The drug relationship classification method of network " in drug text set and processing mode.In each of parent drugs text set The different different length of parent drugs text formatting, and medicine name word is complicated and uncommon, when being classified using neural network, It is readily incorporated error, therefore is pre-processed firstly the need of to the parent drugs text of acquisition, including parent drugs are literary All words in this carry out morphology normalization, i.e., the morphology of all words is unified;By drug target title word using unified Naming method be named and be replaced original drug target title word in the form after naming, concrete operations with Following steps:
All words in the parent drugs text set are normalized step 2.1, are named with Unified Form, then benefit The drug target title word is replaced by the drug target title word after being named with Unified Form, after being normalized Drug text set;
Wherein, normalization includes that morphology normalizes and name normalization;
To make drug text in classification, it can more accurately classify, reduce error and introduce, therefore by parent drugs Each of text word carries out morphology normalization, converts them to unified format.To every in parent drugs text One word carries out morphology normalization, the parent drugs text after being normalized, until each in parent drugs text set Each of a parent drugs text word all have passed through morphology normalization, the parent drugs text set after being normalized.
In order to improve the generalization of neural network, by all drug target title prefixes in drug text first with unified shape Formula name, Unified Form are the form of " X serial number ", and wherein X can be any English word, such as " day ", " interaction " etc., serial number is with the sequence serial number of English form, such as " one, two, three " etc., and by the system Title after one name replaces the title of former drug target word, the drug target title word after replacing be " drugone ", " drugtwo " and " drugthree " etc. obtain pretreated drug text set there is no influencing between drug text.
Step 2.2, by each drug text uniform length in the drug text set after normalization, obtain length it is fixed after Drug text set;
The length of each of pretreated drug text set drug text is fixed as n, for length Text less than n is filled, and the mode of filling can be for by the way of full odd jobs random number, then drug text can indicate Are as follows:
S=w1w2w3...wn
Step 2.3, it is fixed to each length in drug text set after fixed of the length after drug text into Row vector mapping, obtains pretreated drug text set;
Since neural network can not be handled the text of natural language form directly as input, by drug text Originally it is mapped as the text vector of digital form, comprising the following steps:
A, term vector table is constructed, the term vector table is made of word and corresponding digital term vector;
Term vector table is made of word and corresponding digital term vector, each word is corresponding unique in term vector table The term vector of digital form as far as possible inserts more words in the table, and term vector table is enable to cover more word.
For more significant term vectors can be converted out, in the present embodiment, studied using Stanford university NLP small Group provides GloVe (Global Vectors for Word Representation) model term vector table, including 2196016 term vectors, the dimension of each term vector are 300.If inputting the word in urtext not in this term vector table, Then the often one-dimensional of the term vector of the word is initialized to 0.
B, the fixed drug text of each length in drug text set is mapped by way of tabling look-up, is obtained pre- Treated drug text set.
For each of n dimension drug text word all by way of looking into the term vector table, it is mapped to one Each of n dimension drug text word is all mapped to the term vector of d dimension, therefore by the vector of a d dimension in this manner One length is that the parent drugs text S of n is just mapped as the text vector of one (n × d) dimension:
It and include that the parent drugs text S that m length is n is just mapped as a m × (n × d) text for one Vector set includes m (n × d) text vectors tieed up in drug text set.
Step 3 carries out backward behaviour to pretreated drug text each in the pretreated drug text set Make, obtains backward text set;
Using the pretreated drug text set as positive sequence text set;
There are also the language phenomenons such as the text, such as attribute postposition of backward in natural language, in the present solution, to make extraction Feature is more comprehensive, using positive sequence drug text and backward drug text respectively to neural metwork training, obtains disaggregated model.
When carrying out backout to drug text, by the reversed order in text vector, such as the vector of one 1 dimension [0.21 0.35 0.62 0.85 0.96], after backward are as follows: [0.96 0.85 0.62 0.35 0.21].
Step 4, using the positive sequence text set and backward text set as input, by the drug relationship tally set As output, training neural network obtains drug relationship disaggregated model;
The neural network includes parallel positive sequence Text character extraction layer and backward Text character extraction layer, feature Fused layer and classification layer;
The positive sequence Text character extraction layer is identical as the backward Text character extraction layer structure, includes successively The convolution block and shot and long term Memory Neural Networks block of setting.
In the present embodiment, in order to improve the accuracy rate that drug relationship is classified, the structure of neural network has been carried out again Design then these local features are sent to as shown in Figure 1, extracting the local feature of text using convolution block first LSTM model extracts the global characteristics and temporal aspect of text to supplement, but it is the text that can handle positive sequence that this, which is also, if The text modified backward of that attributive clause etc is encountered, processing capacity is still very weak, so using two identical features Extract layer handles the positive sequence and backward of input text respectively, and then the positive sequence and backward feature extracted is combined, and obtains most Whole text feature;Text feature is exported into classification layer again later and is classified.
In the present embodiment, the number of convolution block is not accurate enough lower than 4 local features extracted, the number of convolution block Higher than 4, it may appear that the phenomenon that over-fitting, cause feature extraction to fail, therefore as a preferred embodiment, convolution block It is provided with 4.
Optionally, each convolution block includes batch regularization sublayer, the convolution sublayer, activation primitive set gradually Sublayer, convolution sublayer, activation primitive sublayer and pond beggar layer.
In the present embodiment, as shown in Fig. 2, positive sequence drug text and backward drug text can be sent to convolution block it After be introduced into batch regularization layer, the effect of batch regularization layer is that input data is made more to meet normal distribution, meets normal state The speed of the sample training of distribution can greatly improve, and accuracy rate can also improve.
In the present embodiment, the data after regularization are sent into convolutional layer and carry out convolution operation, the parameter setting of convolutional layer Are as follows: the number of convolution unit filter 128.
Enter activation primitive later, data meaningless after convolution are deleted, as a preferred embodiment, swashing Function living is Relu function.
Obtained data are sent into pond layer, pond layer uses maximum by the operation for repeating the above convolution sum activation Pondization operation, such as, a pond window size is 2*2, by the pond window of this 2*2 after convolution sum activation It is slided in data, number maximum in window is elected as representative during sliding, it is a with regard to how many to slide how many times It represents, these is then used to represent the representative as initial data.Such do is advantageous in that: not losing guaranteeing that text spy demonstrate,proves Under the premise of mistake, data are reduced, accelerate the training of network.
Positive sequence drug text and backward drug text are passing through 4 convolution blocks identical in this way and then are entering length Global characteristics present in drug relationship text and temporal aspect are obtained in phase Memory Neural Networks block.
In the present embodiment, shot and long term Memory Neural Networks block interior joint number is set as 64.
Optionally, the Fusion Features layer includes full articulamentum.
In the present embodiment, positive sequence drug text and backward drug text are respectively fed to above-mentioned CNN-LSTM net After network, positive sequence text feature and backward text feature are respectively obtained, the two features are sent to full articulamentum simultaneously.Such as Say that positive text feature and backward text feature there are 100, then constructing a first layer is 200 nodes, the second layer 100 The full articulamentum of a node, positive text feature is sent into 100 nodes before first layer, after backward text feature is sent into first layer Then this 200 features are fused together by 100 nodes in this way.
Optionally, the classification layer includes Softmax function layer.
In the present embodiment, full articulamentum and Softmax function layer constitute last portion of drug relationship sorting algorithm Point, for exporting the drug relationship label of digital vectors form according to the quantity of classification, so that it is determined that last drug relationship point Each output node of the final result of class, full articulamentum and Softmax function layer represents a drug categories, and classifier is most The drug label exported eventually is given pharmaceutical entities to the probability for belonging to each drug categories, and the probability value is in [0,1].Example Such as, it is now assumed that drug relationship there are 2 kinds, relationship and not related, the then output node setting of Softmax function layer have been respectively represented It is 2, i.e., there are two types of drug relationships, positive and negative are respectively represented, if the number of Softmax function layer output The drug relationship label of vector form is p [positive, negative]=[0.1,0.9], i.e. Softmax function layer exports As a result it is 0.1 there are the probability value of positive in, is 0.9 there are the probability value of negative, is then judged with this.? In the present embodiment, drug relationship includes 5 kinds, is that advice suggests respectively, effect effect, mechanism drug action mechanism, int Positive and irrelevant false.
The stratification convolution loop neural network is trained with output using above-mentioned input, drug is obtained and closes Be taxonomical hierarchy convolution loop neural network, the drug relationship text and each drug relationship label be number to Amount form;Stratification convolution loop neural network described in repetition training n times, with the best stratification volume of this n times training performance Product Recognition with Recurrent Neural Network as the drug relationship taxonomical hierarchy convolution loop neural network, wherein N >=1.
The training set of one taxonomical hierarchy convolution loop neural network comprising two parts, first is that being inputted after pretreatment The drug text set of taxonomical hierarchy convolution loop neural network, second is that each drug text in pretreated drug text set In this corresponding parent drugs text, the drug relationship label between drug target title word is obtained to every in drug text set The corresponding drug relationship tally set of one drug text is exported as the target of multilayer convolutional network.Likewise, taxonomical hierarchy The test set of convolution loop neural network also includes two parts, the difference is that during the test, only by pretreated medicine Object text set is input in trained taxonomical hierarchy convolution loop neural network, taxonomical hierarchy convolution loop nerve Network can obtain the classification of drug result set of model prediction according to the drug text data of input and trained model parameter, so The true tag of classification of drug result set and drug relationship is compared afterwards, with the two comparison result classification of assessment stratification volume The performance of product Recognition with Recurrent Neural Network.
In this example, using 2013 drug relationship data set of DDIExtraction as drug relationship text to classification Stratification convolution loop neural network is trained and tests, by the 80% of entire data set as training set, 20% as survey Examination collection, i.e., training set is made of 27792 drug relationship text samples, and test set is by 6409 drug relationship text sample groups At.Then 10 training are carried out to stratification convolution loop neural network using ready-portioned training set, chosen in 10 training Final mask of the best model of modelling effect as drug relationship stratification convolution loop neural network.
Embodiment two
A kind of drug relationship classification method based on binary channels CNN-LSTM network, to drug text to be sorted according to Lower step executes:
Step A, the drug text to be sorted is pre-processed using the method for step 2 in embodiment 1, is obtained pre- Treated drug text;
Step B, by drug relationship disaggregated model described in the pretreated drug text input embodiment 1 In, obtain classification results.
After training final drug relationship stratification convolution loop neural network, model can predict any drug Drug relationship involved in relational text, by the drug text input drug relationship stratification convolution loop that drug relationship is unknown Neural network, the drug that maximum probability is chosen from the digital vectors that drug relationship stratification convolution loop neural network exports close It is the drug relationship classification results of the drug text as unknown drug relationship.
In the present embodiment, drug text to be sorted is " Some quinolones have been associated with transient elevations in serum creatinine in patients receiving Cyclosporine concomitantly ", first aim medicine name word are quinolones, second target medication name Word is referred to as cyclosporine, carries out drug relationship point by trained drug relationship stratification convolution loop neural network Class, the drug relationship digital vectors label of output are as follows:
P [mechanism, advice, effect, int, false]=[0.02,0.09,0.1,0.67,0.12]
I.e. between two drug targets quinolones, cyclosporine is 2% there are the probability of mechanism, I.e. between two drug targets quinolones, cyclosporine is 9% there are the probability of advice, i.e. two target medicines Between object quinolones, cyclosporine is 10% there are the probability of effect, i.e. two drug targets Between quinolones, cyclosporine is 67% there are the probability of int, i.e. two drug target quinolones, Between cyclosporine is 12% there are the probability of false, wherein being up to 67% there are the probability of int relationship, therefore It will be between two drug targets quinolones, cyclosporine using drug relationship stratification convolution loop neural network Relationship is classified as int positive relationship.
The drug relationship classification method and drug in the prior art based on binary channels CNN-LSTM network that this programme provides Sorting algorithm is compared, performance comparison sheet 1, when evaluating a drug relationship classification method performance quality, accuracy rate, recall rate It is bigger with F value, illustrate that drug relationship disaggregated model performance is better, from table 1 it follows that drug relationship layer proposed by the present invention Secondaryization convolution loop neural network will significantly be better than other methods in three accuracy rate, recall rate and F value indexs, this The drug relationship classification method proposed by the present invention based on the two-way convolution loop neural network of stratification is demonstrated in drug relationship Possess optimal classification performance in classification problem.
The drug relationship classification method provided by the invention of table 1 and the deemed-to-satisfy4 energy of other drugs relationship classification compare

Claims (7)

1. a kind of drug relationship disaggregated model construction method based on binary channels CNN-LSTM network, which is characterized in that described Method executes according to the following steps:
Step 1 obtains parent drugs text set;
Drug relationship in parent drugs text each in parent drugs text set is labeled, drug relationship label is obtained Collection;
Step 2 pre-processes the parent drugs text set, obtains pretreated drug text set;
The pretreatment includes Text normalization, text size is fixed and text vector mapping;
Step 3 carries out backout to pretreated drug text each in the pretreated drug text set, obtains Obtain backward text set;
Using the pretreated drug text set as positive sequence text set;
Step 4, using the positive sequence text set and backward text set as input, using the drug relationship tally set as Output, training neural network, obtains drug relationship disaggregated model;
The neural network includes the parallel positive sequence Text character extraction layer and backward Text character extraction set gradually Layer, Fusion Features layer and classification layer;
The positive sequence Text character extraction layer and the backward Text character extraction layer include the convolution block set gradually And shot and long term Memory Neural Networks block.
2. special as described in claim 1 based on the drug relationship disaggregated model construction method of binary channels CNN-LSTM network Sign is that the convolution block is provided with 4.
3. special as claimed in claim 2 based on the drug relationship disaggregated model construction method of binary channels CNN-LSTM network Sign is that each convolution block includes the batch regularization sublayer set gradually, convolution sublayer, activation primitive sublayer, volume Product sublayer, activation primitive sublayer and pond beggar layer.
4. special as claimed in claim 3 based on the drug relationship disaggregated model construction method of binary channels CNN-LSTM network Sign is that the activation primitive in the activation primitive sublayer is ReLU function.
5. special as described in claim 1 based on the drug relationship disaggregated model construction method of binary channels CNN-LSTM network Sign is that the Fusion Features layer includes full articulamentum.
6. special as described in claim 1 based on the drug relationship disaggregated model construction method of binary channels CNN-LSTM network Sign is that the classification layer includes Softmax function layer.
7. a kind of drug relationship classification method based on binary channels CNN-LSTM network, which is characterized in that drug to be sorted Text executes according to the following steps:
Step A, the drug text to be sorted is pre-processed using the method for step 2 in claim 1, obtains pre- place Drug text after reason;
Step B, by medicine described in the pretreated drug text input to any one of claim 1-6 claim In object relationship disaggregated model, classification results are obtained.
CN201910174269.4A 2019-03-08 2019-03-08 Drug relationship classification model construction and classification method based on dual-channel CNN-LSTM network Active CN110020671B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910174269.4A CN110020671B (en) 2019-03-08 2019-03-08 Drug relationship classification model construction and classification method based on dual-channel CNN-LSTM network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910174269.4A CN110020671B (en) 2019-03-08 2019-03-08 Drug relationship classification model construction and classification method based on dual-channel CNN-LSTM network

Publications (2)

Publication Number Publication Date
CN110020671A true CN110020671A (en) 2019-07-16
CN110020671B CN110020671B (en) 2023-04-18

Family

ID=67189365

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910174269.4A Active CN110020671B (en) 2019-03-08 2019-03-08 Drug relationship classification model construction and classification method based on dual-channel CNN-LSTM network

Country Status (1)

Country Link
CN (1) CN110020671B (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111243682A (en) * 2020-01-10 2020-06-05 京东方科技集团股份有限公司 Method, device, medium and apparatus for predicting toxicity of drug
CN111444961A (en) * 2020-03-26 2020-07-24 国家计算机网络与信息安全管理中心黑龙江分中心 Method for judging internet website affiliation through clustering algorithm
CN111898364A (en) * 2020-07-30 2020-11-06 平安科技(深圳)有限公司 Neural network relation extraction method, computer device and readable storage medium
CN112860816A (en) * 2021-03-01 2021-05-28 三维通信股份有限公司 Construction method and detection method of interaction relation detection model of drug entity pair
WO2021159744A1 (en) * 2020-09-27 2021-08-19 平安科技(深圳)有限公司 Medicine classification method and apparatus, terminal device, and storage medium
CN113806531A (en) * 2021-08-26 2021-12-17 西北大学 Drug relationship classification model construction method, drug relationship classification method and system
CN115376668A (en) * 2022-08-30 2022-11-22 温州城市智慧健康有限公司 Big data business analysis method and system applied to intelligent medical treatment
WO2023173823A1 (en) * 2022-03-17 2023-09-21 中国科学院深圳理工大学(筹) Method for predicting interaction relationship of drug pair, and device and medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170308790A1 (en) * 2016-04-21 2017-10-26 International Business Machines Corporation Text classification by ranking with convolutional neural networks
CN108363774A (en) * 2018-02-09 2018-08-03 西北大学 A kind of drug relationship sorting technique based on multilayer convolutional neural networks
CN108763216A (en) * 2018-06-01 2018-11-06 河南理工大学 A kind of text emotion analysis method based on Chinese data collection
CN109299264A (en) * 2018-10-12 2019-02-01 深圳市牛鼎丰科技有限公司 File classification method, device, computer equipment and storage medium

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170308790A1 (en) * 2016-04-21 2017-10-26 International Business Machines Corporation Text classification by ranking with convolutional neural networks
CN108363774A (en) * 2018-02-09 2018-08-03 西北大学 A kind of drug relationship sorting technique based on multilayer convolutional neural networks
CN108763216A (en) * 2018-06-01 2018-11-06 河南理工大学 A kind of text emotion analysis method based on Chinese data collection
CN109299264A (en) * 2018-10-12 2019-02-01 深圳市牛鼎丰科技有限公司 File classification method, device, computer equipment and storage medium

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
杜晓东: "基于深度网络的药物关系抽取算法研究", 《中国优秀硕士学位论文全文数据库》 *
马龙: "基于深度神经网络的药物关系挖掘方法研究", 《中国优秀硕士学位论文全文数据库》 *

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111243682A (en) * 2020-01-10 2020-06-05 京东方科技集团股份有限公司 Method, device, medium and apparatus for predicting toxicity of drug
CN111444961A (en) * 2020-03-26 2020-07-24 国家计算机网络与信息安全管理中心黑龙江分中心 Method for judging internet website affiliation through clustering algorithm
CN111444961B (en) * 2020-03-26 2023-08-18 国家计算机网络与信息安全管理中心黑龙江分中心 Method for judging attribution of Internet website through clustering algorithm
CN111898364A (en) * 2020-07-30 2020-11-06 平安科技(深圳)有限公司 Neural network relation extraction method, computer device and readable storage medium
CN111898364B (en) * 2020-07-30 2023-09-26 平安科技(深圳)有限公司 Neural network relation extraction method, computer equipment and readable storage medium
JP2022538823A (en) * 2020-09-27 2022-09-06 平安科技(深▲せん▼)有限公司 Drug classification method and device, terminal device and storage medium
JP7227405B2 (en) 2020-09-27 2023-02-21 平安科技(深▲せん▼)有限公司 Drug classification method and device, terminal device and storage medium
WO2021159744A1 (en) * 2020-09-27 2021-08-19 平安科技(深圳)有限公司 Medicine classification method and apparatus, terminal device, and storage medium
CN112860816A (en) * 2021-03-01 2021-05-28 三维通信股份有限公司 Construction method and detection method of interaction relation detection model of drug entity pair
CN113806531A (en) * 2021-08-26 2021-12-17 西北大学 Drug relationship classification model construction method, drug relationship classification method and system
CN113806531B (en) * 2021-08-26 2024-02-27 西北大学 Drug relationship classification model construction method, drug relationship classification method and system
WO2023173823A1 (en) * 2022-03-17 2023-09-21 中国科学院深圳理工大学(筹) Method for predicting interaction relationship of drug pair, and device and medium
CN115376668A (en) * 2022-08-30 2022-11-22 温州城市智慧健康有限公司 Big data business analysis method and system applied to intelligent medical treatment
CN115376668B (en) * 2022-08-30 2024-03-08 温州城市智慧健康有限公司 Big data business analysis method and system applied to intelligent medical treatment

Also Published As

Publication number Publication date
CN110020671B (en) 2023-04-18

Similar Documents

Publication Publication Date Title
CN110020671A (en) The building of drug relationship disaggregated model and classification method based on binary channels CNN-LSTM network
CN111078889B (en) Method for extracting relationship between medicines based on various attentions and improved pre-training
CN104318340B (en) Information visualization methods and intelligent visible analysis system based on text resume information
CN104391942B (en) Short essay eigen extended method based on semantic collection of illustrative plates
US10387784B2 (en) Technical and semantic signal processing in large, unstructured data fields
CN108388651A (en) A kind of file classification method based on the kernel of graph and convolutional neural networks
CN108182295A (en) A kind of Company Knowledge collection of illustrative plates attribute extraction method and system
CN106815369A (en) A kind of file classification method based on Xgboost sorting algorithms
CN108073569A (en) A kind of law cognitive approach, device and medium based on multi-layer various dimensions semantic understanding
CN106815293A (en) System and method for constructing knowledge graph for information analysis
CN104142998A (en) Text classification method
CN112347761B (en) BERT-based drug relation extraction method
CN107292348A (en) A kind of Bagging_BSJ short text classification methods
CN112420148A (en) Medical image report quality control system, method and medium based on artificial intelligence
Xu et al. Chinese event detection based on multi-feature fusion and BiLSTM
Kakade et al. A neural network approach for text document classification and semantic text analytics
CN109871414A (en) Biomedical entity relationship classification method based on the context vector kernel of graph
Wang et al. Intelligent classification of tobacco leaves based on residual network
Dutta et al. Fake news prediction: a survey
CN112215006A (en) Organization named entity normalization method and system
Falissard et al. A deep artificial neural network based model for underlying cause of death prediction from death certificates
CN110413899A (en) The storage resource optimization method and system of server storage news
CN110196976A (en) Sentiment orientation classification method, device and the server of text
CN111797945B (en) Text classification method
Aluna et al. Electronic News Sentiment Analysis Application to New Normal Policy during the Covid-19 Pandemic Using Fasttext and Machine Learning

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant