CN113763332A

CN113763332A - Pulmonary nodule analysis method and device based on ternary capsule network algorithm and storage medium

Info

Publication number: CN113763332A
Application number: CN202110946389.9A
Authority: CN
Inventors: 郑光远
Original assignee: Shanghai Jian Qiao University
Current assignee: Shanghai Jian Qiao University
Priority date: 2021-08-18
Filing date: 2021-08-18
Publication date: 2021-12-07
Anticipated expiration: 2041-08-18
Also published as: CN113763332B

Abstract

The invention relates to a lung nodule analysis method, a device and a storage medium based on a ternary capsule network algorithm, wherein the method comprises the following steps: constructing a lung nodule analysis model based on a ternary capsule network algorithm; acquiring an image to be analyzed; learning a single capsule network module, and training a capsule network intelligent agent; performing ternary capsule network module learning, and optimizing a capsule network intelligent agent based on the ternary capsule network loss; analyzing the CT image based on the learnt and optimized capsule network intelligent agent to give a classification result of the lung nodules, namely acquiring the signs of the lung nodules suffered by the patient; and determining a data analysis result of the image to be analyzed according to the classification result, and outputting the data analysis result through a result determination module. Compared with the prior art, the method has the advantages of improving the analysis accuracy of the pulmonary nodules and the like.

Description

Pulmonary nodule analysis method and device based on ternary capsule network algorithm and storage medium

Technical Field

The invention relates to the technical field of machine learning, in particular to a lung nodule analysis method and device based on a ternary capsule network algorithm and a storage medium.

Background

With the development of computer-aided technology, the medical field also increasingly uses computer-aided equipment to reduce the work intensity of doctors. The doctor observes and analyzes the body image of the patient artificially, and the lung image examination and diagnosis needs increase year by year along with the enhancement of the health consciousness of people. In the field of medical image analysis, major difficulties currently include the following: firstly, the sample labeling is complex, the symptom expression of the focus in the medical image is complex and various, the difficulty in distinguishing different types of nodules is high, and professional knowledge and rich experience are needed for identifying and labeling; the complicated labeling process causes fatigue to readers, which leads to unstable diagnosis results, and therefore, each focus needs a plurality of experts to perform the comparison in different time periods, which is expensive. And (II) the medical radiology department doctors are insufficient, the daily film reading task of the doctors is heavy, and the doctors cannot extract special time to label the samples, so that the samples in the field of medical image analysis and research are insufficient. In the field of lung nodule classification and its related medical image classification, capsule network reinforcement learning has begun to attract interest. Silva used a set of 3D geometric features in combination with a reinforcement learning method to classify lung nodules as benign/malignant, with an accuracy of 81%. The heuristic of Ali in Alphago system proposes a lung nodule detection algorithm based on deep reinforcement learning, wherein an agent adopts a convolutional neural network. This method was trained and tested on the LIDC-IDRI database, resulting in an overall accuracy of 64.4% (sensitivity 58.9%, specificity 55.3%, PPV 54.2%, and NPV 60%). As for the ternary loss function, no relevant application in the field of lung nodule classification has been found at present. In the field of medical images, Puch proposes a small sample learning model based on a depth ternary network to classify brain MRI images. The result shows that under the condition of limited samples, the model can more accurately identify images of different modes than a traditional convolutional neural network classifier. Medela in one research work demonstrated that the use of triple network techniques can migrate knowledge from well-defined source data (colon tissue images) to a more general domain consisting of colon, lung and breast tissue images by using few training images.

The image analysis technology based on reinforcement learning gradually obtains the reliability and the adoption of doctors, particularly the detection task of lung nodules; image analysis techniques based on reinforcement learning can greatly reduce the workload of the doctor. However, the experts in the image department cannot additionally extract a large amount of time to perform sample labeling work, so that the sample in the field of medical image analysis and research is insufficient and the accuracy is not high.

Disclosure of Invention

The present invention aims to overcome the defects of the prior art, and provides a lung nodule analysis method, device and storage medium based on a ternary capsule network algorithm, which are used for improving the accuracy of lung nodule analysis.

The purpose of the invention can be realized by the following technical scheme:

a lung nodule analysis method based on a ternary capsule network algorithm comprises the following steps:

s1: and constructing a lung nodule analysis model based on a ternary capsule network algorithm, wherein the lung nodule analysis model comprises a memory pool module, a single capsule network module, a ternary capsule network module, an identification module and a result determination module.

S2: and acquiring an image to be analyzed.

S3: and (5) learning a single capsule network module, and training a capsule network intelligent agent. Specifically, the size of a sliding window on a CT image is kept unchanged, the number of image blocks extracted by each CT is determined, the state number M of each round is determined in the reinforcement learning process, the difference between the single capsule network diagnosis result and the calibration result is compared, and the action return on each image block is correspondingly calculated.

The specific method for comparing the dissimilarity of the single-capsule network diagnosis result and the calibration result comprises the following steps:

for an image block s_tIf the single capsule network diagnosis is correct, a reward value, namely a return value r, is given; otherwise, giving the reward value of 0; the method for judging whether the single-capsule network diagnosis result is correct comprises the following steps:

a: if image block s_tIf the region is overlapped with a focus region of the calibration result, further checking whether the classification result of the image block is consistent with the symptom category label of the calibration result, if so, judging that the diagnosis result of the single capsule network is correct, otherwise, judging that the diagnosis result is wrong;

b: if image block s_tAnd if the area is not overlapped with any focus area of the calibration result and the classification result of the image block is negative, judging that the single capsule network diagnosis result is correct, otherwise, judging that the single capsule network diagnosis result is wrong.

S4: and (4) learning a ternary capsule network module, and optimizing the capsule network intelligent agent based on the ternary capsule network loss.

S5: and inputting lung CT images into the learned capsule network, and performing sliding window processing on the CT images by the model to obtain a series of states.

S6: and determining a data analysis result of the image to be analyzed according to the classification result, and outputting the data analysis result through a result determination module.

Further, in S1, the specific content of constructing the lung nodule analysis model based on the ternary capsule network algorithm is as follows:

11) inputting a plurality of pictures, setting initial learning parameters including the memory space N of a memory pool D in a memory pool module and a temporary memory pool D_TCapacity H, return value r, interval factor α, number of small batches B, number of training iterations EPOCHS, and capsule network initial parameter ω_c；

12) For a CT image with 512 x 512 pixels, a sliding window with K x K pixels is adopted to obtain a 32 x 32 pixel image block on the CT image, and the coordinates (x) at the upper left corner of the image block are recorded_i,y_i) (ii) a Setting each image block to a state s_tFurther, all image blocks extracted from the CT image form a set S of all states;

13) state S in set S of capsule network pairs_tSelecting corresponding classification action a from action space A_tOutput the class attribute c of this state_j；

14) All the segments s taken from a CT image in the capsule network_iPerform the corresponding action a_tAnd after the detection is finished, displaying the lung nodule and the symptom classification result thereof detected by the capsule network back to the CT image according to the coordinate position of the image block region, thereby constructing a lung nodule analysis model based on a ternary capsule network algorithm.

Further, in S3, a fully-connected layer is added to the single-capsule network after the feature expression to output the accumulated reward Q corresponding to each category_targetThe value:

Q_target＝M×r

the corresponding single step loss function is:

in the formula, L (omega)_c) For the name of the loss function, R (x) is the return function,

is a capsule network agent and is a capsule network agent,

as anchor image, a_jFor corresponding actions, ω_cIs radix Ginseng;

and learning the parameters of the single-capsule network by adopting the corresponding single-step loss function of the single-capsule network until the loss function is smaller than a preset value.

Further, in S4, the specific content of learning the ternary capsule network module is as follows:

41) inputting three images at a time, respectively being anchor images

Positive sample image

And negative sample image

Respectively extracting features from the three images by utilizing the capsule network, and further calculating the Euclidean distance d between the anchor point image and the positive sample image₊And the Euclidean distance d between the anchor point image and the negative sample image_-And normalizing the data; calculating a ternary loss function on the basis;

42) converting the loss function of the single-layer capsule network into a ternary loss function;

43) the ternary capsule network receives a state from the state set S and calculates rewards Q corresponding to different categories_targetValue, select corresponding Q_targetExecuting the action with the maximum value, correspondingly entering the next state, and simultaneously storing elements such as the current state, the action and the like; the process is iterated until the agent respectively executes corresponding actions on all states acquired from a CT image by a sliding window method; the environment compares the classification result of each image block by the ternary capsule network with the calibrated classification result, and correspondingly calculates and stores a return value;

44) the system performs experience playback, randomly extracts a batch of data from the state, action and return combination stored in the memory pool, calculates the loss of the capsule network and updates the network weight.

The calculation formula of the loss calculation and the network weight update of the capsule network is as follows:

in the formula, L_composeGenerating a composite loss value; b is the number of data selected by experience playback in reinforcement learning, namely the number of previously-placed experience data is randomly selected from an experience pool; λ is two lost weight adjustment factors;

is a capsule network agent; alpha represents the minimum difference of the embedding expression distances of the anchor sample and the negative sample; l is_i(d₊,d_-) Is a ternary loss function.

Further, in S6, the specific process of determining the data analysis result of the image to be analyzed according to the classification result is as follows:

suppose there is a sample s to be measured^AC is the number of all the classes considered, from the existing labeled sample set, N is randomly selected for each class of samples^PA total of N is obtained for each sample^PX C samples; each sample s^PRespectively associated with a sample s to be measured^AAfter combination form N^PX C group of sample pairs

i∈[0,C-1]And after the distances corresponding to each group of sample pairs are calculated, calculating the average distance corresponding to each category, and taking the category with the minimum average distance as a classification result.

A pulmonary nodule analysis apparatus based on a ternary capsule network algorithm, the apparatus comprising:

a data storage module storing one or more programs;

the construction module is used for constructing a lung nodule analysis model based on a ternary capsule network algorithm;

the acquisition module is used for acquiring a lung image of a patient and taking the lung image as an image to be analyzed;

the analysis module divides the image to be analyzed into regions to form image data, inputs the data into a lung nodule analysis model which is trained in advance, and obtains a classification result output by the lung nodule analysis model;

the analysis result determining module is used for determining the analysis result of the image to be analyzed according to the classification result;

the output module is used for outputting data of the analysis result and the lung nodule image;

a central processor executing one or more programs that implement the lung nodule analysis method based on a ternary capsule network algorithm as described above.

A computer-readable storage medium having stored thereon a computer program for execution by a processor for implementing a lung nodule analysis method based on a ternary capsule network algorithm as described above.

Compared with the prior art, the lung nodule analysis method, the device and the storage medium based on the ternary capsule network algorithm provided by the invention at least have the following beneficial effects:

1) according to the invention, the intelligent agent is trained and learned in an early stage by using the loss of the single capsule network, when the performance of the single capsule network is improved and stagnated gradually to the best, the intelligent agent is further optimized based on the loss of the ternary capsule network, so that the characteristic distance between classes is relatively larger, the distance in the classes is smaller, and thus the samples in different classes can be distinguished in a finer granularity, and the final performance is obviously improved compared with the single capsule network;

2) the method is based on the ternary loss measurement method, and can obtain higher accuracy of analyzing the pulmonary nodules than single-point loss and pairwise loss (twin network).

Drawings

Fig. 1 is a diagram of a single capsule network architecture of a lung nodule analysis method based on a ternary capsule network algorithm according to an embodiment;

fig. 2 is a diagram of a ternary capsule network architecture of a lung nodule analysis method based on a ternary capsule network algorithm according to an embodiment;

fig. 3 is a schematic flowchart of a lung nodule analysis method based on a ternary capsule network algorithm according to an embodiment;

FIG. 4 is a graph comparing the lung nodule analysis method based on the ternary capsule network algorithm and the DQN average classification performance in the embodiment;

FIG. 5 is a lung nodule analysis method and DQN classification accuracy difference confusion matrix diagram based on the ternary capsule network algorithm in the embodiment;

fig. 6 is a comparison graph of a symptom map generated by a lung nodule analysis method based on a ternary capsule network algorithm and DQN reconstruction in the embodiment.

Detailed Description

The invention is described in detail below with reference to the figures and specific embodiments. It is to be understood that the embodiments described are only a few embodiments of the present invention, and not all embodiments. All other embodiments, which can be obtained by a person skilled in the art without any inventive step based on the embodiments of the present invention, shall fall within the scope of protection of the present invention.

Examples

The invention relates to a lung nodule analysis method based on a ternary capsule network algorithm, which can be applied to the condition of analyzing lung nodules according to lung images. The method may be performed by a lung nodule analyzing apparatus, which may be implemented in software and/or hardware, for example, which may be configured in a computer device. As shown in fig. 3, the lung nodule analysis method based on the ternary capsule network algorithm (hereinafter referred to as TriCaps-RL) specifically includes the following steps:

the lung nodule analysis model based on the ternary capsule network algorithm is constructed and comprises a memory pool module, a single capsule network module, a ternary capsule network module, an identification module and a result determination module. The concrete contents are as follows:

firstly, inputting a large number of pictures, performing multi-stage convolution on the input pictures by a capsule network, then encapsulating and learning image characteristics by using capsules, and generating an algorithm scheme; in this embodiment, as a preferred scheme, after two layers of convolution operations are performed on an adopted capsule network, the features extracted by the convolution layers are sent to initial capsule layers (PrimaryCaps) for encapsulation, so as to generate 8-dimensional feature combinations, and then 8-dimensional input space is mapped to 16 dimensions through 8 × 16 weight matrices, so as to obtain final feature expression of an image.

Setting initial learning parameters: memory space N of memory pool D in memory pool module and memory space D of temporary memory pool_TCapacity H, return value r, interval factor α, number of small batches B, number of training iterations EPOCHS, and capsule network initial parameter ω_c。

For a 512 x 512 pixel CT image, the algorithm first uses K x K (K)<512) The sliding window of the pixel acquires different image blocks on the CT image one by one, and simultaneously records the coordinates (x) of the upper left corner of the image blocks_i,y_i) (ii) a Each image block being regarded as a state s_tAll image blocks extracted from the CT image then constitute a set S of all states.

S in capsule network pair S_tSelecting corresponding classification action a from action space A_tOutput the class attribute c of this state_j(ii) a When the capsule network is used for all image blocks s obtained from a CT image_iThe corresponding action a is executed_tThen, the lung nodule and the symptom classification result obtained by the capsule network detection are displayed back to the CT image according to the coordinate position of the image block area; therefore, a lung nodule analysis model based on a ternary capsule network algorithm is constructed.

And step two, acquiring an image to be analyzed.

Step three, learning a single-capsule network module, wherein the single-capsule network architecture diagram is shown in figure 1:

the size of a sliding window on one CT image is unchanged, so that the number of image blocks extracted from each CT image is determined, the state number M of each round is determined in the reinforcement learning process, the difference between the single-capsule network diagnosis result and the calibration result is compared, and the action return on each image block is correspondingly calculated, wherein the specific method comprises the following steps: for an image block s_tIf the single capsule network diagnosis is correct, a reward value r is given (which is determined experimentally); otherwise, giving the reward value of 0; the method for judging whether the single-capsule network diagnosis result is correct is as follows:

1) if image block s_tIf the region is overlapped with a certain focus region of the calibration result, further checking whether the classification result of the image block is consistent with the symptom category label of the calibration result, if so, determining that the single-capsule network diagnosis result is correct, otherwise, determining that the diagnosis result is wrong;

2) if image block s_tThe area is not overlapped with any focus area of the calibration result, and the classification result of the image block is negative, the single capsule network diagnosis result is considered to be correct; otherwise, the single capsule network diagnosis result is considered to be wrong.

Adding a fully-connected layer to output the accumulated reward Q corresponding to each category after the characteristics are expressed by the single-capsule network_targetThe value:

Q_target＝M×r (1)

the corresponding single step loss function is:

wherein L is_j(ω_c) Is the name of the loss function and,

for the capsule network agent, R (x) is a reporting function,

representing one of the states in a round, a, for the anchor image_jRepresenting the corresponding movement, ω_cThe super reference is shown.

And (3) learning the parameters of the single-capsule network by adopting the loss function formula (2) of the single-capsule network until the performance cannot be improved.

Step four, learning a ternary capsule network module, wherein the ternary capsule network architecture diagram is shown in fig. 2:

inputting three images at a time, respectively being anchor images

Positive sample image

And negative sample image

Respectively extracting features from the three images by utilizing the capsule network, and further calculating the Euclidean distance d between the anchor point image and the positive sample image₊And the Euclidean distance d between the anchor point image and the negative sample image_-And normalizing it to obtain:

in the formula (I), the compound is shown in the specification,

a capsule-embedded representation of the anchor point sample,

representing an embedded representation of the positive sample,

representing the embedded expression of the negative examples.

On this basis, the ternary loss function is calculated as:

in the formula: alpha represents the minimum difference between the anchor sample and the negative sample, and the anchor sample and the positive sample embed the expression distance.

The loss function of the single-layer capsule network is transited into a ternary loss function (5);

the ternary capsule network receives a state from state set S and computes Q corresponding to different classes_targetValue, select corresponding Q_targetExecuting the action with the maximum value, correspondingly entering the next state, and simultaneously storing elements such as the current state, the action and the like; the process is iterated until the agent respectively executes corresponding actions on all states acquired from a CT image by a sliding window method; and the environment compares the classification result of each image block by the ternary capsule network with the calibrated classification result, and automatically calculates and stores a return value (reward value r).

The system executes experience playback, randomly extracts a batch from the state, action and return combination stored in the memory pool, calculates the loss of the capsule network according to the following formula (6) and updates the network weight:

wherein L is_composeRepresents the integrated loss value generated by equation (6); b is the number of data selected by experience playback in reinforcement learning, namely the number of previously-placed experience data is randomly selected from an experience pool; λ is two lost weight adjustment factors, which gradually increase with the increase of the training iteration number n, and the calculation formula is:

in equation (7), EPOCHS is a preset iteration value.

The use method of the formula (7) is as follows: when the iteration number n of the algorithm is less than EPOCHS, the algorithm mainly adopts the formula (8):

when the iteration number n is increased to exceed EPOCHS, the ternary loss shown in the formula (5) is mainly adopted, so that the classification accuracy of the intelligent agent is further finely adjusted, and the class distinction with finer granularity is realized for the sample. The effect of the unit loss stated above is to make the network converge faster.

And step five, inputting lung CT images into the learned capsule network, and performing sliding window processing on the CT images by the model to obtain a series of states.

And step six, selecting corresponding actions with maximum return for the states by the intelligent agents in the model according to the learned strategies, and outputting an analysis result, wherein the specific contents are as follows:

suppose there is a sample s to be measured^AC is the number of all the classes considered, from the existing labeled sample set, N is randomly selected for each class of samples^PA total of N is obtained for each sample^PX C samples; each sample s^PRespectively associated with a sample s to be measured^AAfter combination to form N^PX C group of sample pairs

i∈[0,C-1]Calculating the distance corresponding to each group of sample pairs, then calculating the average distance corresponding to each category, and taking the category with the minimum average distance as a classification result, namely:

wherein, y^*Is expressed as the mostClass of final output, s^AFor the sample to be tested, yⁱIn the case of the (i) th category,

to test the group yⁱJ ∈ [1, N) of the jth sample of (c)^P]。

The invention determines the analysis result of the image data to be analyzed according to the classification result and outputs the analysis result, and the lung nodule analysis result of the patient is output as the sign of the lung nodule suffered by the patient.

To demonstrate the effectiveness of the method of the invention, this example was verified experimentally. The experiments were performed on a sample set selected from the LIDC-IDRI library. The edge part of each nodule in the LIDC-IDRI library is sketched in detail by a plurality of doctors, and the nodules are evaluated according to the quality and the malignancy, and the labeling information is a record of the diagnosis operation of the real film reading process of the doctors. Thus, the label data in the LIDC-IDRI is used in this embodiment as a basis for the environment (simulating the doctor) to evaluate and feedback rewards for the actions selected and performed by the agent in the method of the invention.

For the convenience of experimental comparison, the node features provided by the radiologist during labeling are graded, the CT images of the positive non-central calcium feature, the lobular feature, the burr and the GGO feature 4-type node features are subjected to sample size increase through rotation and overturning operations, 12000 CT images are generated, and the experimental scheme is a 5-fold cross validation experiment.

For each CT image, a 32 × 32 pixel sliding window block acquisition is performed, with a sliding interval of 11 pixels, and accordingly 46 × 46 — 2116 blocks of pictures can be taken from each CT. In this experiment, for each agent identification, the correct return is given as r ═ 1, and Q corresponds to each image_targetIs 2116. The maximum capacity N of the playback memory pool D is set to 3 k. Each time 50 small batches of samples were taken from D for training of the agent. For the epsilon greedy strategy, epsilon is the greedy weight, and the experiment sets epsilon to be linearly reduced from 1 to 0.1 in 3000 iteration cycles. Further, according to the experiment, for EPOCHS in equations (5-8), it is set to 22000.

In the classification stage of the symptoms in the experiment, the categoriesThe number C is 5. Randomly selecting P from each type of sample at a time_nTotal 10 individuals, get P_nAnd x C is 50 samples, and the x C and the samples to be classified form 50 sample pairs, and are classified according to the formula (9).

The accuracy rate of the Tricaps-RL changes dramatically before 2600 steps of the learning process, the process should be that the data in the experience memory pool is not completely updated, the value of the loss function at this stage is unstable and does not become smaller gradually with the increase of the iteration number, and the accuracy rate gradually increases steadily with the accumulation of the experience after 2600 steps. When training is carried out to 14000 iterations, the accuracy of the algorithm gradually reaches a stable value, and small-amplitude improvement is carried out after 22000 steps. This is because EPOCHS in equation (7) was set to 22000 in the experiment, after which the loss of the algorithm gradually transitions to be calculated by the ternary loss function shown in (5).

With the continuous increase of the training iteration number, the feature embedded expressions of the same feature class sample are gradually aggregated, and the feature embedded expressions of different feature class samples are gradually separated.

A DQN (classical deep reinforcement learning algorithm) network is constructed according to the structure shown in Table 1, and comprises three convolutional layers, then the data obtained from LIDC-IDRI is used for training the convolutional layers, and the final test result is calculated and compared with the classification performance of Tricaps-RL.

Table 1 CNN network architecture for comparison

The difference between DQN and the Tricaps-RL method of the invention is that: a) a CNN network is employed instead of a capsule network; 2) only the Q-learning objective is employed and no triplet learning objective is considered.

The average classification performance of both methods is shown in fig. 4. As can be seen from fig. 4, the classification effect of tricas-RL is significantly better than that of DQN classification method in terms of sensitivity, specificity and accuracy, which indicates that the capsule network and triplet learning objective employed herein are reasonably effective.

In order to perform finer-grained analysis and check the performance difference of the two methods in different categories, a difference confusion matrix of the classification accuracy of the TriCaps-RL and the DQN is constructed in the embodiment, as shown in fig. 5. The diagonal floating point numbers represent the difference in classification accuracy of the corresponding classes for TriCaps-RL and DQN, while the non-diagonal floating point numbers represent their difference in classification error rate, showing the rate of error classification to each of the other respective classes.

The value on the diagonal line is the difference of the classification accuracy of the Tricaps-RL and the DQN; the elements that are not on the diagonal are the difference in the classification error rate. As shown in fig. 5, the values of the elements on the diagonal of the difference confusion matrix are all positive, and the average value of the number of the diagonals is 0.1047, which indicates that the tricas-RL algorithm has higher accuracy on each class. The overall classification performance is better than DQN. No positive values appear in the off-diagonal elements, which means that the misclassification rate of Tricaps-RL is lower than DQN on all classes. Table 2 is a record of the average time of 5 trains and the average predicted time of 20 samples for both the TriCaps-RL method and the DQN method.

TABLE 2 comparison of training time and sort time for the Tricaps-RL and DQN methods

From table 2, it can be seen that the training time of the tricas-RL method is much longer than that of the DQN method, mainly because when the classification performance of the method cannot be improved any more in training the tricas-RL method, the strategy is gradually switched to the strategy optimization using the ternary loss metric. The ternary loss metric strategy increases the number of computations and also makes the gradient change small at each iteration, so it takes longer when the classification performance reaches a new balance. Meanwhile, compared with DQN, the time of the Tricaps-RL method is longer in the classification stage, because the Tricaps-RL method needs to be subjected to averaging after multiple sample similarity comparisons are carried out. However, the classification time is still less than 0.1 second, and the real-time interaction requirement can be fully met in the interaction process with the doctor.

And respectively reconstructing images on the two network output characteristics so as to further compare and analyze the quality of the characteristics acquired by the two networks. For the DQN network, a symmetrical deconvolution network is added on the basis of the CNN network shown in Table 1 to realize reconstruction; for the algorithm, reconstruction is realized by adding a reconstruction network behind a capsule network according to the scheme in the document Dynamic Routing Between Capsules. Both were re-generated after 30000 training sessions with the single capsule network of TriCaps-RL and CNN in DQN, respectively, for the incoming symptom samples. Fig. 6 shows a comparison of the two results, the first row being the original image and the second and third rows being the image blocks generated by the capsule network and CNN, respectively. It can be seen from the figure that the image blocks of the signs generated by the capsule network are obviously finer and more remarkable than those generated by the CNN; the image generated by the CNN obviously loses many details, especially for the No. 1 and No. 9 symbolic graphs with unobvious texture contrast or the No. 14 and No. 18 symbolic graphs with large lesion areas, the CNN can not successfully express the characteristics, and only a blank graph is generated. The reconstruction of the capsule network is also not very good for the 9 th image, but we can find that it is also difficult for the human eye to discern the texture of its symptoms for this sample.

Correspondingly, the invention also provides a lung nodule analysis device based on the ternary capsule network algorithm, which comprises:

a data storage module for storing one or more programs;

the acquisition module is used for acquiring a lung image of a patient as an image to be analyzed;

the analysis module is used for dividing the image to be analyzed into regions to form image data, inputting the data into a pre-trained lung nodule analysis model and obtaining a classification result output by the analysis model;

a central processor for executing the one or more programs that implement the method for lung nodule analysis provided herein above.

Further, the present invention also provides a computer readable storage medium having stored thereon a computer program which, when executed by a processor, implements the lung nodule analyzing method as provided above.

While the invention has been described with reference to specific embodiments, the invention is not limited thereto, and those skilled in the art can easily conceive of various equivalent modifications or substitutions within the technical scope of the invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims

1. A lung nodule analysis method based on a ternary capsule network algorithm is characterized by comprising the following steps:

1) constructing a lung nodule analysis model based on a ternary capsule network algorithm, wherein the lung nodule analysis model comprises a memory pool module, a single capsule network module, a ternary capsule network module, an identification module and a result determination module;

2) acquiring an image to be analyzed;

3) learning a single capsule network module, and training a capsule network intelligent agent;

4) performing ternary capsule network module learning, and optimizing a capsule network intelligent agent based on the ternary capsule network loss;

5) inputting lung CT images into the learned capsule network, and performing sliding window processing on the CT images by the model to obtain a series of states;

6) and determining a data analysis result of the image to be analyzed according to the classification result, and outputting the data analysis result through a result determination module.

2. The lung nodule analysis method based on the ternary capsule network algorithm as claimed in claim 1, wherein in the step 1), the specific contents for constructing the lung nodule analysis model based on the ternary capsule network algorithm are as follows:

12) For a CT image with 512 x 512 pixels, a sliding window with K x K pixels is adopted to obtain a 32 x 32 pixel image block on the CT image, and the coordinates (x) at the upper left corner of the image block are recorded_i，y_i) (ii) a Setting each image block to a state s_tFurther, all image blocks extracted from the CT image form a set S of all states;

3. The lung nodule analysis method based on the ternary capsule network algorithm as claimed in claim 2, wherein in the step 3), the specific contents for learning the single capsule network module are as follows:

keeping the size of a sliding window on the CT image unchanged, determining the number of image blocks extracted by each CT, determining the state number M of each round in the reinforcement learning process, comparing the difference between the single-capsule network diagnosis result and the calibration result, and correspondingly calculating the action return on each image block.

4. The lung nodule analysis method based on the ternary capsule network algorithm as claimed in claim 3, wherein the specific method for comparing the dissimilarity of the diagnosis result and the calibration result of the single capsule network is as follows:

5. The lung nodule analysis method based on the ternary capsule network algorithm as claimed in claim 4, wherein in the step 3), a fully-connected layer is added to the single capsule network after the feature expression to output the accumulated reward Q corresponding to each category_targetThe value:

Q_target＝M×r

the corresponding single step loss function is:

is a capsule network agent and is a capsule network agent,

as anchor image, a_jFor corresponding actions, ω_cIs radix Ginseng;

6. The lung nodule analysis method based on the ternary capsule network algorithm according to claim 5, wherein in the step 4), the specific contents for learning the ternary capsule network module are as follows:

41) inputting three images at a time, respectively being anchor images

Positive sample image

And negative sample image

43) the ternary capsule network receives a state from the state set S and calculates rewards Q corresponding to different categories_targetValue, select corresponding Q_targetExecuting the action with the maximum value, correspondingly entering the next state, and simultaneously storing elements such as the current state, the action and the like; the process is iterated until the agent respectively executes corresponding actions on all states acquired from a CT image by a sliding window method; the environment carries out classification results of the ternary capsule network on each image block and the calibrated classification resultsComparing, correspondingly calculating and storing the return value;

7. The lung nodule analysis method based on the ternary capsule network algorithm as claimed in claim 6, wherein the calculation formula of the loss calculation and the network weight update of the capsule network is as follows:

is a capsule network agent; alpha represents the minimum difference of the embedding expression distances of the anchor sample and the negative sample; l is_i(d₊，d_-) Is a ternary loss function.

8. The lung nodule analysis method based on the ternary capsule network algorithm as claimed in claim 5, wherein in the step 6), the specific process of determining the data analysis result of the image to be analyzed according to the classification result is as follows:

And after the distance corresponding to each group of sample pairs is calculated, calculating the average distance corresponding to each category, and taking the category with the minimum average distance as a classification result.

9. A pulmonary nodule analysis apparatus based on a ternary capsule network algorithm, comprising:

a data storage module storing one or more programs;

a central processor executing one or more programs implementing the lung nodule analysis method based on the ternary capsule network algorithm of any one of claims 1 to 8.

10. A computer-readable storage medium having stored thereon a computer program for execution by a processor to implement the lung nodule analysis method based on a ternary capsule network algorithm according to any one of claims 1 to 8.