CN113255889A

CN113255889A - Occupational pneumoconiosis multi-modal analysis method based on deep learning

Info

Publication number: CN113255889A
Application number: CN202110579022.8A
Authority: CN
Inventors: 周孟然; 杨先军; 胡锋; 陈焱焱; 卞凯; 闫鹏程
Original assignee: Hefei Broshare Electronic Technology Co ltd; Anhui University of Science and Technology
Current assignee: Hefei Broshare Electronic Technology Co ltd; Anhui University of Science and Technology
Priority date: 2021-05-26
Filing date: 2021-05-26
Publication date: 2021-08-13
Anticipated expiration: 2041-05-26
Also published as: CN113255889B

Abstract

The invention provides a deep learning-based occupational pneumoconiosis multi-modal analysis method, which belongs to the field of pneumoconiosis analysis and comprises the following steps: collecting chest X-ray image information and personal basic information of a person; carrying out word vectorization processing on the personal basic information; constructing a one-dimensional convolutional neural network and a two-dimensional convolutional neural network, and establishing a multi-mode convolutional neural network MM-CNN model on the basis; the two kinds of information are used as the input of a multi-modal convolutional neural network MM-CNN model, a multi-classification MM-CNN pneumoconiosis analysis model is established, and an objective function is formed under the condition that corresponding constraints are met; optimizing the hyper-parameters of the multi-classification MM-CNN pneumoconiosis analysis model by adopting a mixed frog-jumping algorithm SFLA; and analyzing the chest X-ray image information of the person and the personal information after word vectorization by adopting the optimized multi-classification MM-CNN pneumoconiosis analysis model, and outputting an analysis result. The method can realize accurate and real-time detection and analysis of the lung health of the personnel, and complete early warning of part of occupational pneumoconiosis.

Description

Occupational pneumoconiosis multi-modal analysis method based on deep learning

Technical Field

The invention belongs to the field of pneumoconiosis analysis, and particularly relates to a deep learning-based occupational pneumoconiosis multi-modal analysis method.

Background

In recent years, the continuous stable improvement of the safety production in China is realized, the rapid reduction of the number of production safety accidents and death people for years is realized, but the situation of occupational health work is still very severe. Occupational pneumoconiosis is one of occupational diseases and is mainly distributed in coal, colored, mechanical, building material, light industry and other industrial industries. For example, in the coal mine production process, many dust (mainly including coal dust and silicon dust) is generated in many links such as rock tunnel blasting, rock tunnel loading, rock tunnel tunneling, coal tunnel blasting, coal tunnel reinforcement, coal preparation transportation, and the like, and the excessive dust is a fierce cause of pneumoconiosis. Pneumoconiosis is a disease in which lung tissue fibrosis, which occurs as a result of workers inhaling large quantities of free silica and other dust for a long period of time during industrial activities, predominates. Most of the dust is discharged, but a part of the dust is retained in bronchioles and alveoli for a long time and is continuously phagocytosed by alveolar macrophages, and the dust-swallowed macrophages are the main pathogenic factors. A series of studies have shown that after a pneumoconiosis lesion is formed, the residual dust in the lung continues to react with alveolar macrophages, which is the main reason why the lesion continues to develop even though the pneumoconiosis patient is out of dust operation. The common symptoms of patients with pneumoconiosis are chest distress, chest pain, short breath, cough, general weakness, serious patients losing labor capacity even can not lie flat, and finally lung function failure, kneeling and death are caused, and the symptoms of the patients are not witnessed.

Pneumoconiosis is an incurable disease, no specific medicine for curing pneumoconiosis exists in the world at present, and lung washing can only relieve the pain of patients to a certain extent and slow down the development of the disease, but cannot reverse the disease fundamentally. And the early detection aiming at the lung health condition of the personnel can realize the early recognition of the occupational pneumoconiosis, improve the early discovery rate of the occupational pneumoconiosis and reduce the serious burden of the personnel caused by the occupational pneumoconiosis. Therefore, it is necessary and important to develop a diagnosis apparatus and an analysis method for occupational pneumoconiosis of this special group, which can realize accurate and real-time measurement and analysis of the lung health status of the personnel, and have important significance for early warning of some occupational pneumoconiosis and ensuring the life health of the personnel.

At present, the diagnosis of occupational pneumoconiosis is mainly determined by 'diagnosis of occupational pneumoconiosis' (GBZ70-2015), and doctors rely on personal experience to perform diagnosis and analysis of pneumoconiosis based on relevant diagnosis standards and principles by comparing and analyzing chest slices of patients and X-ray diagnosis standard slices of pneumoconiosis. With the continuous development and application of the artificial intelligence algorithm, the application of the artificial intelligence algorithm to the auxiliary analysis of the chest radiography image data becomes a research hotspot and a series of successful applications are achieved. However, the chest radiography acquisition process is influenced by various factors such as the position of the irradiated object, exposure conditions, operation errors, films and the like, so that the chest radiography has uneven image quality levels and high difficulty in feature extraction and analysis, and the defects of low identification precision, long time consumption and the like exist when the traditional artificial intelligence algorithm is directly used for analyzing the chest radiography. With the proposal of deep learning theory and the improvement of numerical computing equipment, Convolutional Neural Network (CNN) has been rapidly developed, and CNN is used as one of the best modes for feature extraction in image processing and text analysis and is applied to the fields of computer vision, natural language processing and the like. Although the CNN model has excellent performance, how to design a suitable network structure for a specific application problem and find the optimal model parameters is a difficult problem in the CNN application process.

In view of the above, the invention provides a deep learning-based multi-modal analysis method for occupational pneumoconiosis.

Disclosure of Invention

In order to overcome the defects of the prior art, the invention provides a deep learning-based occupational pneumoconiosis multimodal analysis method.

In order to achieve the above purpose, the invention provides the following technical scheme:

a multi-modal analysis method for occupational pneumoconiosis based on deep learning comprises the following steps:

collecting chest X-ray image information and personal basic information of a person;

performing word vectorization processing on the personal basic information;

constructing a One-dimensional Convolutional Neural Network (1D-CNN) and a Two-dimensional Convolutional Neural Network (2D-CNN), and establishing a Multi-modal Convolutional Neural Network (MM-CNN) model on the basis;

taking chest X-ray image information of personnel and personal information after word vectorization as input of a multi-mode convolutional neural network MM-CNN model, establishing a multi-classification MM-CNN pneumoconiosis analysis model for occupational pneumoconiosis analysis, and forming an objective function under the condition of meeting corresponding constraints;

adopting a mixed Frog-jumping algorithm (SHuffled Frog leapingAlgorithm, SFLA) to optimize the hyper-parameters of the multi-classification MM-CNN pneumoconiosis analysis model;

and analyzing the chest X-ray image information and the personal information after word vectorization processing of the personnel by adopting the optimized multi-classification MM-CNN pneumoconiosis analysis model, and outputting the result of the occupational pneumoconiosis analysis.

Preferably, in the word vectorization processing of the personal basic information, a skip-gram model in word2vec is adopted to perform word vectorization conversion of the personal basic information of the person, the size of a context window is set to be 10, the size of a word vector dimension is set to be 50, and the sampling size is set to be 1 e-3.

Preferably, the building of the one-dimensional convolutional neural network 1D-CNN is specifically that the number of layers of the one-dimensional convolutional neural network is deepened by connecting a plurality of one-dimensional convolutional units, the sizes of convolutional kernels are all set to 3, when one convolutional layer is the lth convolutional layer, the number of convolutional kernels is 8 × L, the one-dimensional pooling layer adopts a maximum pooling mode, and the pooling size is 2;

the method for constructing the two-dimensional convolutional neural network specifically comprises the steps of deepening the layer number of the two-dimensional convolutional neural network by connecting a plurality of two-dimensional convolutional units, setting the sizes of convolutional kernels to be 3 multiplied by 3, and when one convolutional layer is the L-th convolutional layer, the number of the convolutional kernels is 8 multiplied by L, wherein the two-dimensional pooling layer adopts a maximum pooling mode, and the pooling size is 2 multiplied by 2.

Preferably, the establishing of the multi-classification MM-CNN model for the analysis and diagnosis of the occupational pneumoconiosis takes the prediction precision of the MM-CNN analysis model as an objective function.

Preferably, the hybrid frog-leaping algorithm SFLA is adopted to optimize the hyper-parameters of the multi-classification MM-CNN pneumoconiosis analysis model, and the hyper-parameters comprise a 1D-CNN network layer number M, a 1D-CNN activation function, a 2D-CNN network layer number N, a 2D-CNN activation function, an optimizer and a learning rate, and the method specifically comprises the following steps:

initializing a frog population;

and (3) frog classification: sequencing the frogs in the population S according to the increasing sequence of the fitness, and recording the frog position P with the best fitness in the population S_xIs F (1);

group division: partitioning the cultural genres according to the following formula;

M^k＝[F^k(j),f^k(j)|F^k(j)＝F(k+m(j-1)),f^k(j)＝f(k+m(j-1)),j＝1,2,…,n；k＝1,2,…,m]；

cultural gene inheritance evolution: each cultural genome M^k(k ═ 1,2, …, m) evolved independently from local search steps;

mixing culture gene bodies: after each cultural gene body is subjected to a round of local search, the population S is recombined, the population S is sorted in an increasing way according to fitness again, the optimal frogs in the population are updated, and the position P of the globally optimal frogs is recorded_x；

And (3) checking a stopping condition: if the algorithm convergence condition is met, stopping the algorithm execution process; otherwise, returning to execute the group division.

Preferably, the local search process is implemented by the following specific steps:

step A: defining a calculator: setting mi to be 0, wherein mi is a counter of a culture gene body, and marking a serial number of a current evolutionary culture gene body; setting ni as 0, wherein ni is a counter of the independent evolution times, and marking and comparing whether the independent evolution times of the current culture gene body is smaller than the maximum independent evolution times or not;

and B: initializing a calculator: mi is mi + 1;

and C: initializing a calculator: ni + 1;

step D: according to p_j2(n +1-j)/n (n +1), j 1,2, …, n to construct the subformulated gene:

step E: and (3) frog position updating: according to

Update the worst frog position and use F (q) ═ P_W+ L to calculate a new location F (q), and if F (q) is in the feasible region, calculating a new fitness f (q); otherwise, entering the step F for execution; if the new fitness is better than the old fitness, i.e. a better result is generated, replacing the old F (q) with the new F (q), and proceeding to step H; otherwise, entering the step F for execution;

wherein L is the jumping step length of the frog with the worst fitness in the subformization gene body, and r is [0, 1 ]]Random number of intervals, P_BFor the best location of the frog in the subformulation genome, P_WThe worst frog position in the subformized genome, L_maxThe maximum jumping step length after the frog is infected;

step F: if step D can not produce better result, updating the worst frog position again according to

To calculate a jump step; if F (q) is in the feasible region, calculating new fitness f (q), otherwise, turning to the step G; if the new fitness is better than the old fitness, i.e. a better result is produced, the old F (q) is replaced by the new F (q), and step H is repeatedExecuting; otherwise, executing step G;

wherein, P_XIs the global best position of the frog;

step G: randomly generating a new position of the frog: if the new position is not feasible and is not better than the old position, a new frog F (r) is randomly generated in the feasible region to replace the original frog, so as to stop the propagation of the defective cultural gene and calculate the fitness f (r);

step H: upgrading culture gene bodies: the worst frog in the sub-culture gene body is transferred and evolved to replace the worst frog in the culture gene body M^miAnd arranges M in descending order of fitness^mi；

Step I: checking the evolution times: if ni is less than n, skipping to the step C, and carrying out next inheritance evolution;

step J: checking culture gene number: if mi is less than m, skipping to the step B, and carrying out inheritance evolution of the next culture gene; otherwise, returning to the global search to mix cultural gene bodies.

Preferably, the basic information of the personal information includes basic information such as sex, age, height, weight, occupation history (including work units, departments (workshops), work types, harmful factors, protective measures and the like), tobacco and wine history, past history (including whether hypertension, diabetes, tuberculosis and the like exist, and time of the same) and the like of the person, and is used for assisting chest radiography analysis.

The occupational pneumoconiosis multi-modal analysis method based on deep learning provided by the invention has the following beneficial effects:

1. the individual difference of the operating personnel can be comprehensively considered by collecting the chest X-ray image information and the personal basic information of the personnel;

2. the adopted convolutional neural network algorithm has good classification performance, the information after considering the personal basic information word vector is one-dimensional data, the chest X-ray image information is two-dimensional data, corresponding one-dimensional convolutional networks and two-dimensional convolutional neural networks are respectively designed to extract the characteristics of multi-modal data, and an MM-CNN model suitable for analysis and diagnosis of occupational pneumoconiosis is constructed on the basis;

3. in the process of selecting the hyper-parameters of the MM-CNN model, a mixed frog-jumping algorithm is adopted to optimize the parameters of the 1D-CNN network layer number M, the 1D-CNN activation function, the 2D-CNN network layer number N, the 2D-CNN activation function, the optimizer and the learning rate of the MM-CNN occupational pneumoconiosis analysis and diagnosis model, so that the defect of manually selecting DF parameters is overcome;

4. the method can be used for rapidly detecting and analyzing the lung health condition and the occupational pneumoconiosis of special group personnel, so that accurate and real-time detection and analysis of the lung health of the personnel are realized, early warning of part of occupational pneumoconiosis is completed, and the life health of the personnel is guaranteed.

Drawings

In order to more clearly illustrate the embodiments of the present invention and the design thereof, the drawings required for the embodiments will be briefly described below. The drawings in the following description are only some embodiments of the invention and it will be clear to a person skilled in the art that other drawings can be derived from them without inventive effort.

FIG. 1 is a flowchart of a deep learning-based multimodal analysis method for occupational pneumoconiosis in embodiment 1 of the present invention;

FIG. 2 is a schematic diagram of a one-dimensional convolutional neural network structure;

FIG. 3 is a schematic diagram of a two-dimensional convolutional neural network structure;

FIG. 4 is a mixed frog leaping algorithm MM-CNN model hyper-parameter flow chart;

fig. 5 is a partial search flowchart.

Detailed Description

In order that those skilled in the art will better understand the technical solutions of the present invention and can practice the same, the present invention will be described in detail with reference to the accompanying drawings and specific examples. The following examples are only for illustrating the technical solutions of the present invention more clearly, and the protection scope of the present invention is not limited thereby.

Example 1

The invention provides a deep learning-based multi-modal analysis method for occupational pneumoconiosis, which is specifically shown in figure 1 and comprises the following steps: the method comprises the following steps:

s1, collecting chest X-ray image information and personal basic information of a person; the chest X-ray image acquisition system mainly comprises a chest X-ray image acquisition unit, a personal information acquisition unit and a chest X-ray image acquisition unit, wherein the chest X-ray image acquisition unit is mainly used for acquiring basic information of sex, age, height, weight, professional history (including work units, departments (workshops), work seeds, harmful factors, protective measures and the like), tobacco and wine history, past history (including whether hypertension, diabetes, tuberculosis and the like exist and time of the same) and the like of personnel by means of a direct Digital Radiography (DR) system;

during the acquisition process of the chest X-ray image information of a person, the following points need to be ensured:

firstly, the method comprises the following steps: requirements for camera position: when the chest is in a back-front standing position, the examinee should tightly attach the chest wall to the photographic frame, naturally separate the two feet, and rotate the two arms, so that the scapula does not overlap the lung field as much as possible;

secondly, the method comprises the following steps: requirements for source image distance: the source image distance is kept at 180 cm;

thirdly, the method comprises the following steps: requirements for focus: using a small focus;

fourthly: requirements for bulb position: adjusting the position of the bulb tube, wherein the center line is horizontal to the sixth thoracic vertebra;

fifth, the method comprises the following steps: requirements for exposure control: automatic exposure control is adopted, the exposure time is less than 100ms, and the exposure is carried out in a breath holding state after full air suction;

sixth: requirements for imaging voltage: the imaging voltage is controlled at 100-125 kV.

S2, carrying out word vectorization processing on the personal basic information;

specifically, in the word vectorization processing of the personal basic information, a skip-gram model in word2vec is adopted to perform word vectorization conversion of the personal basic information of the personnel, the size of a context window is set to be 10, the size of a word vector dimension is set to be 50, and the sampling size is set to be 1 e-3.

S3, respectively constructing a one-dimensional convolutional neural network 1D-CNN and a two-dimensional convolutional neural network 2D-CNN based on the personal information after word vectorization processing and the chest X-ray image information of the personnel, and establishing a multi-mode convolutional neural network MM-CNN model on the basis;

further, as shown in fig. 2, constructing a one-dimensional convolutional neural network 1D-CNN specifically includes: deepening the number of layers of the one-dimensional convolutional neural network is realized by connecting a plurality of one-dimensional convolutional units, the sizes of convolutional kernels are all set to be 3, when one convolutional layer is the L-th convolutional layer, the number of convolutional kernels is 8 multiplied by L, the one-dimensional pooling layer adopts a maximum pooling mode, and the pooling size is 2;

as shown in fig. 3, the two-dimensional convolutional neural network is constructed specifically as follows: the deepening of the number of layers of the two-dimensional convolutional neural network is realized by connecting a plurality of two-dimensional convolutional units, the sizes of convolutional kernels are all set to be 3 multiplied by 3, when one convolutional layer is the L-th convolutional layer, the number of the convolutional kernels is 8 multiplied by L, the two-dimensional pooling layer adopts a maximum pooling mode, and the pooling size is 2 multiplied by 2.

S4, taking the chest X-ray image information of the person and the personal information after word vectorization as the input of a multi-mode convolutional neural network MM-CNN model, establishing a multi-classification MM-CNN pneumoconiosis analysis model for occupational pneumoconiosis analysis, and forming an objective function under the condition of meeting corresponding constraints; specifically, the prediction accuracy of the MM-CNN analysis model is taken as an objective function.

S5, optimizing the hyper-parameters of the multi-classification MM-CNN pneumoconiosis analysis model by adopting a mixed frog-jumping algorithm SFLA;

in this embodiment, a mixed frog-leaping algorithm SFLA is used to optimize the hyper-parameters of the multi-classification MM-CNN pneumoconiosis analysis model, including the 1D-CNN network layer number M, the 1D-CNN activation function, the 2D-CNN network layer number N, the 2D-CNN activation function, the optimizer, and the learning rate, as shown in fig. 4, the specific steps are as follows:

s51: initializing a frog population;

s52: and (3) frog classification: sequencing the frogs in the population S according to the increasing sequence of the fitness, and recording the frog position P with the best fitness in the population S_xIs F (1);

s53: group division: division of cultural genres according to the formula

S54: cultural gene inheritance evolution: each cultural genome M^k(k-1, 2, …, m) evolved independently from the local search step.

Further, in this embodiment, as shown in fig. 5, the specific steps implemented in the local search process are as follows:

s541: defining a calculator: setting mi to be 0, wherein mi is a counter of the culture gene body, and marking the serial number of the current evolutionary culture gene body; and (5) setting ni as 0, wherein ni is a counter of the independent evolution times, and marking and comparing whether the independent evolution times of the current culture gene body is smaller than the maximum independent evolution times.

S542: initializing a calculator: mi is mi + 1;

s543: initializing a calculator: ni + 1;

s544: according to p_j2(n +1-j)/n (n +1), j 1,2, …, n to construct the subformulated gene:

s545: and (3) frog position updating: according to

Update the worst frog position and use F (q) ═ P_W+ L to calculate a new location F (q), and if F (q) is in the feasible region, calculating a new fitness f (q); otherwise, step S546 is executed. If the new fitness is better than the old fitness, i.e. a better result is generated, replacing the old f (q) with the new f (q), and proceeding to step S548; otherwise, the process proceeds to step S546.

s546: if step S544 fails to produce a good result, the worst frog position is updated again. According to

To calculate the jumping stepLong. If F (q) is in the feasible region, calculating new fitness f (q), otherwise, turning to the step S547; if the new fitness is better than the old fitness, i.e. a better result is produced, replacing the old f (q) with the new f (q), and performing step S548; otherwise, the step S547 is executed.

Wherein, P_XIs the global best position of the frog;

s547: randomly generating a new position of the frog: if the new position is not feasible and is not better than the old position, a new frog F (r) is randomly generated in the feasible region to replace the original frog, so as to stop the propagation of the defective cultural gene and calculate the fitness f (r);

s548: upgrading culture gene bodies: the worst frog in the sub-culture gene body is transferred and evolved to replace the worst frog in the culture gene body M^miAnd arranges M in descending order of fitness^mi；

S549: checking the evolution times: if ni is less than n, jumping to step S543 to perform next inheritance evolution;

s5410: checking culture gene number: if mi is less than m, jumping to step S542, and carrying out inheritance evolution of the next culture gene; otherwise, returning to the global search to mix cultural gene bodies.

S55: mixing culture gene bodies: after each cultural gene body is subjected to a round of local search, the population S is recombined, the population S is sorted in an increasing way according to fitness again, the optimal frogs in the population are updated, and the position P of the globally optimal frogs is recorded_x；

S56: and (3) checking a stopping condition: if the algorithm convergence condition is met, stopping the algorithm execution process; otherwise, return to step S53 is performed.

S6, analyzing chest X-ray image information and personal information after word vectorization processing of the personnel by adopting the optimized multi-classification MM-CNN pneumoconiosis analysis model, and outputting results of professional pneumoconiosis analysis.

The above-mentioned embodiments are only preferred embodiments of the present invention, and the scope of the present invention is not limited thereto, and any simple modifications or equivalent substitutions of the technical solutions that can be obviously obtained by those skilled in the art within the technical scope of the present invention are included in the scope of the present invention.

Claims

1. A multi-modal analysis method for occupational pneumoconiosis based on deep learning is characterized by comprising the following steps:

performing word vectorization processing on the personal basic information;

constructing a one-dimensional convolutional neural network 1D-CNN and a two-dimensional convolutional neural network 2D-CNN, and establishing a multi-mode convolutional neural network MM-CNN model on the basis;

optimizing the hyper-parameters of the multi-classification MM-CNN pneumoconiosis analysis model by adopting a mixed frog-jumping algorithm SFLA;

2. The profound learning-based occupational pneumoconiosis multi-modal analysis method according to claim 1, wherein in the word vectorization processing of the personal basic information, a skip-gram model in word2vec is used for word vectorization conversion of the personal basic information, the size of a context window is set to 10, the size of a word vector dimension is set to 50, and the sampling size is set to 1 e-3.

3. The occupational pneumoconiosis multimodal analysis method based on deep learning of claim 1, wherein the constructing of the one-dimensional convolutional neural network 1D-CNN is specifically that the number of layers of the one-dimensional convolutional neural network is deepened by connecting a plurality of one-dimensional convolutional units, the sizes of convolutional kernels are all set to be 3, when one convolutional layer is the L-th convolutional layer, the number of convolutional kernels is 8 xL, the one-dimensional pooling layer adopts a maximum pooling mode, and the pooling size is 2;

4. The method of claim 1, wherein the MM-CNN model is established for multiple classifications of diagnosis and analysis of occupational pneumoconiosis, and the prediction accuracy of the MM-CNN model is used as an objective function.

5. The occupational pneumoconiosis multimodal analysis method based on deep learning of claim 1, wherein the mixed frogging SFLA (multiple classification of MM-CNN pneumoconiosis) algorithm is adopted to optimize the hyper-parameters of the multi-classification MM-CNN pneumoconiosis analysis model, and comprises a 1D-CNN network layer number M, a 1D-CNN activation function, a 2D-CNN network layer number N, a 2D-CNN activation function, an optimizer and a learning rate, and the method comprises the following specific steps:

initializing a frog population;

mixing culture gene bodies: in each cultural genomeAfter a round of local search, the population S is recombined, the population S is sorted in an increasing way according to the fitness again, the optimal frogs in the population are updated, and the position P of the globally optimal frogs is recorded_x；

6. The deep learning-based multimodal analysis method for occupational pneumoconiosis according to claim 5, wherein the local search process is implemented by the following specific steps:

and B: initializing a calculator: mi is mi + 1;

and C: initializing a calculator: ni + 1;

step E: and (3) frog position updating: according to

To calculate a jump step; if F (q) is in the feasible region, calculating new fitness f (q), otherwise, turning to the step G; if the new fitness is better than the old fitness, i.e. a better result is produced, replacing the old F (q) with the new F (q), and executing step H; otherwise, executing step G;

wherein, P_XIs the global best position of the frog;

7. The deep learning-based multimodal analysis method for occupational pneumoconiosis according to claim 1, wherein the personal information basic information comprises sex, age, height, weight, occupational history, smoking and drinking history, and past history information of the person.