CN111899191A

CN111899191A - Text image restoration method and device and storage medium

Info

Publication number: CN111899191A
Application number: CN202010703681.3A
Authority: CN
Inventors: 侯青; 刘军
Original assignee: Wuhan Institute of Technology
Current assignee: Wuhan Institute of Technology
Priority date: 2020-07-21
Filing date: 2020-07-21
Publication date: 2020-11-06
Anticipated expiration: 2040-07-21
Also published as: CN111899191B

Abstract

The invention provides a text image restoration method, a text image restoration device and a storage medium, wherein the method comprises the following steps: obtaining a plurality of original medical text images from image acquisition equipment, and randomly classifying the original medical text images to obtain an original training set image and an original test set image; respectively carrying out image deletion processing on the original training set image and the original test set image to obtain a defect training set image and a defect test set image; constructing a training model, and training the training model according to the defect training set image to obtain a final repair model; and repairing the defect test set image according to the final repair model to obtain a repaired image result. The method overcomes the defects of large workload, low efficiency and insufficient accuracy of the traditional manual text image restoration, can restore the condition that the text in the image has structural deficiency, can restore the incomplete text to have certain readability, and has a very deep significance for reading the incomplete text.

Description

Text image restoration method and device and storage medium

Technical Field

The invention mainly relates to the technical field of image restoration, in particular to a text image restoration method, a text image restoration device and a storage medium.

Background

Image restoration has originated in european art restoration, a process by which an artistic craftsman recovers damaged artwork. Because the artwork is easily influenced by factors such as human, environment, material and the like in the preservation process, the artwork is difficult to avoid being damaged by some factors, such as cracks, loss, mildew, falling, scratches and the like. In the revival period of European literature, the art works in the middle century are repaired in order to recover the damaged or lost parts of the art works and maintain the overall effect of the works. Image restoration is performed by professional artists, repairing the damaged work of art according to the nature and degree of the work of art, in order to hopefully restore the original appearance of the work of art. However, the method is often in great risk because the artwork is extremely precious and cannot be compensated once an accident occurs.

Image inpainting is an important task in computer vision, and has numerous applications in image-based rendering and computational photography, and the main difficulty of image inpainting is how to synthesize visually realistic and semantically reasonable pixels in missing parts of an image, and the sense of incongruity cannot be stored between these pixels and existing pixels.

Disclosure of Invention

The invention provides a text image restoration method, a text image restoration device and a storage medium, aiming at the defects of the prior art.

The technical scheme for solving the technical problems is as follows: a text image restoration method comprises the following steps:

obtaining a plurality of original medical text images from image acquisition equipment, and randomly classifying the original medical text images to obtain an original training set image and an original test set image;

respectively carrying out image deletion processing on the original training set image and the original test set image to obtain a defect training set image and a defect test set image;

constructing a training model, and training the training model according to the defect training set image to obtain a final repair model;

and repairing the defect test set image according to the final repair model to obtain a repaired image result.

Another technical solution of the present invention for solving the above technical problems is as follows: a text image restoration apparatus comprising:

the random classification processing module is used for obtaining a plurality of original medical text images from image acquisition equipment and randomly classifying the original medical text images to obtain an original training set image and an original test set image;

the defect image processing module is used for respectively carrying out image deletion processing on the original training set image and the original test set image to obtain a defect training set image and a defect test set image;

the training model processing module is used for constructing a training model and training the training model according to the defect training set image to obtain a final repairing model;

and the repair image result obtaining module is used for repairing the defect test set image according to the final repair model to obtain a repair image result.

The invention has the beneficial effects that: the method comprises the steps of obtaining an original training set image and an original test set image by randomly classifying a plurality of original medical text images, obtaining a defect training set image and a defect test set image by respectively carrying out image deletion processing on the original training set image and the original test set image, constructing a training model, obtaining a final repairing model by training the training model according to the defect training set image, further improving the image repairing effect, obtaining a repairing image result according to the final repairing model by carrying out repairing processing on the defect test set image, overcoming the defects of large workload, low efficiency and insufficient accuracy of the traditional manual repairing text image, repairing the condition that the text in the image has structure deletion, repairing the defective text to a certain readability, and reading the defective text to a great significance.

Drawings

Fig. 1 is a schematic flowchart of a text image restoration method according to an embodiment of the present invention;

fig. 2 is a block diagram of a text image restoration apparatus according to an embodiment of the present invention.

Detailed Description

The principles and features of this invention are described below in conjunction with the following drawings, which are set forth by way of illustration only and are not intended to limit the scope of the invention.

Fig. 1 is a schematic flow chart of a text image restoration method according to an embodiment of the present invention.

As shown in fig. 1, a text image restoration method includes the following steps:

Specifically, the original medical text images are Chinese character images captured from the medical text images, the content of the images is mostly 1 to 4 characters, 56867 images exist in total, wherein the images corresponding to the same Chinese character are not unique, and the situation that the same Chinese character corresponds to a plurality of images can occur; because the use frequency of each Chinese character in the medical text is unequal, the more frequently the Chinese characters are used, the number of the obtained images is correspondingly increased, and each type of image cannot reach the same scale; putting all the images into the same folder, randomly selecting the images from the folder, dividing the images into two parts, wherein one part is used as an original training image, the other part is used as an original test image, the number of the original test images is 20% of the number of all the images, and the randomness is enhanced after the images are disordered.

In the embodiment, the original training set image and the original test set image are obtained by randomly classifying a plurality of original medical text images, the defect training set image and the defect test set image are obtained by respectively carrying out image deletion processing on the original training set image and the original test set image, the training model is constructed, the final repairing model is obtained by training the training model according to the defect training set image, the image repairing effect is further improved, the repairing image result is obtained by carrying out repairing processing on the defect test set image according to the final repairing model, the defects of large workload, low efficiency and insufficient accuracy of the traditional manual repairing text image are overcome, the condition that the text in the image has the structure deletion can be repaired, the defective text can be repaired to have certain readability, and the reading of the defective text is of great significance.

Optionally, as an embodiment of the present invention, the process of performing image deletion processing on the original training set image and the original test set image respectively to obtain a defect training set image and a defect test set image includes:

adding a preset binary patch at a random position on each image in the original training set image to obtain a defective training set image;

and adding a preset binary patch at a random position on each image in the original test set image to obtain a defect test set image.

In the above embodiment, a preset binary patch is added to a random position on each image in the original training set image to obtain a defective training set image; the method has the advantages that the preset binary patches are added at random positions of the images in the original test set image to obtain the defective test set image, objectivity of data can be guaranteed, human factors are reduced, accuracy of a subsequent repair model is effectively improved, the defects of large workload, low efficiency and insufficient accuracy of the traditional manual text image repair are overcome, the condition that the text in the image has structural deficiency can be repaired, the defective text can be repaired to have certain readability, and the method has a deep significance in reading the defective text.

Optionally, as an embodiment of the present invention, the constructing a training model, and training the training model according to the defect training set image to obtain a final repair model includes:

constructing a generated countermeasure network, and training the generated countermeasure network by using the defect training set image to obtain a basic restoration model;

and optimizing the basic restoration model by using an attention mechanism optimization algorithm to obtain a final restoration model.

Specifically, an attention mechanism is added in the network, the Chinese characters with missing structures in the image are repaired, and the Chinese characters are formed by strokes, so that the attention mechanism is more hopefully introduced in the repairing process, the network pays attention to the missing information in a certain stroke and the residual information of the strokes originally connected with the network, and the repairing effect of the repairing model is improved; the generation countermeasure network model is divided into a repair network and a discrimination network, and the two networks are defined as a convolutional neural network; corresponds to two parts: a generator (G) and a Discriminator (D), the generator receiving as input a random noise (an initial distribution), generating an image from the noise, and taking it as input to the Discriminator; the nature of the discriminator is a two-classifier which is used to discriminate whether the input data is from real data or the generation of G-network, and the final result returned is the probability that the image is from training data. If the image is from training data, D should return a value close to 1, otherwise a value close to 0 should be returned to indicate that the image is false, from G-network generation. Therefore, the basic restoration model obtained by the text image restoration method based on the attention mechanism and the generation countermeasure network can effectively improve the accuracy and effect of image restoration, and the final restoration model obtained by adding the attention mechanism to the basic restoration model for optimization can further improve the image restoration effect and can restore the incomplete text to have certain readability.

In the above embodiment, the basic repairing model is obtained by training the generated countermeasure network with the defect training set image; the final repairing model is obtained by optimizing the basic repairing model by using an attention mechanism optimization algorithm, the accuracy and the effect of image repairing can be effectively improved, the image repairing effect is further improved, the incomplete text can be repaired to have certain readability, the defects of large workload, low efficiency and insufficient accuracy of the traditional manual repairing of the text image are overcome, the condition that the text in the image has structural deficiency can be repaired, the incomplete text can be repaired to have certain readability, and the method has a deep significance in reading the incomplete text.

Optionally, as an embodiment of the present invention, the training the generated countermeasure network by using the defect training set image to obtain a basic repair model includes:

s1: inputting the defect training set image into the generated countermeasure network for training according to preset iterative training times to obtain a first repair model, wherein the first repair model comprises a first repair network and a first discrimination network;

s2: repairing the defect training set image through the first repairing network to obtain a first intermediate repairing image;

s3: performing discrimination processing on the first intermediate restored picture through the first discrimination network to obtain a first predicted value;

s4: judging whether the first predicted value is larger than or equal to a preset judgment value, if so, taking the first repairing model as the basic repairing model; otherwise, go to step S5;

s5: inputting the defect training set image into the generated countermeasure network for training according to preset iterative training times to obtain a second repair model, wherein the second repair model comprises a second repair network and a second judgment network;

s6: repairing the defect training set image through the second repairing network to obtain a second intermediate repairing image;

s7: judging the second intermediate repair picture through the second judgment network to obtain a second predicted value;

s8: judging whether the second predicted value is larger than or equal to a preset judgment value, and if so, taking the second repairing model as the basic repairing model; if so, the process returns to step S1.

It should be understood that the formation of the antagonistic network architecture is divided into a repair network and a discrimination network in the network structure, and both networks are defined as convolutional neural networks.

In the above embodiment, the first repairing model and the second repairing model are respectively obtained according to the defect training set image and the generated confrontation network model, and the basic repairing model is obtained according to whether the judgment result meets the preset judgment value, so that the expected repairing model is obtained while the higher repairing efficiency is ensured, and the stability and the reliability of the text image repairing are realized.

Optionally, as an embodiment of the present invention, the optimizing the basic repair model by using an attention mechanism optimization algorithm to obtain a final repair model includes:

the basic restoration model comprises an original first-stage restoration network, an original second-stage restoration network and a judgment network, wherein the original first-stage restoration network comprises a 3 x 3 convolution layer, a down-sampling layer, a cavity convolution layer and an up-sampling layer;

adding a preset 1 × 1 convolutional layer to the front of the 3 × 3 convolutional layer to obtain a first-stage repair network;

performing network branch modification processing on the original second-stage repair network by using an attention mechanism algorithm to obtain a second-stage repair network;

and combining the first-stage repair network, the second-stage repair network and the judgment network according to the sequence to obtain a final repair model.

It should be appreciated that the original first-stage repair network alters the convolutional layer structure before the hole convolutional layer, adding several 1 x 1 convolutional layers, increasing the non-linearity and reducing the training time.

Specifically, the original first-stage restoration network includes a 3 × 3 convolution layer, a down-sampling layer, a hole convolution layer, and an up-sampling layer, and the first-stage restoration network obtained by adding the 1 × 1 convolution layer includes the 1 × 1 convolution layer, the 3 × 3 convolution layer, the down-sampling layer, the hole convolution layer for expanding the receptive field, and the up-sampling layer for restoring and outputting the picture.

In the above embodiment, a first-stage repair network is obtained before a preset 1 × 1 convolutional layer is added to the 3 × 3 convolutional layer; modifying the network branches of the original second-stage repair network by using an attention mechanism algorithm to obtain a second-stage repair network; the final repairing model is obtained by combining the first-stage repairing network, the second-stage repairing network and the judging network according to the sequence, the nonlinearity is increased, the training time is reduced, the accuracy and the effect of image repairing can be effectively improved, the image repairing effect is further improved, the incomplete text can be repaired to have certain readability, the defects of large workload, low efficiency and insufficient accuracy of the traditional manual text image repairing are overcome, the condition that the text in the image has structural deficiency can be repaired, the incomplete text can be repaired to have certain readability, and the reading of the incomplete text is of great significance.

Optionally, as an embodiment of the present invention, the process of performing network branch modification processing on the original second-stage repair network by using an attention mechanism algorithm to obtain a second-stage repair network includes:

the original second-stage repair network comprises two convolution branches;

performing convolution branch modification on any volume branch in the original second-stage repair network by using an attention mechanism algorithm to obtain an attention branch;

and combining the attention branch and the other convolution branch to obtain a second-stage repair network.

Specifically, an attention mechanism is introduced to change a branch, wherein one convolution branch acts on a network structure before the cavity convolution, a combination structure of convolution and a down-sampling layer is adopted, finally, a cavity convolution layer is added, the attention layer replaces the cavity convolution in the other branch, two convolution layers are added, output results of the two branches are combined, operation such as convolution up-sampling layer is carried out to reduce the picture to the input size, and a final restoration result graph is obtained.

In the above embodiment, an attention mechanism algorithm is used to modify the convolution branch of any convolution branch in the original second-stage repair network to obtain an attention branch; the attention branch and the other convolution branch are combined to obtain a second-stage repairing network, so that the accuracy and the effect of image repairing can be effectively improved, the image repairing effect is further improved, the incomplete text can be repaired to have certain readability, the defects of large workload, low efficiency and insufficient accuracy of the traditional manual repairing of the text image are overcome, the condition that the text in the image has structural deficiency can be repaired, the incomplete text can be repaired to have certain readability, and the method has a deep significance in reading the incomplete text.

Optionally, as another embodiment of the present invention, a batch of training samples are randomly selected from the training set, the sample image is randomly selected, and a binary patch is added at a random position to obtain a damaged image, an image with a size of 128 × 128 and a maximum patch size of 64 × 64 is set in the network, and the batch size is set to 16; the repairing image takes a damaged image and a binary patch as input data together, the input data is put into a repairing model for repairing, the model comprises a generator and a discriminator, two repairing stages are arranged in the generator, the input data enters a first-stage repairing network, the input data is input into the network, required characteristics are obtained through the steps of convolution, deconvolution and the like, a primary repaired image is obtained according to the characteristics and is input into a second-stage repairing network to complete subsequent repairing tasks, in order to improve the repairing efficiency, an attention mechanism is applied, a convolution branch and an attention branch are combined, one branch acts on a network structure before the cavity convolution, a combination structure of the convolution and a down sampling layer is adopted, finally a cavity convolution layer is added, the convolution cavity is replaced by the attention layer in the other branch, two convolution layers are added, after the output results of the two branches are combined, operations such as convolution and sampling layer uploading are carried out to reduce the image to the input size, and a final restoration result graph is obtained; at this time, the output result is the repairing result of the whole network, and a clear repairing image can be obtained.

Alternatively, as another embodiment of the present invention, as shown in fig. 2, a text image restoration apparatus includes:

Optionally, as an embodiment of the present invention, the defect image processing module is specifically configured to:

and respectively adding random positions of the original training set image and the original test set image according to a preset binary patch to obtain a defect training set image and a defect test set image, wherein the defect training set image and the defect test set image both carry the preset binary patch.

Alternatively, another embodiment of the present invention provides a text image restoration apparatus, including a memory, a processor, and a computer program stored in the memory and executable on the processor, and when the processor executes the computer program, the text image restoration method as described above is implemented. The device may be a computer or the like.

Alternatively, another embodiment of the present invention provides a computer-readable storage medium storing a computer program which, when executed by a processor, implements the text image inpainting method as described above.

It is clear to those skilled in the art that, for convenience and brevity of description, the specific working processes of the above-described apparatuses and units may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.

In the several embodiments provided in the present application, it should be understood that the disclosed apparatus and method may be implemented in other ways. For example, the above-described apparatus embodiments are merely illustrative, and for example, a division of a unit is merely a logical division, and an actual implementation may have another division, for example, a plurality of units or components may be combined or integrated into another system, or some features may be omitted, or not executed.

Units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment of the present invention.

In addition, functional units in the embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, and can also be realized in a form of a software functional unit.

The integrated unit, if implemented in the form of a software functional unit and sold or used as a stand-alone product, may be stored in a computer readable storage medium. It will be understood that the technical solution of the present invention essentially contributes to the prior art, or all or part of the technical solution can be embodied in the form of a software product stored in a storage medium and including instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.

While the invention has been described with reference to specific embodiments, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined by the appended claims. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims

1. A text image restoration method is characterized by comprising the following steps:

2. The method according to claim 1, wherein the process of performing image deletion processing on the original training set image and the original test set image respectively to obtain a defective training set image and a defective test set image comprises:

3. The method according to claim 1, wherein the process of constructing a training model and training the training model according to the defect training set image to obtain a final repair model comprises:

4. The method according to claim 3, wherein the training the generated countermeasure network with the images of the impairment training set to obtain a basic repair model comprises:

5. The method according to claim 3, wherein the optimizing the basic repairing model by using the attention mechanism optimizing algorithm to obtain the final repairing model comprises:

6. The method according to claim 5, wherein the performing network branch modification processing on the original second-stage repair network by using the attention mechanism algorithm to obtain a second-stage repair network comprises:

the original second-stage repair network comprises two convolution branches;

7. A text image restoration apparatus, comprising:

8. The text image restoration method according to claim 7, wherein the defective image processing module is specifically configured to:

9. A text image restoration apparatus comprising a memory, a processor, and a computer program stored in the memory and executable on the processor, characterized in that when the computer program is executed by the processor, the text image restoration method according to any one of claims 1 to 6 is implemented.

10. A computer-readable storage medium, in which a computer program is stored, which, when being executed by a processor, carries out a text image inpainting method according to any one of claims 1 to 6.