WO2021139175A1

WO2021139175A1 - Electric power operation ticket character recognition method based on convolutional neural network

Info

Publication number: WO2021139175A1
Application number: PCT/CN2020/111550
Authority: WO
Inventors: 罗麟; 位一鸣; 苗晓君; 张引贤; 熊安
Original assignee: 国网浙江省电力有限公司舟山供电公司
Priority date: 2020-01-09
Filing date: 2020-08-27
Publication date: 2021-07-15
Also published as: CN111259880B; CN111259880A

Abstract

Disclosed is an electric power operation ticket character recognition method based on a convolutional neural network, relating to a character recognition method. At present, character recognition of an electric power operation ticket is unclear. The method in the present invention comprises the steps of: constructing a convolutional neural network model with only three convolutional layers, no pooling layer and no fully connected layer, performing training to obtain a nonlinear mapping function, and improving a peak signal-to-noise ratio of an image; using a handwriting feature calculation method to respectively obtain imaginary stroke features, path signature features and 8-directional features of image characters of an electric power operation ticket; and constructing an integrated convolutional neural network model with six convolutional layers, five pooling layers and one fully connected layer, and in combination with the imaginary stroke features, the path signature features and the 8-directional features, performing training to obtain a character recognition model. According to the present technical solution, the excellent spatial feature learning ability of a convolutional neural network is used, and an image enhancement method based on the convolutional neural network and a character recognition method based on the convolutional neural network are used, such that the accuracy of image character recognition of an electric power operation ticket is improved.

Description

一种基于卷积神经网络的电力操作票文字识别方法A Method for Recognizing Power Operation Ticket Characters Based on Convolutional Neural Network

技术领域Technical field

本发明涉及一种文字识别方法，尤其涉及一种基于卷积神经网络的电力操作票文字识别方法。The invention relates to a character recognition method, in particular to a power operation ticket character recognition method based on a convolutional neural network.

背景技术Background technique

传统基于卷积神经网络的文字识别方法直接使用CNN方法训练手写汉字图片样本集，进而得到文字分类模型。此类方法仅使用CNN方法学习图像空间特征信息，训练高效、设计简单。但是，CNN方法在训练的过程中样本有限，无法学习所有可能存在的手写字体特征，其网络结构简单，图像表示能力有限，训练易过拟合。此类方法的识别性能有待进一步提高。The traditional text recognition method based on convolutional neural network directly uses the CNN method to train the handwritten Chinese character picture sample set, and then obtains the text classification model. This type of method only uses the CNN method to learn the image spatial feature information, and the training is efficient and the design is simple. However, the CNN method has limited samples in the training process and cannot learn all possible handwritten font features. Its network structure is simple, the image representation ability is limited, and training is easy to overfit. The recognition performance of such methods needs to be further improved.

为进一步提高文字识别的准确度，目前也出现了基于改进卷积神经网络的文字识别方法，使用图像变形网络GTN、仿射变形AD、弹性变形ED等文字变形方法丰富手写字体样本集，克服样本局限性；使用分数池化方法FMP、训练方法DropSample、松弛卷积神经网络R-CNN等方法优化卷积神经网络性能，克服训练过拟合、空间特征表达简单等问题。此类方法仅考虑样本集扩增、网络结构优化等方面，忽略了手写字体更深层次、细粒度的笔迹特征，如笔迹方向变化等，使得电力操作票图像文字识别的准确度低。In order to further improve the accuracy of text recognition, a text recognition method based on improved convolutional neural networks has also appeared, using image deformation network GTN, affine deformation AD, elastic deformation ED and other text deformation methods to enrich the handwritten font sample set and overcome the sample Limitations: Use fractional pooling method FMP, training method DropSample, relaxed convolutional neural network R-CNN and other methods to optimize the performance of convolutional neural network, and overcome the problems of training over-fitting and simple spatial feature expression. This type of method only considers sample set amplification, network structure optimization, etc., and ignores the deeper and fine-grained handwriting features of handwritten fonts, such as handwriting direction changes, which makes the accuracy of power operation ticket image text recognition low.

发明内容Summary of the invention

本发明要解决的技术问题和提出的技术任务是对现有技术方案进行完善与改进，提供一种基于卷积神经网络的电力操作票文字识别方法，以提高文字识别准确度目的。为此，本发明采取以下技术方案。The technical problem to be solved and the technical task proposed by the present invention are to perfect and improve the existing technical scheme, and provide a method for recognizing power operation ticket text based on convolutional neural network, so as to improve the accuracy of text recognition. To this end, the present invention adopts the following technical solutions.

一种基于卷积神经网络的电力操作票文字识别方法，其特征在于包括以下步骤：A method for recognizing power operation ticket characters based on convolutional neural network, which is characterized in that it comprises the following steps:

1)获取样本图像，得到训练集；1) Obtain sample images and get the training set;

2)构建仅具有3层卷积层，无池化层，无全连接层的卷积神经网络模型C ₀； _{2) Construct a convolutional neural network model C 0} with only 3 convolutional layers, no pooling layer, and no fully connected layer;

3)定义C ₀的损失函数， 3) Define the loss function of _{C 0,}

4)训练得到非线性映射函数F _λ(p),遍历训练集c中所有样本图像，训练输出图像增强计算函数，即非线性映射函数F _λ(p)； 4) Train to obtain the nonlinear mapping function F _λ (p), traverse all the sample images in the training set c, and train the output image enhancement calculation function, namely the nonlinear mapping function F _λ (p);

5)基于非线性映射函数F _λ(p)，计算输出图像p的高峰值信噪比图像； 5) Calculate the high-peak signal-to-noise ratio image of the output image p based on the nonlinear mapping function F _{λ (p);}

6)使用笔迹特征计算方法，计算高峰值信噪比图像的假想笔画特征、路径签名特征与8方向特征；6) Use the handwriting feature calculation method to calculate the imaginary stroke feature, path signature feature and 8-direction feature of the high-peak signal-to-noise ratio image;

7)构建具有6层卷积层、5层池化层和1层全连接层的集成卷积神经网络模型C ₁； _{7) Construct an integrated convolutional neural network model C 1} with 6 layers of convolutional layers, 5 layers of pooling layers and 1 layer of fully connected layers;

8)遍历训练集中所有样本图像，结合假想笔画特征、路径签名特征与8个方向特征，训练得到电力操作票文字识别模型；8) Traverse all the sample images in the training set, combine the imaginary stroke features, path signature features and 8 direction features to train to obtain the power operation ticket text recognition model;

9)获取需要识别的电力操作票，通过电力操作票文字识别模型进行文字识别。9) Obtain the power operation ticket that needs to be recognized, and perform text recognition through the power operation ticket text recognition model.

作为优选技术手段：在步骤1)中，针对样本图像p _i(p _i∈c,1≤i≤N)得到训练集，其中N是训练集c包含的样本图像总数，获取p _i的M×M数值矩阵A _i，及其对应的清晰图像数值矩阵B _i。 Preferred techniques: in step 1), for a sample image _{_{p i (p i ∈c, 1≤i≤N}} ) to give the training set, where N is the total number of training set sample image (c) includes the acquiring of M × p _i The M numerical matrix A _i and the corresponding clear image numerical matrix B _i .

作为优选技术手段：在步骤2)中构建三层卷积神经网络模型C ₀，仅含卷积层，选择激活函数ReLU，步长设置为1，不对卷积运算填充0，网络结构为： As a preferred technical means: construct a three-layer convolutional neural network model C ₀ in step 2), which only contains the convolutional layer, select the activation function ReLU, set the step size to 1, and do not fill the convolution operation with 0, the network structure is:

作为优选技术手段：在步骤3)中定义损失函数，损失函数目的是获得最小F范数，计算公式如下：As a preferred technical means: define the loss function in step 3). The purpose of the loss function is to obtain the minimum F norm. The calculation formula is as follows:

其中λ＝{W _j,b _j}；其中W _j＝{W _j ^k:1≤k≤n _j}，W _j ^k为卷积网络第j层的卷积矩阵，b _j为偏差值，n _j为卷积网络第j层的卷积核个数。 Where λ = {W _j ,b _j }; where W _j ={W _j ^k :1≤k≤n _j }, W _j ^k is the convolution matrix of the jth layer of the convolutional network, b _j is the deviation value, n _j is the number of convolution kernels in the jth layer of the convolutional network.

作为优选技术手段：在步骤5)中使用非线性映射函数F _λ(p)，计算训练集c中每一个样本图像p _i对应的增强图像P _i，得到新训练集C(P _i∈C,1≤i≤N)。 In step 5) is non-linear mapping function _F λ (p), c in the training set is calculated for each sample image to enhance the image corresponding to p _i of P _i, to give a new training set C (P _i ∈C,: as a preferred technology 1≤i≤N).

作为优选技术手段：步骤6)包括步骤：As a preferred technical means: Step 6) includes the steps:

a)假想笔画特征提取笔迹方向变化程度，针对每一相邻像素点，计算公式如下：a) The imaginary stroke feature extracts the degree of change in the direction of the handwriting. For each adjacent pixel, the calculation formula is as follows:

其中，θ为不同笔画之间相连构成的夹角度数(180≤θ≤180)，l为笔画长度，

w＝1/8；比较不同像素点dcd的值，计算得到P _i的假想笔画矩阵； Among them, θ is the number of angles between different strokes (180≤θ≤180), and l is the length of the stroke,

w = 1/8; value comparing different pixels dcd, the calculated virtual matrix P _i of the stroke;

b)路径签名特征提取笔迹曲率值，给定笔迹起止区间为[s,t]，其曲率特征定义如下：b) The path signature feature extracts the handwriting curvature value, the starting and ending interval of the given handwriting is [s, t], and the curvature feature is defined as follows:

若k＝0，则0重积分特征计算结果为1，表示笔迹的二值图像特征；若k＝1，则1重积分特征表示笔迹的位移特征；若k＝2，则2重积分特征表示笔迹的曲率特征；k值可取任意值，正常情况下不宜取值太大，否则会导致计算复杂度指数级增加，却不能获取更多有效笔迹特征；针对每一相邻像素点，计算得到P _i的路径签名矩阵； If k=0, the zero-fold integral feature calculation result is 1, which represents the binary image feature of the handwriting; if k=1, the 1-fold integral feature represents the displacement feature of the handwriting; if k=2, the double-integrated feature represents The curvature characteristic of the handwriting; the value of k can take any value, under normal circumstances it should not be too large, otherwise it will cause the computational complexity to increase exponentially, but it will not be able to obtain more effective handwriting features; for each adjacent pixel, P is calculated _i 's path signature matrix;

c)8方向特征能够出色地拟合汉字的横竖撇捺等笔画，可用于提取笔迹方向信息；假设给定一个二维坐标，8方向特征分别从0°，45°，90°，135°，180°225°，270°，315°计算笔迹梯度大小；给定一段笔迹的起止坐标(x ₁,y ₁)与(x ₂,y ₂)，梯度计算公式如下： c) The 8-direction feature can perfectly fit the strokes of Chinese characters, such as horizontal and vertical strokes, and can be used to extract handwriting direction information; assuming a two-dimensional coordinate, the 8-direction features are from 0°, 45°, 90°, 135°, 180°225°, 270°, 315° calculate the handwriting gradient size; given the starting and ending coordinates (x ₁ , y ₁ ) and (x ₂ , y ₂ ) of a handwriting, the gradient calculation formula is as follows:

其中d _x＝|x ₂-x ₁|，d _y＝|y ₂-y ₁|，

针对每一相邻像素点，计算得到P _i的8方向矩阵。 Where d _x =|x ₂ -x ₁ |, d _y = |y ₂ -y ₁ |,

For each adjacent pixel, calculated matrix P _i of the 8 directions.

作为优选技术手段：在步骤7)中，集成卷积神经网络模型C ₁的第1层卷积网络的卷积核大小设置为3×3，卷积核个数为80，且依次递增80；第2～6层卷积网络的卷积核大小设置为2×2；选择激活函数ReLU，选择补0卷积运算，步长取1；池化层矩阵大小均为2×2；集成卷积神经网络模型C ₁结构为： As a preferred technical means: in step 7), the _{size of the convolution kernel of the first layer of the convolutional network of the integrated convolutional neural network model C 1} is set to 3×3, the number of convolution kernels is 80, and the number is increased by 80 in sequence; The size of the convolution kernel of the second to sixth layers of the convolutional network is set to 2×2; select the activation function ReLU, select the zero-complement convolution operation, and the step size is 1; the size of the pooling layer matrix is 2×2; integrated convolution The structure of neural network model C _{1 is:}

表2集成卷积神经网络模型C ₁结构 Table 2 Integrated convolutional neural network model C ₁ structure

其中n表示笔迹特征维度个数。Where n represents the number of handwriting feature dimensions.

有益效果：Beneficial effects:

1)使用自定义三层卷积神经网络训练得到非线性映射函数，输出高峰值信噪比值图像，解决输入图像低清晰度问题，有利于后续文字的准确识别；1) Use a custom three-layer convolutional neural network to train to obtain a non-linear mapping function, output a high-peak signal-to-noise ratio image, solve the problem of low definition of the input image, and facilitate the accurate recognition of subsequent text;

2)基于笔迹特征方法，分别计算手写字体的假想笔画特征、路径签名特征与8方向特征，解决卷积神经网络图像空间特征学习的局限性，准确地表达了手写字体笔迹信息；2) Based on the handwriting feature method, separately calculate the imaginary stroke feature, path signature feature and 8-direction feature of handwritten fonts, solve the limitations of convolutional neural network image spatial feature learning, and accurately express the handwriting information of handwritten fonts;

3)通过融合笔迹特征，使用集成卷积神经网络模型训练得到电力操作票文字识别模型，有效地提升了文字识别的准确度。3) Through the fusion of handwriting features, the integrated convolutional neural network model is used to train the power operation ticket text recognition model, which effectively improves the accuracy of text recognition.

附图说明Description of the drawings

图1是本发明的流程图。Figure 1 is a flowchart of the present invention.

具体实施方式Detailed ways

以下结合说明书附图对本发明的技术方案做进一步的详细说明。The technical solution of the present invention will be further described in detail below in conjunction with the accompanying drawings of the specification.

如图1所示，本发明包括步骤：As shown in Figure 1, the present invention includes the steps:

3)定义C ₀的损失函数， 3) Define the loss function of _{C 0,}

以上步骤分为图像增强处理、文字识别处理两个阶段。The above steps are divided into two stages: image enhancement processing and character recognition processing.

以下就各阶段进行具体的说明：The following is a detailed description of each stage:

图像增强处理阶段Image enhancement processing stage

图像增强处理的步骤如下：The steps of image enhancement processing are as follows:

步骤1，针对样本图像p _i(p _i∈c,1≤i≤N)，其中N是训练集c包含的样本图像总数，获取p _i的M×M数值矩阵A _i，及其对应的清晰图像数值矩阵B _i。 Step 1, for a sample image _{_{p i (p i ∈c, 1≤i≤N}} ), where N is the total number of samples of training images comprises a set of c, obtaining M × M matrix of values of A _{_i} p _i, and the corresponding clear Image value matrix B _i .

步骤2，构建三层卷积神经网络模型C ₀，仅含卷积层，选择激活函数ReLU，步长设置为1，不对卷积运算填充0，网络结构如表1所示： Step 2. Construct a three-layer convolutional neural network model C ₀ with only the convolutional layer, select the activation function ReLU, set the step size to 1, and do not fill the convolution operation with 0. The network structure is shown in Table 1:

表1自定义卷积神经网络模型C ₀结构 Table 1 Custom Convolutional Neural Network Model C ₀ Structure

步骤3，定义损失函数，与B _i相比较，损失函数目的是获得最小F范数，计算公式如下： Step 3. Define the loss function. Compared with B _i , the purpose of the loss function is to obtain the minimum F norm. The calculation formula is as follows:

其中λ＝{W _j,b _j}。其中W _j＝{W _j ^k:1≤k≤n _j}，W _j ^k为卷积网络第j层的卷积矩阵，b _j为偏差值，n _j为卷积网络第j层的卷积核个数。 Where λ={W _j ,b _j }. Where W _j = {W _j ^k :1≤k≤n _j }, W _j ^k is the convolution matrix of the jth layer of the convolutional network, b _j is the deviation value, and n _j is the convolution of the jth layer of the convolutional network The number of cores.

步骤4，遍历训练集c中所有样本图像，训练输出图像增强计算函数，即非线性映射函数F _λ(p)。 Step 4: Traverse all the sample images in the training set c, and train the output image enhancement calculation function, that is, the non-linear mapping function F _λ (p).

文字识别处理阶段Word recognition processing stage

文字识别处理的步骤如下：The steps of word recognition processing are as follows:

步骤1，使用非线性映射函数F _λ(p)，计算训练集c中每一个样本图像p _i对应的增强图像P _i，得到新训练集C(P _i∈C,1≤i≤N)。 Step 1 using a nonlinear mapping function _F λ (p), calculated in the training set c p _i enhanced image corresponding to each sample P _i, to give a new training set _{C (P i ∈C, 1≤i≤N)} .

步骤2，基于笔迹特征计算方法，分别计算P _i的假想笔画特征、路径签名特征与8方向特征。具体计算方法如下： 2, wherein the method is calculated based on handwriting, P _i are calculated hypothetical stroke features, and wherein the signature path 8 wherein the step direction. The specific calculation method is as follows:

d)假想笔画特征提取笔迹方向变化程度，针对每一相邻像素点，计算公式如下：d) The imaginary stroke feature extracts the degree of change in the direction of the handwriting. For each adjacent pixel, the calculation formula is as follows:

w＝1/8。比较不同像素点dcd的值，计算得到P _i的假想笔画矩阵。 Among them, θ is the number of angles between different strokes (180≤θ≤180), and l is the length of the stroke,

w=1/8. Different pixel value comparison dcd, the calculated virtual matrix P _i of the stroke.

e)路径签名特征提取笔迹曲率值，给定笔迹起止区间为[s,t]，其曲率特征定义如下：e) The path signature feature extracts the handwriting curvature value. The starting and ending interval of the given handwriting is [s, t], and the curvature feature is defined as follows:

若k＝0，则0重积分特征计算结果为1，表示笔迹的二值图像特征；若k＝1，则1重积分特征表示笔迹的位移特征；若k＝2，则2重积分特征表示笔迹的曲率特征。k值可取任意值，正常情况下不宜取值太大，否则会导致计算复杂度指数级增加，却不能获取更多有效笔迹特征。针对每一相邻像素点，计算得到P _i的路径签名矩阵。 If k=0, the zero-fold integral feature calculation result is 1, which represents the binary image feature of the handwriting; if k=1, the 1-fold integral feature represents the displacement feature of the handwriting; if k=2, the double-integrated feature represents The curvature characteristics of the handwriting. The k value can take any value, and it should not be too large under normal circumstances, otherwise it will cause the computational complexity to increase exponentially, but it will not be possible to obtain more effective handwriting features. For each adjacent pixel, the path P _i calculated signature matrix.

f)8方向特征能够出色地拟合汉字的横竖撇捺等笔画，可用于提取笔迹方向信息。假设给定一个二维坐标，8方向特征分别从0°，45°，90°，135°，180°225°，270°，315°计算笔迹梯度大小。给定一段笔迹的起止坐标(x ₁,y ₁)与(x ₂,y ₂)，梯度计算公式如下： f) The 8-direction feature can perfectly fit the strokes such as horizontal and vertical strokes of Chinese characters, and can be used to extract handwriting direction information. Assuming that a two-dimensional coordinate is given, the 8-direction features are calculated from 0°, 45°, 90°, 135°, 180°, 225°, 270°, and 315° respectively. Given the starting and ending coordinates (x ₁ , y ₁ ) and (x ₂ , y ₂ ) of a handwriting, the gradient calculation formula is as follows:

其中d _x＝|x ₂-x ₁|，d _y＝|y ₂-y ₁|，

For each adjacent pixel, calculated matrix P _i of the 8 directions.

步骤3，构建集成卷积网络模型C ₁，该模型包含6层卷积网络，前5层卷积网络下一层均配置池化层，第6层卷积网络的下一层配置全连接层。第1层卷积网络的卷积核大小设置为3×3，卷积核个数为80，且依次递增80；第2～6层卷积网络的卷积核大小设置为2×2；选择激活函数ReLU，选择补0卷积运算，步长取1。池化层矩阵大小均为2×2。模型结构如表2所示，其中n表示笔迹特征维度个数。 Step 3. Construct an integrated convolutional network model C ₁ , which contains a 6-layer convolutional network. The next layer of the first 5 layers of convolutional network is configured with a pooling layer, and the next layer of the 6th layer of convolutional network is configured with a fully connected layer . The size of the convolution kernel of the first layer of convolutional network is set to 3×3, the number of convolution kernels is 80, and the number of convolution kernels is increased by 80; the size of the convolution kernel of the second to sixth layers of convolutional network is set to 2×2; choose The activation function ReLU is selected to complement 0 convolution operation, and the step size is 1. The size of the pooling layer matrix is 2×2. The model structure is shown in Table 2, where n represents the number of handwriting feature dimensions.

步骤4，遍历训练集C中所有样本图像，结合假想笔画特征、路径签名特征与8方向特征，训练输出文字识别模型。Step 4: Traverse all the sample images in the training set C, combine the imaginary stroke features, path signature features and 8-direction features to train and output a text recognition model.

以上图1所示的一种基于卷积神经网络的电力操作票文字识别方法是本发明的具体实施例，已经体现出本发明实质性特点和进步，可根据实际的使用需要，在本发明的启示下，对其进行形状、结构等方面的等同修改，均在本方案的保护范围之列。The above-mentioned method for recognizing power operation ticket characters based on convolutional neural network shown in Figure 1 is a specific embodiment of the present invention, which has embodied the substantive features and progress of the present invention, and can be used in accordance with actual needs of the present invention. Under the enlightenment, equivalent modifications to the shape, structure, etc., are all within the scope of protection of this scheme.

Claims

一种基于卷积神经网络的电力操作票文字识别方法，其特征在于包括以下步骤：A method for recognizing power operation ticket characters based on convolutional neural network, which is characterized in that it comprises the following steps:

1)获取样本图像，得到训练集；1) Obtain sample images and get the training set;

2)构建仅具有3层卷积层，无池化层，无全连接层的卷积神经网络模型C ₀； _{2) Construct a convolutional neural network model C 0} with only 3 convolutional layers, no pooling layer, and no fully connected layer;

3)定义C ₀的损失函数； 3) Define the loss function of _{C 0;}

4)训练得到非线性映射函数F _λ(p),遍历训练集c中所有样本图像，训练输出图像增强计算函数，即非线性映射函数F _λ(p)； 4) Train to obtain the nonlinear mapping function F _λ (p), traverse all the sample images in the training set c, and train the output image enhancement calculation function, namely the nonlinear mapping function F _λ (p);

5)基于非线性映射函数F _λ(p)，计算输出图像p的高峰值信噪比图像； 5) Calculate the high-peak signal-to-noise ratio image of the output image p based on the nonlinear mapping function F _{λ (p);}

6)使用笔迹特征计算方法，计算高峰值信噪比图像的假想笔画特征、路径签名特征与8方向特征；6) Use the handwriting feature calculation method to calculate the imaginary stroke feature, path signature feature and 8-direction feature of the high-peak signal-to-noise ratio image;

7)构建具有6层卷积层、5层池化层和1层全连接层的集成卷积神经网络模型C ₁； _{7) Construct an integrated convolutional neural network model C 1} with 6 layers of convolutional layers, 5 layers of pooling layers and 1 layer of fully connected layers;

8)遍历训练集中所有样本图像，结合假想笔画特征、路径签名特征与8个方向特征，训练得到电力操作票文字识别模型；8) Traverse all the sample images in the training set, combine the imaginary stroke features, path signature features and 8 direction features to train to obtain the power operation ticket text recognition model;

9)获取需要识别的电力操作票，通过电力操作票文字识别模型进行文字识别。9) Obtain the power operation ticket that needs to be recognized, and perform text recognition through the power operation ticket text recognition model.
根据权利要求1所述的一种基于卷积神经网络的电力操作票文字识别方法，其特征在于：在步骤1)中，针对样本图像p _i(p _i∈c,1≤i≤N)得到训练集，其中N是训练集c包含的样本图像总数，获取p _i的M×M数值矩阵A _i，及其对应的清晰图像数值矩阵B _i。 In step 1), to obtain a sample image for the _{_{p i (p i ∈c, 1≤i≤N}} ): according to one of claim 1 to claim power generating operation character recognition method based on convolutional neural network, wherein training set, where N is the total number of samples of training images comprises a set of c, obtaining M × M matrix of values a _{_i} p _i, the sharp image and its corresponding numerical matrix B _i.
根据权利要求2所述的一种基于卷积神经网络的电力操作票文字识别方法，其特征在于：在步骤2)中构建三层卷积神经网络模型C ₀，仅含卷积层，选择激活函数ReLU，步长设置为1，不对卷积运算填充0，网络结构为： The method for recognizing power operation ticket text based on convolutional neural network according to claim 2, characterized in that: in step 2), a three-layer convolutional neural network model C ₀ is constructed, which contains only the convolutional layer, and selects activation Function ReLU, the step size is set to 1, and the convolution operation is not filled with 0, the network structure is:
根据权利要求3所述的一种基于卷积神经网络的电力操作票文字识别方法，其特征在于：在步骤3)中定义损失函数，损失函数目的是获得最小F范数，计算公式如下：The method for recognizing power operation ticket characters based on convolutional neural network according to claim 3, characterized in that: in step 3), a loss function is defined, the purpose of the loss function is to obtain the minimum F norm, and the calculation formula is as follows:

其中λ＝{W _j,b _j}；其中
为卷积网络第j层的卷积矩阵，b _j为偏差值，n _j为卷积网络第j层的卷积核个数。 Where λ = {W _j ,b _j }; where
Is the convolution matrix of the jth layer of the convolutional network, b _j is the deviation value, and n _j is the number of convolution kernels of the jth layer of the convolutional network.
根据权利要求4所述的一种基于卷积神经网络的电力操作票文字识别方法，其特征在于：在步骤5)中使用非线性映射函数F _λ(p)，计算训练集c中每一个样本图像p _i对应的增强图像P _i，得到新训练集C(P _i∈C,1≤i≤N)。 The method for recognizing power operation ticket characters based on convolutional neural network according to claim 4, characterized in that: in step 5), a non-linear mapping function F _λ (p) is used to calculate each sample in the training set c p _i enhanced image corresponding to P _i, to give a new training set _{C (P i ∈C, 1≤i≤N)} .
根据权利要求5所述的一种基于卷积神经网络的电力操作票文字识别方法，其特征在于：步骤6)包括步骤：The method for recognizing power operation ticket characters based on convolutional neural network according to claim 5, characterized in that: step 6) comprises the steps:

a)假想笔画特征提取笔迹方向变化程度，针对每一相邻像素点，计算公式如下：a) The imaginary stroke feature extracts the degree of change in the direction of the handwriting. For each adjacent pixel, the calculation formula is as follows:

其中，θ为不同笔画之间相连构成的夹角度数(180≤θ≤180)，l为笔画长度，
w＝1/8；比较不同像素点dcd的值，计算得到P _i的假想笔画矩阵； Among them, θ is the number of angles between different strokes (180≤θ≤180), and l is the length of the stroke,
w = 1/8; value comparing different pixels dcd, the calculated virtual matrix P _i of the stroke;

b)路径签名特征提取笔迹曲率值，给定笔迹起止区间为[s,t]，其曲率特征定义如下：b) The path signature feature extracts the handwriting curvature value, the starting and ending interval of the given handwriting is [s, t], and the curvature feature is defined as follows:

若k＝0，则0重积分特征计算结果为1，表示笔迹的二值图像特征；若k＝1，则1重积分特征表示笔迹的位移特征；若k＝2，则2重积分特征表示笔迹的曲率特征；k值可取任意值，正常情况下不宜取值太大，否则会导致计算复杂度指数级增加，却不能获取更多有效笔迹特征；针对每一相邻像素点，计算得到P _i的路径签名矩阵； If k=0, the zero-fold integral feature calculation result is 1, which represents the binary image feature of the handwriting; if k=1, the 1-fold integral feature represents the displacement feature of the handwriting; if k=2, the double-integrated feature represents The curvature characteristic of the handwriting; the value of k can take any value, under normal circumstances it should not be too large, otherwise it will cause the computational complexity to increase exponentially, but it will not be able to obtain more effective handwriting features; for each adjacent pixel, P is calculated _i 's path signature matrix;

c)8方向特征能够出色地拟合汉字的横竖撇捺等笔画，可用于提取笔迹方向信息；假设给定一个二维坐标，8方向特征分别从0°，45°，90°，135°，180°225°，270°，315°计算笔迹梯度大小；给定一段笔迹的起止坐标(x ₁,y ₁)与(x ₂,y ₂)，梯度计算公式如下： c) The 8-direction feature can perfectly fit the strokes of Chinese characters, such as horizontal and vertical strokes, and can be used to extract handwriting direction information; assuming a two-dimensional coordinate, the 8-direction features are from 0°, 45°, 90°, 135°, 180°225°, 270°, 315° calculate the handwriting gradient size; given the starting and ending coordinates (x ₁ , y ₁ ) and (x ₂ , y ₂ ) of a handwriting, the gradient calculation formula is as follows:

其中d _x＝|x ₂-x ₁|，d _y＝|y ₂-y ₁|，
针对每一相邻像素点，计算得到P _i的8方向矩阵。 Where d _x =|x ₂ -x ₁ |, d _y = |y ₂ -y ₁ |,
For each adjacent pixel, calculated matrix P _i of the 8 directions.
根据权利要求6所述的一种基于卷积神经网络的电力操作票文字识别方法，其特征在于：在步骤7)中，集成卷积神经网络模型C ₁的第1层卷积网络的卷积核大小设置为3×3，卷积核个数为80，且依次递增80；第2～6层卷积网络的卷积核大小设置为2×2；选择激活函数ReLU，选择补0卷积运算，步长取1；池化层矩阵大小均为2×2；集成卷积神经网络模型C ₁结构为： One of the claim 6, the character recognition method in the power generating operation convolutional neural network, wherein: in step 7), integrated Convolution Neural Network Model C convolutional network layer 1 ₁ The kernel size is set to 3×3, the number of convolution kernels is 80, and the number of convolution kernels is 80, and the number of convolution kernels is set to 2×2; the activation function ReLU is selected, and 0 convolution is selected. Operation, the step size is 1; the size of the pooling layer matrix is 2×2; the structure of the _{integrated convolutional neural network model C 1 is:}

表2集成卷积神经网络模型C ₁结构 Table 2 Integrated convolutional neural network model C ₁ structure

其中n表示笔迹特征维度个数。Where n represents the number of handwriting feature dimensions.