CN105893968B

CN105893968B - The unrelated person's handwriting recognition methods end to end of text based on deep learning

Info

Publication number: CN105893968B
Application number: CN201610202734.7A
Authority: CN
Inventors: 金连文; 杨维信; 刘曼飞
Original assignee: South China University of Technology SCUT
Current assignee: Chongqing Sign Digital Technology Co ltd
Priority date: 2016-03-31
Filing date: 2016-03-31
Publication date: 2019-06-14
Anticipated expiration: 2036-03-31
Also published as: CN105893968A

Abstract

The present invention provides the unrelated person's handwriting recognition methods end to end of the text based on deep learning, steps are as follows: A, pre-processes to online handwritten text, generates pseudo- character sample；B, the path integral characteristic image of pseudo- character sample is calculated；C, the deep neural network model of writer's sample known to training；D, using the deep neural network model of step C, automatic identification is carried out to the sample of uncertain writer.This method can automatically process online line of text, not need artificially to extract character feature, efficiently realize the unrelated online writer identification of text.

Description

The unrelated person's handwriting recognition methods end to end of text based on deep learning

Technical field

The invention belongs to deep learnings and artificial intelligence field, more particularly to computer user's clipboard computer The unrelated hand script Chinese input equipment document of text carry out feature learning to distinguish the technology of writer.

Background technique

Person's handwriting is an important evidence of authentication, law evidence obtaining etc..In recent years, mobile terminal and clipboard are touched It is universal to improve the attention rate of hand script Chinese input equipment writer verification.But since insufficient, the no discovery handwriting samples of data are especially good Feature, writer identification accuracy need to be improved.Previous recognition methods is all based on the relevant meter such as pattern-recognition Calculation machine image processing techniques artificially determines some feature statements and apart from calculating, artificially extracts character feature, the accuracy rate of identification It is not high.

Summary of the invention

In order to solve technical problem present in the prior art, the present invention provides the text based on deep learning is unrelated Person's handwriting recognition methods end to end, this method can automatically process online line of text, not need artificially to extract character feature, efficiently Realize the unrelated online writer identification of text.

The present invention adopts the following technical scheme that realize: the unrelated identification of person's handwriting end to end of the text based on deep learning Method, which comprises the steps of: A, online handwritten text is pre-processed, generates pseudo- character sample；B, it calculates The path integral characteristic image of pseudo- character sample；C, the deep neural network model of writer's sample known to training；D, step is utilized The deep neural network model of rapid C carries out automatic identification to the sample of uncertain writer.

Preferably, the step A specifically: resampling A1, is carried out to each stroke of online handwritten text, is sampled The uniform hand script Chinese input equipment text chunk of dot density；

A2, stroke segmentation is carried out to the hand script Chinese input equipment text after resampling, obtains the path of more broken small stroke section composition Set；

A3, stroke is divided after text carry out Character segmentation, generate pseudo- character；

A4, the pseudo- character progress stroke section after each segmentation is removed at random；

A5, the size for normalizing pseudo- character；

A6, affine transformation generate pseudo- character sample.

Preferably, in the step A1, resampling is calculated according to set tracing point density parameter and original stroke number The quantity of the tracing point of new samples；

In the step A2, stroke segmentation is first to carry out Corner Detection, and then the stroke is cut off from corner point, is generated new Shorter stroke section；

In the step A3, Character segmentation is successively to take out stroke section in order, when these stroke sections being extracted into are combined into The width of character when be just above character average height, the last one current stroke section is taken as a fresh character Start；

In the step A4, the quantity for removing obtained pseudo- character at random of each character stroke section is total stroke number of segment s_i Function；

In the step A5, normalization is that the figure that each pseudo- character is mapped to two-dimensional surface zooms in and out, wider With high length, longest edge is transformed to fixed required length value, and in the case where keeping the ratio of width to height constant, to short side Multiplied by corresponding scaling multiple；

In the step A6, affine transformation includes rotation, stretching and the inclination to entire character path.

Preferably, the step B specifically: one group of path integral feature B1, is calculated to each pseudo- character sample；

B2, every group of path integral feature is reassembled into different path integral characteristic patterns according to the feature of identical dimensional；

B3, path integral characteristic pattern progress margin pixel is filled up.

Preferably, when the step B1 calculates path integral feature, it is assumed that a finite length stroke section P is two-dimensional spacePath, the track mobile time meets0 < τ₁< ... < τ_k< T, τ_iAmong indicating I-th of time point, and positive integer i meets 1≤i≤k, then calculates the k rank path integral feature in time [0, T] interior P When P is straight line, Δ is used_0,TIndicate path displacement,It can be acquired by being segmented to calculate；N rank path integral feature is calculated, just It is path integral feature to be done the truncation of k rank, obtained feature set isObtain 2ⁿ⁺¹The path product of dimension Dtex sign；

In the step B2, each dimension of path integral feature is individually become into a path integral characteristic pattern, institute Have 2 with each puppet character sampleⁿ⁺¹Path integral characteristic pattern is opened, including the two-dimensional image in path itself；

In the step B3, path integral characteristic pattern is first set as the size that pixel value is z × z, is placed on a pixel value For the center of the figure of Z × Z, then these path integral characteristic patterns are input in deep neural network model described in step C It is trained, z < Z≤3z.

Preferably, the step C specifically:

The neuron number of C1, the projected depth neural network number of plies, the template number of convolutional layer and full articulamentum；

C2, whole sample extraction characteristic images of training set are trained as the input of deep neural network model；

C3, when network converges to accuracy rate on training set and no longer rises, deconditioning, save deep neural network mould Shape parameter.

Preferably, in the step C1, deep neural network includes five convolutional layers, have behind each convolutional layer one most Great Chiization layer；

In the step C2, the training of deep neural network includes the successive ignition of two steps of forward and backward, is first used After feedforward network obtains network error, network parameter is updated using back-propagation algorithm, continuous iteration optimization network ginseng Number.

Preferably, the step D specifically: depth nerve D1, will be inputted by pretreated path integral characteristic image Each random corresponding candidate item probability tables of pseudo- character sample removed after stroke are calculated in network model；

D2, it asks probability average the corresponding candidate item probability tables addition of multiple puppet character samples of each character, is somebody's turn to do The candidate item probability tables addition of all characters of text is asked probability averagely to obtain the text by the candidate item probability tables of pseudo- character Candidate item probability tables；

D3, it selects the candidate item of highest scoring according to the candidate item probability tables of text and is determined as writer.

Preferably, in the step D1, each text dividing is multiple characters, and each puppet character generates multiple pseudo- characters Sample, these pseudo- character samples have identical label, calculate candidate item probability tables respectively as an independent sample；

In the step D2, probability averagely includes that the more character probabilities of text are average and multiple puppet character sample probability of character It is average；The candidate item for being added to obtain the character to the pseudo- each dimension of character sample candidate item probability tables of each of character is average Probability tables are added to obtain the time of the text by each dimension of candidate item average probability table of each character to a text chunk Option probability tables；

In the step D3, the candidate item probability tables that are averaged of the character according to text are selected the candidate item of highest scoring and are determined as The writer of the text.

From the above technical scheme, the end to end person's handwriting identification side unrelated the present invention is based on the text of deep learning Method, main includes preprocessing process, deep neural network model training process and the automatic identification process of hand script Chinese input equipment text. It wherein pre-processes the stroke section dividing method used, random removal stroke phase method and is used to write by path integral feature for the first time Person's identification is innovation emphasis of the invention.Compared with prior art, the invention has the following advantages and beneficial effects:

1, pretreated method include text dividing, sample augmentation and text it is unrelated generalization ability enhancing；Pretreatment behaviour Make so that text of the present invention suitable for various lengths, can be long text or short text, it might even be possible to be individual character.

2, it removes stroke section and generates training sample abundant, over-fitting is prevented when for training deep neural network, It is also used for generating multiple pseudo- characters when test and be identified to improve discrimination.

3, the present invention proposes a path integral feature for writer's identification mission for the first time, and uses depth for the first time Convolutional neural networks realize writer's identification.Path integral feature can extract the validity feature that can be used for writer's identification, Learnt by the supplemental characteristic of deep neural network, discrimination is up to 95.72% (Chinese), 98.51% (English).Based on depth Neural network can identify the handwriting samples of the different length of writer, accuracy with higher and robustness.

Detailed description of the invention

Fig. 1 is flow chart of the invention；

Fig. 2 is the schematic diagram of random drop stroke section in the present invention, has chosen 30 random drop strokes of a character Pseudo- character after section is as example；

Fig. 3 indicates the visualized graphs of path integral feature in the present invention.

Specific embodiment

Present invention will now be described in further detail with reference to the embodiments and the accompanying drawings, but a specific embodiment of the invention It is not limited to this.

Embodiment

Present invention mainly solves the identifications and its specific implementation of online text written person, cut using to online text Point and the preprocess method that removes at random of stroke section, establish completely unrelated end-to-end of the text based on deep learning Person's handwriting recognition methods.There is no limit to text, also there is no limit being capable of maximum journey to the character types that user inputs by the present invention User is allowed to carry out free text written on degree, overall flow is as shown in Figure 1.

Referring to Fig. 1, the present invention includes following four process: the A, preprocessing process of hand script Chinese input equipment text；B, known writing The deep neural network model training process of person's sample；C, path integral feature is calculated；D, the sample of uncertain writer is automatic Identification process.Specifically, it first has to carry out resampling to the line of text of hand script Chinese input equipment long text, it is equal to become sampled point spacing Online line of text, then the line of text after resampling is divided into smaller stroke section set, these stroke sections is based on width The height character more single than being divided into.Then the stroke section of each character is removed at random, generates multiple pseudo- characters.It calculates later every A puppet character generates one group of path integral characteristic pattern after carrying out affine transformation, and zero point is filled around.By the puppet of training set The path integral characteristic pattern of character sample, which is input in deep neural network, carries out the training of depth network model to close to saturation, protects Deposit depth network training parameter.In test, training set hand script Chinese input equipment long text is subjected to data prediction described above simultaneously It is input in the depth network model of preservation and is calculated, export the candidate probability table of each pseudo- character sample, then calculate every The probability tables of a character.The probability tables from the same text fragment are corresponded to candidate item later to sum, are obtained final Probability tables, and the maximum candidate item of probability value is selected according to the probability tables, it is determined as writer.The label of the test item of this system It needs to occur in training set.

Each key step of the invention is described in detail individually below:

Step A data prediction

The purpose of step A data prediction is split to the hand script Chinese input equipment line of text data of user's input, and formation can With the format utilized, and some features are extracted, help deep neural network preferably learns and processing feature, in efficiency and identification There is good auxiliaring effect in accuracy.Sample passes through the method resampling of linear interpolation, is calculated and is examined by local buckling degree Angle measurement point.Stroke section after segmentation is combined into character, then the stroke section inside each character is removed at random, is obtained big The pseudo- character of amount, these pseudo- characters obtain more diversified pseudo- character sample by size normalization and affine transformation.

A1, sample resampling

Resampling is that the quantity of the tracing point of new samples is calculated according to set tracing point density parameter and original stroke number； The total length that a stroke is calculated according to original tracing point obtains dot density divided by the quantity of the tracing point of new samples, into And determine former tracing point whether retain and need on line two-by-two the number of interpolation so that it is determined that new samples tracing point coordinate.

If a stroke has the sampled point { (x of p constant duration₁,x₂),...,(x_p,k_p)}.Due to writing speed The Euclidean distance of difference, these points is also different.When integer i meets 1≤i≤p, it is assumed that (x_i,y_i) and (x_i+1,y_i+1) two o'clock Between Euclidean distance be d_i, (x₀,y₀) arrive (x_i,y_i) inter-two-point path is a length ofIf the puppet after the interpolation to be obtained Specimen sample point sum is l, and l is the integral multiple of p.First point of each stroke remains unchanged after interpolation, from second point Start, i-th point of position coordinates are:

(x_i×α+x_i+1×(1-α),y_i×α+y_i+1×(1-α)), (1)

Wherein

The set of point after each resampling still falls within the stroke.

A2, the segmentation of stroke section

Stroke segmentation is first to carry out Corner Detection, and the stroke is cut off from corner point then, generates new shorter pen Draw section；Judge that a point is angle point, need to be calculated its curvature by the coordinate of the point before and after the point, local buckling degree is maximum Point be considered as angle point；Assuming that (x_i,y_i) it is i-th of trajectory coordinates point after interpolation, respectively with the of the front and back K point (x_i-k,y_i-k) and (x_i+k,y_i+k) coordinate value calculate curvature.

The segmentation of stroke section is first to carry out the identification of stroke endpoint according to storing data.Terminate point identification or a connection running through The first coordinate of the hand-written long article this document of machine, is just defaulted as the starting point of a stroke, i.e. endpoint.Angle point is judged in Corner Detection Principle be that local buckling degree is maximum.Its curvature is calculated by the coordinate of the front and back point of each point: assuming that (x_i,y_i) be interpolation after Trajectory coordinates point, k-th point of the front and back is (x_i-k,y_i-k) and (x_i+k,y_i+k), curvature is defined as:

β=max (| x_i+k+x_i-k-2x_i|,|y_i+k+y_i-k-2y_i|)/2k, (3)

Then the stroke is cut off from corner point, generates new shorter stroke section；For training data, if each word The height of symbol is y_max-y_min, then estimate the average height y of each character an of document_averFor Character segmentation.For Test data traverses each character of text, obtains maximum, the minimum value y of local vertical coordinate_maxAnd y_min, and then estimate this The average character height y of text_aver。

A3, Character segmentation, pseudo- character generate

Character segmentation is the character that length-width ratio is fixed in order to obtain.Character segmentation be successively take out stroke section in order, and Record the maximum value x of the abscissa of its appearance_maxWith minimum value x_min.When the width for the character that these stroke sections being extracted into are combined into Just above character average height y_averWhen, the last one stroke section is taken as the beginning of a fresh character；Character Average height is calculated when cutting stroke section.

A4, stroke section remove at random

Assuming that a total stroke number of character is m, the stroke number of segment s of i-th of stroke_i, what the random removal of stroke section obtained The sample size of character is s_iWith the function of m.If each character is removed d at random_i(0≤d_i< s_i) pen, these are remaining Stroke section is reassembled into pseudo- character sample according to original sequencing, then obtained pseudo- character sample sum is exactly:

Schematic diagram is shown in Fig. 2.

A5, size normalization

Normalization is that the figure that each pseudo- character is mapped to two-dimensional surface zooms in and out, and wider and high length will Longest edge transforms to fixed required length value, and in the case where keeping the ratio of width to height constant, to short side multiplied by corresponding contracting Put multiple.

Size normalization is the coordinate (x for first traversing the path point of a character_i,y_i), find out width w=x_max-x_minAnd height Spend h=y_max-y_min, long side max (w, h) is then stretched to fixed value Q, short side expands corresponding multiple, after obtaining normalization Path point coordinate

A6, affine transformation

Affine transformation includes rotation, stretching, the inclination etc. to entire character path；The angle of rotation depends on certain section The twiddle factor w of interior random size, the coordinate of postrotational point is:

(x_i×cos(w)+y_i×sin(w),-x_i×sin(w)+y_i×cos(x)), (6)

Stretching is that linear transformation is carried out to the abscissa or ordinate of path coordinate points, drawing coefficient be set to α and β, ((α, β) ∈ [- 1,1]), coordinate (x_i,y_i) coordinate after stretching conversion is:

(x_i×(1+α),y_i×(1+β)), (7)

Tilt variation includes the tilt variation to horizontal direction and vertical direction.Coordinate (x_i,y_i) inclining in the horizontal direction The tiltedly coordinate after variation are as follows:

(x_i×(1+α_x),y_i), (8)

Coordinate (x_i,y_i) coordinate after tilt variation in the vertical direction:

(x_i,y_i×(1+α_y)), (9)

Step B calculates path integral characteristic pattern

B1, path integral feature is calculated

Calculate the method that path integral is characterized in using path integral feature.Assuming that a finite length stroke section P is two dimension SpacePath, i-th intermediate of time point be τ_i, positive integer i satisfaction 1≤i≤k, the track mobile timeAnd 0 < τ₁< ... < τ_k< T, then the k rank path integral feature of P is exactly:

When P is straight line, Δ is used_0,TIndicate that path displacement, segmentation calculate:

N rank path integral feature is calculated, obtained feature set is expressed as

The dimension of the obtained path integral feature including path itself is 2ⁿ⁺¹。

B2, path integral characteristic pattern is generated

The multidimensional path integral feature of image is generated in step bl is determined, each dimension can correspond into width path product Divide characteristic pattern.Path image itself is removed, the quantity for the characteristic pattern that each puppet character sample obtains is 2ⁿ⁺¹-1.Generate path product Divide schematic diagram as shown in Figure 3.

B3, margin pixel are filled up

In order to keep image do not lost because of convolution operation image rim path integrate feature, to input tomographic image into Line blank pixel is filled up.The specific embodiment filled up is that path integral characteristic pattern is first converted into the big of 54 × 54 pixels Small, then surrounding fills the blank pixel point of 21 layers of pixel, becomes 96 × 96 figure.

Step C is trained deep neural network

C1, projected depth neural network model

In the present invention, the deep neural network of setting includes convolutional layer and maximum pond layer；Its structure is five convolutional layers, There is a maximum pond layer (MP) behind each convolutional layer；The size of first layer convolution kernel is 3 × 3 (being expressed as C3), behind four The size of layer convolution kernel is 2 × 2 (being expressed as C2)；Step-length is 2；Last there are two full articulamentums, are 480 and 512 respectively Neuron.Whole network structure unified representation are as follows:

M×96×96Input-80C3-MP2-160C2-MP2-240C2-MP2-320C2-MP2-400C2-MP2- 480FC-512FC-Output,

Wherein M indicates the port number of input layer, equal with each pseudo- integral quantity of characteristic pattern of character sample.

C2, training deep neural network

The data of training set are used to train deep neural network.Classification problem is done when training.Deep neural network Training include two steps of forward and backward successive ignition.After first obtaining network error with feedforward network, passed using reversed It broadcasts algorithm to be updated network parameter, continuous iteration optimization network parameter, point for training data is tested after every suboptimization Class accuracy rate.

C3, deep neural network model parameter is saved

The trend that concussion rises is presented in the accuracy rate of training data.When the accuracy rate of training data almost no longer rises, The close saturation of training is thought, then preservation model Parameter File, for testing.

Step D automatic identification writer

D1, candidate probability calculate

Tens to thousands of pseudo- character samples can be generated for each long text, the writer of text is exactly each The label of a puppet character sample.Each puppet character sample can be generated 2ⁿ⁺¹Path integral characteristic pattern, 2ⁿ⁺¹It is simultaneously input layer Port number.Input tomographic image is input in the deep neural network model of step C3 preservation and carries out forward calculation, obtains depth The output of neural network；I-th of character generates in long text i-th_jThe probability of a puppet character sample are as follows:

D2, probability are average

Probability averagely includes the more character probabilities of text averagely and multiple pseudo- character sample probability of character are average.Assuming that shared R character can be generated in η class hand writer's long text, each long text, and N number of pseudo- character sample can be generated in each character, each The character puppet sample probability of i-th of character of the pseudo- available candidate item probability column of character sample is average are as follows:

The character candidates item average probability table of each text chunk are as follows:

The most probable value found in formula (14) is λ

D3, writer determine

The candidate item that candidate item probability tables according to pseudo- character select highest scoring is determined as writer；From step D2 The classification results of the long text are the λ class of candidate item.

Embodiment of the present invention are not limited by the above embodiments, other are any without departing from Spirit Essence of the invention With changes, modifications, substitutions, combinations, simplifications made under principle, equivalent substitute mode should be, be included in of the invention Within protection scope.

Claims

1. the unrelated person's handwriting recognition methods end to end of the text based on deep learning, which comprises the steps of:

A, online handwritten text is pre-processed, generates pseudo- character sample；

B, the path integral characteristic image of pseudo- character sample is calculated；

C, the deep neural network model of writer's sample known to training；

D, using the deep neural network model of step C, automatic identification is carried out to the sample of uncertain writer；

The step A specifically:

A1, resampling is carried out to each stroke of online handwritten text, obtains the uniform hand script Chinese input equipment text chunk of sampling point density；

A2, stroke segmentation is carried out to the hand script Chinese input equipment text after resampling, obtains the set of paths of more broken small stroke section composition；

A5, the size for normalizing pseudo- character；

A6, affine transformation generate pseudo- character sample.

2. person's handwriting recognition methods according to claim 1, which is characterized in that

In the step A1, resampling is the track that new samples are calculated according to set tracing point density parameter and original stroke number The quantity of point；

In the step A2, stroke segmentation is first to carry out Corner Detection, and the stroke is cut off from corner point then, generates new ratio Shorter stroke section；

In the step A3, Character segmentation is successively to take out stroke section in order, when the word that these stroke sections being extracted into are combined into When the width of symbol is just above character average height, the last one current stroke section is taken as opening for a fresh character Begin；

In the step A4, the quantity for removing obtained pseudo- character at random of each character stroke section is total stroke number of segment s_iLetter Number；

In the step A5, normalization is that the figure that each pseudo- character is mapped to two-dimensional surface zooms in and out, wider and high Length, longest edge transforms to fixed required length value, and in the case where keeping the ratio of width to height constant, to short side multiplied by Corresponding scaling multiple；

3. person's handwriting recognition methods according to claim 1, which is characterized in that the step B specifically:

B1, one group of path integral feature is calculated to each pseudo- character sample；

B3, path integral characteristic pattern progress margin pixel is filled up.

4. person's handwriting recognition methods according to claim 3, it is characterised in that:

When the step B1 calculates path integral feature, it is assumed that a finite length stroke section P is two-dimensional spacePath, rail The mark mobile time meetsτ_iIndicate i-th intermediate of time point, and Positive integer i meets 1≤i≤k, then calculates the k rank path integral feature in time [0, T] interior PWhen P is straight line, use Δ_0,TIndicate path displacement,It can be acquired by being segmented to calculate；N rank path integral feature is calculated, is exactly path integral spy Sign does the truncation of k rank, and obtained feature set isObtain 2ⁿ⁺¹The path integral feature of dimension；

In the step B2, each dimension of path integral feature is individually become into a path integral characteristic pattern, so often One pseudo- character sample has 2ⁿ⁺¹Path integral characteristic pattern is opened, including the two-dimensional image in path itself；

In the step B3, first by path integral characteristic pattern be set as pixel value be z × z size, be placed on a pixel value be Z × Then these path integral characteristic patterns are input in deep neural network model described in step C and instruct by the center of the figure of Z Practice, z < Z≤3z.

5. person's handwriting recognition methods according to claim 1, which is characterized in that the step C specifically:

C3, when network converges to accuracy rate on training set and no longer rises, deconditioning, save deep neural network model ginseng Number.

6. person's handwriting recognition methods according to claim 5, it is characterised in that:

In the step C1, deep neural network includes five convolutional layers, there is a maximum pond layer behind each convolutional layer；

In the step C2, the training of deep neural network includes the successive ignition of two steps of forward and backward, first with it is preceding to After network obtains network error, network parameter is updated using back-propagation algorithm, continuous iteration optimization network parameter.

7. person's handwriting recognition methods according to claim 1, which is characterized in that the step D specifically:

D1, deep neural network model will be inputted by pretreated path integral characteristic image, each random shifting is calculated Except the corresponding candidate item probability tables of pseudo- character sample after stroke；

D2, it asks probability average the corresponding candidate item probability tables addition of multiple puppet character samples of each character, obtains the puppet word The candidate item probability tables of all characters of text are added the candidate for asking probability averagely to obtain the text by the candidate item probability tables of symbol Item probability tables；

8. person's handwriting recognition methods according to claim 7, which is characterized in that

In the step D1, each text dividing is multiple characters, and each puppet character generates multiple pseudo- character samples, these are pseudo- Character sample has identical label, calculates candidate item probability tables respectively as an independent sample；

In the step D2, probability averagely includes the more character probabilities of text averagely and multiple pseudo- character sample probability of character are flat ?；The candidate item for being added to obtain the character to the pseudo- each dimension of character sample candidate item probability tables of each of character is average general Rate table is added to obtain the candidate of the text by each dimension of candidate item average probability table of each character to a text chunk Item probability tables；

In the step D3, the candidate item probability tables that are averaged of the character according to text select the candidate item of highest scoring and are determined as this article This writer.

9. person's handwriting recognition methods according to claim 2, which is characterized in that when the step A2 Corner Detection, judge one A point is angle point, needs to be calculated its curvature by the coordinate of the point before and after the point, and the maximum point of local buckling degree is considered It is angle point；Assuming that (x_i,y_i) it is trajectory coordinates point after interpolation, respectively with k-th point of (x of the front and back_i-k,y_i-k) With (x_i+k,y_i+k) coordinate value calculate curvature.