CN101477426B - Method and system for recognizing hand-written character input - Google Patents

Method and system for recognizing hand-written character input Download PDF

Info

Publication number
CN101477426B
CN101477426B CN2009100364948A CN200910036494A CN101477426B CN 101477426 B CN101477426 B CN 101477426B CN 2009100364948 A CN2009100364948 A CN 2009100364948A CN 200910036494 A CN200910036494 A CN 200910036494A CN 101477426 B CN101477426 B CN 101477426B
Authority
CN
China
Prior art keywords
character
sample
center
hand
coordinate sequence
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN2009100364948A
Other languages
Chinese (zh)
Other versions
CN101477426A (en
Inventor
高精鍊
黄新春
陈炳辉
胡安进
蔡沐宇
陆华兴
刘志玭
王志爱
郭方
李景平
王红辉
谭春桃
吴政维
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Guobi Technology Co Ltd
Original Assignee
Guangdong Guobi Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Guobi Technology Co Ltd filed Critical Guangdong Guobi Technology Co Ltd
Priority to CN2009100364948A priority Critical patent/CN101477426B/en
Publication of CN101477426A publication Critical patent/CN101477426A/en
Application granted granted Critical
Publication of CN101477426B publication Critical patent/CN101477426B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Character Discrimination (AREA)

Abstract

The invention discloses a method for recognizing the handwritten input. The method comprises the following steps: establishing a coarse classification module and a fine classification module; receiving signals about the input of user handwritten characters, collecting the discrete ordinate sequence of the signals, and conducting the pretreatment; conducting the feature extraction and calculation on the multidimensional characteristic vectors of handwritten input characters; matching the handwritten input characters with the coarse classification module respectively, and choosing a plurality of candidate character classes with strong similarities; matching the handwritten input characters subjected to the feature transformation with the sample center of the candidate character classes chosen from the fine classification module, and determining the most similar character classes. The invention further discloses a system for recognizing the input of handwritten characters and has the advantages of higher speed of recognizing the input of handwritten characters, and higher recognition accuracy.

Description

A kind of method and system of discerning handwriting input
Technical field
The present invention relates to the Handwritten Digits Recognition field, relate in particular to a kind of method and system of discerning handwriting input.
Background technology
The recognition technology of handwriting input at present is applied to the various communication terminals and the information processing terminal; Such end product has a hand-written video screen of the touch that is used to write usually, the user can write with pen or finger in the above, handle through the identification of terminal then, generating corresponding character is presented on the terminal, and carry out follow-up associative operation, the recognition technology of handwriting input has improved the speed and the dirigibility of input, is therefore used at large.
The identifying of present handwriting input is divided into steps such as signals collecting, pre-service, feature extraction, characteristic matching mostly, prior art is big because of calculated amount, the processing time is long, make recognition speed still slower, and accuracy of identification is not high, is still waiting further improvement.
Summary of the invention
Based on the deficiencies in the prior art, the technical problem to be solved in the present invention is to provide a kind of method and system of discerning handwriting input, makes that the feature extraction speed of hand-written character input identification is faster, and more accurate.
Purpose of the present invention is achieved through the following technical solutions: a kind of method of discerning handwriting input may further comprise the steps:
A, set up the coarse classification that first center of a sample by each character type constitutes, and the template in fine classification that constitutes by second center of a sample of each character type, described coarse classification is to choose from the multidimensional eigenvector of the sample of each character type according to the Fisher criterion to make Fisher than maximum several features value, calculates that first center of a sample of each character type obtains again; Described template in fine classification is according to Fisher criterion calculated characteristics transformation matrix, utilizes this transformation matrix that the multidimensional eigenvector of the sample of each character type is carried out eigentransformation, recomputates that the center of a sample of each character type obtains again;
B, the signal that receives the input of user's hand-written character and the discrete coordinate sequence of gathering the input character tracing point are carried out the regular coordinate sequence that pre-service obtains this character afterwards;
C, according to described regular coordinate sequence, carry out feature extraction, calculate the multidimensional eigenvector of handwriting characters;
D, according to the Fisher criterion, selected part eigenwert from the multidimensional eigenvector of described handwriting characters, described handwriting characters is mated with described coarse classification respectively, from described coarse classification, choose first center of a sample of several character types of similarity maximum, and with this several first center of a sample's corresponding characters class as candidate character classes;
E, utilize the eigentransformation matrix described in the steps A, multidimensional eigenvector to described handwriting characters carries out eigentransformation, from described template in fine classification, choose second center of a sample of the described candidate character classes correspondence that obtains among the step D, respectively with eigentransformation after handwriting characters mate, from described candidate character classes, determine the most similar character type, select for the user.
The present invention also provides a kind of system that discerns handwriting input, and it comprises:
Memory module is used to store each character type, and the coarse classification of described each character type correspondence and template in fine classification, also has the eigentransformation matrix; Described coarse classification is to choose from the multidimensional eigenvector of the sample of each character type according to the Fisher criterion to make Fisher than maximum several features value, calculates that the center of a sample of each character type obtains again; Described template in fine classification is according to Fisher criterion calculated characteristics transformation matrix, utilizes this transformation matrix that the multidimensional eigenvector of the sample of each character type is carried out eigentransformation, recomputates that the center of a sample of each character type obtains again;
First signal acquisition module is used to receive the signal of user's hand-written character input and the discrete coordinate sequence of gathering the hand-written character tracing point;
The first normalization module, the discrete coordinate sequence that is used for hand-written character tracing point that described first signal acquisition module is collected is for conversion into another discrete coordinate sequence, with the size of adjustment hand-written character and the coordinate figure of shape and center of gravity, obtain the regular coordinate sequence of this character
First characteristic extracting module is used for according to described regular coordinate sequence, and the vector line segment that all adjacent track points of hand-written character are formed decomposes eight reference directions, calculates the multidimensional eigenvector of hand-written character;
The rough sort module, be used for criterion according to Fisher, selected part eigenwert from the multidimensional eigenvector of user's handwriting characters, with described handwriting characters respectively with described memory module in coarse classification mate, therefrom choose several center of a sample of similarity maximum, and with these several center of a sample's corresponding characters classes as candidate character classes;
The disaggregated classification module, be used for utilizing the eigentransformation matrix of described memory module, multidimensional eigenvector to described handwriting characters carries out eigentransformation, from described template in fine classification, choose the center of a sample of described candidate character classes, respectively with eigentransformation after handwriting characters mate, from described candidate character classes, determine the most similar character type, select for the user.
Display module is used to show the most similar character type of described disaggregated classification module output, selects for the user.
Compared with prior art, the present invention carries out Feature Selection according to the Fisher criterion, and the aspect ratio artificial selection feature of selecting with the Fisher criterion has the better recognition precision, and calculated amount is little, thereby makes that the accuracy of character recognition is higher.
A kind of a kind of preferred implementation of discerning the method for handwriting input of the present invention is that described steps A specifically may further comprise the steps:
A1, gather the hand-written character sample of each character type, and calculate the discrete coordinate sequence of the tracing point of described hand-written character sample;
A2, the discrete coordinate sequence of the tracing point of hand-written character sample is carried out pre-service, obtain the regular coordinate sequence of this hand-written character sample;
A3, according to described regular coordinate sequence, carry out feature extraction, the vector line segment that all adjacent track points are formed decomposes eight reference directions, obtains the multidimensional eigenvector of this hand-written character sample;
A4, according to the Fisher criterion, choosing from the multidimensional eigenvector of the sample of each character type makes Fisher than the best part eigenwert, calculate first center of a sample of each character type, obtain the coarse classification that first center of a sample by each character type constitutes;
A5, according to Fisher criterion calculated characteristics transformation matrix, utilize this transformation matrix that the multidimensional eigenvector of the sample of each character type is carried out eigentransformation, the center of a sample that recomputates each character type is as second center of a sample, obtains the template in fine classification that second center of a sample by each character type constitutes.
A kind of another kind of preferred implementation of discerning the method for handwriting input of the present invention is that described steps A 3 and C specifically may further comprise the steps:
According to described regular coordinate sequence, the vector line segment that all adjacent track points are formed decomposes eight reference directions, obtains the length value of each vector line segment on each reference direction;
Length value to described each vector line segment that obtains is handled, and calculates large-scale characteristics value and small scale features value, obtains the multidimensional eigenvector that is made of large-scale characteristics value and small scale features value.
A kind of another kind of preferred implementation of discerning the method for handwriting input of the present invention is that described steps A 4 specifically may further comprise the steps:
According to the Fisher criterion, from the multidimensional eigenvector of the sample of each character type of prestoring, choose and make Fisher than maximum several features value;
According to the eigenvector of the sample that constitutes by the selected characteristic value, calculate the eigenvector of the center of a sample of this character type, obtain the coarse classification that the center of a sample by each character type constitutes.
A kind of another kind of preferred implementation of discerning the method for handwriting input of the present invention is that described steps A 5 specifically may further comprise the steps:
The eigentransformation matrix that utilization obtains according to the Fisher criterion carries out eigentransformation with the multidimensional eigenvector of the sample of all character types, reduces the dimension of its multidimensional eigenvector;
The center of a sample of all character types after the calculated characteristics conversion;
Center of a sample to described eigentransformation matrix and all character types carries out the iteration adjustment, the center of a sample that recomputates eigentransformation matrix and all character types is as second center of a sample, obtains the template in fine classification that second center of a sample by each character type constitutes.
A kind of another kind of preferred implementation of discerning the method for handwriting input of the present invention is that described step D specifically may further comprise the steps:
According to the Fisher criterion, selected part eigenwert from the multidimensional eigenvector of described handwriting characters, the eigenvector that constitutes by the partial feature value of choosing have with coarse classification in the identical dimension of first center of a sample of character type;
Described handwriting characters is mated with described coarse classification respectively, from prestored character classes, chooses first center of a sample of several character types of similarity maximum, and with this several first center of a sample's corresponding characters class as candidate character classes.
A kind of another kind of preferred implementation of discerning the method for handwriting input of the present invention is that described steps A 1 and B are specifically further comprising the steps of:
Judge whether a hand-written character input finishes, and surpasses preset threshold when the time that does not receive the hand-written character input signal, then finishes the collection of this character discrete coordinate sequence.
Whether the tracing point of the hand-written character that inspection collects has only one, gathers again if then delete this tracing point;
Coordinate distance in the tracing point of the hand-written character that detection collects between the consecutive point if this distance less than preset threshold, is then deleted wherein a bit, makes to keep certain distance between the consecutive point.
A kind of another kind of preferred implementation of discerning the method for handwriting input of the present invention is, pre-service among described steps A 2, the B is that the discrete coordinate sequence of the hand-written character tracing point that will collect is for conversion into another discrete coordinate sequence, with the size of adjustment handwriting characters and the coordinate figure of shape and center of gravity, specifically may further comprise the steps:
The abscissa value and the ordinate value of all tracing points are transformed between 0 to 100;
Calculate the barycentric coordinates value of all tracing point horizontal ordinates and ordinate respectively;
With all tracing point coordinate figures and barycentric coordinates value divided by 100, all tracing point coordinate figures and barycentric coordinates value are become between 0 to 1, utilization makes the horizontal ordinate of barycentric coordinates value and ordinate become a level and smooth continuous function of 0.5, and the discrete coordinate sequence of the hand-written character tracing point that collects is for conversion into another discrete coordinate sequence;
Again all tracing point coordinate figures be multiply by 64, obtain the regular coordinate sequence of input character.
A kind of a kind of preferred implementation of discerning the system of handwriting input of the present invention is, this system comprises that also a matching stencil obtains subsystem, and this subsystem comprises:
The secondary signal acquisition module, the discrete coordinate sequence that is used to receive the signal of hand-written character sample and gathers the hand-written character tracing point;
The second normalization module, the discrete coordinate sequence that is used for hand-written character tracing point that described secondary signal acquisition module is collected is for conversion into another discrete coordinate sequence, with the size of adjustment hand-written character and the coordinate figure of shape and center of gravity, obtain the regular coordinate sequence of this character;
Second characteristic extracting module is used for according to described regular coordinate sequence, and the vector line segment that all adjacent track points of hand-written character are formed decomposes eight reference directions, calculates the multidimensional eigenvector of hand-written character;
The coarse classification computing module, the multidimensional eigenvector that is used for the sample of each character type of drawing from described second characteristic extracting module, choose and make Fisher than the best part eigenwert, calculate the center of a sample of each character type, obtain the coarse classification that the center of a sample by each character type constitutes, and store this coarse classification into described memory module;
The template in fine classification computing module, be used for according to Fisher criterion calculated characteristics transformation matrix, utilize this transformation matrix that the multidimensional eigenvector of the sample of each character type is carried out eigentransformation, recomputate the center of a sample of each character type, obtain the template in fine classification that the center of a sample by each character type constitutes, and with this template in fine classification and described eigentransformation matrix stores to described memory module.
A kind of another kind of preferred implementation of discerning the system of handwriting input of the present invention is that described first signal acquisition module and secondary signal acquisition module specifically comprise:
Collecting unit, the discrete coordinate sequence that is used to receive the signal of hand-written character and gathers the hand-written character tracing point;
Judging unit is used to judge whether a hand-written character input finishes, and surpasses preset threshold when the time that does not receive the hand-written character input signal, then finishes the collection of this character discrete coordinate sequence;
Detecting unit is used to check whether the tracing point of the hand-written character that collects has only one, gathers again if then delete this tracing point; And detect the coordinate distance between the consecutive point in the tracing point of the hand-written character collect, if this distance less than preset threshold, is then deleted wherein a bit, make to keep certain distance between the consecutive point.
Description of drawings
Fig. 1 is a kind of process flow diagram of discerning hand-written inputting method of the embodiment of the invention.
Fig. 2 is a kind of detail flowchart of discerning hand-written inputting method of the embodiment of the invention.
Fig. 3 is a kind of structural representation of discerning hand-written input system of the embodiment of the invention.
Fig. 4 is a kind of detailed structure synoptic diagram of discerning hand-written input system of the embodiment of the invention.
Fig. 5 a is the synoptic diagram before the character boundary shape adjustments among the step S02 among Fig. 1.
Fig. 5 b is the synoptic diagram after the character boundary shape adjustments among the step S02 among Fig. 1.
Fig. 6 is the synoptic diagram that step S02 puts adjusted character into a grid among Fig. 1.
Fig. 7 is 8 reference direction figure described in the step S03 among Fig. 1.
Fig. 8 is the synoptic diagram that a vector line segment is decomposed 2 reference directions described in the step S03 among Fig. 1.
Embodiment
For making the present invention easier to understand, the present invention is further elaborated in conjunction with the accompanying drawings, but the embodiment in the accompanying drawing does not constitute any limitation of the invention.
The present invention is the character process coordinate sequence collection to handwriting input, treatment schemees such as pre-service, eigenvector extraction, rough sort, disaggregated classification, thus finally discern this hand-written character.
Fig. 1 shows a kind of process flow diagram of discerning hand-written inputting method of the embodiment of the invention, and the method that wherein obtains character matching stencil may further comprise the steps:
Step S01 gathers the hand-written character sample of all each character types, and calculates the discrete coordinate sequence of the tracing point of described hand-written character sample; Write on touch-screen with pen particularly, (x, y) coordinate sequence is noted the position of nib when starting writing.The complete handwriting trace of an input character with one (x, y) sequence is represented: (x1, y1), (x2, y2) ..., (xn, yn) }.
Step S02 is for conversion into another discrete coordinate sequence with the discrete coordinate sequence of the tracing point of hand-written character sample, carries out the regular coordinate sequence that pre-service obtains this hand-written character sample; Particularly, utilize a level and smooth continuous function, the discrete coordinate sequence of the hand-written character tracing point that collects is for conversion into another discrete coordinate sequence,, obtains the regular coordinate sequence of this hand-written character with the size of adjusting hand-written character and the coordinate figure of shape and center of gravity; The normalization of character track has two purposes: size criteriaization and shape correction.Shown in Fig. 5 a-5b, character among Fig. 5 a is through having become the shape among Fig. 5 b after the normalization, not only the border has become the size (all characters border after normalization becomes identical size) of regulation, and variation has also taken place shape, the more standard that becomes, thereby easier identification.
Normalization is by two coordinate conversion function x '=x aAnd y '=y bRealize, the coordinate of every bit in the character track (x, y) replace with (x ', y ') after, just obtain normalized character track.
The method of estimation of parameter a and b is as follows:
At first, find out the minimum value of x and y in the coordinate sequence, x that is had a few and y coordinate are deducted the minimum value of x and y respectively, thereby make the minimum value of x and y all become 0.Then, all x and y are on duty with 100/u, wherein u be the maximal value of an x and y is arranged, thereby make x and y value between 0 to 100.
Second the step, ask stroke track in the horizontal direction with vertical direction on projection.The stroke track is put into the grid of a 100x100, (being the grid of 10x10 in the synoptic diagram) as shown in Figure 6.The stroke length addition in each file grid, just obtain the projection fx (i) of horizontal direction, i=1,2 ..., 100.Equally,, just obtain the projection fy (i) of vertical direction the stroke length addition in each row grid, i=1,2 ..., 100. centers of gravity by fx (i) calculated level direction:
x c = Σ i = 1 100 i × f x ( i ) Σ i = 1 100 f x ( i ) - - - ( 1 )
Equally, calculate the center of gravity yc of vertical direction by fy (i).
The 3rd step is the coordinate of being had a few and (xc yc) divided by 100, becomes between 0 to 1.Function x '=x aAnd y '=y bRespectively xc and yc are become 0.5, i.e. x c a=0.5,
Figure GSB00000329839400092
Same y c b=0.5,
Figure GSB00000329839400093
Through conversion, the center of gravity of character track is moved on to (0.5,0.5) and the border remains unchanged.
In the 4th step, (x ', y ') be multiply by a given multiple, thereby make the housing of character become the size of regulation.We are decided to be 64 this multiple.At last, the coordinate of being had a few in the normalization character track is between 0 to 64.
Step S03 according to described regular coordinate sequence, carries out feature extraction, and the vector line segment that all adjacent track points are formed decomposes eight reference directions, obtains the multidimensional eigenvector of this hand-written character sample; Basic thought is as follows: as shown in Figure 7, stroke line segment (being linked to be a vector line segment between every adjacent 2) is decomposed 8 reference directions of D1 to D8, the line segment length value of all directions in each grid in the record 64x64 grid is calculated the direction character value of two yardsticks then.
The first step decomposes 8 reference directions to the stroke line segment.Being linked to be a line segment between every adjacent 2 in the coordinate point sequence, is a directive vector f iThis vector f iDirection between two reference direction D2 and D3, vector f iResolve into the component (as shown in Figure 8) on two reference direction D2 and the D3, the component length on each reference direction counts the line segment length value of this direction in the grid of place.Like this, on each direction of 8 directions, obtain 64x64 line segment length value.
In second step, calculate large-scale characteristics.64x64 grid on each direction evenly is divided into 4x4 grid, calculate all directions in each grid reach the standard grade length value and, obtain 8x4x4=128 eigenwert.
In the 3rd step, calculate small scale features.64x64 grid on each direction evenly is divided into 8x8 grid, calculate all directions in each grid reach the standard grade length value and, obtain 8x8x8=512 eigenwert.
Total number of large-scale characteristics and small scale features is 128+512=640.
Step S04, selected part eigenwert from the multidimensional eigenvector of the sample of all each character types is calculated the center of a sample of each character type, obtains the coarse classification that the center of a sample by each character type constitutes; Particularly, according to the Fisher criterion, from the multidimensional eigenvector of the sample of each character type, choose and make Fisher than maximum several features value, calculate the center of a sample of each character type, obtain the coarse classification that the center of a sample by each character type constitutes.
The purpose of this step is to obtain coarse classification from the eigenvector of the sample of prestored character classes, in order to improve the speed of rough sort, select a part of feature calculation matching distance, and feature selecting and masterplate design are closed at a training sample set and carried out.Training sample set comprises the handwriting samples of each character class, each sample through feature extraction with 640 eigenwerts (640 the dimension eigenvector x=[x 1..., x 640] T) expression.Be provided with the sample of N altogether of C classification, wherein classification i has Ni sample.Selecting the criterion of feature is Fisher criterion (on the pattern-recognition teaching material detailed description being arranged): the basic thought of Fisher criterion function is, the structure evaluation function, make that the distance between the classification that is classified is big as far as possible when evaluation function is optimum, distance is as far as possible little between all kinds of internal specimens simultaneously.
J schedule of samples of i class is shown eigenvector
Figure GSB00000329839400101
(being made up of the part candidate feature), then the center of a sample of each classification (average) is
μ i = 1 N i Σ j = 1 N i x j i , i = 1 , . . . , C - - - ( 2 )
Total center is
Figure GSB00000329839400111
Covariance matrix is calculated as respectively between interior covariance matrix of class and class:
S w = 1 N Σ i = 1 C Σ j = 1 N i ( x j i - μ i ) ( x j i - μ i ) T - - - ( 3 )
S b = 1 N Σ i = 1 C N i ( μ i - μ 0 ) ( μ i - μ 0 ) T - - - ( 4 )
The target of feature selecting is on the basis of selecting Partial Feature, matrix
Figure GSB00000329839400114
Mark
Figure GSB00000329839400115
(Fisher ratio) reaches maximal value.Here Candidate feature changes in feature selection process.Seeking Fisher is a combinatorial optimization problem than maximum characteristics combination, available order sweep forward method approximate solution:
Calculate the Fisher ratio of each feature earlier, select Fisher than maximum feature.Then in the remaining feature each is calculated the Fisher ratio with selecting feature composition characteristic vector successively, select Fisher to add and has selected feature than maximum feature.So repeatedly, till selecting feature to reach specified number or amount (being decided to be below 100).
The detailed process of feature selecting is as follows: at first with in 640 features each successively as the candidate, calculate the Fisher ratio, with Fisher than a maximum feature as first feature of electing.Then each feature of electing with the first time successively in 639 remaining features (this moment, candidate feature had two) is estimated, selected Fisher than the maximum combination that contains 2 eigenwerts.Then in 638 remaining features each is estimated with combine (this moment, candidate feature had 3) of containing 2 eigenwerts elected previously successively, selected Fisher than the maximum combination that contains 3 eigenwerts.So repeatedly, till the characteristic number of electing reaches the number of appointment.After feature selecting was finished, feature set was also just fixing.
Through after the feature selecting, the coarse classification of each classification is the center (average) of a class sample, calculates with formula (2).
Step S05, according to Fisher criterion calculated characteristics transformation matrix, utilize this transformation matrix that the multidimensional eigenvector of the sample of each character type is carried out eigentransformation, recomputate the center of a sample of each character type, obtain the template in fine classification that the center of a sample by each character type constitutes; In order to obtain higher accuracy of identification, the disaggregated classification feature is taked eigentransformation, rather than feature selecting, (vector of d<D), the eigenvector dimension after the eigentransformation generally is decided to be between 100 to 150 promptly the eigenvector of original D=640 dimension to be obtained low-dimensional through linear conversion.Utilize formula: y=Wx carries out eigentransformation, and wherein W is the transformation matrix of dxD.Finding the solution transformation matrix makes Fisher compare tr[(WS wW T) -1W TS bW T] maximum, its result, each row of W is a matrix
Figure GSB00000329839400121
D the latent vector (this is the mathematical method of standard, needn't give unnecessary details) of corresponding eigenvalue maximum.Through behind the dimensionality reduction, the masterplate of each classification is the center (formula (2)) of a class sample.
The eigentransformation matrix and the classification masterplate that obtain as top can't obtain very high accuracy of identification.For this reason, transformation matrix and classification masterplate are carried out the iteration adjustment, the classification error (each sample is assigned to nearest classification) of closing at training sample set is gradually reduced.At first, the weight of all training samples is made as 1, transformation matrix that obtains with the Fisher criterion and classification center masterplate are to all training sample classification, and each wrong sample weights of dividing adds 1.If sample
Figure GSB00000329839400122
The weight table of (j sample of i class) is shown
Figure GSB00000329839400123
Recomputate in class center, the class and covariance matrix between class by following formula:
μ i = 1 Σ j = 1 N i v j i Σ j = 1 N i v j i x j i - - - ( 5 )
μ 0 = 1 Σ i = 1 C Σ j = 1 N i v j i Σ i = 1 C Σ j = 1 N i v j i x j i - - - ( 6 )
S w = 1 Σ i = 1 C Σ j = 1 N i v j i Σ i = 1 C Σ j = 1 N i v j i ( x j i - μ i ) ( x j i - μ i ) T - - - ( 7 )
S b = 1 Σ i = 1 C V i Σ i = 1 C V i ( μ i - μ 0 ) ( μ i - μ 0 ) T , Wherein V i = Σ j = 1 N i v j i - - - ( 8 )
On this basis by making tr[(WS wW T) -1W TS bW T] maximization recomputates the class center after transformation matrix and the eigentransformation, again to the training sample classification, wrong sample weights of dividing adds 1; So repeatedly, till the classification error of training sample no longer further reduces.
By above-mentioned method step, coarse classification and the template in fine classification that to use in the handwriting recognition process have been obtained; The flow process of handwriting characters identification may further comprise the steps:
Step S06 receives the signal of user's hand-written character input and the discrete coordinate sequence of gathering the input character tracing point, carries out the regular coordinate sequence that pre-service obtains this character afterwards;
The processing procedure of this step and above-mentioned step S01 and S02 are similar, at first with the pen on touch-screen, write, when starting writing the position of nib (x, y) coordinate sequence is noted; The complete handwriting trace of an input character with one (x, y) sequence is represented: (x1, y1), (x2, y2) ..., (xn, yn) }.
Utilize a level and smooth continuous function then, the discrete coordinate sequence of the hand-written character tracing point that collects is for conversion into another discrete coordinate sequence, with the size of adjustment hand-written character and the coordinate figure of shape and center of gravity, obtain the regular coordinate sequence of this hand-written character; The normalization of character track has two purposes: size criteriaization and shape correction.Shown in Fig. 5 a-5b, character among Fig. 5 a is through having become the shape among Fig. 5 b after the normalization, not only the border has become the size (all characters border after normalization becomes identical size) of regulation, and variation has also taken place shape, the more standard that becomes, thereby easier identification.Normalization is by two coordinate conversion function x '=x aAnd y '=y bRealize, the coordinate of every bit in the character track (x, y) replace with (x ', y ') after, just obtain normalized character track.The method of estimation of parameter a and b is identical with the method described in the step S02.
Step S07 according to described regular coordinate sequence, carries out the multidimensional eigenvector of feature extraction and calculation handwriting characters; Concrete characteristic extraction procedure is identical with the process described in the step S03, as shown in Figure 7, stroke line segment (being linked to be a vector line segment between every adjacent 2) is decomposed 8 reference directions of D1 to D8, the line segment length value of all directions in each grid in the record 64x64 grid is calculated the direction character value of two yardsticks then.
The first step decomposes 8 reference directions to the stroke line segment.Being linked to be a line segment between every adjacent 2 in the coordinate point sequence, is a directive vector f iThis vector f iDirection between two reference direction D2 and D3, vector f iResolve into the component (as shown in Figure 8) on two reference direction D2 and the D3, the component length on each reference direction counts the line segment length value of this direction in the grid of place.Like this, on each direction of 8 directions, obtain 64x64 line segment length value.
In second step, calculate large-scale characteristics.64x64 grid on each direction evenly is divided into 4x4 grid, calculate all directions in each grid reach the standard grade length value and, obtain 8x4x4=128 eigenwert.
In the 3rd step, calculate small scale features.64x64 grid on each direction evenly is divided into 8x8 grid, calculate all directions in each grid reach the standard grade length value and, obtain 8x8x8=512 eigenwert.
Total number of large-scale characteristics and small scale features is 128+512=640.
Step S08, according to the Fisher criterion, selected part eigenwert from the multidimensional eigenvector of described handwriting characters, described handwriting characters is mated with coarse classification respectively, from coarse classification, choose several center of a sample of similarity maximum, and with these several center of a sample's corresponding characters classes as candidate character classes; Particularly, according to the Fisher criterion, from the multidimensional eigenvector of handwriting characters, choose and make Fisher than maximum several features value, the eigenwert number of choosing among the eigenwert number of choosing and the step S04 is identical.
The distance calculation of masterplate coupling is as follows: the multidimensional eigenvector of establishing handwriting characters is expressed as vector x=[x 1..., x n] T, the center of a sample of a classification is expressed as eigenvector y=[y in the coarse classification 1..., y n] T, then calculate matching distance by following formula:
d ( x , y ) = Σ i = 1 n | x i - y i | - - - ( 9 )
Step S09, utilize the eigentransformation matrix that calculates according to the Fisher criterion among the step S05, multidimensional eigenvector to described handwriting characters carries out eigentransformation, from described template in fine classification, choose the center of a sample of described candidate character classes, respectively with eigentransformation after handwriting characters mate, from described candidate character classes, determine the most similar character type, select for the user; Return step S06 then, carry out the input identification of next hand-written character.
The purpose of this step is to carry out disaggregated classification, to an input character, after step S08 rough sort is found out M candidate's classification, disaggregated classification adopts than the more feature of rough sort, utilize formula (9) to recomputate the distance of input character, get nearest classification as final recognition result to M candidate's classification masterplate.
Disaggregated classification provides a plurality of (being generally 10) classification of matching distance minimum as final candidate.These candidate's classifications can directly show for the user and select, or utilize language rule based on context to select automatically.
The rough sort of step S08 is that the masterplate of each character class of storing in the eigenvector of input character (character to be identified) and the masterplate database is compared (coupling), the individual classification of M (such as M=100) of finding out distance minimum (similarity maximum just) is found out the minimum candidate's classification of distance as final recognition result again as the candidate in the disaggregated classification of step S09.
Rough sort different with the masterplate that disaggregated classification is compared (feature is also different): coarse classification simple (feature is few), computing velocity is fast, template in fine classification complexity (feature is more), computing velocity is slower.
The purpose of rough sort is after finding out M candidate's classification fast, needn't calculate the distance (only calculating the distance of M candidate's classification) of all categories in the disaggregated classification, thereby improve overall recognition speed.
Give an example and illustrate: have character type to be expressed as: R, S, T, U, V ..., its corresponding coarse classification is expressed as: R1, S1, T1, U1, V1 ..., its corresponding template in fine classification is expressed as: R2, S2, T2, U2, V2 This moment, the user imported a hand-written character Z, mated with coarse classification through after the Feature Selection, obtained several center of a sample of similarity maximum: R1, S1, T1, therefore choose R, S, T as candidate character classes; From template in fine classification, choose candidate character classes R, S, T corresponding sample center: R2, S2, T2, and then will mate with R2, S2, T2 after the hand-written character Z eigentransformation, that obtain the matching distance minimum is S2, this moment just with character type S as last identifying object.
In sum, Fig. 2 shows a kind of detail flowchart of discerning hand-written inputting method of the embodiment of the invention;
Step S05 specifically may further comprise the steps:
Step S051 utilizes the eigentransformation matrix that obtains according to the Fisher criterion, and the sample of all character types is carried out eigentransformation, reduces the dimension of its multidimensional eigenvector;
Step S052 recomputates the center of a sample of all character types after the eigentransformation;
Step S053 carries out the iteration adjustment to the center of a sample of described eigentransformation matrix and all character types, recomputates the center of a sample of eigentransformation matrix and all character types, obtains the template in fine classification that the center of a sample by each character type constitutes.
Described step S06 specifically may further comprise the steps:
Step S061 receives the signal of hand-written character input and the discrete coordinate sequence of gathering the input character tracing point;
Step S062 judges whether a character input finishes, and surpasses preset threshold when the time that does not receive the hand-written character input signal, then finishes the collection of this character discrete coordinate sequence; When pen lifting time surpasses a threshold value (as 0.5 second), be considered as a wordbook and write end; The complete handwriting trace of an input character with one (x, y) sequence is represented: (x1, y1), (x2, y2) ..., (xn, yn) }, wherein, start writing and represent with a special coordinate (1,0).
Step S063 checks whether the tracing point of the hand-written character that collects has only one, gathers again if then delete this tracing point;
Step S064, coordinate distance in the tracing point of the hand-written character that detection collects between the consecutive point is if this distance is less than preset threshold, if i.e. two consecutive point position coincidences or very tight from getting, then wherein any of deletion makes to keep certain distance between the consecutive point;
Step S065, utilize a level and smooth continuous function, the discrete coordinate sequence of the hand-written character tracing point that collects is for conversion into another discrete coordinate sequence,, obtains the regular coordinate sequence of this character with the size of adjusting handwriting characters and the coordinate figure of shape and center of gravity.
Described step S07 specifically may further comprise the steps:
Step S071, according to the regular coordinate sequence of hand-written character, the vector line segment that all adjacent track points are formed decomposes 8 reference directions (shown in Fig. 7 and 8), obtains the line of vector length value on each reference direction;
Step S072 handles the described line of vector length value that obtains, and obtains the multidimensional eigenvector that large-scale characteristics value and small scale features value constitute.
Described step S08 specifically may further comprise the steps:
Step S081, according to the Fisher criterion, selected part eigenwert from the multidimensional eigenvector of described handwriting characters, described handwriting characters has the eigenvector that is made of the selected characteristic value with the sample same dimension of character type;
Step S082 mates described handwriting characters respectively with described coarse classification, choose the plurality of candidate character classes of similarity maximum from prestored character classes.
Described step S09 specifically may further comprise the steps:
Step S091 carries out eigentransformation with the adjusted eigentransformation matrix of iteration to handwriting characters, obtains its low-dimensional eigenvector;
Step S092, the low-dimensional eigenvector of this handwriting characters mates with the center of a sample of the candidate character classes of choosing from described template in fine classification respectively, determines the character type of similarity maximum from candidate character classes, selects for the user.
Fig. 3 shows a kind of structural representation of discerning the system of handwriting input of the embodiment of the invention.This system comprises:
First signal acquisition module 1 is used to receive the signal of hand-written character input and the discrete coordinate sequence of gathering this hand-written character tracing point;
The first normalization module 2, be used to utilize a level and smooth continuous function, the discrete coordinate sequence of the hand-written character tracing point that collects is for conversion into another discrete coordinate sequence,, obtains the regular coordinate sequence of this hand-written character with the size of adjusting hand-written character and the coordinate figure of shape and center of gravity;
First characteristic extracting module 3 is used for the regular coordinate sequence according to hand-written character, and the vector line segment that all adjacent track points are formed decomposes eight reference directions, obtains the multidimensional eigenvector of hand-written character;
Memory module 4 is used to store each character type, and the coarse classification of described each character type correspondence and template in fine classification, the eigentransformation matrix that also with good grounds Fisher criterion calculates;
Rough sort module 5, be used for criterion according to Fisher, selected part eigenwert from the multidimensional eigenvector of handwriting characters, with described handwriting characters respectively with described memory module 4 in coarse classification mate, from prestored character classes, choose the plurality of candidate character classes of similarity maximum;
Disaggregated classification module 6, be used for the multidimensional eigenvector of handwriting characters is carried out eigentransformation, the center of a sample of the candidate character classes chosen in the handwriting characters after the eigentransformation and the template in fine classification from described memory module 4 is mated, therefrom determine the most similar character type, select for the user.
Fig. 4 shows a kind of detailed structure synoptic diagram of discerning the system of handwriting input of the embodiment of the invention; Wherein matching stencil acquisition subsystem X1 comprises:
Secondary signal acquisition module 11, the discrete coordinate sequence that is used to receive the signal of hand-written character sample and gathers the hand-written character tracing point;
The second normalization module 21, the discrete coordinate sequence that is used for hand-written character tracing point that described secondary signal acquisition module 11 is collected is for conversion into another discrete coordinate sequence, with the size of adjustment hand-written character and the coordinate figure of shape and center of gravity, obtain the regular coordinate sequence of this character;
Second characteristic extracting module 31 is used for according to described regular coordinate sequence, and the vector line segment that all adjacent track points of hand-written character are formed decomposes eight reference directions, calculates the multidimensional eigenvector of hand-written character;
Coarse classification computing module 51, the multidimensional eigenvector that is used for the sample of each character type of drawing from described characteristic extracting module, the selected part eigenwert, calculate the center of a sample of each character type, obtain the coarse classification that the center of a sample by each character type constitutes, and store this coarse classification into described memory module 4;
Template in fine classification computing module 61, be used for according to Fisher criterion calculated characteristics transformation matrix, utilize this transformation matrix that the multidimensional eigenvector of the sample of each character type is carried out eigentransformation, recomputate the center of a sample of each character type, obtain the template in fine classification that the center of a sample by each character type constitutes, and with this template in fine classification and described eigentransformation matrix stores to described memory module 4.
Wherein first and second signal acquisition module 1 and 11 specifically comprise:
Collecting unit 101, the discrete coordinate sequence that is used to gather the hand-written character tracing point;
Judging unit 102 is used to judge whether a hand-written character input finishes, and surpasses preset threshold when the time that does not receive the hand-written character input signal, then finishes the collection of this hand-written character discrete coordinate sequence;
Detecting unit 103 is used to check whether the tracing point of the hand-written character that collects has only one, gathers again if then delete this tracing point; And detect the coordinate distance between the consecutive point in the tracing point of the hand-written character collect, if this distance less than preset threshold, is then deleted wherein a bit, make to keep certain distance between the consecutive point.
System also comprises a display module 7, is used to show the most similar character type of described disaggregated classification module 6 outputs, selects for the user.
Compare with existing method, the technology in normalization, feature extraction, rough sort, disaggregated classification has some differences:
Normalization: coordinate transform function x '=x aAnd y '=y bBe new the proposition, the benefit of doing like this is that the function of coordinate transform is level and smooth continuous function, thereby makes the character shape after the conversion more natural, guarantees the center (0.5) of the gravity-center-change of character track to housing simultaneously.Also handlebar character center of gravity was mapped to the way at housing center in the past, but usefulness is piecewise linear function, and the character shape after the conversion is nature, also influences the identification of back.
Feature extraction: the stroke track line segment directly decomposes 8 directions.Existing method is earlier track to be become piece image, and the pixel travel direction in the image is decomposed, and the calculated amount of doing like this is bigger, and the image that generates has distortion.Our method has been avoided generating the calculated amount that image increased, and the direction character that obtains is more accurate.
Rough sort: existing method generally is manually to choose a part of feature (as direct use large-scale characteristics) or adopt eigentransformation.And the aspect ratio artificial selection feature that we select with the Fisher criterion has the better recognition precision, compares with eigentransformation and has reduced calculated amount (because not having linear transformation).
Disaggregated classification: the iteration adjustment of eigentransformation matrix and classification center masterplate can obviously improve accuracy of identification.Existing method is the transformation matrix that directly adopts the Fisher criterion to obtain, and (Learning Vector Quantization, LVQ) algorithm is adjusted classification center masterplate to adopt the study vector quantization then on transform characteristics.Our method can obtain higher accuracy of identification by adjusting transformation matrix and classification center masterplate simultaneously.
This method is applicable to the identification of Chinese, English, numeral, symbol.
One, input character track:
Figure GSB00000329839400201
Two, the track after the normalization:
Figure GSB00000329839400202
Three, 10 candidate's classifications and the matching distance (selecting computed range on the features) thereof selected of rough sort at 60
Instigate Sound of sighing What Pyridine Call out Furan Smell Cry Larynx Mark
597 621 643 676 689 698 715 732 764 771
Four, disaggregated classification is to the rearrangement and the distance (computed range on 120 transform characteristics) thereof of 10 candidate's classifications
Sound of sighing Pyridine Call out Instigate What Cry Furan Larynx Smell Mark
1079 1121 1157 1186 1233 1298 1374 1419 1462 1503
Five, last recognition result is " sound of sighing ".
The above is a preferred implementation of the present invention; certainly can not limit the present invention's interest field with this; should be understood that; for those skilled in the art; under the prerequisite that does not break away from the principle of the invention; can also make some improvement and change, these improvement and change also are considered as protection scope of the present invention.

Claims (11)

1. a method of discerning handwriting input is characterized in that, may further comprise the steps:
A, set up the coarse classification that first center of a sample by each character type constitutes, and the template in fine classification that constitutes by second center of a sample of each character type, described coarse classification is to choose from the multidimensional eigenvector of the sample of each character type according to the Fisher criterion to make Fisher than maximum several features value, calculates that first center of a sample of each character type obtains again; Described template in fine classification is according to Fisher criterion calculated characteristics transformation matrix, utilizes this transformation matrix that the multidimensional eigenvector of the sample of each character type is carried out eigentransformation, recomputates that second center of a sample of each character type obtains again;
B, the signal that receives the input of user's hand-written character and the discrete coordinate sequence of gathering the input character tracing point are carried out the regular coordinate sequence that pre-service obtains this character afterwards;
C, according to described regular coordinate sequence, carry out feature extraction, calculate the multidimensional eigenvector of handwriting characters;
D, according to the Fisher criterion, selected part eigenwert from the multidimensional eigenvector of described handwriting characters, described handwriting characters is mated with described coarse classification respectively, from described coarse classification, choose the center of a sample of several character types of similarity maximum, and with this several first center of a sample's corresponding characters class as candidate character classes;
E, utilize the eigentransformation matrix described in the steps A, multidimensional eigenvector to described handwriting characters carries out eigentransformation, from described template in fine classification, choose second center of a sample of the described candidate character classes correspondence that obtains among the step D, respectively with eigentransformation after handwriting characters mate, from described candidate character classes, determine the most similar character type, select for the user.
2. a kind of method of discerning handwriting input according to claim 1 is characterized in that described steps A specifically may further comprise the steps:
A1, gather the hand-written character sample of each character type, and calculate the discrete coordinate sequence of the tracing point of described hand-written character sample;
A2, the discrete coordinate sequence of the tracing point of hand-written character sample is carried out pre-service, obtain the regular coordinate sequence of this hand-written character sample;
A3, according to described regular coordinate sequence, carry out feature extraction, the vector line segment that all adjacent track points are formed decomposes eight reference directions, obtains the multidimensional eigenvector of this hand-written character sample;
A4, according to the Fisher criterion, choosing from the multidimensional eigenvector of the sample of each character type makes Fisher than the best part eigenwert, calculate first center of a sample of each character type, obtain the coarse classification that first center of a sample by each character type constitutes;
A5, according to Fisher criterion calculated characteristics transformation matrix, utilize this transformation matrix that the multidimensional eigenvector of the sample of each character type is carried out eigentransformation, the center of a sample that recomputates each character type is as second center of a sample, obtains the template in fine classification that second center of a sample by each character type constitutes.
3. a kind of method of discerning handwriting input according to claim 2 is characterized in that, described steps A 3 and C specifically may further comprise the steps:
According to described regular coordinate sequence, the vector line segment that all adjacent track points are formed decomposes eight reference directions, obtains the length value of each vector line segment on each reference direction;
Length value to described each vector line segment that obtains is handled, and calculates large-scale characteristics value and small scale features value, obtains the multidimensional eigenvector that is made of large-scale characteristics value and small scale features value.
4. a kind of method of discerning handwriting input according to claim 2 is characterized in that, described steps A 4 specifically may further comprise the steps:
According to the Fisher criterion, from the multidimensional eigenvector of the sample of each character type of prestoring, choose and make Fisher than maximum several features value;
According to the eigenvector of the sample that constitutes by the selected characteristic value, calculate the eigenvector of the center of a sample of this character type, obtain the coarse classification that the center of a sample by each character type constitutes.
5. a kind of method of discerning handwriting input according to claim 2 is characterized in that, described steps A 5 specifically may further comprise the steps:
The eigentransformation matrix that utilization obtains according to the Fisher criterion carries out eigentransformation with the multidimensional eigenvector of the sample of all character types, reduces the dimension of its multidimensional eigenvector;
The center of a sample of all character types after the calculated characteristics conversion;
Center of a sample to described eigentransformation matrix and all character types carries out the iteration adjustment, the center of a sample that recomputates eigentransformation matrix and all character types is as second center of a sample, obtains the template in fine classification that second center of a sample by each character type constitutes.
6. a kind of method of discerning handwriting input according to claim 1 is characterized in that described step D specifically may further comprise the steps:
According to the Fisher criterion, selected part eigenwert from the multidimensional eigenvector of described handwriting characters, the eigenvector that constitutes by the partial feature value of choosing have with coarse classification in the identical dimension of first center of a sample of character type;
Described handwriting characters is mated with described coarse classification respectively, from prestored character classes, chooses first center of a sample of several character types of similarity maximum, and with this several first center of a sample's corresponding characters class as candidate character classes.
7. a kind of method of discerning handwriting input according to claim 2 is characterized in that, described steps A 1 and B are specifically further comprising the steps of:
Judge whether a hand-written character input finishes, and surpasses preset threshold when the time that does not receive the hand-written character input signal, then finishes the collection of this character discrete coordinate sequence.
Whether the tracing point of the hand-written character that inspection collects has only one, gathers again if then delete this tracing point;
Coordinate distance in the tracing point of the hand-written character that detection collects between the consecutive point if this distance less than preset threshold, is then deleted wherein a bit, makes to keep certain distance between the consecutive point.
8. a kind of method of discerning handwriting input according to claim 2, it is characterized in that, pre-service among described steps A 2, the B is that the discrete coordinate sequence of the hand-written character tracing point that will collect is for conversion into another discrete coordinate sequence, with the size of adjustment handwriting characters and the coordinate figure of shape and center of gravity, specifically may further comprise the steps:
The abscissa value and the ordinate value of all tracing points are transformed between 0 to 100;
Calculate the barycentric coordinates value of all tracing point horizontal ordinates and ordinate respectively;
With all tracing point coordinate figures and barycentric coordinates value divided by 100, all tracing point coordinate figures and barycentric coordinates value are become between 0 to 1, utilization makes the horizontal ordinate of barycentric coordinates value and ordinate become a level and smooth continuous function of 0.5, and the discrete coordinate sequence of the hand-written character tracing point that collects is for conversion into another discrete coordinate sequence;
Again all tracing point coordinate figures be multiply by 64, obtain the regular coordinate sequence of input character.
9. system that discerns handwriting input is characterized in that it comprises:
Memory module is used to store each character type, and the coarse classification of described each character type correspondence and template in fine classification, also has the eigentransformation matrix; Described coarse classification is to choose from the multidimensional eigenvector of the sample of each character type according to the Fisher criterion to make Fisher than maximum several features value, calculates that the center of a sample of each character type obtains again; Described template in fine classification is according to Fisher criterion calculated characteristics transformation matrix, utilizes this transformation matrix that the multidimensional eigenvector of the sample of each character type is carried out eigentransformation, recomputates that the center of a sample of each character type obtains again;
First signal acquisition module is used to receive the signal of user's hand-written character input and the discrete coordinate sequence of gathering the hand-written character tracing point;
The first normalization module, the discrete coordinate sequence that is used for hand-written character tracing point that described first signal acquisition module is collected is for conversion into another discrete coordinate sequence, with the size of adjustment hand-written character and the coordinate figure of shape and center of gravity, obtain the regular coordinate sequence of this character;
First characteristic extracting module is used for according to described regular coordinate sequence, and the vector line segment that all adjacent track points of hand-written character are formed decomposes eight reference directions, calculates the multidimensional eigenvector of hand-written character;
The rough sort module, be used for criterion according to Fisher, selected part eigenwert from the multidimensional eigenvector of user's handwriting characters, with described handwriting characters respectively with described memory module in coarse classification mate, therefrom choose several center of a sample of similarity maximum, and with these several center of a sample's corresponding characters classes as candidate character classes;
The disaggregated classification module, be used for utilizing the eigentransformation matrix of described memory module, multidimensional eigenvector to described handwriting characters carries out eigentransformation, from described template in fine classification, choose the center of a sample of described candidate character classes, respectively with eigentransformation after handwriting characters mate, from described candidate character classes, determine the most similar character type, select for the user;
Display module is used to show the most similar character type of described disaggregated classification module output, selects for the user.
10. a kind of system that discerns handwriting input according to claim 9, this system comprise that also a matching stencil obtains subsystem, and this subsystem comprises:
The secondary signal acquisition module, the discrete coordinate sequence that is used to receive the signal of hand-written character sample and gathers the hand-written character tracing point;
The second normalization module, the discrete coordinate sequence that is used for hand-written character tracing point that described secondary signal acquisition module is collected is for conversion into another discrete coordinate sequence, with the size of adjustment hand-written character and the coordinate figure of shape and center of gravity, obtain the regular coordinate sequence of this character;
Second characteristic extracting module is used for according to described regular coordinate sequence, and the vector line segment that all adjacent track points of hand-written character are formed decomposes eight reference directions, calculates the multidimensional eigenvector of hand-written character;
The coarse classification computing module, be used for criterion according to Fisher, the multidimensional eigenvector of the sample of each character type that draws from described second characteristic extracting module, choose and make Fisher than the best part eigenwert, calculate the center of a sample of each character type, obtain the coarse classification that the center of a sample by each character type constitutes, and store this coarse classification into described memory module;
The template in fine classification computing module, be used for according to Fisher criterion calculated characteristics transformation matrix, utilize this transformation matrix that the multidimensional eigenvector of the sample of each character type is carried out eigentransformation, recomputate the center of a sample of each character type, obtain the template in fine classification that the center of a sample by each character type constitutes, and with this template in fine classification and described eigentransformation matrix stores to described memory module.
11. a kind of system that discerns handwriting input according to claim 10, described first signal acquisition module and secondary signal acquisition module specifically comprise:
Collecting unit, the discrete coordinate sequence that is used to receive the signal of hand-written character and gathers the hand-written character tracing point;
Judging unit is used to judge whether a hand-written character input finishes, and surpasses preset threshold when the time that does not receive the hand-written character input signal, then finishes the collection of this character discrete coordinate sequence;
Detecting unit is used to check whether the tracing point of the hand-written character that collects has only one, gathers again if then delete this tracing point; And detect the coordinate distance between the consecutive point in the tracing point of the hand-written character collect, if this distance less than preset threshold, is then deleted wherein a bit, make to keep certain distance between the consecutive point.
CN2009100364948A 2009-01-07 2009-01-07 Method and system for recognizing hand-written character input Expired - Fee Related CN101477426B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2009100364948A CN101477426B (en) 2009-01-07 2009-01-07 Method and system for recognizing hand-written character input

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2009100364948A CN101477426B (en) 2009-01-07 2009-01-07 Method and system for recognizing hand-written character input

Publications (2)

Publication Number Publication Date
CN101477426A CN101477426A (en) 2009-07-08
CN101477426B true CN101477426B (en) 2011-02-16

Family

ID=40838151

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2009100364948A Expired - Fee Related CN101477426B (en) 2009-01-07 2009-01-07 Method and system for recognizing hand-written character input

Country Status (1)

Country Link
CN (1) CN101477426B (en)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102298485A (en) * 2010-06-22 2011-12-28 广东国笔科技股份有限公司 Real-time calling system based on touch screen
CN102298486A (en) * 2010-06-22 2011-12-28 广东国笔科技股份有限公司 Fast calling system and method based on touch screen
CN102314300A (en) * 2010-07-08 2012-01-11 广东国笔科技股份有限公司 A kind of intelligent switching system and method
JP2012190215A (en) * 2011-03-10 2012-10-04 Sony Corp Input processor, input processing method, and program
CN103324415B (en) * 2012-03-22 2016-11-23 百度在线网络技术(北京)有限公司 input method switching method in mobile terminal and mobile terminal
CN102854982B (en) * 2012-08-01 2015-06-24 华平信息技术(南昌)有限公司 Method for recognizing customized gesture tracks
CN103927532B (en) * 2014-04-08 2017-11-03 武汉汉德瑞庭科技有限公司 Person's handwriting method for registering based on stroke feature
CN104102450A (en) * 2014-06-18 2014-10-15 深圳贝特莱电子科技有限公司 Touch screen based gesture recognition method and system
CN108920077B (en) * 2018-06-27 2021-07-23 青岛清原精准农业科技有限公司 Chemical structural formula drawing method based on dynamic gesture library recognition
CN110737364B (en) * 2019-11-06 2023-12-12 广州炫视智能科技有限公司 Control method for touch writing acceleration under android system

Also Published As

Publication number Publication date
CN101477426A (en) 2009-07-08

Similar Documents

Publication Publication Date Title
CN101482920B (en) Hand-written character recognition method and system
CN101477426B (en) Method and system for recognizing hand-written character input
CN101447017B (en) Method and system for quickly identifying and counting votes on the basis of layout analysis
CN101452357B (en) Hand-written character input method and system
CN101968847B (en) Statistical online character recognition
CN101477425A (en) Method and system for recognizing hand-written character input
Cilia et al. An end-to-end deep learning system for medieval writer identification
CN103810484B (en) The mimeograph documents discrimination method analyzed based on printing character library
CN108664975B (en) Uyghur handwritten letter recognition method and system and electronic equipment
CN110738247A (en) fine-grained image classification method based on selective sparse sampling
CN102663454B (en) Method and device for evaluating character writing standard degree
Bhattacharya et al. Neural combination of ANN and HMM for handwritten Devanagari numeral recognition
CN102982343B (en) Handwritten number recognition and incremental type obscure support vector machine method
CN105930873A (en) Self-paced cross-modal matching method based on subspace
Fu et al. User adaptive handwriting recognition by self-growing probabilistic decision-based neural networks
Almazán et al. A non-rigid appearance model for shape description and recognition
CN101436249A (en) Method and system for obtaining character matching stencil
Jubair et al. A simplified method for handwritten character recognition from document image
CN105844299A (en) Image classification method based on bag of words
Ozdil et al. Optical character recognition without segmentation
Tan et al. Multiscale region projection method to discriminate between printed and handwritten text on registration forms
CN107067046B (en) Hand-written digit recognition method based on mixed feature extraction
Kang et al. Application of Deep Convolution Neural Network Algorithm in Detecting Traditional Calligraphy Characters
Srinivasan et al. Signature-based retrieval of scanned documents using conditional random fields
CN114519787B (en) Depth feature visualization interpretation method of intelligent handwriting evaluation system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
PP01 Preservation of patent right

Effective date of registration: 20150921

Granted publication date: 20110216

RINS Preservation of patent right or utility model and its discharge
PD01 Discharge of preservation of patent

Date of cancellation: 20160921

Granted publication date: 20110216

PP01 Preservation of patent right

Effective date of registration: 20160913

Granted publication date: 20110216

RINS Preservation of patent right or utility model and its discharge
PD01 Discharge of preservation of patent
PD01 Discharge of preservation of patent

Date of cancellation: 20170913

Granted publication date: 20110216

PP01 Preservation of patent right

Effective date of registration: 20170913

Granted publication date: 20110216

PP01 Preservation of patent right
PD01 Discharge of preservation of patent

Date of cancellation: 20180313

Granted publication date: 20110216

PD01 Discharge of preservation of patent
DD01 Delivery of document by public notice

Addressee: Guangdong Guobi Technology Co., Ltd.

Document name: Notification of Termination of Procedure

DD01 Delivery of document by public notice
DD01 Delivery of document by public notice

Addressee: Guangdong Guobi Technology Co., Ltd.

Document name: Notification to Go Through Formalities Rectification of Restoration of Right

Addressee: Guangdong Guobi Technology Co., Ltd.

Document name: Notification of Termination of Patent Right

DD01 Delivery of document by public notice
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20110216

Termination date: 20160107

CF01 Termination of patent right due to non-payment of annual fee
DD01 Delivery of document by public notice
DD01 Delivery of document by public notice

Addressee: Gao Jingjian

Document name: Notification of Approving Refund

Addressee: Guangdong Guobi Technology Co., Ltd.

Document name: Notification of Decision on Request for Restoration of Right