CN112580650A

CN112580650A - Font classification prediction method and system

Info

Publication number: CN112580650A
Application number: CN202011529220.5A
Authority: CN
Inventors: 田辉; 刘其开
Original assignee: Hefei High Dimensional Data Technology Co ltd
Current assignee: Hefei High Dimensional Data Technology Co ltd
Priority date: 2020-12-22
Filing date: 2020-12-22
Publication date: 2021-03-30
Anticipated expiration: 2040-12-22
Also published as: CN112580650B

Abstract

The invention discloses a font classification prediction method and a system, wherein the method comprises the following steps: s1, data set preparation, namely collecting and converting samples of various fonts, obtaining a single character picture through a character segmentation network, and determining the minimum circumscribed rectangle of each character as a single standard sample; s2, sampling data, namely selecting 1 type of the N types of samples as a reference sample set, calculating the similarity of the rest N-1 types of sample sets and the reference sample set through template matching, selecting the first M samples as a batch size, and performing cyclic sampling training; s3, determining a network structure, network parameters and a loss function, and inputting samples into a network for training; and S4, verifying and testing the network model to obtain a prediction classification result. The method has the advantages of wide application prospect and stronger network generalization capability.

Description

Font classification prediction method and system

Technical Field

The invention relates to the technical field of font classification, in particular to a font classification prediction method and system.

Background

At present, because the classification standard of Chinese fonts has no system and few application scenes, the font classification technology at the present stage mainly uses a deep learning algorithm to solve the problem of ancient font classification, but in the prior art, the problems that samples corresponding to the fonts are not wide enough, difficult to obtain and have no practical application background exist, and therefore, the technology has an improvement space.

Disclosure of Invention

The present invention is directed to solving at least one of the problems of the prior art. Therefore, one objective of the present invention is to provide a font classification prediction method, which has the advantages of wide application prospect and strong network generalization capability.

The invention also provides a system with the font classification prediction method.

The font classification predicting method provided by the embodiment of the invention comprises the following steps of:

s1, data set preparation, namely collecting and converting samples of various fonts, obtaining a single character picture through a character segmentation network, and determining the minimum circumscribed rectangle of each character as a single standard sample;

s2, carrying out data sampling on the N types of samples, namely selecting 1 type in the N types of samples as a reference sample, calculating the similarity between the residual N-1 types of sample sets and the reference sample set through template matching, selecting the first M samples as one batch size, and carrying out cyclic sampling training;

s3, determining a network structure, network parameters and a loss function, and inputting samples to perform network training;

and S4, network verification and test are carried out to obtain a prediction classification result.

The font classification prediction method has the advantages of wide application prospect and stronger network generalization capability.

According to the font classification predicting method of an embodiment of the present invention, in step S1, the plurality of fonts include: song style, Song-imitating style, regular style and black body.

According to the font classification predicting method of an embodiment of the present invention, in step S2, the data sampling mode for the N-type samples is an online sampling mode.

According to the font classification predicting method of one embodiment of the present invention, in step S3, the network structure adopts a convolutional neural network, and the convolutional neural network is composed of a convolutional layer, a pooling layer and a full link layer.

According to the font classification prediction method provided by the embodiment of the invention, a deep learning algorithm is adopted to train the convolutional neural network.

According to the font classification predicting method of one embodiment of the present invention, in step S3, the loss function is an L2-softmax loss function, and the formula of the loss function is as follows:

wherein f (x)_i) The following constraints are satisfied for the feature vectors output by the network:

the parameter alpha has two setting modes, wherein the alpha is set as a fixed value in the training process, and the parameter alpha is obtained through training.

According to a second aspect of the present invention, there is provided a font classification predicting system using the font classification predicting method according to any one of the first aspects, the font classification predicting system comprising:

the data collection module is used for collecting and converting samples of various fonts and obtaining a single character sample through a character segmentation network;

the data sampling module can perform data sampling on the N types of samples, select 1 type of the N types of samples as a reference sample, calculate the similarity between the N-1 type of samples and the reference sample through template matching, select the first M samples as one batch size, and perform cyclic sampling training;

a network structure module that can implement constraints of network parameters and loss functions, into which sample parameters can be input;

and the network verification and test module can perform network verification and test on the sample parameters to obtain a prediction result.

The system for predicting font classification according to the second aspect of the present invention has the same advantages as the foregoing method for predicting font classification compared with the prior art, and is not described herein again.

Additional aspects and advantages of the invention will be set forth in part in the description which follows and, in part, will be obvious from the description, or may be learned by practice of the invention.

Drawings

The above and/or additional aspects and advantages of the present invention will become apparent and readily appreciated from the following description of the embodiments, taken in conjunction with the accompanying drawings of which:

FIG. 1 is a flow diagram of a font classification prediction method according to an embodiment of the invention;

fig. 2 is a schematic structural diagram of a font classification prediction system according to an embodiment of the present invention.

Reference numerals:

100-a font classification prediction system, 1-a data collection module, 2-a data sampling module, 3-a network structure module and 4-a network verification and test module.

Detailed Description

Reference will now be made in detail to embodiments of the present invention, examples of which are illustrated in the accompanying drawings, wherein like or similar reference numerals refer to the same or similar elements or elements having the same or similar function throughout. The embodiments described below with reference to the accompanying drawings are illustrative only for the purpose of explaining the present invention, and are not to be construed as limiting the present invention.

In the description of the present invention, it is to be understood that the terms "central," "longitudinal," "lateral," "length," "width," "thickness," "upper," "lower," "front," "rear," "left," "right," "vertical," "horizontal," "top," "bottom," "inner," "outer," "clockwise," "counterclockwise," "axial," "radial," "circumferential," and the like are used in the orientations and positional relationships indicated in the drawings for convenience in describing the invention and to simplify the description, and are not intended to indicate or imply that the referenced device or element must have a particular orientation, be constructed and operated in a particular orientation, and are not to be considered limiting of the invention.

A font classification prediction method according to an embodiment of the present invention is described below with reference to fig. 1 and 2. As shown in fig. 1, a font classification prediction method according to an embodiment of the present invention may include the following steps:

s1, data set preparation, namely collecting and converting samples of various fonts, obtaining a single character picture through a character segmentation network, and determining the minimum external connection corresponding area of the fonts;

s2, carrying out data sampling on the N samples, namely selecting 1 of the N samples as a reference sample, calculating the similarity between the N-1 samples and the reference sample through template matching, and carrying out sample matching to obtain sample parameters; for example, in one specific embodiment, the number of a batch is set to 200, and each type of sample is 50, i.e. four types of samples. Further, one of the samples is randomly selected as a reference sample, and the similarity between the reference sample and the other three types of samples is calculated through template matching. Further, the first M samples are selected as a batch size and sent to the network training, and compared with the conventional random sampling, the sampling method in step S2 can improve the classification accuracy of the words with smaller font style difference.

S3, determining a network structure, network parameters and a loss function, and inputting sample parameters;

and S4, performing network verification and test on the sample parameters to obtain a prediction result.

The font classification prediction method has wide application prospect; and auxiliary sampling is performed through a template matching algorithm, so that the generalization capability of the network is favorably improved.

According to the font classification predicting method of an embodiment of the present invention, in step S1, the plurality of fonts may include: song style, Song-imitating style, regular style and black body. Further, in the process of data set preparation, it is first required to ensure that the number of samples in each class is substantially the same in proportion; secondly, shooting the screenshot of the document or various printed documents to form a photo; thirdly, obtaining a single character picture through a character segmentation network; and finally, obtaining the area corresponding to the minimum external connection of the font.

Note that the types of fonts are not limited to the above four types.

In step S2, the method for predicting font classification according to an embodiment of the present invention includes the following steps: in the online sampling manner, it should be noted that a manner of sampling while training may also be adopted to perform data sampling on the N-type samples.

Further, in a specific embodiment, the online sampling method includes the following specific steps: first, the batch size of the network training is determined, for example, the batch size may be set to 200; secondly, randomly selecting 20 samples in one type of font, and matching the first 10 samples which are closest to the remaining types of fonts on each sample by utilizing a template matching algorithm; and finally, finishing sampling when the total number reaches 300, randomly selecting 200 samples as a batch size, and inputting the batch size into a network for training.

According to the font classification predicting method of one embodiment of the present invention, in step S3, the network structure employs a convolutional neural network. Specifically, the convolutional neural network is composed of convolutional layers, pooling layers, and fully-connected layers, and specifically, the convolutional layers may be provided in plurality.

According to the font classification prediction method of one embodiment of the present invention, in step S3, the loss function is an L2-softmax loss function, specifically, the formula of the loss function is as follows:

The minimum value of α is:

where C is the number of categories classified.

Further, in the prior art, softmax loss is mostly adopted in the visual classification task, but softmax loss can only ensure that the learned font features are separable, and cannot ensure that the learned features of the characters with smaller font style difference are far enough and the features of different characters under the same font are close enough; that is, since the Chinese character is composed of a plurality of different radicals according to different rules, the difference of characteristic distinctiveness of different characters among a plurality of fonts is large. In the font classification prediction method provided by the embodiment of the invention, L2-softmax is used as a loss function, so that the learned features can be normalized before classification, and the distinguishing degree of the features is enhanced through constraint, so that the classification accuracy is improved.

In conclusion, the font classification prediction method has wide application prospect; and auxiliary sampling is performed through a template matching algorithm, so that the generalization capability of the network is favorably improved.

The font classification predicting system 100 according to the second aspect of the present invention is configured to employ the font classification predicting method according to any one of the first aspect, and further, as shown in fig. 2, the font classification predicting system 100 may include:

the data acquisition module 1 is further used for acquiring and converting samples of various fonts, obtaining a single character picture through a character segmentation network, and determining a minimum external connection corresponding area of the fonts; in the description of the present invention, "a plurality" means two or more.

The data sampling module 2 is used for further performing data sampling on the N types of samples, selecting 1 type of the N types of samples as a reference sample, calculating the similarity between the N-1 type of samples and the reference sample through template matching, selecting the first M samples as a batch size, and performing cyclic sampling training;

a network structure module 3, further, the network structure module 3 can realize the constraint of network parameters and loss functions, further, the sample parameters can be input into the network structure module 3;

and the network verification and test module 4, further, the network verification and test module 4 can perform network verification and test on the sample parameters to obtain a prediction result.

In summary, the font classification prediction system 100 according to the second aspect of the present invention has the advantages of wider application prospect and stronger network generalization capability.

In the description herein, references to the description of the term "one embodiment," "some embodiments," "an illustrative embodiment," "an example," "a specific example," or "some examples" or the like mean that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the invention. In this specification, the schematic representations of the terms used above do not necessarily refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.

While embodiments of the invention have been shown and described, it will be understood by those of ordinary skill in the art that: various changes, modifications, substitutions and alterations can be made to the embodiments without departing from the principles and spirit of the invention, the scope of which is defined by the claims and their equivalents.

Claims

1. A font classification prediction method is characterized by comprising the following steps:

s2, carrying out data sampling on the N types of samples, selecting 1 type of the N types of samples as a reference sample, calculating the similarity between the residual N-1 type of sample set and the reference sample set through template matching, selecting the first M samples as one batch size, and carrying out cyclic sampling training;

2. The font classification predicting method according to claim 1, wherein in step S1, the plurality of fonts include: song style, Song-imitating style, regular style and black body.

3. The font classification predicting method according to claim 1, wherein in step S2, the data sampling mode for the N-type samples is an online sampling mode.

4. The font classification predicting method according to claim 1, wherein in step S3, the network structure employs a convolutional neural network, and the convolutional neural network is composed of a convolutional layer, a pooling layer, and a full link layer.

5. The font classification prediction method according to claim 4, characterized in that a deep learning algorithm is used to train the convolutional neural network.

6. The font classification predicting method according to claim 1, wherein in step S3, the loss function is an L2-softmax loss function, and the formula of the loss function is as follows:

7. A font classification prediction system, characterized in that the font classification prediction method according to any one of claims 1 to 6 is used, the system comprising:

the data acquisition module is used for collecting and converting samples of various fonts, obtaining a single character picture through a character segmentation network, and determining the minimum circumscribed rectangle of each character as a single standard sample;

the network structure module consists of a plurality of convolution layers, a pooling layer and a full-connection layer, can realize the constraint of network parameters and loss functions, and can input sample parameters into the network structure module;