CN111027345A - Font identification method and apparatus - Google Patents

Font identification method and apparatus Download PDF

Info

Publication number
CN111027345A
CN111027345A CN201811172388.8A CN201811172388A CN111027345A CN 111027345 A CN111027345 A CN 111027345A CN 201811172388 A CN201811172388 A CN 201811172388A CN 111027345 A CN111027345 A CN 111027345A
Authority
CN
China
Prior art keywords
picture
network model
font
recognized
neural network
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811172388.8A
Other languages
Chinese (zh)
Inventor
邓斌
章庆元
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Kingsoft Office Software Inc
Zhuhai Kingsoft Office Software Co Ltd
Guangzhou Kingsoft Mobile Technology Co Ltd
Original Assignee
Beijing Kingsoft Office Software Inc
Zhuhai Kingsoft Office Software Co Ltd
Guangzhou Kingsoft Mobile Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Kingsoft Office Software Inc, Zhuhai Kingsoft Office Software Co Ltd, Guangzhou Kingsoft Mobile Technology Co Ltd filed Critical Beijing Kingsoft Office Software Inc
Priority to CN201811172388.8A priority Critical patent/CN111027345A/en
Publication of CN111027345A publication Critical patent/CN111027345A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/32Digital ink
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/22Character recognition characterised by the type of writing
    • G06V30/226Character recognition characterised by the type of writing of cursive writing
    • G06V30/2264Character recognition characterised by the type of writing of cursive writing using word shape

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Multimedia (AREA)
  • Biophysics (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Biomedical Technology (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Character Discrimination (AREA)

Abstract

The embodiment of the invention provides a font identification method and a font identification device, wherein the method comprises the following steps: determining a picture to be identified, wherein the picture to be identified is a picture containing characters to be identified. And inputting the picture to be recognized into the font recognition network model to obtain the font of the character to be recognized in the picture to be recognized. The font recognition network model is a model obtained by training according to a training set, and the training set comprises: a plurality of sample pictures including a text, and a real font of the text in each sample picture. Because the scanned part can be converted into the format of the picture, the font of the characters in the scanned part can be identified by using the font identification method provided by the embodiment of the invention.

Description

Font identification method and apparatus
Technical Field
The present invention relates to the field of image processing technologies, and in particular, to a font identification method and apparatus.
Background
The scanning piece is an electronic file obtained by scanning a paper file through a scanner. The text in the scan piece is not editable. In order to realize the editing of characters in the scanned part, the scanned part can be converted into an editable document.
In order to convert the scanned part into an editable document, information such as the font size, font style and position of characters in the scanned part needs to be acquired. The information such as the character size and the position of the characters in the scanned piece can be acquired by the existing image processing method. In the prior art, the font of the characters in the scanned part cannot be identified.
Disclosure of Invention
The embodiment of the invention aims to provide a font identification method and a font identification device so as to realize identification of the font of characters in a scanned piece. The specific technical scheme is as follows:
in order to solve the above problem, an embodiment of the present invention provides a font identification method, where the method includes:
determining a picture to be identified, wherein the picture to be identified is a picture containing characters to be identified;
inputting the picture to be recognized into a font recognition network model to obtain the font of the character to be recognized in the picture to be recognized;
the font recognition network model is a model obtained by training a preset neural network model according to a training set, wherein the training set comprises: a plurality of sample pictures including a text, and a real font of the text in each sample picture.
Optionally, the step of determining the picture to be recognized includes:
acquiring a scanned document to be identified;
and intercepting the character area in the scanned document to be identified as the picture to be identified.
Optionally, if the picture to be recognized includes a plurality of characters to be recognized, the step of inputting the picture to be recognized into the font recognition network model includes:
cutting the picture to be recognized into a plurality of target pictures, wherein each target picture comprises a character to be recognized;
and respectively inputting each target picture into the font identification network model.
Optionally, the font recognition network model is obtained by training through the following steps:
acquiring a preset neural network model and the training set;
inputting the sample pictures contained in the training set into the neural network model to obtain the predicted fonts of characters in the sample pictures;
determining a loss value according to the obtained predicted font and the real font of the characters in the sample pictures contained in the training set;
determining whether the neural network model converges according to the loss value;
if not, adjusting parameter values in the neural network model, and returning to the step of inputting the sample pictures contained in the training set into the updated neural network model to obtain the predicted fonts of the characters in the sample pictures;
and if so, determining the current neural network model as the font recognition network model.
Optionally, the preset neural network model is a sensor Flow neural network model.
In order to solve the above problem, an embodiment of the present invention further provides a font identification apparatus, where the apparatus includes:
the device comprises a determining module, a judging module and a judging module, wherein the determining module is used for determining a picture to be identified, and the picture to be identified is a picture containing characters to be identified;
the recognition module is used for inputting the picture to be recognized into a font recognition network model to obtain the font of the character to be recognized in the picture to be recognized;
the font recognition network model is a model obtained by training a preset neural network model according to a training set, wherein the training set comprises: a plurality of sample pictures including a text, and a real font of the text in each sample picture.
Optionally, the determining module is specifically configured to:
acquiring a scanned document to be identified;
and intercepting the character area in the scanned document to be identified as the picture to be identified.
Optionally, if the picture to be recognized includes a plurality of characters to be recognized, the recognition module is specifically configured to:
cutting the picture to be recognized into a plurality of target pictures, wherein each target picture comprises a character to be recognized;
and respectively inputting each target picture into the font identification network model.
Optionally, the apparatus further comprises: a training module for training the training of the device,
the training module is specifically configured to:
acquiring a preset neural network model and the training set;
inputting the sample pictures contained in the training set into the neural network model to obtain the predicted fonts of characters in the sample pictures;
determining a loss value according to the obtained predicted font and the real font of the characters in the sample pictures contained in the training set;
determining whether the neural network model converges according to the loss value;
if not, adjusting parameter values in the neural network model, and returning to the step of inputting the sample pictures contained in the training set into the updated neural network model to obtain the predicted fonts of the characters in the sample pictures;
and if so, determining the current neural network model as the font recognition network model.
Optionally, the preset neural network model is a sensor Flow neural network model.
The embodiment of the invention also provides electronic equipment which comprises a processor, a communication interface, a memory and a communication bus, wherein the processor, the communication interface and the memory complete mutual communication through the communication bus;
a memory for storing a computer program;
and the processor is used for realizing any method step when executing the program stored in the memory.
An embodiment of the present invention further provides a computer-readable storage medium, in which a computer program is stored, and the computer program, when executed by a processor, implements any of the above method steps.
The embodiment of the invention provides a font identification method and a font identification device, wherein the method comprises the following steps: determining a picture to be identified, wherein the picture to be identified is a picture containing characters to be identified. And inputting the picture to be recognized into the font recognition network model to obtain the font of the character to be recognized in the picture to be recognized. The font recognition network model is a model obtained by training according to a training set, and the training set comprises: a plurality of sample pictures including a text, and a real font of the text in each sample picture. Because the scanned part can be converted into the format of the picture, the font of the characters in the scanned part can be identified by using the font identification method provided by the embodiment of the invention.
Of course, not all of the advantages described above need to be achieved at the same time in the practice of any one product or method of the invention.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.
Fig. 1 is a flowchart of a font identification method according to an embodiment of the present invention;
FIG. 2 is a schematic diagram of capturing a scanned document to obtain a picture to be recognized according to an embodiment of the present invention;
fig. 3 is a schematic diagram of a picture to be recognized according to an embodiment of the present invention;
FIG. 4 is a flowchart of training a font recognition network model according to an embodiment of the present invention;
fig. 5 is a schematic structural diagram of a font identification apparatus according to an embodiment of the present invention;
fig. 6 is a schematic structural diagram of an electronic device according to an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Aiming at the technical problem that the font of characters in a scanned part cannot be identified in the prior art, the embodiment of the invention provides a font identification method, which comprises the following steps: determining a picture to be identified, wherein the picture to be identified is a picture containing characters to be identified. And inputting the picture to be recognized into the font recognition network model to obtain the font of the character to be recognized in the picture to be recognized. The font recognition network model is a model obtained by training according to a training set, and the training set comprises: a plurality of sample pictures including a text, and a real font of the text in each sample picture. Because the scanned part can be converted into the format of the picture, the font of the characters in the scanned part can be identified by using the font identification method provided by the embodiment of the invention.
The present invention will be described in detail below with reference to specific examples.
Referring to fig. 1, fig. 1 is a flowchart of a font identification method according to an embodiment of the present invention, where the method may include the following steps:
s101: determining a picture to be identified, wherein the picture to be identified is a picture containing characters to be identified.
In one embodiment, the picture to be identified may be a picture taken from a scanned document. Specifically, the step S101 of determining the picture to be recognized may include the following steps:
step 11: and determining a scanned document to be identified.
The scanned document to be identified can be a scanned piece in a PDF format, and can also be a scanned piece in a picture format.
Step 12: and intercepting a text area in the scanned document as a picture to be identified.
To identify the font of the text in the scanned document, the text region in the scanned document may be truncated. The intercepted text area is in a picture format and can be used as a picture to be identified. Referring to fig. 2, fig. 2 is a schematic diagram of capturing a scanned document to obtain a picture to be recognized.
S102: and inputting the picture to be recognized into the font recognition network model to obtain the font of the character to be recognized in the picture to be recognized. The font recognition network model is obtained by training a preset neural network model according to a training set. The training set comprises: a plurality of sample pictures including a text, and a real font of the text in each sample picture.
In an embodiment of the present invention, the preset neural network model may be a sensor Flow neural network model.
In the embodiment of the invention, the font identification network model is trained according to the sample picture containing one character and the real font of the character in the sample picture, and after the training is finished, the picture to be identified is input into the font identification network model, so that the font of the character in the picture to be identified can be obtained.
In an embodiment of the present invention, if the picture to be recognized includes a plurality of characters to be recognized, the step of inputting the picture to be recognized into the font recognition network model may include the following refining steps:
step 21: and cutting the picture to be recognized into a plurality of target pictures, wherein each target picture comprises a character to be recognized.
If the picture to be recognized comprises a plurality of characters, the picture to be recognized can be cut to obtain a plurality of target pictures. Wherein each target picture contains a character.
For example, referring to fig. 3, fig. 3 is another schematic diagram of a picture to be recognized. Since the picture to be recognized shown in fig. 3 includes 7 characters, the picture to be recognized can be divided into 7 target pictures, and each target picture includes one character.
Step 22: and respectively inputting each target picture into the font recognition network model.
After the cutting is finished, the target pictures can be input into the font identification network model respectively to obtain the fonts of the characters in the target pictures.
In an embodiment provided by the present invention, considering that fonts of characters in individual segments in a scanned document are usually consistent, when performing font recognition on a plurality of target pictures corresponding to the individual segments in the scanned document, if a few recognition results are different from other recognition results, the few recognition results may be considered as being incorrect, and may be corrected.
Referring to fig. 4, in an embodiment of the present invention, the font identification network model may be obtained by training through the following steps:
s401: and acquiring a preset neural network model and a training set.
In an embodiment of the present invention, the preset neural network model may be a sensor Flow neural network model.
Wherein, the training set may be: a plurality of sample pictures including a text, and a real font of the text in each sample picture.
S402: and inputting the sample pictures contained in the training set into the neural network model to obtain the predicted fonts of the characters in the sample pictures.
The neural network model can be a cyclic neural network model, a convolutional neural network model, a cyclic convolutional neural network model, a deep neural network model, and the like. The embodiment of the present invention is not limited thereto.
In an embodiment of the invention, after the sample picture is input into the neural network model, the neural network model can realize the feature extraction of the sample picture, carry out convolution processing on the extracted features, and then carry out maximum pooling on the convolution result to obtain a maximum pooling result. And inputting the maximum pooling result into a full-connection layer in the neural network model to obtain the predicted font of the characters in the sample picture.
S403: and determining a loss value according to the obtained predicted font and the real font of the characters in the sample picture contained in the training set.
After the predicted font of the characters in the sample picture is obtained, the loss value can be calculated by combining the real font of the characters in the sample picture.
In one embodiment of the present invention, the fonts of the text can be marked with numbers. For example, a song style is labeled "0001", a regular style is labeled "0002", a bold style is labeled "0003", and so on.
The loss value may be calculated from the numerical identification of the true font of the text in the sample picture and the numerical identification of the predicted font of the text in the sample picture. When calculating the loss value, a Mean Squared Error (MSE) may be used as the loss function, or other loss functions may be selected, which is not limited to this.
S404: and determining whether the neural network model converges according to the loss value. If so, step S405 is performed. If not, step S406 is performed.
In an embodiment of the present invention, a loss threshold may be preset, and when the calculated loss value is greater than the preset loss threshold, it indicates that the neural network model is not converged, and the training may be continued. And when the calculated loss value is not greater than the preset loss threshold value, the neural network model is converged.
In another embodiment of the present invention, a maximum iteration number may be preset, and when the iteration number reaches the maximum iteration number in the training process, it may be considered that the neural network model has converged. And when the iteration times are not reached, if the calculated loss value is not greater than a preset loss threshold value, the convergence of the neural network model is also indicated.
S405: and determining the current neural network model as a font recognition network model.
S406: and adjusting parameter values in the neural network model, and returning to execute the step S402.
Based on the same inventive concept, according to the above font identification method embodiment, an embodiment of the present invention further provides a font identification apparatus, referring to fig. 5, which may include the following modules:
the determining module 501 is configured to determine a picture to be recognized, where the picture to be recognized is a picture including characters to be recognized.
The recognition module 502 is configured to input the picture to be recognized into the font recognition network model, so as to obtain a font of the character to be recognized in the picture to be recognized;
the font recognition network model is obtained by training a preset neural network model according to a training set, wherein the training set comprises: a plurality of sample pictures including a text, and a real font of the text in each sample picture.
In an embodiment of the present invention, the determining module 501 may be specifically configured to:
acquiring a scanned document to be identified;
and intercepting a character area in the scanned document to be identified as an image to be identified.
In an embodiment of the present invention, if the picture to be recognized includes a plurality of characters to be recognized, the recognition module 502 may be specifically configured to:
cutting a picture to be recognized into a plurality of target pictures, wherein each target picture comprises a character to be recognized;
and respectively inputting each target picture into the font recognition network model.
In an embodiment of the present invention, on the basis of the apparatus shown in fig. 5, the apparatus may further include a training module, and the training module may be specifically configured to: acquiring a preset neural network model and a training set;
inputting the sample pictures contained in the training set into a neural network model to obtain the predicted fonts of characters in the sample pictures;
determining a loss value according to the obtained predicted font and the real font of the characters in the sample picture contained in the training set;
determining whether the neural network model converges according to the loss value;
if not, adjusting parameter values in the neural network model, and returning to the step of inputting the sample pictures contained in the training set into the updated neural network model to obtain the predicted fonts of the characters in the sample pictures;
and if so, determining the current neural network model as the font recognition network model.
In an embodiment of the present invention, the preset neural network model may be a sensor Flow neural network model.
Therefore, the font identification device provided by the embodiment of the invention can determine the picture to be identified, wherein the picture to be identified is the picture containing the characters to be identified. And inputting the picture to be recognized into the font recognition network model to obtain the font of the character to be recognized in the picture to be recognized. The font recognition network model is a model obtained by training according to a training set, and the training set comprises: a plurality of sample pictures including a text, and a real font of the text in each sample picture. Because the scanning piece can be converted into the format of the picture, the font identification device provided by the embodiment of the invention can identify the font of the characters in the scanning piece.
Based on the same inventive concept, according to the above font identification method embodiment, an embodiment of the present invention further provides an electronic device, as shown in fig. 6, including a processor 601, a communication interface 602, a memory 603, and a communication bus 604, where the processor 601, the communication interface 602, and the memory 603 complete mutual communication through the communication bus 604,
a memory 603 for storing a computer program;
the processor 601 is configured to implement the following steps when executing the program stored in the memory 603:
determining a picture to be identified, wherein the picture to be identified is a picture containing characters to be identified;
inputting the picture to be recognized into a font recognition network model to obtain the font of the character to be recognized in the picture to be recognized;
the font recognition network model is a model obtained by training a preset neural network model according to a training set, wherein the training set comprises: a plurality of sample pictures including a text, and a real font of the text in each sample picture.
Therefore, the electronic equipment provided by the embodiment of the invention can determine the picture to be recognized, wherein the picture to be recognized is a picture containing characters to be recognized. And inputting the picture to be recognized into the font recognition network model to obtain the font of the character to be recognized in the picture to be recognized. The font recognition network model is a model obtained by training according to a training set, and the training set comprises: a plurality of sample pictures including a text, and a real font of the text in each sample picture. Because the scanning piece can be converted into the format of the picture, the electronic equipment provided by the embodiment of the invention can identify the font of the characters in the scanning piece.
The communication bus mentioned in the electronic device may be a Peripheral Component Interconnect (PCI) bus, an Extended Industry Standard Architecture (EISA) bus, or the like. The communication bus may be divided into an address bus, a data bus, a control bus, etc. For ease of illustration, only one thick line is shown, but not only one bus or class of buses.
The communication interface is used for communication between the electronic equipment and other equipment.
The Memory may include a Random Access Memory (RAM) or a Non-Volatile Memory (NVM), such as at least one disk Memory. Optionally, the memory may also be at least one memory device located remotely from the processor.
The Processor may be a general-purpose Processor, including a Central Processing Unit (CPU), a Network Processor (NP), and the like; but may also be a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA) or other Programmable logic device, discrete Gate or transistor logic device, discrete hardware component.
Based on the same inventive concept, according to the above-mentioned font identification method embodiment, an embodiment of the present invention further provides a computer-readable storage medium, in which a computer program is stored, and the computer program, when executed by a processor, implements any of the above-mentioned method steps.
It is noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in a process, method, article, or apparatus that comprises the element.
All the embodiments in the present specification are described in a related manner, and the same and similar parts among the embodiments may be referred to each other, and each embodiment focuses on the differences from the other embodiments. In particular, as for the embodiments of the font identification apparatus, the electronic device and the computer readable storage medium, since they are substantially similar to the embodiments of the font identification method, the description is simple, and the relevant points can be referred to the partial description of the embodiments of the method.
The above description is only for the preferred embodiment of the present invention, and is not intended to limit the scope of the present invention. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention shall fall within the protection scope of the present invention.

Claims (12)

1. A method for font recognition, the method comprising:
determining a picture to be identified, wherein the picture to be identified is a picture containing characters to be identified;
inputting the picture to be recognized into a font recognition network model to obtain the font of the character to be recognized in the picture to be recognized;
the font recognition network model is a model obtained by training a preset neural network model according to a training set, wherein the training set comprises: a plurality of sample pictures including a text, and a real font of the text in each sample picture.
2. The method according to claim 1, wherein the step of determining the picture to be recognized comprises:
acquiring a scanned document to be identified;
and intercepting the character area in the scanned document to be identified as the picture to be identified.
3. The method of claim 1, wherein if the image to be recognized includes a plurality of words to be recognized, the step of inputting the image to be recognized into a font recognition network model comprises:
cutting the picture to be recognized into a plurality of target pictures, wherein each target picture comprises a character to be recognized;
and respectively inputting each target picture into the font identification network model.
4. The method of claim 1, wherein the font recognition network model is obtained by training using the following steps:
acquiring a preset neural network model and the training set;
inputting the sample pictures contained in the training set into the neural network model to obtain the predicted fonts of characters in the sample pictures;
determining a loss value according to the obtained predicted font and the real font of the characters in the sample pictures contained in the training set;
determining whether the neural network model converges according to the loss value;
if not, adjusting parameter values in the neural network model, and returning to the step of inputting the sample pictures contained in the training set into the updated neural network model to obtain the predicted fonts of the characters in the sample pictures;
and if so, determining the current neural network model as the font recognition network model.
5. The method according to claim 1, wherein the preset neural network model is a Tensor Flow neural network model.
6. A font recognition apparatus, characterized in that the apparatus comprises:
the device comprises a determining module, a judging module and a judging module, wherein the determining module is used for determining a picture to be identified, and the picture to be identified is a picture containing characters to be identified;
the recognition module is used for inputting the picture to be recognized into a font recognition network model to obtain the font of the character to be recognized in the picture to be recognized;
the font recognition network model is a model obtained by training a preset neural network model according to a training set, wherein the training set comprises: a plurality of sample pictures including a text, and a real font of the text in each sample picture.
7. The apparatus of claim 6, wherein the determining module is specifically configured to:
acquiring a scanned document to be identified;
and intercepting the character area in the scanned document to be identified as the picture to be identified.
8. The apparatus of claim 6, wherein if the picture to be recognized includes a plurality of words to be recognized, the recognition module is specifically configured to:
cutting the picture to be recognized into a plurality of target pictures, wherein each target picture comprises a character to be recognized;
and respectively inputting each target picture into the font identification network model.
9. The apparatus of claim 6, further comprising: a training module for training the training of the device,
the training module is specifically configured to:
acquiring a preset neural network model and the training set;
inputting the sample pictures contained in the training set into the neural network model to obtain the predicted fonts of characters in the sample pictures;
determining a loss value according to the obtained predicted font and the real font of the characters in the sample pictures contained in the training set;
determining whether the neural network model converges according to the loss value;
if not, adjusting parameter values in the neural network model, and returning to the step of inputting the sample pictures contained in the training set into the updated neural network model to obtain the predicted fonts of the characters in the sample pictures;
and if so, determining the current neural network model as the font recognition network model.
10. The apparatus according to claim 6, wherein the preset neural network model is a Tensor Flow neural network model.
11. An electronic device is characterized by comprising a processor, a communication interface, a memory and a communication bus, wherein the processor and the communication interface are used for realizing mutual communication by the memory through the communication bus;
a memory for storing a computer program;
a processor for implementing the method steps of any one of claims 1 to 5 when executing a program stored in the memory.
12. A computer-readable storage medium, characterized in that a computer program is stored in the computer-readable storage medium, which computer program, when being executed by a processor, carries out the method steps of any one of the claims 1-5.
CN201811172388.8A 2018-10-09 2018-10-09 Font identification method and apparatus Pending CN111027345A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811172388.8A CN111027345A (en) 2018-10-09 2018-10-09 Font identification method and apparatus

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811172388.8A CN111027345A (en) 2018-10-09 2018-10-09 Font identification method and apparatus

Publications (1)

Publication Number Publication Date
CN111027345A true CN111027345A (en) 2020-04-17

Family

ID=70190559

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811172388.8A Pending CN111027345A (en) 2018-10-09 2018-10-09 Font identification method and apparatus

Country Status (1)

Country Link
CN (1) CN111027345A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111539424A (en) * 2020-04-21 2020-08-14 北京云从科技有限公司 Image processing method, system, device and medium based on OCR
CN111626383A (en) * 2020-05-29 2020-09-04 Oppo广东移动通信有限公司 Font identification method and device, electronic equipment and storage medium
CN111667066A (en) * 2020-04-23 2020-09-15 北京旷视科技有限公司 Network model training and character recognition method and device and electronic equipment

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1438604A (en) * 2002-12-23 2003-08-27 北京邮电大学 Character written-form judgement apparatus and method based on Bayes classification device
CN103810484A (en) * 2013-10-29 2014-05-21 西安电子科技大学 Print file identification method based on print font library analysis
CN104899571A (en) * 2015-06-12 2015-09-09 成都数联铭品科技有限公司 Random sample generation method for recognition of complex character
CN104966097A (en) * 2015-06-12 2015-10-07 成都数联铭品科技有限公司 Complex character recognition method based on deep learning
CN106067019A (en) * 2016-05-27 2016-11-02 北京旷视科技有限公司 The method and device of Text region is carried out for image
CN107292280A (en) * 2017-07-04 2017-10-24 盛世贞观(北京)科技有限公司 A kind of seal automatic font identification method and identifying device
CN107346629A (en) * 2017-08-22 2017-11-14 贵州大学 A kind of intelligent blind reading method and intelligent blind reader system
CN108427953A (en) * 2018-02-26 2018-08-21 北京易达图灵科技有限公司 A kind of character recognition method and device
US20180260621A1 (en) * 2017-03-10 2018-09-13 Baidu Online Network Technology (Beijing) Co., Ltd. Picture recognition method and apparatus, computer device and computer- readable medium

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1438604A (en) * 2002-12-23 2003-08-27 北京邮电大学 Character written-form judgement apparatus and method based on Bayes classification device
CN103810484A (en) * 2013-10-29 2014-05-21 西安电子科技大学 Print file identification method based on print font library analysis
CN104899571A (en) * 2015-06-12 2015-09-09 成都数联铭品科技有限公司 Random sample generation method for recognition of complex character
CN104966097A (en) * 2015-06-12 2015-10-07 成都数联铭品科技有限公司 Complex character recognition method based on deep learning
CN106067019A (en) * 2016-05-27 2016-11-02 北京旷视科技有限公司 The method and device of Text region is carried out for image
US20180260621A1 (en) * 2017-03-10 2018-09-13 Baidu Online Network Technology (Beijing) Co., Ltd. Picture recognition method and apparatus, computer device and computer- readable medium
CN107292280A (en) * 2017-07-04 2017-10-24 盛世贞观(北京)科技有限公司 A kind of seal automatic font identification method and identifying device
CN107346629A (en) * 2017-08-22 2017-11-14 贵州大学 A kind of intelligent blind reading method and intelligent blind reader system
CN108427953A (en) * 2018-02-26 2018-08-21 北京易达图灵科技有限公司 A kind of character recognition method and device

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
刘志伟: "《基于CNN网络的汉字图像字体识别》" *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111539424A (en) * 2020-04-21 2020-08-14 北京云从科技有限公司 Image processing method, system, device and medium based on OCR
CN111667066A (en) * 2020-04-23 2020-09-15 北京旷视科技有限公司 Network model training and character recognition method and device and electronic equipment
CN111667066B (en) * 2020-04-23 2024-06-11 北京旷视科技有限公司 Training method and device of network model, character recognition method and device and electronic equipment
CN111626383A (en) * 2020-05-29 2020-09-04 Oppo广东移动通信有限公司 Font identification method and device, electronic equipment and storage medium
CN111626383B (en) * 2020-05-29 2023-11-07 Oppo广东移动通信有限公司 Font identification method and device, electronic equipment and storage medium

Similar Documents

Publication Publication Date Title
CN109993112B (en) Method and device for identifying table in picture
CN109726643B (en) Method and device for identifying table information in image, electronic equipment and storage medium
CN109657221B (en) Document paragraph sorting method, sorting device, electronic equipment and storage medium
CN111222368B (en) Method and device for identifying document paragraphs and electronic equipment
CN110569341B (en) Method and device for configuring chat robot, computer equipment and storage medium
CN112001406B (en) Text region detection method and device
CN111027345A (en) Font identification method and apparatus
CN109697414B (en) Text positioning method and device
JP4661921B2 (en) Document processing apparatus and program
CN110942074A (en) Character segmentation recognition method and device, electronic equipment and storage medium
CN112789623A (en) Text detection method, device and storage medium
CN114529837A (en) Building outline extraction method, system, computer equipment and storage medium
CN111178147A (en) Screen crushing and grading method, device, equipment and computer readable storage medium
US10750036B1 (en) Rapid workflow design using machine learning
CN115546809A (en) Table structure identification method based on cell constraint and application thereof
CN113420848A (en) Neural network model training method and device and gesture recognition method and device
CN111145202A (en) Model generation method, image processing method, device, equipment and storage medium
CN112560545A (en) Method and device for identifying form direction and electronic equipment
CN111243593A (en) Speech recognition error correction method, mobile terminal and computer-readable storage medium
CN110633457B (en) Content replacement method and device, electronic equipment and readable storage medium
CN112396057A (en) Character recognition method and device and electronic equipment
CN111340140A (en) Image data set acquisition method and device, electronic equipment and storage medium
CN110363189B (en) Document content restoration method and device, electronic equipment and readable storage medium
CN113269153B (en) Form identification method and device
CN116631003A (en) Equipment identification method and device based on P & ID drawing, storage medium and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination