CN112801016B - Ballot data statistics method, device, equipment and medium - Google Patents

Ballot data statistics method, device, equipment and medium Download PDF

Info

Publication number
CN112801016B
CN112801016B CN202110172512.6A CN202110172512A CN112801016B CN 112801016 B CN112801016 B CN 112801016B CN 202110172512 A CN202110172512 A CN 202110172512A CN 112801016 B CN112801016 B CN 112801016B
Authority
CN
China
Prior art keywords
counted
ballot
image
items
vote
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110172512.6A
Other languages
Chinese (zh)
Other versions
CN112801016A (en
Inventor
莫国龙
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Lianren Healthcare Big Data Technology Co Ltd
Original Assignee
Lianren Healthcare Big Data Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Lianren Healthcare Big Data Technology Co Ltd filed Critical Lianren Healthcare Big Data Technology Co Ltd
Priority to CN202110172512.6A priority Critical patent/CN112801016B/en
Publication of CN112801016A publication Critical patent/CN112801016A/en
Application granted granted Critical
Publication of CN112801016B publication Critical patent/CN112801016B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content
    • G06V30/414Extracting the geometrical structure, e.g. layout tree; Block segmentation, e.g. bounding boxes for graphics or text
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/60Type of objects
    • G06V20/62Text, e.g. of license plates, overlay texts or captions on TV images
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Molecular Biology (AREA)
  • Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Multimedia (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Geometry (AREA)
  • Computer Graphics (AREA)
  • Character Input (AREA)

Abstract

The embodiment of the invention discloses a method, a device, equipment and a medium for counting ballot data, wherein the method comprises the following steps: acquiring a vote image to be counted, and extracting the contents in a table in the vote image to be counted, wherein the contents in the table comprise a mark selection item and a numerical score item, the mark selection item is a vote item which needs to take a preset character as a vote selection result in the vote, and the numerical score item is a vote item which needs to take a numerical score as a result in the vote; respectively identifying characters corresponding to the mark selection items and numbers corresponding to the numerical scoring items in the table contents; and generating a statistics result of the votes to be counted based on the characters and the numbers. The technical scheme of the embodiment solves the problem of low efficiency of manual ballot making and scoring statistics in the prior art, realizes the automatic statistics of ballot results by respectively identifying voting selection and scoring scores in ballot images, improves the efficiency of ballot statistics and avoids artificial statistics errors.

Description

Ballot data statistics method, device, equipment and medium
Technical Field
The embodiment of the invention relates to the technical field of computers, in particular to a ballot data statistics method, a ballot data statistics device, ballot data statistics equipment and ballot data statistics media.
Background
In some review activities, it is often desirable for a voter or a qualified person to vote or score for the object being reviewed, and then to make statistics and obtain statistics by a statistics person.
However, when the obtained evaluation data volume is large, the statistics staff has limited manpower, so that the ticket number and/or score of the evaluation result can not be counted in a short time, the statistics efficiency is low, and meanwhile, the statistics error may exist.
Disclosure of Invention
The embodiment of the invention provides a ballot data statistics method, a ballot data statistics device, ballot data statistics equipment and ballot data statistics media, so that the statistics efficiency of voting and scoring conditions in ballots is improved, and the statistics result errors influenced by people are reduced.
In a first aspect, an embodiment of the present invention provides a method for counting ballot data, where the method includes:
Acquiring a vote image to be counted, and extracting contents in a table in the vote image to be counted, wherein the contents in the table comprise mark selection items and numerical scoring items, the mark selection items are voting items which need to take preset characters as voting selection results in the votes, and the numerical scoring items are voting items which need to take numerical scores as results in the votes;
Respectively identifying characters corresponding to the mark selection items and numbers corresponding to the numerical scoring items in the table contents;
And generating a statistics result of the votes to be counted based on the characters and the numbers.
Optionally, the extracting contents in a table in the vote image to be counted includes:
Preprocessing the vote image to be counted;
identifying and extracting lines with cross points in the preprocessed ballot image to be counted, and obtaining a table frame in the ballot image to be counted;
and determining the contents in the table based on the to-be-counted vote image and the table frame.
Optionally, the preprocessing the vote image to be counted includes:
Gray processing is carried out on the vote image to be counted, and binarization processing is carried out on the vote image to be counted, so that a binarization vote image is obtained;
and corroding and expanding the binarized ballot image.
Optionally, the identifying the characters corresponding to the mark selection items and the numbers corresponding to the numerical score items in the table respectively includes:
Identifying the characters corresponding to the mark selection items in the contents of the table by an optical character identification method;
and identifying the corresponding number in the numerical scoring item through a preset handwriting number identification model.
Optionally, the identifying, by an optical character identifying method, the character corresponding to the tag selection item in the content in the table includes:
recognizing handwriting symbols in a preset pixel coordinate area in the vote image to be counted by an optical character recognition method, wherein the preset pixel coordinate area corresponds to the area corresponding to the mark selection item;
Determining whether the identified handwritten symbol belongs to a symbol in a preset symbol library;
And when the identified handwritten symbol belongs to the symbol in the preset symbol library, taking the identified handwritten symbol as the preset character corresponding to the mark selection item.
Optionally, the preset symbol library includes symbols associated with the shapes of the specified handwriting symbols in the ballot image to be counted.
Optionally, the generating the statistics result of the votes to be counted based on the characters and the numbers includes:
counting the selection results corresponding to the mark selection items through the characters;
and counting the scores corresponding to the numerical scoring items through the numbers.
In a second aspect, an embodiment of the present invention further provides a ballot data statistics apparatus, where the apparatus includes:
The content extraction module is used for acquiring the to-be-counted ballot image and extracting the content in a table in the to-be-counted ballot image, wherein the content in the table comprises a mark selection item and a numerical score item, the mark selection item is a voting item which needs to take a preset character as a voting selection result in the ballot, and the numerical score item is a voting item which needs to take a numerical score as a result in the ballot;
The content identification module is used for respectively identifying characters corresponding to the mark selection items and numbers corresponding to the numerical value scoring items in the contents in the table;
And the ballot statistics module is used for generating a statistics result of the ballot to be counted based on the characters and the numbers.
Optionally, the content extraction module is specifically configured to:
Preprocessing the vote image to be counted;
identifying and extracting lines with cross points in the preprocessed ballot image to be counted, and obtaining a table frame in the ballot image to be counted;
and determining the contents in the table based on the to-be-counted vote image and the table frame.
Optionally, the content extraction module is further configured to:
Gray processing is carried out on the vote image to be counted, and binarization processing is carried out on the vote image to be counted, so that a binarization vote image is obtained;
and corroding and expanding the binarized ballot image.
Optionally, the content identification module includes:
a character recognition sub-module for recognizing characters corresponding to the mark selection items in the contents of the table by an optical character recognition method;
and the number recognition sub-module is used for recognizing the corresponding number in the numerical score through a preset handwriting number recognition model.
Optionally, the character recognition submodule is specifically configured to:
recognizing handwriting symbols in a preset pixel coordinate area in the vote image to be counted by an optical character recognition method, wherein the preset pixel coordinate area corresponds to the area corresponding to the mark selection item;
Determining whether the identified handwritten symbol belongs to a symbol in a preset symbol library;
And when the identified handwritten symbol belongs to the symbol in the preset symbol library, taking the identified handwritten symbol as the preset character corresponding to the mark selection item.
Optionally, the preset symbol library includes symbols associated with the shapes of the specified handwriting symbols in the ballot image to be counted.
Optionally, the vote counting module is specifically configured to:
counting the selection results corresponding to the mark selection items through the characters;
and counting the scores corresponding to the numerical scoring items through the numbers.
In a third aspect, an embodiment of the present invention further provides a computer apparatus, including:
one or more processors;
a memory for storing one or more programs;
The one or more programs, when executed by the one or more processors, cause the one or more processors to implement the ballot data statistics method as provided by any embodiment of the present invention.
In a fourth aspect, embodiments of the present invention also provide a computer readable storage medium having stored thereon a computer program which when executed by a processor implements a ballot data statistics method as provided by any of the embodiments of the present invention.
The embodiments of the above invention have the following advantages or benefits:
According to the embodiment of the invention, the contents in the table in the vote image to be counted are obtained and extracted, wherein the contents in the table comprise the mark selection items and the numerical scoring items, and further the characters corresponding to the mark selection items and the numbers corresponding to the numerical scoring items in the contents in the table are respectively identified; finally, generating a statistical result of the ballot to be counted based on the characters and the numbers; the problem of manually carrying out ballot and scoring statistics in the prior art is solved, the voting selection and scoring score in the ballot image are respectively identified, the ballot result is automatically counted, the ballot statistics efficiency is improved, and artificial statistics errors are avoided.
Drawings
FIG. 1 is a flowchart of a method for statistics of ballot data according to a first embodiment of the present invention;
FIG. 2 is a schematic diagram of a vote image to be counted according to a first embodiment of the present invention;
FIG. 3 is a schematic diagram of a preprocessed ballot image to be counted according to a first embodiment of the present invention;
FIG. 4 is a schematic diagram of a table extracted from preprocessed ballot images to be counted according to a first embodiment of the present invention;
FIG. 5 is a schematic diagram of extracting contents of a table from preprocessed vote images to be counted according to a first embodiment of the present invention;
FIG. 6 is a schematic diagram of a character corresponding to a tag selection item in a recognized form according to a first embodiment of the present invention;
FIG. 7 is a schematic diagram of a number corresponding to a numerical score in content in a recognized form according to an embodiment of the present invention;
FIG. 8 is a diagram of statistics according to the identification result of a ballot image according to the first embodiment of the present invention;
fig. 9 is a schematic structural diagram of a ballot data statistics device according to a second embodiment of the present invention;
Fig. 10 is a schematic structural diagram of a computer device according to a third embodiment of the present invention.
Detailed Description
The invention is described in further detail below with reference to the drawings and examples. It is to be understood that the specific embodiments described herein are merely illustrative of the invention and are not limiting thereof. It should be further noted that, for convenience of description, only some, but not all of the structures related to the present invention are shown in the drawings.
Example 1
Fig. 1 is a flowchart of a method for statistics of ballot data according to a first embodiment of the present invention, where the present embodiment is applicable to statistics of handwritten ballots. The method can be executed by a ballot data statistics device which can be realized by software and/or hardware and is integrated in an electronic device with application development function.
As shown in fig. 1, the ballot data statistics method includes the steps of:
s110, acquiring a vote image to be counted, and extracting the contents in a table in the vote image to be counted.
The to-be-counted vote can be a pre-designed table, and contents in the table comprise the selected objects and the selected items of the selected objects. Specifically, the comment items in the table include a flag selection item and a numerical score item. The tag selection item is a voting item in the ballot requiring a predetermined character as a voting selection result, for example, by circling, hooking, or other predetermined character, as an indication of the selection result. The numerical scoring items are voting items in the ballot that require a numeric score as a result, for example, items such as scoring the performance of the subject being evaluated.
In the ballot, the names of the selected objects, the names of the selected items of the selected objects and other contents can be characters of printing bodies, the ballot is printed in advance, and then the ballot is filled out by the person participating in the selecting event, so that the ballot is filled out. The ballot image to be counted can be obtained through scanning or photographing and the like. Illustratively, the votes to be counted may be as shown in fig. 2. The vote in fig. 2 contains the basic information of the name, age, department, post, job level, etc. of the subject being selected, as well as the voting content. The voting content comprises mark selection items, satisfaction degree of selected objects and numerical scoring items, wherein the mark selection items are used for selecting the satisfaction degree of the selected objects, and the numerical scoring items are used for respectively selecting the items such as responsibility center, team consciousness, professional ability, result output and the like of the selected objects. Here, the content of the vote may be set according to specific evaluation content and rules, which are only illustrated here.
Further, after the images of the multiple votes to be counted are acquired, the contents in the vote table need to be extracted to count the data.
Specifically, the statistical vote image is preprocessed firstly, so that the form lines in the image are more prominent, and the extraction effect is better in the subsequent form extraction. The preprocessing process of the ballot image to be counted can comprise gray processing and binarization processing of the ballot image to be counted to obtain a binarized ballot image; then, the binarized ballot image is eroded and expanded. The results of the image preprocessing can be referred to as a schematic diagram of fig. 3, and it can be seen that the table in fig. 3 is enhanced and displayed.
To obtain the content of the table, the table in the image is separated from the text, and the table frame in the image of the vote to be counted can be obtained by identifying and extracting the lines with the cross points in the preprocessed image of the vote to be counted. The frame of the form, including the horizontal lines and the vertical lines, can form a cell every four crossing points, and the form can be extracted by only retaining the frame of the crossing points and positioning the cells based on the crossing points, as shown in fig. 4. Finally, the contents of the table may be determined based on the ballot image to be counted and the table frame. For example, the extracted form and the to-be-counted ballot image subjected to the gradation processing are subjected to difference so as to obtain text and symbols without form, and reference may be made to the character extracted through the contents shown in fig. 5, wherein the character includes a print character and a handwritten character.
S120, respectively identifying characters corresponding to the mark selection items and numbers corresponding to the numerical value scoring items in the table.
Specifically, for the character corresponding to the marker selection item in the content in the table extracted in the previous step, the character corresponding to the marker selection item in the content in the table may be identified by the optical character recognition method. For example, in fig. 6, the options of "satisfied, basically satisfied, unsatisfied and unknown" are marked item items that need to represent the result of the selection by using preset characters, and the printed text and handwritten symbol can be extracted by combining the optical character recognition technology. Considering that the recognition effect of the optical character recognition technology on the handwritten character may not meet the requirement, in the step, only the handwritten character in the preset pixel coordinate area is recognized, the recognized cell is occupied, the condition that the format is not corresponding due to unrecognized errors does not occur, and then all recognition results are stored. The preset pixel coordinate area is the area corresponding to the corresponding mark selection item.
In fig. 6, the default characters are circles, and the corresponding handwriting characters in the options "satisfied, substantially satisfied, unsatisfied, and unknown" are recognized as characters D, O, 0, c, d, b, and o, which are similar or have similar points in shape to the circles. The characters may be stored in a pre-set symbol library, and when the recognized characters belong to the characters in the pre-set symbol library, it is indicated that the person participating in the evaluation selects the tag selection item corresponding to the character. If other preset characters are appointed in the vote, a corresponding preset symbol library can be set according to the corresponding preset characters.
For the numbers corresponding to the numerical score items in the contents of the table extracted in the previous step, the numbers corresponding to the numerical score items may be identified by a preset handwriting number identification model. The preset handwriting digital recognition model may be a model which is built and trained in advance based on a Python language and keras neural network, and the extracted handwriting digital content part is input into the preset handwriting digital recognition model, so that a result output by the model, namely the recognized number, can be obtained.
Recognition results as shown in fig. 7 can be finally obtained by recognizing the handwritten character symbol and the handwritten numeral, respectively. In each of the review items, the review comments and scores that the reviewer gives to the reviewed object are presented.
S130, generating a statistical result of the votes to be counted based on the characters and the numbers.
After identifying the content in the ballot image to be counted, the statistics of the voting data can be performed. Specifically, the selection result corresponding to the selection item can be marked through characters and statistics; by number, the score corresponding to the score item is scored by a statistics value, such as a total score, and the score is ranked. Objects with a single high score may also be selected based on the scores of any of the scoring items. Or the comprehensive results of the statistics mark selection items and the numerical scoring items can be integrated for evaluation.
According to the technical scheme, contents in a table in the to-be-counted ballot image are obtained and extracted, wherein the contents in the table comprise mark selection items and numerical scoring items, and further characters corresponding to the mark selection items and numbers corresponding to the numerical scoring items in the contents in the table are respectively identified; finally, generating a statistical result of the ballot to be counted based on the characters and the numbers; the problem of manually carrying out ballot and scoring statistics in the prior art is solved, the voting selection and scoring score in the ballot image are respectively identified, the ballot result is automatically counted, the ballot statistics efficiency is improved, and artificial statistics errors are avoided.
The following is an embodiment of the ballot data statistics device provided by the embodiment of the present invention, which belongs to the same inventive concept as the ballot data statistics method of each embodiment, and can implement the ballot data statistics method of each embodiment. Details which are not described in detail in the embodiments of the ballot data statistics apparatus may be referred to the embodiments of the ballot data statistics method described above.
Example two
Fig. 9 is a schematic structural diagram of a vote data statistics device according to a third embodiment of the present invention, where the present embodiment is applicable to statistics of handwritten votes.
As shown in fig. 9, the vote data statistics apparatus includes a content extraction module 210, a content identification module 220, and a vote statistics module 230.
The content extraction module 210 is configured to obtain a vote image to be counted, and extract contents in a table in the vote image to be counted, where the contents in the table include a marker selection item and a numerical score item, the marker selection item is a voting item in the vote, where a preset character is required to be used as a voting selection result, and the numerical score item is a voting item in the vote, where a numerical score is required to be used as a voting result; a content identifying module 220, configured to identify the characters corresponding to the tag selection items and the numbers corresponding to the numerical score items in the contents in the table respectively; the vote counting module 230 is configured to generate a counting result of the votes to be counted based on the characters and the numbers.
According to the technical scheme, contents in a table in the to-be-counted ballot image are obtained and extracted, wherein the contents in the table comprise mark selection items and numerical scoring items, and further characters corresponding to the mark selection items and numbers corresponding to the numerical scoring items in the contents in the table are respectively identified; finally, generating a statistical result of the ballot to be counted based on the characters and the numbers; the problem of manually carrying out ballot and scoring statistics in the prior art is solved, the voting selection and scoring score in the ballot image are respectively identified, the ballot result is automatically counted, the ballot statistics efficiency is improved, and artificial statistics errors are avoided.
Optionally, the content extraction module 210 is specifically configured to:
Preprocessing the vote image to be counted;
identifying and extracting lines with cross points in the preprocessed ballot image to be counted, and obtaining a table frame in the ballot image to be counted;
and determining the contents in the table based on the to-be-counted vote image and the table frame.
Optionally, the content extraction module 210 is further configured to:
Gray processing is carried out on the vote image to be counted, and binarization processing is carried out on the vote image to be counted, so that a binarization vote image is obtained;
and corroding and expanding the binarized ballot image.
Optionally, the content identification module 220 includes:
a character recognition sub-module for recognizing characters corresponding to the mark selection items in the contents of the table by an optical character recognition method;
and the number recognition sub-module is used for recognizing the corresponding number in the numerical score through a preset handwriting number recognition model.
Optionally, the character recognition submodule is specifically configured to:
recognizing handwriting symbols in a preset pixel coordinate area in the vote image to be counted by an optical character recognition method, wherein the preset pixel coordinate area corresponds to the area corresponding to the mark selection item;
Determining whether the identified handwritten symbol belongs to a symbol in a preset symbol library;
And when the identified handwritten symbol belongs to the symbol in the preset symbol library, taking the identified handwritten symbol as the preset character corresponding to the mark selection item.
Optionally, the preset symbol library includes symbols associated with the shapes of the specified handwriting symbols in the ballot image to be counted.
Optionally, the vote counting module 230 is specifically configured to:
counting the selection results corresponding to the mark selection items through the characters;
and counting the scores corresponding to the numerical scoring items through the numbers.
The ballot data statistics device provided by the embodiment of the invention can execute the ballot data statistics method provided by any embodiment of the invention, and has the corresponding functional modules and beneficial effects of the execution method.
Example III
Fig. 10 is a schematic structural diagram of a server according to a third embodiment of the present invention. FIG. 10 illustrates a block diagram of an exemplary computer device 12 suitable for use in implementing embodiments of the present invention. The computer device 12 shown in fig. 10 is merely an example and should not be construed as limiting the functionality and scope of use of embodiments of the present invention. The computer device 12 may be any terminal device with computing power, such as an intelligent controller, a server, a mobile phone, and the like.
As shown in FIG. 10, the computer device 12 is in the form of a general purpose computing device. Components of computer device 12 may include, but are not limited to: one or more processors or processing units 16, a system memory 28, a bus 18 that connects the various system components, including the system memory 28 and the processing units 16.
Bus 18 represents one or more of several types of bus structures, including a memory bus or memory controller, a peripheral bus, an accelerated graphics port, a processor, and a local bus using any of a variety of bus architectures. By way of example, and not limitation, such architectures include Industry Standard Architecture (ISA) bus, micro channel architecture (MAC) bus, enhanced ISA bus, video Electronics Standards Association (VESA) local bus, and Peripheral Component Interconnect (PCI) bus.
Computer device 12 typically includes a variety of computer system readable media. Such media can be any available media that is accessible by computer device 12 and includes both volatile and nonvolatile media, removable and non-removable media.
The system memory 28 may include computer system readable media in the form of volatile memory, such as Random Access Memory (RAM) 30 and/or cache memory 32. The computer device 12 may further include other removable/non-removable, volatile/nonvolatile computer system storage media. By way of example only, storage system 34 may be used to read from or write to non-removable, nonvolatile magnetic media (not shown in FIG. 10, commonly referred to as a "hard disk drive"). Although not shown in fig. 10, a magnetic disk drive for reading from and writing to a removable non-volatile magnetic disk (e.g., a "floppy disk"), and an optical disk drive for reading from or writing to a removable non-volatile optical disk (e.g., a CD-ROM, DVD-ROM, or other optical media) may be provided. In such cases, each drive may be coupled to bus 18 through one or more data medium interfaces. The system memory 28 may include at least one program product having a set (e.g., at least one) of program modules configured to carry out the functions of the embodiments of the invention.
A program/utility 40 having a set (at least one) of program modules 42 may be stored in, for example, system memory 28, such program modules 42 including, but not limited to, an operating system, one or more application programs, other program modules, and program data, each or some combination of which may include an implementation of a network environment. Program modules 42 generally perform the functions and/or methods of the embodiments described herein.
The computer device 12 may also communicate with one or more external devices 14 (e.g., keyboard, pointing device, display 24, etc.), one or more devices that enable a user to interact with the computer device 12, and/or any devices (e.g., network card, modem, etc.) that enable the computer device 12 to communicate with one or more other computing devices. Such communication may occur through an input/output (I/O) interface 22. Moreover, computer device 12 may also communicate with one or more networks such as a Local Area Network (LAN), a Wide Area Network (WAN) and/or a public network, such as the Internet, through network adapter 20. As shown, network adapter 20 communicates with other modules of computer device 12 via bus 18. It should be appreciated that although not shown in fig. 10, other hardware and/or software modules may be used in connection with computer device 12, including, but not limited to: microcode, device drivers, redundant processing units, external disk drive arrays, RAID systems, tape drives, data backup storage systems, and the like.
The processing unit 16 executes various functional applications and data processing by running programs stored in the system memory 28, for example, implementing a ballot data statistics method step provided by the present embodiment, the method comprising:
Acquiring a vote image to be counted, and extracting contents in a table in the vote image to be counted, wherein the contents in the table comprise mark selection items and numerical scoring items, the mark selection items are voting items which need to take preset characters as voting selection results in the votes, and the numerical scoring items are voting items which need to take numerical scores as results in the votes;
Respectively identifying characters corresponding to the mark selection items and numbers corresponding to the numerical scoring items in the table contents;
And generating a statistics result of the votes to be counted based on the characters and the numbers.
Example IV
The fourth embodiment provides a computer-readable storage medium having stored thereon a computer program which, when executed by a processor, implements the ballot data statistics method as provided in any embodiment of the present invention, including:
Acquiring a vote image to be counted, and extracting contents in a table in the vote image to be counted, wherein the contents in the table comprise mark selection items and numerical scoring items, the mark selection items are voting items which need to take preset characters as voting selection results in the votes, and the numerical scoring items are voting items which need to take numerical scores as results in the votes;
Respectively identifying characters corresponding to the mark selection items and numbers corresponding to the numerical scoring items in the table contents;
And generating a statistics result of the votes to be counted based on the characters and the numbers.
The computer storage media of embodiments of the invention may take the form of any combination of one or more computer-readable media. The computer readable medium may be a computer readable signal medium or a computer readable storage medium. The computer readable storage medium may be, for example, but not limited to: an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or a combination of any of the foregoing. More specific examples (a non-exhaustive list) of the computer-readable storage medium would include the following: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In this document, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device.
The computer readable signal medium may include a propagated data signal with computer readable program code embodied therein, either in baseband or as part of a carrier wave. Such a propagated data signal may take any of a variety of forms, including, but not limited to, electro-magnetic, optical, or any suitable combination of the foregoing. A computer readable signal medium may also be any computer readable medium that is not a computer readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device.
Program code embodied on a computer readable medium may be transmitted using any appropriate medium, including but not limited to: wireless, wire, fiber optic cable, RF, etc., or any suitable combination of the foregoing.
Computer program code for carrying out operations of the present invention may be written in one or more programming languages, including an object oriented programming language such as Java, smalltalk, C ++, python and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the case of a remote computer, the remote computer may be connected to the user's computer through any kind of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or may be connected to an external computer (for example, through the Internet using an Internet service provider).
It will be appreciated by those of ordinary skill in the art that the modules or steps of the invention described above may be implemented in a general purpose computing device, they may be centralized on a single computing device, or distributed over a network of computing devices, or they may alternatively be implemented in program code executable by a computer device, such that they are stored in a memory device and executed by the computing device, or they may be separately fabricated as individual integrated circuit modules, or multiple modules or steps within them may be fabricated as a single integrated circuit module. Thus, the present invention is not limited to any specific combination of hardware and software.
Note that the above is only a preferred embodiment of the present invention and the technical principle applied. It will be understood by those skilled in the art that the present invention is not limited to the particular embodiments described herein, but is capable of various obvious changes, rearrangements and substitutions as will now become apparent to those skilled in the art without departing from the scope of the invention. Therefore, while the invention has been described in connection with the above embodiments, the invention is not limited to the embodiments, but may be embodied in many other equivalent forms without departing from the spirit or scope of the invention, which is set forth in the following claims.

Claims (6)

1. A method of ballot data statistics, comprising:
Acquiring a vote image to be counted, and extracting contents in a table in the vote image to be counted, wherein the contents in the table comprise mark selection items and numerical scoring items, the mark selection items are voting items which need to take preset characters as voting selection results in the votes, and the numerical scoring items are voting items which need to take numerical scores as results in the votes;
Respectively identifying characters corresponding to the mark selection items and numbers corresponding to the numerical scoring items in the table contents;
Generating a statistics result of the votes to be counted based on the characters and the numbers;
the extracting the contents in the table in the vote image to be counted comprises the following steps:
Preprocessing the vote image to be counted;
identifying and extracting lines with cross points in the preprocessed ballot image to be counted, and obtaining a table frame in the ballot image to be counted;
Determining contents in the table based on the to-be-counted vote image and the table frame;
wherein the identifying the characters corresponding to the mark selection items and the numbers corresponding to the numerical score items in the table respectively comprises:
Identifying the characters corresponding to the mark selection items in the contents of the table by an optical character identification method;
identifying corresponding numbers in the numerical scoring items through a preset handwriting number identification model;
The identifying the character corresponding to the mark selection item in the table content by the optical character identification method comprises the following steps:
recognizing handwriting symbols in a preset pixel coordinate area in the vote image to be counted by an optical character recognition method, wherein the preset pixel coordinate area corresponds to the area corresponding to the mark selection item;
Determining whether the identified handwritten symbol belongs to a symbol in a preset symbol library;
when the identified handwritten symbol belongs to a symbol in the preset symbol library, taking the identified handwritten symbol as a preset character corresponding to the mark selection item;
The preset symbol library comprises symbols associated with the shapes of the specified handwriting symbols in the ballot images to be counted.
2. The method according to claim 1, wherein the preprocessing of the ballot image to be counted comprises:
Gray processing is carried out on the vote image to be counted, and binarization processing is carried out on the vote image to be counted, so that a binarization vote image is obtained;
and corroding and expanding the binarized ballot image.
3. The method of claim 1, wherein the generating statistics of votes to be counted based on the characters and the numbers comprises:
counting the selection results corresponding to the mark selection items through the characters;
and counting the scores corresponding to the numerical scoring items through the numbers.
4. A ballot data statistics apparatus, comprising:
The content extraction module is used for acquiring the to-be-counted ballot image and extracting the content in a table in the to-be-counted ballot image, wherein the content in the table comprises a mark selection item and a numerical score item, the mark selection item is a voting item which needs to take a preset character as a voting selection result in the ballot, and the numerical score item is a voting item which needs to take a numerical score as a result in the ballot;
The content identification module is used for respectively identifying characters corresponding to the mark selection items and numbers corresponding to the numerical value scoring items in the contents in the table;
The ballot counting module is used for generating a counting result of a ballot to be counted based on the characters and the numbers;
the content extraction module is specifically configured to: preprocessing the vote image to be counted; identifying and extracting lines with cross points in the preprocessed ballot image to be counted, and obtaining a table frame in the ballot image to be counted; determining contents in the table based on the to-be-counted vote image and the table frame;
wherein the content identification module comprises:
a character recognition sub-module for recognizing characters corresponding to the mark selection items in the contents of the table by an optical character recognition method;
the digital identification sub-module is used for identifying the corresponding number in the numerical score through a preset handwriting digital identification model;
Wherein, the character recognition submodule is specifically used for:
recognizing handwriting symbols in a preset pixel coordinate area in the vote image to be counted by an optical character recognition method, wherein the preset pixel coordinate area corresponds to the area corresponding to the mark selection item;
Determining whether the identified handwritten symbol belongs to a symbol in a preset symbol library;
when the identified handwritten symbol belongs to a symbol in the preset symbol library, taking the identified handwritten symbol as a preset character corresponding to the mark selection item;
The preset symbol library comprises symbols associated with the shapes of the specified handwriting symbols in the ballot images to be counted.
5. A computer device, the computer device comprising:
one or more processors;
a memory for storing one or more programs;
The one or more programs, when executed by the one or more processors, cause the one or more processors to implement the ballot data statistics method of any of claims 1-3.
6. A computer readable storage medium having stored thereon a computer program, which when executed by a processor implements a ballot data statistics method as claimed in any of claims 1-3.
CN202110172512.6A 2021-02-08 2021-02-08 Ballot data statistics method, device, equipment and medium Active CN112801016B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110172512.6A CN112801016B (en) 2021-02-08 2021-02-08 Ballot data statistics method, device, equipment and medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110172512.6A CN112801016B (en) 2021-02-08 2021-02-08 Ballot data statistics method, device, equipment and medium

Publications (2)

Publication Number Publication Date
CN112801016A CN112801016A (en) 2021-05-14
CN112801016B true CN112801016B (en) 2024-06-21

Family

ID=75814804

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110172512.6A Active CN112801016B (en) 2021-02-08 2021-02-08 Ballot data statistics method, device, equipment and medium

Country Status (1)

Country Link
CN (1) CN112801016B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117195846B (en) * 2023-11-07 2024-03-01 江西五十铃汽车有限公司 Expert table management method, system, storage medium and equipment

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101447017A (en) * 2008-11-27 2009-06-03 浙江工业大学 Method and system for quickly identifying and counting votes on the basis of layout analysis

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7077313B2 (en) * 2001-10-01 2006-07-18 Avante International Technology, Inc. Electronic voting method for optically scanned ballot
CN104766043A (en) * 2014-10-15 2015-07-08 南通北城科技创业管理有限公司 Method of fast identification of ballot image
CN105740857A (en) * 2016-01-31 2016-07-06 华南理工大学 OCR based automatic acquisition and recognition system for fast pencil-and-paper voting result

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101447017A (en) * 2008-11-27 2009-06-03 浙江工业大学 Method and system for quickly identifying and counting votes on the basis of layout analysis

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
一种基于OCR识别技术的通用计票***设计;罗维;;企业科技与发展(第09期);全文 *

Also Published As

Publication number Publication date
CN112801016A (en) 2021-05-14

Similar Documents

Publication Publication Date Title
CN112597312A (en) Text classification method and device, electronic equipment and readable storage medium
CN110619568A (en) Risk assessment report generation method, device, equipment and storage medium
US11055327B2 (en) Unstructured data parsing for structured information
CN109902285B (en) Corpus classification method, corpus classification device, computer equipment and storage medium
CN112507936A (en) Image information auditing method and device, electronic equipment and readable storage medium
CN112380870A (en) User intention analysis method and device, electronic equipment and computer storage medium
CN112115130A (en) Method, device, equipment and medium for acquiring data corresponding relation
US11341319B2 (en) Visual data mapping
CN109165316A (en) A kind of method for processing video frequency, video index method, device and terminal device
CN111428480A (en) Resume identification method, device, equipment and storage medium
CN113064994A (en) Conference quality evaluation method, device, equipment and storage medium
CN113360768A (en) Product recommendation method, device and equipment based on user portrait and storage medium
CN112650867A (en) Picture matching method and device, electronic equipment and storage medium
CN112801016B (en) Ballot data statistics method, device, equipment and medium
CN113821602A (en) Automatic answering method, device, equipment and medium based on image-text chatting record
CN113254814A (en) Network course video labeling method and device, electronic equipment and medium
CN112613367A (en) Bill information text box acquisition method, system, equipment and storage medium
CN112528984A (en) Image information extraction method, device, electronic equipment and storage medium
CN108170838B (en) Topic evolution visualization display method, application server and computer readable storage medium
CN115934928A (en) Information extraction method, device, equipment and storage medium
CN114049686A (en) Signature recognition model training method and device and electronic equipment
CN114943306A (en) Intention classification method, device, equipment and storage medium
CN115203364A (en) Software fault feedback processing method, device, equipment and readable storage medium
CN114067343A (en) Data set construction method, model training method and corresponding device
CN114548825B (en) Complaint work order distortion detection method, device, equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant