CN115471858A - Data processing method and device for bill management - Google Patents

Data processing method and device for bill management Download PDF

Info

Publication number
CN115471858A
CN115471858A CN202211109286.8A CN202211109286A CN115471858A CN 115471858 A CN115471858 A CN 115471858A CN 202211109286 A CN202211109286 A CN 202211109286A CN 115471858 A CN115471858 A CN 115471858A
Authority
CN
China
Prior art keywords
bill
image
result information
information
processed
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202211109286.8A
Other languages
Chinese (zh)
Inventor
王忠军
魏铃昌
陈达
谢朝晖
周国才
杨轶俊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Southern Power Grid Digital Platform Technology Guangdong Co ltd
Original Assignee
China Southern Power Grid Digital Platform Technology Guangdong Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Southern Power Grid Digital Platform Technology Guangdong Co ltd filed Critical China Southern Power Grid Digital Platform Technology Guangdong Co ltd
Priority to CN202211109286.8A priority Critical patent/CN115471858A/en
Publication of CN115471858A publication Critical patent/CN115471858A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/42Document-oriented image-based pattern recognition based on the type of document
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/82Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/18Extraction of features or characteristics of the image
    • GPHYSICS
    • G07CHECKING-DEVICES
    • G07DHANDLING OF COINS OR VALUABLE PAPERS, e.g. TESTING, SORTING BY DENOMINATIONS, COUNTING, DISPENSING, CHANGING OR DEPOSITING
    • G07D11/00Devices accepting coins; Devices accepting, dispensing, sorting or counting valuable papers
    • G07D11/20Controlling or monitoring the operation of devices; Data handling
    • G07D11/22Means for sensing or detection

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Computing Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Software Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • General Engineering & Computer Science (AREA)
  • Molecular Biology (AREA)
  • Mathematical Physics (AREA)
  • Data Mining & Analysis (AREA)
  • Computational Linguistics (AREA)
  • Biophysics (AREA)
  • Databases & Information Systems (AREA)
  • Medical Informatics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Character Input (AREA)

Abstract

The invention discloses a data processing method and a data processing device for bill management, wherein the method comprises the following steps: acquiring image information of a bill to be processed; the bill image information to be processed comprises a plurality of bill images to be processed; detecting and processing image information of a bill to be processed to obtain an image detection result information set; the image detection result information set comprises a plurality of pieces of image detection result information; carrying out identification and classification processing on the image detection result information set to obtain bill identification information; the bill identification information is used for indicating intelligent management of the bill. Therefore, the invention can obtain the bill identification information for indicating the intelligent management of the bill by carrying out comprehensive processing such as detection processing, identification and classification processing and the like on the image information of the bill to be processed, is favorable for improving the intelligent acquisition, intelligent storage and intelligent inspection of the bill information, further improves the invoice registration efficiency and reduces the invoice audit risk.

Description

Data processing method and device for bill management
Technical Field
The present invention relates to the field of image processing technologies, and in particular, to a data processing method and apparatus for bill management.
Background
Invoice management is one of the most important jobs in financial management services, and invoices must be registered by the contractual payment service. In the face of a large number of invoices generated by contract payment services, in the technical field of invoice information management, the work form of the method still manually registers invoice information by one invoice, and the method not only costs a large amount of manpower, material resources and time, but also has the risk of input errors, and has high error rate and low efficiency. Therefore, it is very important to provide a data processing method and device for bill management to improve intelligent collection, intelligent storage and intelligent inspection of bill information, further improve invoice registration efficiency, and reduce invoice audit risk.
Disclosure of Invention
The technical problem to be solved by the invention is to provide a data processing method and device for bill management, which can obtain bill identification information for indicating intelligent management of bills by carrying out comprehensive processing such as detection processing, identification and classification processing and the like on image information of bills to be processed, and are beneficial to improving intelligent acquisition, intelligent storage and intelligent inspection of bill information, thereby improving the invoice registration efficiency and reducing the invoice examination risk.
In order to solve the above technical problem, a first aspect of an embodiment of the present invention discloses a data processing method for ticket management, where the method includes:
acquiring image information of a bill to be processed; the bill image information to be processed comprises a plurality of bill images to be processed;
detecting and processing the image information of the bill to be processed to obtain an image detection result information set; the image detection result information set comprises a plurality of pieces of image detection result information;
carrying out identification and classification processing on the image detection result information set to obtain bill identification information; the bill identification information is used for indicating intelligent management of the bill.
As an optional implementation manner, in the first aspect of the embodiment of the present invention, the processing, to detect image information of the to-be-processed bill to obtain an image detection result information set, includes:
for any bill image to be processed, carrying out angle detection processing on the bill image to be processed to obtain first processed image information corresponding to the bill image to be processed; the angle detection processing is used for carrying out angle correction on the bill image to be processed;
performing character detection processing on the first processed image information to obtain second processed image information corresponding to the bill image to be processed; the character detection processing is used for identifying a character line area of the first processing image information;
and carrying out straight line and seal detection on the second processed image information to obtain image detection result information corresponding to the bill image to be processed.
As an optional implementation manner, in the first aspect of the embodiment of the present invention, the performing identification and classification processing on the image detection result information set to obtain the bill identification information includes:
identifying the image detection result information set to obtain an image identification result information set; the image recognition result information set comprises a plurality of pieces of image recognition result information;
and post-processing the image recognition result information set to obtain bill recognition information.
As an optional implementation manner, in the first aspect of the embodiment of the present invention, the performing identification processing on the image detection result information set to obtain an image identification result information set includes:
carrying out whole-line character recognition on the image detection result information set to obtain a first recognition result information set; the first identification result information set comprises a plurality of pieces of first identification result information;
calling a special engine to perform region identification on the image detection result information set to obtain a second identification result information set; the first identification result information set comprises a plurality of pieces of second identification result information;
and integrating the first recognition result information set and the second recognition result information set to obtain an image recognition result information set.
As an optional implementation manner, in the first aspect of the embodiment of the present invention, the performing post-processing on the image recognition result information set to obtain the ticket recognition information includes:
filtering the image recognition result information set to obtain a filtered image result information set; the filtered image result information set comprises a plurality of pieces of filtered image result information; the filtering processing is used for eliminating text lines with low credibility;
correcting the filtered image result information set to obtain bill identification information; the correction process is used to correct for routine errors in the filtered image result information.
As an optional implementation manner, in the first aspect of the embodiment of the present invention, after performing recognition and classification processing on the image detection result information set to obtain the bill recognition information, the method further includes:
performing type identification on the bill identification information to obtain bill type information;
and executing a preset bill management strategy according to the bill type information.
As an optional implementation manner, in the first aspect of the embodiment of the present invention, the executing a preset ticket management policy according to the ticket type information includes:
when the bill type information is a non-value-added tax invoice, storing and updating the bill identification information;
when the bill type information is a value-added tax invoice, performing authenticity check on the bill identification information to obtain check result information;
and when the checking result information is yes, performing repeated aggregation checking processing on the bill identification information, and performing storage updating processing.
The second aspect of the embodiments of the present invention discloses a data processing apparatus for bill management, the apparatus comprising:
the acquisition module is used for acquiring image information of the bill to be processed; the bill image information to be processed comprises a plurality of bill images to be processed;
the first processing module is used for detecting and processing the image information of the bill to be processed to obtain an image detection result information set; the image detection result information set comprises a plurality of pieces of image detection result information;
the second processing module is used for identifying and classifying the image detection result information set to obtain bill identification information; the bill identification information is used for indicating intelligent management of the bill.
As an optional implementation manner, in the second aspect of the embodiment of the present invention, the specific manner of detecting and processing the image information of the to-be-processed bill by the first processing module to obtain the image detection result information set is as follows:
for any bill image to be processed, carrying out angle detection processing on the bill image to be processed to obtain first processing image information corresponding to the bill image to be processed; the angle detection processing is used for carrying out angle correction on the bill image to be processed;
performing character detection processing on the first processed image information to obtain second processed image information corresponding to the bill image to be processed; the character detection processing is used for identifying a character line area of the first processing image information;
and carrying out straight line and seal detection on the second processed image information to obtain image detection result information corresponding to the bill image to be processed.
As an optional implementation manner, in the second aspect of the embodiment of the present invention, the second processing module performs recognition and classification processing on the image detection result information set, and a specific manner of obtaining the bill recognition information is as follows:
identifying the image detection result information set to obtain an image identification result information set; the image recognition result information set comprises a plurality of pieces of image recognition result information;
and post-processing the image recognition result information set to obtain bill recognition information.
As an optional implementation manner, in the second aspect of the embodiment of the present invention, the second processing module performs recognition processing on the image detection result information set, and a specific manner of obtaining the image recognition result information set is as follows:
carrying out whole-line character recognition on the image detection result information set to obtain a first recognition result information set; the first identification result information set comprises a plurality of pieces of first identification result information;
calling a special engine to perform region identification on the image detection result information set to obtain a second identification result information set; the first identification result information set comprises a plurality of pieces of second identification result information;
and integrating the first recognition result information set and the second recognition result information set to obtain an image recognition result information set.
As an optional implementation manner, in the second aspect of the embodiment of the present invention, the second processing module performs post-processing on the image recognition result information set to obtain the bill recognition information in a specific manner:
filtering the image recognition result information set to obtain a filtered image result information set; the filtered image result information set comprises a plurality of pieces of filtered image result information; the filtering processing is used for eliminating text lines with low credibility;
correcting the filtered image result information set to obtain bill identification information; the correction process is used to correct for routine errors in the filtered image result information.
As an optional implementation manner, in the second aspect of the embodiment of the present invention, after the second processing module performs recognition and classification processing on the image detection result information set to obtain the bill recognition information, the apparatus further includes:
the identification module is used for identifying the types of the bill identification information to obtain bill type information;
and the execution module is used for executing a preset bill management strategy according to the bill type information.
As an optional implementation manner, in a second aspect of the embodiment of the present invention, a specific manner in which the execution module executes a preset ticket management policy according to the ticket type information is as follows:
when the bill type information is a non-value-added tax invoice, storing and updating the bill identification information;
when the bill type information is a value-added tax invoice, performing authenticity check on the bill identification information to obtain check result information;
and when the checking result information is yes, performing repeated aggregation checking processing on the bill identification information, and performing storage updating processing.
A third aspect of the present invention discloses another data processing apparatus for ticket management, the apparatus comprising:
a memory storing executable program code;
a processor coupled with the memory;
the processor calls the executable program code stored in the memory to execute part or all of the steps of the data processing method for bill management disclosed in the first aspect of the embodiment of the present invention.
A fourth aspect of the present invention discloses a computer storage medium, where the computer storage medium stores computer instructions, and when the computer instructions are called, the computer instructions are used to execute some or all of the steps in the data processing method for ticket management disclosed in the first aspect of the present invention.
Compared with the prior art, the embodiment of the invention has the following beneficial effects:
in the embodiment of the invention, the image information of a bill to be processed is acquired; the bill image information to be processed comprises a plurality of bill images to be processed; detecting and processing image information of a bill to be processed to obtain an image detection result information set; the image detection result information set comprises a plurality of pieces of image detection result information; identifying and classifying the image detection result information set to obtain bill identification information; the bill identification information is used for indicating intelligent management of the bill. Therefore, the invention can obtain the bill identification information for indicating the intelligent management of the bill by carrying out comprehensive processing such as detection processing, identification and classification processing and the like on the image information of the bill to be processed, is favorable for improving the intelligent acquisition, intelligent storage and intelligent inspection of the bill information, further improves the invoice registration efficiency and reduces the invoice examination risk.
Drawings
In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings required to be used in the description of the embodiments are briefly introduced below, and it is obvious that the drawings in the description below are only some embodiments of the present invention, and it is obvious for those skilled in the art that other drawings can be obtained according to the drawings without creative efforts.
FIG. 1 is a flow chart illustrating a data processing method for bill management according to an embodiment of the present invention;
FIG. 2 is a flow chart of another data processing method for bill management according to the embodiment of the invention;
FIG. 3 is a schematic structural diagram of a data processing apparatus for bill management according to an embodiment of the present disclosure;
FIG. 4 is a schematic structural diagram of another data processing apparatus for bill management according to an embodiment of the present disclosure;
FIG. 5 is a schematic structural diagram of another data processing apparatus for bill management according to an embodiment of the present invention.
Detailed Description
In order to make those skilled in the art better understand the technical solutions of the present invention, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be obtained by a person skilled in the art without inventive step based on the embodiments of the present invention, are within the scope of protection of the present invention.
The terms "first," "second," and the like in the description and claims of the present invention and in the above-described drawings are used for distinguishing between different objects and not for describing a particular order. Furthermore, the terms "include" and "have," as well as any variations thereof, are intended to cover non-exclusive inclusions. For example, a process, method, apparatus, product, or apparatus that comprises a list of steps or elements is not limited to those listed but may alternatively include other steps or elements not listed or inherent to such process, method, product, or apparatus.
Reference herein to "an embodiment" means that a particular feature, structure, or characteristic described in connection with the embodiment can be included in at least one embodiment of the invention. The appearances of the phrase in various places in the specification are not necessarily all referring to the same embodiment, nor are separate or alternative embodiments mutually exclusive of other embodiments. It is explicitly and implicitly understood by one skilled in the art that the embodiments described herein can be combined with other embodiments.
The invention discloses a data processing method and a data processing device for bill management, which can obtain bill identification information for indicating intelligent management of bills by carrying out comprehensive processing such as detection processing, identification and classification processing and the like on image information of bills to be processed, are favorable for improving intelligent acquisition, intelligent storage and intelligent inspection of bill information, further improve the invoice registration efficiency and reduce the invoice audit risk. The following are detailed descriptions.
Example one
Referring to fig. 1, fig. 1 is a schematic flowchart of a data processing method for bill management according to an embodiment of the present invention. The data processing method for ticket management described in fig. 1 is applied to a ticket management system, such as a local server or a cloud server for data processing management of ticket management, and the embodiment of the present invention is not limited thereto. As shown in fig. 1, the data processing method for ticket management may include the operations of:
101. and acquiring image information of the bill to be processed.
In the embodiment of the invention, the information of the bill images to be processed comprises a plurality of bill images to be processed.
102. And detecting and processing the image information of the bill to be processed to obtain an image detection result information set.
In an embodiment of the present invention, the image detection result information set includes a plurality of pieces of image detection result information.
103. And identifying and classifying the image detection result information set to obtain bill identification information.
In the embodiment of the invention, the bill identification information is used for indicating the intelligent management of the bill.
Optionally, the image information of the bill to be processed is obtained by scanning the bill or uploading electronic invoice files in batch.
Optionally, the above-mentioned processing of detecting the image information of the bill to be processed is performed by using an image recognition model.
Optionally, the output layer of the image recognition model is 10, so as to match the number of types of the bills, and improve the data processing efficiency of the bills.
Optionally, the processing of detecting the information of the to-be-processed bill image includes inputting the to-be-processed bill image into a U-type network, performing feature extraction on the image through a 5-layer convolution layer, then performing up-sampling and merging convolution results of a previous layer to obtain 128 feature maps with the size of 1/2 of the image, then outputting different predicted scoresmaps according to different targets, and finally analyzing corresponding image detection result information according to the scoresmaps.
Optionally, the problem that the network scale is huge and the storage model is huge can be avoided by using the U-shaped network to detect and process the bill image information to be processed, so that the processing efficiency of the bill image to be processed is improved.
Optionally, the data processing method for bill management can be used for completing registration of multiple invoices in batches, the bill information registration process is controlled within 3 to 5 seconds, manual entry time is greatly shortened, bill information entry efficiency is improved, and accuracy and normalization of bill information are guaranteed.
Therefore, the data processing method for bill management, which is described in the embodiment of the invention, can obtain the bill identification information for indicating the intelligent management of the bill by performing comprehensive processing such as detection processing, identification and classification processing and the like on the image information of the bill to be processed, is favorable for improving the intelligent acquisition, intelligent storage and intelligent inspection of the bill information, further improving the invoice registration efficiency and reducing the invoice examination risk.
In an optional embodiment, in the step 102, detecting and processing the image information of the to-be-processed bill to obtain an image detection result information set includes:
for any bill image to be processed, carrying out angle detection processing on the bill image to be processed to obtain first processing image information corresponding to the bill image to be processed; the angle detection processing is used for carrying out angle correction on the bill image to be processed;
performing character detection processing on the first processed image information to obtain second processed image information corresponding to the bill image to be processed; the character detection processing is used for identifying a character line area of the first processing image information;
and performing straight line and seal detection on the second processed image information to obtain image detection result information corresponding to the bill image to be processed.
In this optional embodiment, as an optional implementation, the above-mentioned performing angle detection processing on the to-be-processed document image to obtain the first processed image information corresponding to the to-be-processed document image specifically includes:
identifying the main direction angle of the bill image to be processed;
and correcting the bill image to be processed by utilizing the main direction angle to obtain first processing image information corresponding to the bill image to be processed.
Optionally, the position information of the bill data of the bill image to be processed can be accurately positioned in a disordered and strange complex scene by utilizing the angle detection processing and the character detection processing.
Optionally, the position information of the bill data includes a main direction angle, and/or line information, and/or a stamp region, and/or a text region, which is not limited in the embodiment of the present invention.
Optionally, the performing of the straight line detection on the second processed image information is to identify and locate a table line in the second processed image.
Therefore, the data processing method for bill management, which is described in the embodiment of the invention, can obtain image detection result information by performing angle detection, character detection, linear detection and stamp detection on the bill image to be processed, and is beneficial to improving intelligent acquisition, intelligent storage and intelligent inspection of the bill information, thereby improving the invoice registration efficiency and reducing the invoice examination risk.
In another optional embodiment, the identifying and classifying the image detection result information set to obtain the bill identification information includes:
carrying out identification processing on the image detection result information set to obtain an image identification result information set; the image recognition result information set comprises a plurality of pieces of image recognition result information;
and post-processing the image recognition result information set to obtain bill recognition information.
Therefore, the data processing method for bill management can perform comprehensive processing such as identification processing and post-processing on the image detection result information set to obtain the bill identification information, and is beneficial to improving intelligent acquisition, intelligent storage and intelligent inspection of the bill information, thereby improving the invoice registration efficiency and reducing the invoice audit risk.
In yet another optional embodiment, the identifying the image detection result information set to obtain an image identification result information set includes:
carrying out whole-line character recognition on the image detection result information set to obtain a first recognition result information set; the first identification result information set comprises a plurality of pieces of first identification result information;
calling a special engine to perform region identification on the image detection result information set to obtain a second identification result information set; the first identification result information set comprises a plurality of pieces of second identification result information;
and integrating the first recognition result information set and the second recognition result information set to obtain an image recognition result information set.
Optionally, the whole line of text recognition performs recognition processing on the text image through sequence recognition, so as to avoid the extremely difficult segmentation and recognition of single characters in the traditional algorithm, thereby improving recognition accuracy and efficiency.
Optionally, the recognizing the text image by the sequence recognition is to recognize the text region according to the writing rule and the sequence of the text.
Optionally, the average end-to-end accuracy of the whole line of character recognition may be up to 95% or more.
Optionally, the general recognition engine for performing whole-line character recognition on the image detection result information set includes a CNN network module, and/or an LSTM network module, and/or an Attention network module, and/or a CTC network module, which is not limited in the embodiment of the present invention.
In this optional embodiment, as an optional implementation manner, the specific manner of calling the special engine to perform region identification on the image detection result information set to obtain the second identification result information set is as follows:
for any one of the image detection result information, obtaining form template information; the form template information comprises a plurality of form templates;
for any form template, analyzing a form area in the image detection result information by using the straight line information in the image detection result information to obtain first form lattice information corresponding to the form template;
performing structural analysis on the first table lattice information, and performing fusion processing on character information in the image detection result information to obtain second table lattice information corresponding to the table template;
extracting the features of the second form lattice information to obtain form feature information corresponding to the form template;
performing matching score calculation processing on the form characteristic information to obtain a form matching score corresponding to the form template;
selecting the table matching score with the largest value from all the table matching scores as a target table matching score;
determining a form template corresponding to the target form matching score as a target form template;
calling an identification engine corresponding to the target form template as a special engine;
and performing small character set character recognition on the image detection result information by using the special engine to obtain second recognition result information corresponding to the image detection result information.
Optionally, the special engine is used for identifying the small character set characters in the image detection result information set, so that the identification of special items such as handwritten numbers, handwritten English, handwritten capital money and the like can be realized, and the small character set characters can be identified more accurately and quickly.
Optionally, the table characteristic information includes a number of rows and columns, and/or a relative size of the table, and/or a number of rows and columns occupied by the table, and/or a key word inside the table, which is not limited in the embodiment of the present invention.
Therefore, the data processing method for bill management, which is described in the embodiment of the invention, can obtain the image identification result information set by performing whole-line character identification, area identification and integration processing on the image detection result information set, so that the intelligent acquisition, intelligent storage and intelligent inspection of the bill information can be improved, the invoice registration efficiency can be improved, and the invoice audit risk can be reduced.
In another optional embodiment, the post-processing the image recognition result information set to obtain the bill recognition information includes:
filtering the image recognition result information set to obtain a filtered image result information set; the filtered image result information set comprises a plurality of pieces of filtered image result information; the filtering processing is used for eliminating text lines with low credibility;
correcting the filtered image result information set to obtain bill identification information; the correction process is used to correct for routine errors in the filtered image result information.
Therefore, the data processing method for bill management, which is described in the embodiment of the invention, can obtain the bill identification information by filtering and correcting the image identification result information set, and is more beneficial to improving intelligent acquisition, intelligent storage and intelligent inspection of the bill information, thereby improving the invoice registration efficiency and reducing the invoice audit risk.
Example two
Referring to fig. 2, fig. 2 is a schematic flowchart illustrating another data processing method for bill management according to an embodiment of the present invention. The data processing method for ticket management depicted in fig. 2 is applied to a ticket management system, such as a local server or a cloud server for data processing management of ticket management, and the embodiment of the present invention is not limited thereto. As shown in fig. 2, the data processing method for ticket management may include the operations of:
201. and acquiring image information of the bill to be processed.
202. And detecting and processing the image information of the bill to be processed to obtain an image detection result information set.
203. And identifying and classifying the image detection result information set to obtain bill identification information.
204. And performing type identification on the bill identification information to obtain bill type information.
205. And executing a preset bill management strategy according to the bill type information.
In the embodiment of the present invention, for specific technical details and technical noun explanations of step 201 to step 202, reference may be made to the detailed description of step 101 to step 102 in the first embodiment, and details are not repeated in the embodiment of the present invention.
In this optional embodiment, as an optional implementation manner, after the executing a preset ticket management policy according to the ticket type information, the method further includes:
acquiring bill collection information in a database;
and carrying out classified display processing on the bill collecting information to dynamically display the bill information.
Optionally, the bill aggregation information includes an invoice code, and/or an invoice number date, and/or, an amount, and/or a check code, which is not limited in the embodiment of the present invention.
Therefore, the data processing method for bill management described in the embodiment of the invention can obtain the bill identification information for indicating the intelligent management of the bill by carrying out comprehensive processing such as detection processing, identification classification processing and the like on the image information of the bill to be processed, and carry out the intelligent management on the bill by carrying out the type identification processing and the like on the bill identification information, thereby being beneficial to improving the intelligent acquisition, intelligent storage and intelligent inspection of the bill information, further improving the invoice registration efficiency and reducing the invoice examination risk.
In an optional embodiment, in step 205, executing a preset ticket management policy according to the ticket type information includes:
when the bill type information is a non-value-added tax invoice, storing and updating the bill identification information;
when the bill type information is a value-added tax invoice, carrying out authenticity check on the bill identification information to obtain check result information;
and when the checking result information is yes, performing repeated aggregation checking processing on the bill identification information, and performing storage updating processing.
Optionally, the ticket identification information includes a plurality of ticket identification results.
In this optional embodiment, as an optional implementation manner, a specific manner of performing repeated aggregation checking processing on the bill identification information and performing storage updating processing is as follows:
sequentially selecting bill identification results in the bill identification information as bills to be collected;
judging whether all the filed value-added tax invoices in the filed value-added tax invoice set in the database have target value-added tax invoices matched with the target value-added tax invoices or not to obtain an invoice judgment result;
when the invoice judgment result is yes, triggering an execution sequence to select a bill identification result in the bill identification information as a bill to be collected;
and when the invoice judgment result is negative, storing and updating the archived value-added tax invoice set by using the to-be-collected bill.
Therefore, the data processing method for bill management, which is described in the embodiment of the invention, can be used for repeatedly performing the grouping, checking and storage updating processing on the bill identification information according to the checking result information, and is more favorable for improving the intelligent acquisition, intelligent storage and intelligent checking of the bill information, thereby improving the invoice registration efficiency and reducing the invoice audit risk.
EXAMPLE III
Referring to fig. 3, fig. 3 is a schematic structural diagram of a data processing apparatus for bill management according to an embodiment of the present invention. The apparatus described in fig. 3 can be applied to a ticket management system, such as a local server or a cloud server for data processing management of ticket management, and the embodiment of the present invention is not limited thereto. As shown in fig. 3, the apparatus may include:
the acquisition module 301 is used for acquiring image information of a bill to be processed; the bill image information to be processed comprises a plurality of bill images to be processed;
the first processing module 302 is configured to detect and process the to-be-processed bill image information to obtain an image detection result information set; the image detection result information set comprises a plurality of pieces of image detection result information;
the second processing module 303 is configured to perform recognition and classification processing on the image detection result information set to obtain bill recognition information; the bill identification information is used for indicating intelligent management of the bill.
Therefore, by implementing the data processing device for bill management described in fig. 3, the bill identification information for indicating the intelligent management of the bill can be obtained by performing comprehensive processing such as detection processing, identification and classification processing and the like on the image information of the bill to be processed, so that the intelligent acquisition, intelligent storage and intelligent inspection of the bill information can be improved, the invoice registration efficiency can be improved, and the invoice audit risk can be reduced.
In another alternative embodiment, as shown in fig. 4, the specific way for the first processing module 302 to detect and process the to-be-processed bill image information and obtain the image detection result information set is as follows:
for any bill image to be processed, carrying out angle detection processing on the bill image to be processed to obtain first processing image information corresponding to the bill image to be processed; the angle detection processing is used for carrying out angle correction on the bill image to be processed;
carrying out character detection processing on the first processed image information to obtain second processed image information corresponding to the bill image to be processed; the character detection processing is used for identifying a character line area of the first processing image information;
and carrying out straight line and seal detection on the second processed image information to obtain image detection result information corresponding to the bill image to be processed.
Therefore, the data processing device for bill management depicted in fig. 4 can obtain image detection result information by performing angle detection, character detection, line detection and stamp detection on the bill image to be processed, and is beneficial to improving intelligent acquisition, intelligent storage and intelligent inspection of the bill information, thereby improving the invoice registration efficiency and reducing the invoice audit risk.
In yet another alternative embodiment, as shown in fig. 4, the second processing module 303 performs recognition and classification processing on the image detection result information set to obtain the bill recognition information in a specific manner:
carrying out identification processing on the image detection result information set to obtain an image identification result information set; the image recognition result information set comprises a plurality of pieces of image recognition result information;
and post-processing the image recognition result information set to obtain bill recognition information.
Therefore, by implementing the data processing device for bill management described in fig. 4, the image detection result information set can be subjected to comprehensive processing such as identification processing and post-processing to obtain the bill identification information, which is beneficial to improving intelligent acquisition, intelligent storage and intelligent inspection of the bill information, further improving the invoice registration efficiency and reducing the invoice audit risk.
In yet another alternative embodiment, as shown in fig. 4, the second processing module 303 performs recognition processing on the image detection result information set, and the specific manner of obtaining the image recognition result information set is as follows:
carrying out whole-line character recognition on the image detection result information set to obtain a first recognition result information set; the first identification result information set comprises a plurality of pieces of first identification result information;
calling a special engine to perform region identification on the image detection result information set to obtain a second identification result information set; the first identification result information set comprises a plurality of pieces of second identification result information;
and integrating the first recognition result information set and the second recognition result information set to obtain an image recognition result information set.
Therefore, by implementing the data processing device for bill management described in fig. 4, the image recognition result information set can be obtained by performing whole-line character recognition, area recognition and integration processing on the image detection result information set, which is more beneficial to improving intelligent acquisition, intelligent storage and intelligent inspection of bill information, further improving the invoice registration efficiency and reducing the invoice audit risk.
In still another alternative embodiment, as shown in fig. 4, the second processing module 303 performs post-processing on the image recognition result information set to obtain the bill recognition information in a specific manner:
filtering the image recognition result information set to obtain a filtered image result information set; the filtered image result information set comprises a plurality of pieces of filtered image result information; the filtering processing is used for eliminating text lines with low credibility;
correcting the filtered image result information set to obtain bill identification information; the correction process is used to correct for conventional errors in the filtered image result information.
Therefore, by implementing the data processing device for bill management depicted in fig. 4, the bill identification information can be obtained by filtering and correcting the image identification result information set, which is more beneficial to improving intelligent acquisition, intelligent storage and intelligent inspection of the bill information, and further improving the invoice registration efficiency and reducing the invoice audit risk.
In yet another alternative embodiment, as shown in fig. 4, after the second processing module 303 performs recognition and classification processing on the image detection result information set to obtain the bill recognition information, the apparatus further includes:
the identification module 304 is used for identifying the types of the bill identification information to obtain bill type information;
and the executing module 305 is configured to execute a preset ticket management policy according to the ticket type information.
It can be seen that, with the data processing apparatus for bill management described in fig. 4, through comprehensive processing such as detection processing, identification classification processing and the like performed on image information of a bill to be processed, bill identification information for indicating intelligent management of the bill is obtained, and through processing such as type identification and the like performed on the bill identification information, intelligent management of the bill is performed, which is beneficial to improving intelligent acquisition, intelligent storage and intelligent inspection of the bill information, thereby improving the invoice registration efficiency and reducing the invoice audit risk.
In yet another alternative embodiment, as shown in fig. 4, the specific way for the executing module 305 to execute the preset ticket management policy according to the ticket type information is as follows:
when the bill type information is a non-value-added tax invoice, storing and updating the bill identification information;
when the bill type information is a value-added tax invoice, carrying out authenticity check on the bill identification information to obtain check result information;
and when the checking result information is yes, performing repeated aggregation checking processing on the bill identification information, and performing storage updating processing.
Therefore, by implementing the data processing device for bill management described in fig. 4, the repeated collection, checking, storage and updating of the bill identification information can be performed according to the checking result information, which is more beneficial to improving the intelligent collection, intelligent storage and intelligent checking of the bill information, further improving the invoice registration efficiency and reducing the invoice audit risk.
Example four
Referring to fig. 5, fig. 5 is a schematic structural diagram of another data processing apparatus for bill management according to an embodiment of the present invention. The apparatus described in fig. 5 can be applied to a ticket management system, such as a local server or a cloud server for data processing management of ticket management, and the embodiment of the present invention is not limited thereto. As shown in fig. 5, the apparatus may include:
a memory 401 storing executable program code;
a processor 402 coupled with the memory 401;
the processor 402 calls the executable program code stored in the memory 401 for performing the steps in the data processing method for ticket management described in embodiment one or embodiment two.
EXAMPLE five
The embodiment of the invention discloses a computer-readable storage medium which stores a computer program for electronic data exchange, wherein the computer program enables a computer to execute the steps of the data processing method for bill management described in the first embodiment or the second embodiment.
EXAMPLE six
The embodiment of the invention discloses a computer program product, which comprises a non-transitory computer readable storage medium storing a computer program, wherein the computer program is operable to make a computer execute the steps in the data processing method for bill management described in the first embodiment or the second embodiment.
The above-described embodiments of the apparatus are merely illustrative, and the modules described as separate parts may or may not be physically separate, and the parts displayed as modules may or may not be physical modules, may be located in one place, or may be distributed on a plurality of network modules. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of this embodiment. One of ordinary skill in the art can understand and implement it without inventive effort.
Through the above detailed description of the embodiments, those skilled in the art will clearly understand that the embodiments may be implemented by software plus a necessary general hardware platform, and may also be implemented by hardware. With this understanding in mind, the above technical solutions may essentially or in part contribute to the prior art, be embodied in the form of a software product, which may be stored in a computer-readable storage medium, including a Read-Only Memory (ROM), a Random Access Memory (RAM), a Programmable Read-Only Memory (PROM), an Erasable Programmable Read-Only Memory (EPROM), a One-time Programmable Read-Only Memory (OTPROM), an electronically Erasable Programmable Read-Only Memory (EEPROM), an optical Disc-Read (CD-ROM) or other storage medium capable of storing data, a magnetic tape, or any other computer-readable medium capable of storing data.
Finally, it should be noted that: the data processing method and apparatus for bill management disclosed in the embodiments of the present invention are only the preferred embodiments of the present invention, and are only used for illustrating the technical solutions of the present invention, not for limiting the same; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art; the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and the modifications or the substitutions do not make the essence of the corresponding technical solutions depart from the spirit and scope of the technical solutions of the embodiments of the present invention.

Claims (10)

1. A data processing method for ticket management, the method comprising:
acquiring image information of a bill to be processed; the bill image information to be processed comprises a plurality of bill images to be processed;
detecting and processing the image information of the bill to be processed to obtain an image detection result information set; the image detection result information set comprises a plurality of pieces of image detection result information;
carrying out identification and classification processing on the image detection result information set to obtain bill identification information; the bill identification information is used for indicating intelligent management of the bill.
2. The data processing method for bill management according to claim 1, wherein said detecting and processing image information of said bill to be processed to obtain an image detection result information set comprises:
for any bill image to be processed, carrying out angle detection processing on the bill image to be processed to obtain first processing image information corresponding to the bill image to be processed; the angle detection processing is used for carrying out angle correction on the bill image to be processed;
performing character detection processing on the first processed image information to obtain second processed image information corresponding to the bill image to be processed; the character detection processing is used for identifying a character line area of the first processing image information;
and carrying out straight line and seal detection on the second processed image information to obtain image detection result information corresponding to the bill image to be processed.
3. The data processing method for bill management according to claim 1, wherein said identifying and classifying said image detection result information set to obtain bill identification information comprises:
carrying out identification processing on the image detection result information set to obtain an image identification result information set; the image recognition result information set comprises a plurality of pieces of image recognition result information;
and post-processing the image recognition result information set to obtain bill recognition information.
4. The data processing method for bill management according to claim 3, wherein said performing recognition processing on said image detection result information set to obtain an image recognition result information set comprises:
carrying out whole-line character recognition on the image detection result information set to obtain a first recognition result information set; the first identification result information set comprises a plurality of pieces of first identification result information;
calling a special engine to perform region identification on the image detection result information set to obtain a second identification result information set; the first recognition result information set comprises a plurality of pieces of second recognition result information;
and integrating the first recognition result information set and the second recognition result information set to obtain an image recognition result information set.
5. The data processing method for bill management according to claim 3, wherein said post-processing the image recognition result information set to obtain bill recognition information comprises:
filtering the image recognition result information set to obtain a filtered image result information set; the filtered image result information set comprises a plurality of pieces of filtered image result information; the filtering processing is used for eliminating text lines with low credibility;
correcting the filtered image result information set to obtain bill identification information; the correction process is used to correct regular errors in the filtered image result information.
6. The data processing method for bill management according to claim 1, wherein after said identifying and classifying said image detection result information set to obtain bill identification information, said method further comprises:
performing type identification on the bill identification information to obtain bill type information;
and executing a preset bill management strategy according to the bill type information.
7. The data processing method for bill management according to claim 6, wherein said executing a preset bill management policy according to said bill type information comprises:
when the bill type information is a non-value-added tax invoice, storing and updating the bill identification information;
when the bill type information is a value-added tax invoice, performing authenticity check on the bill identification information to obtain check result information;
and when the checking result information is yes, performing repeated aggregation checking processing on the bill identification information, and performing storage updating processing.
8. A data processing apparatus for ticket management, the apparatus comprising:
the acquisition module is used for acquiring image information of the bill to be processed; the bill image information to be processed comprises a plurality of bill images to be processed;
the first processing module is used for detecting and processing the image information of the bill to be processed to obtain an image detection result information set; the image detection result information set comprises a plurality of pieces of image detection result information;
the second processing module is used for identifying and classifying the image detection result information set to obtain bill identification information; the bill identification information is used for indicating intelligent management of the bill.
9. A data processing apparatus for ticket management, the apparatus comprising:
a memory storing executable program code;
a processor coupled with the memory;
the processor calls the executable program code stored in the memory to execute the data processing method for ticket management according to any one of claims 1-7.
10. A computer storage medium storing computer instructions for performing, when invoked, the data processing method for ticket management according to any one of claims 1-7.
CN202211109286.8A 2022-09-13 2022-09-13 Data processing method and device for bill management Pending CN115471858A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211109286.8A CN115471858A (en) 2022-09-13 2022-09-13 Data processing method and device for bill management

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211109286.8A CN115471858A (en) 2022-09-13 2022-09-13 Data processing method and device for bill management

Publications (1)

Publication Number Publication Date
CN115471858A true CN115471858A (en) 2022-12-13

Family

ID=84333958

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211109286.8A Pending CN115471858A (en) 2022-09-13 2022-09-13 Data processing method and device for bill management

Country Status (1)

Country Link
CN (1) CN115471858A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115994743A (en) * 2023-03-24 2023-04-21 广东电网有限责任公司 Document abstract specification management method and system
CN117809325A (en) * 2024-02-29 2024-04-02 深圳市中兴新云服务有限公司 Full invoice checking authentication management method and system

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115994743A (en) * 2023-03-24 2023-04-21 广东电网有限责任公司 Document abstract specification management method and system
CN115994743B (en) * 2023-03-24 2023-06-09 广东电网有限责任公司 Document abstract specification management method and system
CN117809325A (en) * 2024-02-29 2024-04-02 深圳市中兴新云服务有限公司 Full invoice checking authentication management method and system
CN117809325B (en) * 2024-02-29 2024-05-17 深圳市中兴新云服务有限公司 Full invoice checking authentication management method and system

Similar Documents

Publication Publication Date Title
CN115471858A (en) Data processing method and device for bill management
CN111079755B (en) Financial reimbursement data processing method, device and system
WO2017214073A1 (en) Document field detection and parsing
US11132576B2 (en) Text recognition method and apparatus, electronic device, and storage medium
CN112395996A (en) Financial bill OCR recognition and image processing method, system and readable storage medium
Caldeira et al. Industrial optical character recognition system in printing quality control of hot-rolled coils identification
CN111444795A (en) Bill data identification method, electronic device, storage medium and device
CN111046879A (en) Certificate image classification method and device, computer equipment and readable storage medium
CN112464845B (en) Bill recognition method, equipment and computer storage medium
CN112541443B (en) Invoice information extraction method, invoice information extraction device, computer equipment and storage medium
CN111858977B (en) Bill information acquisition method, device, computer equipment and storage medium
CN113963147A (en) Key information extraction method and system based on semantic segmentation
CN112949455A (en) Value-added tax invoice identification system and method
CN112330469A (en) Pre-examination method and device for medical insurance claim settlement materials
CN114511866A (en) Data auditing method, device, system, processor and machine-readable storage medium
CN112232336A (en) Certificate identification method, device, equipment and storage medium
CN111881923A (en) Bill element extraction method based on feature matching
CN111462388A (en) Bill inspection method and device, terminal equipment and storage medium
CN116343237A (en) Bill identification method based on deep learning and knowledge graph
CN111104853A (en) Image information input method and device, electronic equipment and storage medium
CN116311299A (en) Method, device and system for identifying structured data of table
CN112149523B (en) Method and device for identifying and extracting pictures based on deep learning and parallel-searching algorithm
CN111931687B (en) Bill identification method and device
CN113807256A (en) Bill data processing method and device, electronic equipment and storage medium
CN114049686A (en) Signature recognition model training method and device and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination