WO2019012570A1 - Document classification system and method, and accounting system and method - Google Patents
Document classification system and method, and accounting system and method Download PDFInfo
- Publication number
- WO2019012570A1 WO2019012570A1 PCT/JP2017/025058 JP2017025058W WO2019012570A1 WO 2019012570 A1 WO2019012570 A1 WO 2019012570A1 JP 2017025058 W JP2017025058 W JP 2017025058W WO 2019012570 A1 WO2019012570 A1 WO 2019012570A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- document
- accounting
- image data
- classification
- documents
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q40/00—Finance; Insurance; Tax strategies; Processing of corporate or income taxes
Definitions
- the present invention relates to a document classification system for classifying documents into types, a document classification method using the document classification system, an accounting system for accounting based on classified documents, and an accounting system using the accounting system It relates to the processing method.
- books general ledger, journal, cash register, accounts receivable and payable ledger fixed asset ledger, sales and purchase book
- settlement related documents inventory table, balance sheet, profit and loss
- Other documents contracts, receipts, quotations, invoices, orders, contract applications, invoices, inspection documents
- business cards Other documents not necessarily related to accounting, such as company profile and company profile, are stored.
- the present invention relates to a document classification system for classifying documents, image classification method using the document classification system, an accounting processing system for accounting based on classified documents, and an accounting system for image data of documents. It is an issue to provide an accounting method that uses
- Documents stored in the above companies (1) to (4) are not limited to printed paper media, but are often stored as image data optically read by a scanner or the like. With regard to the documents in (1) to (3) above, they are national tax related book documents, and in particular (3) can be scanned and stored electronically. On the other hand, accounting software and accounting services need only input invoices, receipts, accounting personnel and tax accountants and accountants from the above-mentioned vast scanned documents, invoices and receipts It is in a situation where we have to work to take out the book.
- Receipts are often smaller horizontal documents, while other documents such as bills are A4 size vertical documents. While other documents are printed on white paper, the restaurant's handwritten receipts are often entirely colored documents.
- receipts of JR and other railways and subways are colored such as blue and orange, and often have a rectangular shape. Revenue stamps, which are not affixed to other documents, are often affixed to receipts. It is possible to classify documents according to the feature on an image without text information.
- the present invention performs document processing based on a document classification system which classifies documents into the types without necessarily performing OCR processing, a document classification method using the document classification system, and the classified documents.
- An accounting system and an accounting method using the accounting system are provided.
- the document classification method of the present invention is A document classification method for classifying documents for data of documents, It is characterized in that each document is input to the AI function to perform classification.
- the “AI function” refers to a function of making a determination regarding each data based on a large number of data.
- the document classification system of the present invention is A document classification system that classifies documents for document data, and It is characterized in that it has an AI function that classifies each document.
- the document classification system of the present invention is The AI function is characterized by performing classification on each document based on the result of prior learning.
- the document classification system of the present invention is The prior learning is supervised learning based on data given document types,
- the AI function is characterized by performing classification to specify the type of document.
- the document classification system of the present invention is The prior learning is unsupervised learning based on data not given document types,
- the AI function is characterized by performing classification without specifying the type of document.
- the type of document is not specified at the time of learning, it can be classified by similar document group (cluster). Since the user can determine, for example, which category the invoice is based on the classified result, it is sufficiently usable even if the type of the document is not specified by the document classification system.
- the document classification system of the present invention is The AI function is to input and classify a plurality of documents, and perform classification without specifying the type of the document based on the plurality of documents without performing prior learning.
- the document classification system of the present invention is The type of the document is characterized by including a receipt and a bill.
- the document classification system of the present invention is The AI function is characterized by determining whether or not the revenue stamp is attached.
- the document classification system of the present invention is The AI function is characterized by determining the presence or absence of a seal and the shape of the sealed seal.
- the document classification system of the present invention is The data of the document is image data
- the AI function includes at least one of a color of the image data, a shape of the image data, a color of a section of the image data having a color different from the background, and a shape of the image data having a color different from the background It is characterized in that classification is performed on the basis of.
- the document classification system of the present invention is The data of the document contains text data,
- the AI function is characterized by performing classification based on characters and contents described in the character data.
- classification using features included in character data is possible. For example, in the case of a receipt, a date, an amount, a company name, and a product name are often described. In the case of a bill, date, amount, supplier name, company name of sending source, telephone number, fax number, product name, number of products, etc. are often described. With regard to contracts, there is a unique wording used in contracts, which is considered to be a feature.
- the document classification method of the present invention is It is characterized by using a document classification system including a receipt and a bill in the type of the document.
- a document classification method for classifying receipts and invoices is provided.
- the accounting method of the present invention is Separating the documents according to a document classification method using a document classification system including receipts and invoices in the document type; Performing OCR processing on image data whose type of document is a receipt and a bill, and inputting the result into accounting software.
- an accounting method that performs OCR on receipts and invoices, and causes accounting software to process the results based on the results.
- the accounting system of the present invention is A document classification system including receipts and invoices in the types of documents; An OCR processing unit that performs OCR processing on the image data with respect to a document whose type of the document classified by the document classification system is a receipt and a bill; And accounting processing software for performing accounting processing based on the character string output from the OCR processing unit.
- an accounting system that performs OCR on receipts and invoices, and processes the accounting software based on the results.
- the accounting system of the present invention is The accounting software may output an error if the string is nonconforming.
- an accounting system that outputs an error (does not process the document) if the classification of the document is not suitable. It is a robust accounting system even if there are errors in document classification.
- the accounting method of the present invention is Use an accounting system that outputs an error if the string is nonconforming, It is characterized in that supervised learning is performed on the image data for which the accounting method software outputs an error.
- an accounting method which re-learns the document in which the error is output and improves the accuracy of the document classification.
- a document classification system for classifying documents into the types, a document classification method using the document classification system, an accounting system for accounting based on the classified documents, and the accounting system Accounting methods are provided.
- FIG. 1 is a diagram showing the configuration of a document classification system and an accounting system.
- FIG. 2 is a diagram showing a document.
- FIG. 1 is a diagram showing the configuration of a document classification system and an accounting system.
- the document classification system 1 includes an AI function 11 and a learning unit 12, and holds learning data 12a and learning results 12b.
- the document classification system 1 reads the image data 13 and classifies it into invoice image data 13a, receipt image data 13b, and other image data 13c by the AI function 11.
- the bill image data 13a, the receipt image data 13b, and the other image data 13c are an example of classification, and may be classified into other types of image data.
- the AI function 11 classifies the image data 13 with reference to the learning result 12 b.
- the format of the learning result 12 b may be determined based on the specification of the AI function 11. Typically, it is a boundary value for each classification related to the feature value calculated from each image data.
- the learning unit 12 calculates, for example, a feature amount based on the learning data 12 a and outputs a learning result 12 b.
- the learning data 12 a it is assumed that the type of document is added to the image data 13. That is, the learning unit 12 performs supervised learning by knowing the classified types of the image data. Learning other than supervised learning is described in the second embodiment and the following.
- the accounting system 2 includes a document classification system 1, an OCR processor 21, and accounting software 22.
- the OCR processing unit 21 reads characters such as a document name, a summary, and an amount from the image data 13 by OCR processing. Although any image data can be processed, it is assumed that the OCR process is performed on the bill image data 13a and the receipt image data 13b.
- the accounting software 22 prepares financial statements and performs other accounting processes based on the characters read by the OCR processing unit 21.
- the read character is not suitable for accounting (for example, if the bill does not have the name of the bill source), an error is output.
- the learning unit 12 Prior to the operation of the document classification system 1, the learning unit 12 generates a learning result 12 b.
- the learning data 12a past documents (processed correctly) of the user company can be used.
- the types of documents are "bill”, “receipt” and “other”.
- the learning unit 12 inputs the learning data 12 a to which “bill”, “receipt”, and “other” are added, and outputs the learning result 12 b.
- the document is optically read by a scanner or the like to generate image data 13.
- the image data 13 is classified by the AI function 11 into invoice image data 13a, receipt image data 13b, and other image data 13c.
- the AI function 11 classifies the image data 13 with reference to the learning result 12b, and moves the bill image data 13a, the receipt image data 13b, and the other image data 13c to the bill folder, the receipt folder, and the other folder, respectively. It shall be.
- the type of document is indicated by the folder in which the file is stored.
- the OCR processing unit 21 uses OCR processing to process characters such as a document name, a payment, an amount of money, and the like for unprocessed items among the invoice image data 13a and the receipt image data 13b (files in the invoice folder and the receipt folder). read.
- the accounting software 22 prepares financial statements and performs other accounting processes based on the characters read by the OCR processing unit 21.
- the read character is not suitable for accounting (for example, if the bill does not have the name of the bill source), an error is output.
- the image data 13 (invoice image data 13a or receipt image data 13b) for which an error has been output is highly likely to be a classification error of the AI function 11. Therefore, the correct type is added to the image data 13 (manually), and supervised learning is performed as the learning data 12a. Although unsupervised learning is possible, supervised learning is preferable in order to reduce classification errors.
- FIG. 2 is a diagram showing a document. 2 (A) shows a bill, FIG. 2 (B) shows a receipt, and FIG. 2 (C) shows a contract.
- FIGS. 2A to 2C colored portions are shaded, and sections having a color different from the background are hatched.
- Invoices are often printed on A4 paper, as shown in FIG. 2A, and square marks are sealed.
- FIG. 2 (B) the receipt is often a colored, horizontally long sheet, and a square mark is stamped and a revenue stamp is attached.
- FIG. 2C the contract is often printed on A4 paper, and two circles are sealed.
- the presence or absence of the color (white, color other than black) of the image data 13, the shape (values of H, W, H / W) of the image data 13, and the sections of the image data 13 having a color different from the background The color of the image data and the shape (h, w, h / w values) of the sections of the image data that have a color different from the background differ greatly depending on the type of document. Documents can be classified into their types without performing OCR processing using only these feature quantities.
- the size of the document (values of H and W) can not be determined in the case of performing optical reading using a scanner on a background white document.
- this is not a problem because many scanners have a document size detection function.
- the document size detection function is not provided, for example, a black paper larger than the document can be solved by placing it on the back of the document.
- the presence or absence of a color (a color other than white or black) of the image data 13 and the shape of the image data 13 H , W, H / W
- the color of the section of the image data 13 having a color different from the background the shape of the section of the image data having a color different from the background (h, w, h / w values)
- Documents can be classified into the types only by the feature amount.
- documents can be classified to extract bills and receipts, and accounting can be performed using these.
- the learning unit 12 performs unsupervised learning.
- the other points are the same as in the first embodiment, and the detailed description is omitted.
- the learning unit 12 can still classify the similar document groups (clusters).
- classification clustering based on the distribution of features in the document in the learning data 12a is possible.
- Example 1 a clear classification is possible since there are obvious features depending on the type of document.
- the type of document Since the type of document is not given to the learning data 12a, it is only classified, and it is not determined which type is which type (bill, receipt, etc.). However, the user can determine, for example, which category the bill is based on the classified result.
- the type of document can be determined as a classification of the determination result even in the unsupervised learning.
- Accounting processing by the OCR processing unit 21 and the accounting software 22 is possible by the image data 13 according to the classification determined that the user is a bill or a receipt.
- the learning unit 12 can obtain the same effect as that of the first embodiment even by performing unsupervised learning.
- classification can be performed without prior learning. That is, if a large number of image data 13 that can only function as learning data 12a are classified at the same time, classification (clustering) of those image data 13 is possible without prior learning.
- the data of the document includes text data.
- the other points are the same as in the first and second embodiments, and the detailed description will be omitted.
- Document data includes text data.
- OCR processing can be performed to generate character data.
- character data is included therein, from which the character data can be generated.
- Character data contains information that is valid for document classification. For example, in the case of a receipt, a date, an amount, a company name, and a product name are often described. In the case of an invoice, the date, the amount, the supplier name, the company name of the sending source, the telephone number, the fax number, the item name, the number of items, etc. are often described. With regard to contracts, in place of company names, there are often used unique phrases characteristic of contracts such as “A” or “ ⁇ ”.
- the AI function 11 can also classify the character string of the name of the document as a keyword, for example, the receipt includes the character string of “receipt”, and may be classified more clearly based on a large number of data. it can.
- the AI function 11 also corresponds to the case where “estimate sheet number” is described as supplementary information on a bill.
- the AI function 11 can obtain the reliability of each of classification based on character data and classification based on image data (probability of classification).
- the classification based on the character data and the classification based on the image data may be used together to determine the final classification based on the reliability.
- the classification based on character data and the classification based on image data are used in combination, it may be considered that the classification based on character data is only classified by the keyword of the character string of the name of the document.
- the clearness of classification can be improved using character data.
- This embodiment is intended to classify various documents which are not limited to accounting documents.
- the other points are the same as in the first to third embodiments, and the detailed description will be omitted.
- accounting documents such as invoices and receipts have been described as the image data 13 in the first and second embodiments
- documents existing as image data in a company may be considered.
- the image data 13 often has characteristics specific to the type of the document. For example, business cards are characterized by their size. In addition, primary materials such as red and blue are often used as explanatory materials.
- classification by the document classification system 1 is possible as in the first and second embodiments.
- documents other than accounting documents can be classified.
- a document classification system that classifies documents a document classification method that uses the document classification system, an accounting system that performs accounting based on classified documents, and an accounting system that uses the accounting system It is a processing method. It can be used by many companies.
Landscapes
- Business, Economics & Management (AREA)
- Engineering & Computer Science (AREA)
- Technology Law (AREA)
- Physics & Mathematics (AREA)
- Economics (AREA)
- Finance (AREA)
- Marketing (AREA)
- Strategic Management (AREA)
- Accounting & Taxation (AREA)
- Development Economics (AREA)
- General Business, Economics & Management (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Character Input (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Character Discrimination (AREA)
- Financial Or Insurance-Related Operations Such As Payment And Settlement (AREA)
- Image Analysis (AREA)
Abstract
Description
書類のデータについて、その書類を分類する書類分類方法であって、
AI機能に各々の書類を入力して分類を行わせることを特徴とする。 The document classification method of the present invention is
A document classification method for classifying documents for data of documents,
It is characterized in that each document is input to the AI function to perform classification.
書類のデータについて、その書類を分類する書類分類システムであって、
各々の書類について分類を行うAI機能を備えることを特徴とする。 The document classification system of the present invention is
A document classification system that classifies documents for document data, and
It is characterized in that it has an AI function that classifies each document.
前記AI機能は各々の書類について事前学習の結果に基づいて分類を行うことを特徴とする。 The document classification system of the present invention is
The AI function is characterized by performing classification on each document based on the result of prior learning.
前記事前学習は、書類の種類を付与されたデータに基づく教師付き学習であり、
前記AI機能は、書類の種類を特定する分類を行うことを特徴とする。 The document classification system of the present invention is
The prior learning is supervised learning based on data given document types,
The AI function is characterized by performing classification to specify the type of document.
前記事前学習は、書類の種類を付与されないデータに基づく教師なし学習であり、
前記AI機能は、書類の種類を特定しない分類を行うことを特徴とする。 The document classification system of the present invention is
The prior learning is unsupervised learning based on data not given document types,
The AI function is characterized by performing classification without specifying the type of document.
前記AI機能は、複数の書類を入力して分類するものであり、事前学習を行わずに該複数の書類に基づいて書類の種類を特定しない分類を行うことを特徴とする。 The document classification system of the present invention is
The AI function is to input and classify a plurality of documents, and perform classification without specifying the type of the document based on the plurality of documents without performing prior learning.
前記書類の種類は、領収書及び請求書を含むことを特徴とする。 The document classification system of the present invention is
The type of the document is characterized by including a receipt and a bill.
前記AI機能は、収入印紙の貼付の有無を判定することを特徴とする。 The document classification system of the present invention is
The AI function is characterized by determining whether or not the revenue stamp is attached.
前記AI機能は、押印の有無及び押印された印の形状を判定することを特徴とする。 The document classification system of the present invention is
The AI function is characterized by determining the presence or absence of a seal and the shape of the sealed seal.
前記書類のデータは画像データであり、
前記AI機能は、前記画像データの色彩、前記画像データの形状、前記画像データのうち背景と異なる色彩を有する区画の色彩及び前記画像データのうち背景と異なる色彩を有する区画の形状のうち少なくとも1に基づいて分類を行うことを特徴とする。 The document classification system of the present invention is
The data of the document is image data,
The AI function includes at least one of a color of the image data, a shape of the image data, a color of a section of the image data having a color different from the background, and a shape of the image data having a color different from the background It is characterized in that classification is performed on the basis of.
前記書類のデータは文字データを含み、
前記AI機能は、前記文字データに記載された文字や内容に基づいて分類を行うことを特徴とする。 The document classification system of the present invention is
The data of the document contains text data,
The AI function is characterized by performing classification based on characters and contents described in the character data.
前記書類の種類に領収書及び請求書を含む書類分類システムを用いることを特徴とする。 The document classification method of the present invention is
It is characterized by using a document classification system including a receipt and a bill in the type of the document.
前記書類の種類に領収書及び請求書を含む書類分類システムを用いる書類分類方法により書類を分離するステップと、
前記書類の種類が領収書及び請求書である画像データについて、OCR処理を行い、その結果を会計処理ソフトウエアに入力するステップとを含むことを特徴とする。 The accounting method of the present invention is
Separating the documents according to a document classification method using a document classification system including receipts and invoices in the document type;
Performing OCR processing on image data whose type of document is a receipt and a bill, and inputting the result into accounting software.
前記書類の種類に領収書及び請求書を含む書類分類システムと、
前記書類分類システムの分類する前記書類の種類が領収書及び請求書である書類に対して前記画像データに対してOCR処理を行うOCR処理部と、
前記OCR処理部の出力する文字列に基づいて会計処理を行う会計処理ソフトウエアとを含むことを特徴とする。 The accounting system of the present invention is
A document classification system including receipts and invoices in the types of documents;
An OCR processing unit that performs OCR processing on the image data with respect to a document whose type of the document classified by the document classification system is a receipt and a bill;
And accounting processing software for performing accounting processing based on the character string output from the OCR processing unit.
前記会計処理ソフトウエアは、前記文字列が不適合である場合にエラーを出力することを特徴とする。 The accounting system of the present invention is
The accounting software may output an error if the string is nonconforming.
文字列が不適合である場合にエラーを出力する会計処理システムを使用し、
前記会計処理方法ソフトウエアがエラーを出力した前記画像データについて、教師付き学習を行うとこと特徴とする。 The accounting method of the present invention is
Use an accounting system that outputs an error if the string is nonconforming,
It is characterized in that supervised learning is performed on the image data for which the accounting method software outputs an error.
図1は、書類分類システム及び会計処理システムの構成を示す図である。書類分類システム1は、AI機能11及び学習部12を備え、学習データ12a、学習結果12bを保持している。 (System configuration)
FIG. 1 is a diagram showing the configuration of a document classification system and an accounting system. The
以下、書類分類システム1及び会計処理システム2を用いる処理の手順について、説明する。 (Processing procedure)
Hereinafter, the procedure of processing using the
以下、書類の種類毎の画像データ13を例示し、AI機能11による明確な分類が可能となることを説明する。 (Type of documents)
Hereinafter,
11 AI機能
12 学習部
12a 学習データ
12b 学習結果
13 画像データ
13a 請求書画像データ
13b 領収書画像データ
13c その他画像データ
2 会計処理システム
21 OCR処理部
22 会計処理ソフトウエア DESCRIPTION OF
Claims (16)
- 書類のデータについて、その書類を分類する書類分類方法であって、
AI機能によって分類を行わせることを特徴とする、書類分類方法。 A document classification method for classifying documents for data of documents,
A document classification method characterized in that classification is performed by an AI function. - 書類のデータについて、その書類を分類する書類分類システムであって、
各々の書類について分類を行うAI機能を備えることを特徴とする、書類分類システム。 A document classification system that classifies documents for document data, and
A document classification system, comprising: an AI function that classifies each document. - 前記AI機能は各々の書類について事前学習の結果に基づいて分類を行うことを特徴とする、請求項2に記載の書類分類システム。 The document classification system according to claim 2, wherein said AI function performs classification based on the result of prior learning about each document.
- 前記事前学習は、書類の種類を付与されたデータに基づく教師付き学習であり、
前記AI機能は、書類の種類を特定する分類を行うことを特徴とする、請求項3に記載の書類分類システム。 The prior learning is supervised learning based on data given document types,
The document classification system according to claim 3, wherein said AI function performs classification which specifies the type of document. - 前記事前学習は、書類の種類を付与されないデータに基づく教師なし学習であり、
前記AI機能は、書類の種類を特定しない分類を行うことを特徴とする、請求項3に記載の書類分類システム。 The prior learning is unsupervised learning based on data not given document types,
The document classification system according to claim 3, wherein the AI function performs classification without specifying the type of the document. - 前記AI機能は、複数の書類を入力して分類するものであり、事前学習を行わずに該複数の書類に基づいて書類の種類を特定しない分類を行うことを特徴とする、請求項2に記載の書類分類システム。 The AI function is to input and classify a plurality of documents, and perform classification without specifying the type of the document based on the plurality of documents without performing prior learning. Document classification system described.
- 前記書類の種類は、領収書及び請求書を含むことを特徴とする、請求項2~6のいずれか1項に記載の書類分類システム。 The document classification system according to any one of claims 2 to 6, wherein the type of the document includes a receipt and a bill.
- 前記AI機能は、収入印紙の貼付の有無を判定することを特徴とする、請求項7に記載の書類分類システム。 8. The document classification system according to claim 7, wherein said AI function determines presence or absence of revenue stamp sticking.
- 前記AI機能は、押印の有無及び押印された印の形状を判定することを特徴とする、請求項7又は8に記載の書類分類システム。 9. The document classification system according to claim 7, wherein said AI function determines the presence or absence of a seal and the shape of the sealed seal.
- 前記書類のデータは画像データであり、
前記AI機能は、前記画像データの色彩、前記画像データの形状、前記画像データの大きさ、前記画像データのうち背景と異なる色彩を有する区画の色彩及び前記画像データのうち背景と異なる色彩を有する区画の形状のうち少なくとも1に基づいて分類を行うことを特徴とする、請求項2~9のいずれか1項に記載の書類分類システム。 The data of the document is image data,
The AI function has a color of the image data, a shape of the image data, a size of the image data, a color of a section of the image data having a color different from the background, and a color of the image data different from the background The document classification system according to any one of claims 2 to 9, wherein classification is performed based on at least one of the shapes of the sections. - 前記書類のデータは文字データを含み、
前記AI機能は、前記文字データに記載された文字や内容に基づいて分類を行うことを特徴とする、請求項2~10のいずれか1項に記載の書類分類システム。 The data of the document contains text data,
The document classification system according to any one of claims 2 to 10, wherein the AI function performs classification based on characters and contents described in the character data. - 請求項7~11のいずれか1項に記載の書類分類システムを用いることを特徴とする、請求項1に記載の書類分類方法。 The document classification method according to claim 1, wherein the document classification system according to any one of claims 7 to 11 is used.
- 請求項12に記載の書類分類方法により書類を分離するステップと、
前記書類の種類が領収書及び請求書である画像データについて、OCR処理を行い、その結果を会計処理ソフトウエアに入力するステップとを含むことを特徴とする、会計処理方法。 Separating the documents according to the document classification method according to claim 12;
And d) performing OCR processing on image data of which the type of the document is a receipt and a bill, and inputting the result into accounting software. - 請求項7~11のいずれか1項に記載の書類分類システムと、
前記書類分類システムの分類する前記書類の種類が領収書及び請求書である書類に対して前記画像データに対してOCR処理を行うOCR処理部と、
前記OCR処理部の出力する文字列に基づいて会計処理を行う会計処理ソフトウエアとを含むことを特徴とする、会計処理システム。 A document classification system according to any one of claims 7 to 11;
An OCR processing unit that performs OCR processing on the image data with respect to a document whose type of the document classified by the document classification system is a receipt and a bill;
An accounting processing system comprising: accounting software that performs accounting based on a character string output from the OCR processing unit. - 前記会計処理ソフトウエアは、前記文字列が不適合である場合にエラーを出力することを特徴とする、請求項14に記載の会計処理システム。 The accounting system according to claim 14, wherein the accounting software outputs an error when the character string is nonconforming.
- 請求項15に記載の会計処理システムを使用し、
前記会計処理方法ソフトウエアがエラーを出力した前記画像データについて、教師付き学習を行うとこと特徴とする、会計処理方法。 Using the accounting system according to claim 15,
A method of accounting processing, wherein supervised learning is performed on the image data for which the accounting method software outputs an error.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/JP2017/025058 WO2019012570A1 (en) | 2017-07-08 | 2017-07-08 | Document classification system and method, and accounting system and method |
JP2017536900A JP6504514B1 (en) | 2017-07-08 | 2017-07-08 | Document classification system and method and accounting system and method. |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/JP2017/025058 WO2019012570A1 (en) | 2017-07-08 | 2017-07-08 | Document classification system and method, and accounting system and method |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2019012570A1 true WO2019012570A1 (en) | 2019-01-17 |
Family
ID=65001895
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2017/025058 WO2019012570A1 (en) | 2017-07-08 | 2017-07-08 | Document classification system and method, and accounting system and method |
Country Status (2)
Country | Link |
---|---|
JP (1) | JP6504514B1 (en) |
WO (1) | WO2019012570A1 (en) |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2020187426A (en) * | 2019-05-10 | 2020-11-19 | 京セラドキュメントソリューションズ株式会社 | Image processing device and image processing method |
CN112101367A (en) * | 2020-09-15 | 2020-12-18 | 杭州睿琪软件有限公司 | Text recognition method, image recognition and classification method and document recognition processing method |
JP2021005226A (en) * | 2019-06-26 | 2021-01-14 | 京セラドキュメントソリューションズ株式会社 | Document classification system and document classification program |
JP6856916B1 (en) * | 2020-01-08 | 2021-04-14 | ジーニアルテクノロジー,インク. | Information processing equipment, information processing methods and information processing programs |
JP2021072110A (en) * | 2020-04-30 | 2021-05-06 | 株式会社日本デジタル研究所 | Voucher determination device, accounting processing device, voucher determination program, voucher determination system, and voucher determination method |
JP2021072088A (en) * | 2020-04-30 | 2021-05-06 | 株式会社日本デジタル研究所 | Voucher determination device, accounting processing device, voucher determination program, voucher determination system, and voucher determination method |
WO2021140682A1 (en) * | 2020-01-08 | 2021-07-15 | ジーニアルテクノロジー,インク. | Information processing device, information processing method, and information processing program |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0855157A (en) * | 1994-08-09 | 1996-02-27 | Supetsuku:Kk | Taxation document issuing device |
JP2007088609A (en) * | 2005-09-20 | 2007-04-05 | Fuji Xerox Co Ltd | Electronic signature providing apparatus, method, and program |
US9053350B1 (en) * | 2009-01-21 | 2015-06-09 | Google Inc. | Efficient identification and correction of optical character recognition errors through learning in a multi-engine environment |
JP2015170045A (en) * | 2014-03-05 | 2015-09-28 | グローリー株式会社 | Sales management system and method |
JP2016071412A (en) * | 2014-09-26 | 2016-05-09 | キヤノン株式会社 | Image classification apparatus, image classification system, image classification method, and program |
JP2016173822A (en) * | 2015-03-17 | 2016-09-29 | 株式会社リコー | Information processing apparatus, information processing system and program |
JP2017069599A (en) * | 2015-09-28 | 2017-04-06 | 富士ゼロックス株式会社 | Image processing apparatus and program |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH04127287A (en) * | 1990-09-18 | 1992-04-28 | Fujitsu Ltd | Optical character reader |
US8408544B2 (en) * | 2011-06-08 | 2013-04-02 | Eastman Kodak Company | Sorting by controlling scanned document velocity |
JP2016085538A (en) * | 2014-10-23 | 2016-05-19 | キヤノン株式会社 | Information processing equipment, control method of information processing equipment, and program |
-
2017
- 2017-07-08 JP JP2017536900A patent/JP6504514B1/en active Active
- 2017-07-08 WO PCT/JP2017/025058 patent/WO2019012570A1/en active Application Filing
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0855157A (en) * | 1994-08-09 | 1996-02-27 | Supetsuku:Kk | Taxation document issuing device |
JP2007088609A (en) * | 2005-09-20 | 2007-04-05 | Fuji Xerox Co Ltd | Electronic signature providing apparatus, method, and program |
US9053350B1 (en) * | 2009-01-21 | 2015-06-09 | Google Inc. | Efficient identification and correction of optical character recognition errors through learning in a multi-engine environment |
JP2015170045A (en) * | 2014-03-05 | 2015-09-28 | グローリー株式会社 | Sales management system and method |
JP2016071412A (en) * | 2014-09-26 | 2016-05-09 | キヤノン株式会社 | Image classification apparatus, image classification system, image classification method, and program |
JP2016173822A (en) * | 2015-03-17 | 2016-09-29 | 株式会社リコー | Information processing apparatus, information processing system and program |
JP2017069599A (en) * | 2015-09-28 | 2017-04-06 | 富士ゼロックス株式会社 | Image processing apparatus and program |
Cited By (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2020187426A (en) * | 2019-05-10 | 2020-11-19 | 京セラドキュメントソリューションズ株式会社 | Image processing device and image processing method |
JP7419668B2 (en) | 2019-05-10 | 2024-01-23 | 京セラドキュメントソリューションズ株式会社 | Image processing device and image processing method |
JP2021005226A (en) * | 2019-06-26 | 2021-01-14 | 京セラドキュメントソリューションズ株式会社 | Document classification system and document classification program |
JP7364998B2 (en) | 2019-06-26 | 2023-10-19 | 京セラドキュメントソリューションズ株式会社 | Document classification system and document classification program |
JP6856916B1 (en) * | 2020-01-08 | 2021-04-14 | ジーニアルテクノロジー,インク. | Information processing equipment, information processing methods and information processing programs |
WO2021140682A1 (en) * | 2020-01-08 | 2021-07-15 | ジーニアルテクノロジー,インク. | Information processing device, information processing method, and information processing program |
US11315351B2 (en) | 2020-01-08 | 2022-04-26 | Kabushiki Kaisha Genial Technology | Information processing device, information processing method, and information processing program |
JP2021072110A (en) * | 2020-04-30 | 2021-05-06 | 株式会社日本デジタル研究所 | Voucher determination device, accounting processing device, voucher determination program, voucher determination system, and voucher determination method |
JP2021072088A (en) * | 2020-04-30 | 2021-05-06 | 株式会社日本デジタル研究所 | Voucher determination device, accounting processing device, voucher determination program, voucher determination system, and voucher determination method |
CN112101367A (en) * | 2020-09-15 | 2020-12-18 | 杭州睿琪软件有限公司 | Text recognition method, image recognition and classification method and document recognition processing method |
WO2022057707A1 (en) * | 2020-09-15 | 2022-03-24 | 杭州睿琪软件有限公司 | Text recognition method, image recognition classification method, and document recognition processing method |
Also Published As
Publication number | Publication date |
---|---|
JPWO2019012570A1 (en) | 2019-07-11 |
JP6504514B1 (en) | 2019-04-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP6504514B1 (en) | Document classification system and method and accounting system and method. | |
US9552516B2 (en) | Document information extraction using geometric models | |
US8167196B2 (en) | Expanded mass data sets for electronic check processing | |
US11455784B2 (en) | System and method for classifying images of an evidence | |
JP5202677B2 (en) | Receipt data recognition device and program thereof | |
US9384393B2 (en) | Check data lift for error detection | |
US11501344B2 (en) | Partial perceptual image hashing for invoice deconstruction | |
JP2012226402A (en) | Receipt data recognition device and program therefor | |
US20140268250A1 (en) | Systems and methods for receipt-based mobile image capture | |
KR101942468B1 (en) | Structured data and unstructured data extraction system and method | |
US20180053045A1 (en) | Automated Processing of Receipts and Invoices | |
WO2020012539A1 (en) | Journalization element analysis device, accounting system, journalization element analysis method, and journalization element analysis program | |
TWI716761B (en) | Intelligent accounting system and identification method for accounting documents | |
US11030450B2 (en) | System and method for determining originality of computer-generated images | |
CN111428725A (en) | Data structuring processing method and device and electronic equipment | |
TWM575887U (en) | Intelligent accounting system | |
US20160148283A1 (en) | System and Method for Accessing Comic Book Grading Notes via a Quick Scan Code | |
JP6507459B2 (en) | Accounting system | |
JP2011227787A (en) | Accounting transaction information reading device | |
US10417488B2 (en) | Re-application of filters for processing receipts and invoices | |
JP7429365B1 (en) | Data processing device, data processing method and program | |
JP6835382B1 (en) | Electronic data judgment system, electronic data judgment device, electronic data judgment method, electronic data judgment program | |
WO2022054136A1 (en) | Data processing device, data processing method, and program | |
EP3557480B1 (en) | Deriving data from documents and from transmission protocols using machine-learning techniques | |
US20230410214A1 (en) | System and method for supervising expense or income |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
ENP | Entry into the national phase |
Ref document number: 2017536900 Country of ref document: JP Kind code of ref document: A |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 17917573 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 27/03/2020) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 17917573 Country of ref document: EP Kind code of ref document: A1 |