WO2022097189A1 - Dispositif de traitement de données, procédé de traitement de données et programme - Google Patents

Dispositif de traitement de données, procédé de traitement de données et programme Download PDF

Info

Publication number
WO2022097189A1
WO2022097189A1 PCT/JP2020/041162 JP2020041162W WO2022097189A1 WO 2022097189 A1 WO2022097189 A1 WO 2022097189A1 JP 2020041162 W JP2020041162 W JP 2020041162W WO 2022097189 A1 WO2022097189 A1 WO 2022097189A1
Authority
WO
WIPO (PCT)
Prior art keywords
character string
character
master data
recognition
similar
Prior art date
Application number
PCT/JP2020/041162
Other languages
English (en)
Japanese (ja)
Inventor
鴻鵬 葛
顕 松田
貴亮 佐藤
智 小俣
啓太郎 森
Original Assignee
ファーストアカウンティング株式会社
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ファーストアカウンティング株式会社 filed Critical ファーストアカウンティング株式会社
Priority to PCT/JP2020/041162 priority Critical patent/WO2022097189A1/fr
Priority to JP2020561940A priority patent/JP6870159B1/ja
Priority to JP2021068170A priority patent/JP2022075467A/ja
Publication of WO2022097189A1 publication Critical patent/WO2022097189A1/fr

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/12Detection or correction of errors, e.g. by rescanning the pattern
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/26Techniques for post-processing, e.g. correcting the recognition result

Definitions

  • the present invention relates to a data processing apparatus, a data processing method and a program.
  • the error is corrected by comparing the extracted characters with the contents of a predetermined database.
  • the character string described in the voucher is not registered in the database and does not match the contents of the database even though the character recognition result is correct.
  • you correct it so that it matches the contents of other character strings contained in the database, a character string different from the character string described in the voucher will be displayed even though the character is correctly recognized. There was a problem that it was output.
  • the present invention has been made in view of these points, and an object thereof is to improve the probability that the character string included in the image data of the voucher is correctly output.
  • the data processing device has a data acquisition unit for acquiring voucher image data and a character recognition unit for outputting a plurality of recognition character strings by recognizing character strings included in the voucher image data. And, the first character string among the plurality of recognition character strings is not included in the master data associated with the plurality of registered character strings, and is different from the first character string among the plurality of recognition character strings.
  • a similar character string most similar to the first character string among one or more registered character strings associated with the second character string in the master data. It has a correction unit for correcting the first character string, and an output unit for outputting the corrected first character string after the correction of the first character string and the second character string in association with each other.
  • the correction unit corrects the first character string to the similar character string on condition that two or more of the second character strings among the plurality of recognition character strings are included in the master data. May be good.
  • the correction unit determines the similar character string before correcting the first character string to the similar character string.
  • a plurality of candidates may be output to the output unit, and the first character string may be corrected to the similar character string corresponding to the candidate selected from the plurality of candidates.
  • the master data includes the company name and account information as the plurality of registered character strings
  • the correction unit includes the recognition company name which is the first character string recognized by the character recognition unit in the master data. If the master data includes the recognition account information which is the second character string recognized by the character recognition unit, the recognition is given to the company name associated with the recognition account information in the master data. You may correct the company name.
  • the master data includes a company name and an item name as the plurality of registered character strings
  • the correction unit includes a recognition company name which is the first character string recognized by the character recognition unit in the master data.
  • the master data includes the recognized item name which is the second character string recognized by the character recognition unit, a plurality of company names associated with the recognized item name in the master data.
  • the recognized company name may be corrected to the company name most similar to the recognized company name.
  • the master data includes the item name and the product unit price as the plurality of registered character strings
  • the correction unit includes the recognized item name which is the first character string recognized by the character recognition unit in the master data.
  • the recognized product unit price which is the second character string recognized by the character recognition unit
  • a plurality of item names associated with the recognized product unit price in the master data may be corrected to the item name most similar to the recognized item name.
  • the correction unit executes a search on the Internet using at least one of the first character string or the similar character string as a keyword, and performs a search on the Internet, and the similar character. If it is determined that the column is more likely to be correct than the first character string, the first character string may be corrected to the similar character string.
  • the correction unit executes a search on the Internet using at least one of the first character string or the similar character string as a keyword, and performs the first search.
  • the similar character string in the master data may be corrected to the first character string without correcting the first character string. ..
  • the output unit can distinguish between the character string determined by the correction unit to be necessary for correction and the character string determined by the correction unit that correction is not necessary among the plurality of recognition character strings. You may output it.
  • the output unit When the master data does not include a character string having a similarity with the first character string equal to or higher than the threshold value, the output unit outputs the first character string as a character string to be registered in the master data. You may.
  • the output unit is said to have many character strings not included in the master data when the ratio of the recognition character strings not included in the master data is equal to or more than a predetermined value among the plurality of recognition character strings.
  • the data processing method of the second aspect of the present invention is a step of acquiring voucher image data executed by a computer and a step of outputting a plurality of recognition character strings by recognizing a character string included in the voucher image data.
  • the first character string among the plurality of recognition character strings is not included in the master data associated with the plurality of registered character strings, and is different from the first character string among the plurality of recognition character strings.
  • a similar character string most similar to the first character string among one or more registered character strings associated with the second character string in the master data. It has a step of correcting the first character string and a step of associating the corrected first character string with the corrected first character string after the correction of the first character string and outputting the second character string.
  • the program of the third aspect of the present invention includes a step of acquiring voucher image data, a step of outputting a plurality of recognition character strings by recognizing a character string included in the voucher image data, and the plurality of steps.
  • the first character string among the recognition character strings of is not included in the master data associated with a plurality of registered character strings, and the second character string different from the first character string among the plurality of recognition character strings is When included in the master data, the first of the registered character strings associated with the second character string in the master data is the most similar to the first character string.
  • the step of correcting the character string and the step of associating the corrected first character string with the corrected first character string after the correction of the first character string and outputting the second character string are executed.
  • invoice image data which is a type of voucher image data.
  • invoice data This is an example of invoice data.
  • FIG. 1 is a diagram for explaining an outline of the data processing device 1.
  • the data processing device 1 is a device for specifying a character string included in the voucher image data by acquiring voucher image data and performing character recognition processing on the voucher image data, for example, a computer.
  • the data processing device 1 creates voucher data including the specified character string, and outputs the created voucher data to the external device 3.
  • Voucher image data is image data of voucher documents such as invoices, purchase orders, invoices, quotations or inspection slips. If the voucher image data is the image data of the invoice, the voucher image data is an image of the voucher containing the company name, contact information, the name (item name) of the product or service to be billed, the billing amount, the tax amount, etc. It is the converted data.
  • the voucher image data is, for example, image data generated by the image reading device 2 (for example, a scanner or a digital camera) reading the voucher, but may be image data or text data created by a computer.
  • the external device 3 is, for example, a computer used by a user (for example, an accounting person) who uses the data processing device 1 in a company that has received a voucher, or an enterprise resource planning (ERP).
  • the data processing device 1 displays, for example, the specified character string on the user's computer, or transmits the specified character string to the core system.
  • the core system is a system that stores various data used for accounting, for example.
  • FIG. 2 is an example of invoice image data, which is a kind of voucher image data.
  • FIG. 3 is an example of invoice data which is voucher data including a character string specified based on the invoice image data shown in FIG. In the invoice data shown in FIG. 3, the character string included in the invoice image data shown in FIG. 2 is correctly described.
  • the data processing device 1 it is not always possible for the data processing device 1 to correctly identify all the character strings included in the invoice image data by performing character recognition.
  • the data processing device 1 erroneously recognizes a character string
  • the invoice data contains an erroneous character string
  • invoice data including the erroneous character string is created, and the erroneous character string is registered in the external device 3. Will be done.
  • the data processing device 1 has the company name, branch name, telephone number, account information, contact information, department in charge, person in charge name, item name, and product unit price, which are character strings described in the voucher.
  • the master data associated with a plurality of character strings is referred to, and it is determined whether or not the character-recognized character string is correct based on the plurality of character strings associated with the master data.
  • the misrecognized character string is used as the above-mentioned other character in the master data. Correct to the most similar string among the multiple strings associated with the column. Since the data processing device 1 is configured in this way, the probability that the data processing device 1 outputs a correct character string is increased.
  • the voucher is an invoice
  • the voucher may be an order form, a delivery note, a quotation, an acceptance slip, etc. other than the invoice.
  • FIG. 4 is a diagram showing the configuration of the data processing device 1.
  • the data processing device 1 includes a communication unit 11, a storage unit 12, and a control unit 13.
  • the control unit 13 includes a data acquisition unit 131, a character recognition unit 132, a correction unit 133, and an output unit 134.
  • the communication unit 11 includes a communication interface for transmitting and receiving various data to and from the image reading device 2 or the external device 3 via a network such as the Internet or an intranet.
  • the communication unit 11 inputs the invoice image data received from the image reading device 2 to the data acquisition unit 131. Further, the communication unit 11 outputs the invoice data output by the output unit 134 to the external device 3.
  • the storage unit 12 has a storage medium such as a ROM (ReadOnlyMemory), a RAM (RandomAccessMemory), and an SSD (SolidStateDrive).
  • the storage unit 12 stores a program executed by the control unit 13.
  • the storage unit 12 has a company name (registered company name), a branch name, a telephone number, account information (registered account information), contact information, a department in charge, a person in charge, and an item name, which are character strings written on the voucher.
  • the master data in which a plurality of character strings among the product unit prices are associated as a plurality of registered character strings is stored.
  • the storage unit 12 stores, for example, company master data 121 including data on the company and product master data 122 including data on the product. These master data are used when the user creates a voucher, but are also used by the control unit 13 for character recognition and correction of the specified character string.
  • the company master data 121 and the product master data 122 may be stored in a storage medium external to the storage unit 12.
  • FIG. 5 is a diagram showing an example of company master data 121.
  • a company name, a branch name, a company address, a telephone number, a department in charge, a person in charge, and a transfer account number are associated with each other.
  • FIG. 6 is a diagram showing an example of product master data 122.
  • the product name, the price, the manufacturer name, and the business partner name are associated with each other.
  • the product name includes the product name and the model name or model number, but may include only the product name or may include only the model name or model number.
  • the control unit 13 shown in FIG. 4 has, for example, a CPU (Central Processing Unit).
  • the control unit 13 functions as a data acquisition unit 131, a character recognition unit 132, a correction unit 133, and an output unit 134 by executing the program stored in the storage unit 12.
  • the data acquisition unit 131 acquires voucher image data via the communication unit 11.
  • the data acquisition unit 131 acquires, for example, the voucher image data output from the image reading device 2 that has read the voucher, and inputs the acquired voucher image data to the character recognition unit 132.
  • the character recognition unit 132 outputs a plurality of recognition character strings by recognizing the character strings included in the voucher image data.
  • the character recognition unit 132 recognizes the characters included in the voucher image data by executing OCR (Optical Character Recognition) processing executed by, for example, AI (Artificial Intelligence), and sets a plurality of consecutive characters as character strings. recognize.
  • OCR Optical Character Recognition
  • AI Artificial Intelligence
  • the character recognition unit 132 inputs a plurality of recognized character strings to the correction unit 133.
  • the first character string among the plurality of recognition character strings recognized by the character recognition unit 132 in one voucher image data is not included in the master data associated with the plurality of registered character strings, and a plurality of them.
  • the master data contains a second character string different from the first character string among the recognition character strings of, the first character of one or more registered character strings associated with the second character string in the master data. Correct the first string to a similar string that most closely resembles the column.
  • the correction unit 133 does not correct the first character string, and the first character string and the second character string are included in the master data. Notify the output unit 134 that it has not been done.
  • the master data may be company master data 121 or product master data 122.
  • the correction unit 133 may use the company master data 121 and the product master data 122 in combination.
  • the correction unit 133 inputs the corrected first character string to the output unit 134.
  • the output unit 134 outputs the corrected first character string and the second character string in association with each other.
  • the output unit 134 outputs voucher data in a state in which a plurality of character strings corresponding to one voucher image data are associated, for example, like the invoice data shown in FIG.
  • the output unit 134 outputs the voucher data to the external device 3 via the communication unit 11, or outputs the voucher data to the display, for example.
  • the output unit 134 outputs the first character string in association with the similar character string before the correction unit 133 corrects the first character string to a similar character string, and the correction unit 133 outputs the first character string by the output unit 134. After associating the column with the similar character string and outputting it, the first character string may be corrected to the similar character string on condition that the instruction for permitting the correction is received. Further, when the output unit 134 receives a notification from the correction unit 133 that the first character string and the second character string are not included in the master data, the first character string and the second character string are included in the master data. You may output the information indicating that it has not been done.
  • the details of the correction process by the correction unit 133 will be described.
  • the recognition company name which is the first character string recognized by the character recognition unit 132 is not included in the master data, and the recognition account information which is the second character string recognized by the character recognition unit is included. If it is included in the master data, the recognized company name is corrected to the company name associated with the recognized account information in the master data.
  • a plurality of recognition character strings are "Taguchi Shoji” and "AAA Bank Tokyo Branch Ordinary 12233334".
  • the company master data 121 shown in FIG. 5 does not include the first character string.
  • the second character string "AAA Bank Tokyo Branch Ordinary 12233334" is included in the company master data 121 shown in FIG.
  • the correction unit 133 may use a plurality of character strings such as "Tanaka Shoji”, “Tokyo Branch”, and "001-12, Chiyoda-ku, Tokyo” associated with "AAA Bank Tokyo Branch Ordinary 12233334".
  • "Tanaka Shoji” is judged to be the most similar to "Taguchi Shoji”
  • "Tanaka Shoji” in the first character string is corrected to "Tanaka Shoji”.
  • the correction unit 133 identifies the most similar similar character string, for example, based on the similarity calculated based on the number of matching characters among a plurality of characters included in each of the plurality of character strings.
  • the correction unit 133 specifies the character string having the largest number of character strings that match the first character string as a similar character string.
  • the correction unit 133 may specify the character string registered as the company name in the master data as a similar character string.
  • a plurality of recognition character strings are "ink (AK-123)" and " ⁇ 1,800".
  • the first character string is " ⁇ 1,800”
  • the product master data 122 shown in FIG. 6 does not include the first character string.
  • the second character string "ink (AK-123)” is included in the product master data 122 shown in FIG.
  • the correction unit 133 uses the character strings of " ⁇ 1,000", “ABC”, “Tanaka Shoji Co., Ltd.”, etc. associated with “ink (AK-123)” to be “ ⁇ 1,000”. It is determined that "1,000” is the most similar to " ⁇ 1,800", and " ⁇ 1,800" in the first character string is corrected to " ⁇ 1,000".
  • the correction unit 133 does not include the recognition item name, which is the first character string recognized by the character recognition unit 132, in the master data, and the recognition product unit price, which is the second character string recognized by the character recognition unit 132, is the master data.
  • the recognized item name may be corrected to the item name most similar to the recognized item name among the plurality of item names associated with the recognized product unit price in the master data. For example, when the character recognition unit 132 recognizes the ink (AK-723) as the first character string and the character recognition unit 132 recognizes " ⁇ 1,000" as the second character string, the correction unit 133 is shown in FIG. In the indicated product master data 122, the "ink (AK-0123)" and the "copy paper (A1)" associated with " ⁇ 1,000" are specified.
  • the correction unit 133 puts the first character string “ink (AK-723)" in “ink (AK-0123)” and “copy paper (A1)” rather than “copy paper (A1)”.
  • the first character string is corrected to a similar "ink (AK-0123)”.
  • the recognition company name which is the first character string recognized by the character recognition unit 132
  • the recognition item name which is the second character string recognized by the character recognition unit
  • the recognized company name may be corrected to the company name most similar to the recognized company name among the plurality of company names associated with the recognized item name in the master data. For example, when the character recognition unit 132 recognizes "Taguchi Shoji Co., Ltd.” as the first character string and the character recognition unit 132 recognizes "ink (AK-0123)" as the second character string, the correction unit 133 is shown in the figure.
  • the correction unit 133 is a "Tanaka Shoji Co., Ltd.” that is more similar to the first character string "Taguchi Shoji Co., Ltd.” than "MM Electric Co., Ltd.” among "Tanaka Shoji Co., Ltd.” and "MM Electric Co., Ltd.” Correct the first character string to "company”.
  • the correction unit 133 corrects the first character string to the character string associated with the second character string on the premise that the second character string is correct, the first character is added to the wrong character string.
  • the columns may be corrected. Therefore, the correction unit 133 corrects the first character string to a similar character string on condition that the correction unit 133 includes two or more second character strings among the plurality of recognition character strings in the master data. You may.
  • the correction unit 133 is associated with, for example, "AAA Bank Tokyo Branch Ordinary 12233334" and "03-1234-5678" by being included in the master data. Correct "Taguchi Shoji” in the first character string to "Tanaka Shoji". When the probability that two or more character strings are erroneously recognized is sufficiently low, the correction unit 133 operates in this way to reduce the probability that the first character string is corrected to the wrong character string. Therefore, it is possible to improve the probability that the character string described in the voucher is output correctly.
  • the correction unit 133 selects a plurality of candidates for the similar character string before correcting the first character string to the similar character string.
  • the first character string may be corrected to a similar character string corresponding to a candidate selected from a plurality of candidates by causing the output unit 134 to output. For example, if the company master data 121 includes the telephone number "03-1234-5678" and the fax number "03-1234-5679" and the first character string is "03-1234-5670", "03-" Both "1234-5678" and "03-1234-5679" are similar to the first string.
  • the correction unit 133 uses "03-1234-5678" and "03-1234-5679" as correction candidates. , Display on the display of the computer used by the user.
  • the correction unit 133 corrects the first character string to the character string selected by the user. By operating the correction unit 133 in this way, even when a plurality of similar character strings are registered in the master data, the probability that the first character string is corrected to the wrong character string is reduced. be able to.
  • the correction unit 133 executes a search on the Internet using at least one of the first character string or the similar character string as a keyword before correcting the first character string to the similar character string, and the similar character string is the first. If it is determined that the probability of being correct is higher than that of one character string, the first character string may be corrected to a similar character string.
  • the first character string recognized by the character recognition unit 132 is " ⁇ 2-15, Chiyoda-ku, Tokyo"
  • the correction unit 133 is "001, Chiyoda-ku, Tokyo” in the company master data shown in FIG.
  • the correction unit 133 searches on the Internet using the company name or telephone number associated with the similar character string as a keyword. The correction unit 133 does not correct the first character string when the address described in the searched and displayed website matches the first character string, and the address described in the website is similar.
  • the correction unit 133 By operating the correction unit 133 in this way, it is possible to prevent erroneous correction when the character string registered in the master data is not the latest.
  • the correction unit 133 executes a search on the Internet using at least one of the first character string or the similar character string as a keyword before correcting the first character string to a similar character string, and the first character string is used.
  • the similar character string in the master data may be corrected to the first character string without correcting the first character string.
  • the correction unit 133 has a high probability that the address described in the website searched and displayed matches the first character string, and the first character string is more correct than the similar character string.
  • the correction unit 133 By operating the correction unit 133 in this way, the master data is updated. As a result, in the future, the accuracy of correction when the character recognition unit 132 misrecognizes and the correction unit 133 corrects the character string is improved.
  • the output unit 134 outputs the corrected character string to the core system in order to improve the probability that an appropriate character string is registered in the core system based on the character string described in the voucher. It may be possible for the user to confirm. As an example, the output unit 134 can distinguish between a character string determined by the correction unit 133 that correction is necessary and a character string determined by the correction unit 133 that correction is not necessary among the plurality of recognition character strings. Output in the mode.
  • FIG. 7 is a diagram showing an example of a voucher data display screen output by the output unit 134.
  • "Taguchi Shoji” and “ink (AK-723)” are not included in the company master data or the product master data, so that they are different from other character strings (italicized characters in a thick frame). ) Is displayed.
  • the output unit 134 By outputting such data by the output unit 134, for example, the user can easily grasp which character string needs to be corrected.
  • the output unit 134 may display a character string that is a candidate for correction when a predetermined operation is performed on the screen of FIG. 7A.
  • the predetermined operation is, for example, an operation (for example, a click operation or a touch operation) in which the user selects a character string that needs to be corrected.
  • FIG. 7B is a diagram showing an example in which a character string that is a candidate for correction is displayed.
  • the correction unit 133 corrects the character string displayed in FIG. 7A to the displayed candidate character string. As a result, the voucher data is corrected to the state shown in FIG.
  • the output unit 134 determines that the ratio of the recognition character strings not included in the master data among the plurality of recognition character strings is equal to or more than a predetermined value. Warning information indicating that there are many character strings not included in the master data may be output.
  • the output unit 134 may output warning information together with a plurality of recognition character strings. As shown in FIG. 7, the output unit 134 may output warning information together with a plurality of recognition character strings in a state where the character string requiring correction can be identified.
  • the output unit 134 may display a screen for inputting a process to be executed by the user together with the warning information or as the warning information.
  • the output unit 134 displays, for example, a screen for performing an operation for associating a plurality of recognition character strings and registering them in master data.
  • the output unit 134 registers a plurality of recognition character strings in the master data when the operation for registration is performed.
  • the output unit 134 sets the first character string as a character string to be registered in the master data. You may output it.
  • the output unit 134 for example, when the first character string is "Sato Shoji" and the master data does not include "Sato Shoji", such as "Do you want to register Sato Shoji in the master data?" , Display a message containing the first character string on the user's computer.
  • the output unit 134 registers in the master data among a plurality of other character strings included in the voucher image data including the character string determined to be registered (for example, the above-mentioned "Sato Shoji").
  • a plurality of character strings corresponding to the items to be registered may be displayed as character strings of candidates for registration.
  • the output unit 134 displays the operation image that accepts the operation for registration together with the character string of the registration target candidate, and when the operation image is operated, the first character string and the character string of the registration target candidate are displayed. It may be registered in the master data.
  • FIG. 8 is a flowchart showing a processing flow of the data processing device 1. The flowchart shown in FIG. 8 starts from the time when the image reading device 2 outputs the voucher image data.
  • the character recognition unit 132 executes an OCR process for recognizing the characters included in the voucher image data (S12).
  • the character recognition unit 132 recognizes a plurality of character strings based on the recognized characters.
  • the correction unit 133 first selects one recognition character string from the plurality of recognition character strings in order to determine whether or not the plurality of character strings recognized by the character recognition unit 132 are correctly recognized (S13). The correction unit 133 determines whether or not one of the selected recognition character strings matches any of the plurality of character strings included in the master data (S14). When the correction unit 133 determines that one selected recognition character string matches any of a plurality of character strings included in the master data (YES in S14), the correction unit 133 selects another recognition character string (S15). ), The process of S14 is executed again.
  • the correction unit 133 determines that the first character string, which is one selected recognition character string, does not match all of the plurality of character strings included in the master data (NO in S14)
  • the character recognition unit 132 It is determined whether or not the other recognition character string among the plurality of recognition character strings recognized by the user matches any of the plurality of character strings included in the master data (S16).
  • the correction unit 133 determines that the other recognition character string matches any of the plurality of character strings included in the master data (YES in S16), the master data that matches the other recognition character string. Among the plurality of character strings associated with the character string in the above, the recognized first character string is corrected to the character string most similar to the first character string (S17). When the correction unit 133 determines that the other recognition character string does not match any of the plurality of character strings included in the master data (NO in S16), the correction unit 133 further refers to the other recognition character string in S16. Executes the processing of.
  • the correction unit 133 determines whether or not the processing from S14 to S17 has been completed for all the recognition character strings, and if not, returns to S14. When the processing for all the recognized character strings is completed (YES in S18), the correction unit 133 creates voucher data composed of the corrected character strings, and the output unit 134 outputs the voucher data (S19).
  • the data processing device 1 includes the company name, branch name, telephone number, account information, contact information, department in charge, person in charge name, item name, and product unit price, which are character strings written on the voucher.
  • the correction unit 133 does not include the first character string in the master data among the plurality of recognition character strings specified by recognizing the character strings included in the voucher image data, and the correction unit 133 has a plurality of recognition character strings.
  • the master data contains a second character string different from the first character string, it is most similar to the first character string among one or more registered character strings associated with the second character string in the master data.
  • the first character string is corrected to the similar character string.
  • the output unit 134 can output a correct character string even if an error occurs in character recognition, so that the character string included in the image data of the voucher is correct. Improve the probability of being output. As a result, it is possible to improve the work efficiency and work quality of the user who performs the work using the data described in the voucher.
  • the data processing device 1 has the company master data 121 and the product master. Only one of the data 122 may be used. Further, the data processing device 1 does not have to be configured by one computer, and a plurality of computers may operate in cooperation with each other, or the computer and the storage medium in which the master data is stored may be physically separated. You may be doing it.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Character Discrimination (AREA)

Abstract

Un dispositif de traitement de données 1 comprend : une unité de stockage 12 qui stocke des données maîtresses dans lesquelles une pluralité de chaînes de caractères à écrire sur des bons sont associées sous la forme d'une pluralité de chaînes de caractères enregistrées ; une unité de reconnaissance de caractères 132 qui reconnaît des chaînes de caractères incluses dans des données d'image de bon et délivre ainsi une pluralité de chaînes de caractères reconnues ; et une unité de correction 133 qui, si une première chaîne de caractères de la pluralité de chaînes de caractères reconnues n'est pas incluse dans les données maîtresses, mais une seconde chaîne de caractères de la pluralité de chaînes de caractères reconnues qui est différente de la première chaîne de caractères est incluse dans les données maîtresses, corrige la première chaîne de caractères en une chaîne de caractères similaire qui est la plus similaire à la première chaîne de caractères parmi la ou les chaînes de caractères enregistrées qui sont associées, dans les données maîtresses, avec la seconde chaîne de caractères.
PCT/JP2020/041162 2020-11-04 2020-11-04 Dispositif de traitement de données, procédé de traitement de données et programme WO2022097189A1 (fr)

Priority Applications (3)

Application Number Priority Date Filing Date Title
PCT/JP2020/041162 WO2022097189A1 (fr) 2020-11-04 2020-11-04 Dispositif de traitement de données, procédé de traitement de données et programme
JP2020561940A JP6870159B1 (ja) 2020-11-04 2020-11-04 データ処理装置、データ処理方法及びプログラム
JP2021068170A JP2022075467A (ja) 2020-11-04 2021-04-14 データ処理装置、データ処理方法及びプログラム

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/JP2020/041162 WO2022097189A1 (fr) 2020-11-04 2020-11-04 Dispositif de traitement de données, procédé de traitement de données et programme

Publications (1)

Publication Number Publication Date
WO2022097189A1 true WO2022097189A1 (fr) 2022-05-12

Family

ID=75801856

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2020/041162 WO2022097189A1 (fr) 2020-11-04 2020-11-04 Dispositif de traitement de données, procédé de traitement de données et programme

Country Status (2)

Country Link
JP (2) JP6870159B1 (fr)
WO (1) WO2022097189A1 (fr)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2024050375A (ja) 2022-09-29 2024-04-10 株式会社トランザック プログラム、事業者情報確認方法及び事業者情報確認システム

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2004133565A (ja) * 2002-10-09 2004-04-30 Fujitsu Ltd インターネットを利用した文字認識の後処理装置
JP2012517637A (ja) * 2009-02-10 2012-08-02 コファックス, インコーポレイテッド 文書の有効性を決定するためのシステム、方法およびコンピュータプログラム製品
JP2014078203A (ja) * 2012-10-12 2014-05-01 Fuji Xerox Co Ltd 画像処理装置及び画像処理プログラム
JP2014137791A (ja) * 2013-01-18 2014-07-28 Fujitsu Ltd 表示プログラム、表示装置及び表示方法
JP2016159245A (ja) * 2015-03-03 2016-09-05 株式会社東芝 配達物処理装置、および配達物処理プログラム

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2004133565A (ja) * 2002-10-09 2004-04-30 Fujitsu Ltd インターネットを利用した文字認識の後処理装置
JP2012517637A (ja) * 2009-02-10 2012-08-02 コファックス, インコーポレイテッド 文書の有効性を決定するためのシステム、方法およびコンピュータプログラム製品
JP2014078203A (ja) * 2012-10-12 2014-05-01 Fuji Xerox Co Ltd 画像処理装置及び画像処理プログラム
JP2014137791A (ja) * 2013-01-18 2014-07-28 Fujitsu Ltd 表示プログラム、表示装置及び表示方法
JP2016159245A (ja) * 2015-03-03 2016-09-05 株式会社東芝 配達物処理装置、および配達物処理プログラム

Also Published As

Publication number Publication date
JP6870159B1 (ja) 2021-05-12
JP2022075467A (ja) 2022-05-18
JPWO2022097189A1 (fr) 2022-05-12

Similar Documents

Publication Publication Date Title
US6801658B2 (en) Business form handling method and system for carrying out the same
US8468167B2 (en) Automatic data validation and correction
EP1483729B1 (fr) Extraction de texte ecrit sur un cheque
JP6357621B1 (ja) 会計処理装置、会計処理システム、会計処理方法及びプログラム
US20140169665A1 (en) Automated Processing of Documents
US8049921B2 (en) System and method for transferring invoice data output of a print job source to an automated data processing system
WO2022097189A1 (fr) Dispositif de traitement de données, procédé de traitement de données et programme
US20220044012A1 (en) Information processing apparatus, information processing method, and computer program product
JP2019023793A (ja) 仕訳情報処理装置、仕訳情報処理方法、およびプログラム
US20100023517A1 (en) Method and system for extracting data-points from a data file
JP2004013813A (ja) 情報管理システムおよび情報管理方法
JP7122896B2 (ja) 帳票情報処理装置、帳票情報構造化処理方法及び帳票情報構造化処理プログラム
JP6993032B2 (ja) 会計処理装置、会計処理システム、会計処理方法及びプログラム
JP3766854B2 (ja) データ処理装置
WO2022029874A1 (fr) Dispositif de traitement de données, procédé de traitement de données et programme de traitement de données
JPH10105654A (ja) 帳票用文字認識装置
JP2022133739A (ja) プログラム及び情報処理装置
JP6946222B2 (ja) 給与情報処理装置、給与情報処理方法、およびプログラム
TWM584476U (zh) 轉帳伺服系統
JP2021018520A (ja) 情報処理装置、情報処理方法及びプログラム
JP2021064122A (ja) 画像処理装置、画像処理方法、及びプログラム
US20230140357A1 (en) Image processing apparatus, image processing method, and non-transitory storage medium
TWI768744B (zh) 參考單據產生方法及系統
JP7484176B2 (ja) 情報処理装置、情報処理システムおよびプログラム
JP2806340B2 (ja) 帳票管理装置

Legal Events

Date Code Title Description
ENP Entry into the national phase

Ref document number: 2020561940

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20960743

Country of ref document: EP

Kind code of ref document: A1