CN115640952B - Method and system for importing and uploading data - Google Patents

Method and system for importing and uploading data Download PDF

Info

Publication number
CN115640952B
CN115640952B CN202211042463.5A CN202211042463A CN115640952B CN 115640952 B CN115640952 B CN 115640952B CN 202211042463 A CN202211042463 A CN 202211042463A CN 115640952 B CN115640952 B CN 115640952B
Authority
CN
China
Prior art keywords
processing platform
field
added
unified processing
electronic form
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202211042463.5A
Other languages
Chinese (zh)
Other versions
CN115640952A (en
Inventor
刘忠
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Greyjet Shenzhen Technology Co ltd
Original Assignee
Greyjet Shenzhen Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Greyjet Shenzhen Technology Co ltd filed Critical Greyjet Shenzhen Technology Co ltd
Priority to CN202211042463.5A priority Critical patent/CN115640952B/en
Publication of CN115640952A publication Critical patent/CN115640952A/en
Application granted granted Critical
Publication of CN115640952B publication Critical patent/CN115640952B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Character Input (AREA)

Abstract

The application provides a method and a system for importing and uploading data, wherein the method comprises the following steps: determining a target type of target data and constructing a unified processing platform; when the target type is the electronic form type, a first field is added to the electronic form in a self-defining way, whether a unified processing platform needs to be added is identified and judged, if so, the electronic form added with the first field is added to the unified processing platform, supplementary information is acquired, and the first field is displayed at a designated position of a detail page corresponding to the electronic form; and when the target type is the paper bill type, identifying and judging whether a unified processing platform needs to be added, if so, displaying a custom setting block to a user based on the unified processing platform, and adding related second fields. The data of different types are identified and judged, the integrity of the different data is ensured, the requirement of standard importing is met to a great extent, the operation time of a user can be effectively reduced, and the working efficiency is improved.

Description

Method and system for importing and uploading data
Technical Field
The application relates to a data importing and uploading method and system, and belongs to the technical field of data uploading.
Background
In the process of using the system, newly built data such as orders/contracts or other documents are usually generated, and sometimes, before the system is not used, most individuals/companies have paper documents or other electronic version data under the line, so that a lot of time can be saved and the efficiency is greatly improved if the data can be quickly imported at the moment;
the prior art schemes are approximately two, one is to scan a bar code, such as a waybill bar code, and start to be opened or uploaded to the system once scanning is performed, provided that the system can identify the bar code, and the data formats are the same; one is that the system has a template, the user downloads the supplementary data according to the format and then uploads the supplementary data, and the premise is that the field given by the system is a field which can be identified by the system, for example, when the bank payroll is transferred, all the information to be transferred is input according to the specified format and then uploaded, so that the payroll of the user can be newly increased once.
The defect of the prior art scheme is that the data which can be identified by the system is needed, and if a lot of data which cannot be identified are included, the imported data will fail. This problem is common because the form/document content will be different for each company, e.g., company a has a picture content in its order form, company B is not necessarily present, and at this time, if company a/B can import its desired content in the same system, it must be custom developed. The customization cost is quite high in the software industry, and common companies cannot bear the customization cost. Therefore, the method can meet the leading-in and uploading requirements of A and B in the same platform, so that the operation time of a user can be greatly reduced, and the working efficiency is greatly improved.
Therefore, the invention provides a method and a system for importing and uploading data.
Disclosure of Invention
Aiming at the defects of the prior art, the invention aims to provide a data importing and uploading method and system, which are used for constructing a unified processing platform by determining the data types, identifying and judging different types of data, receiving the different types of data based on the unified processing platform, and finally effectively ensuring the integrity of the data by displaying different fields, thereby greatly meeting the requirement of standard importing, effectively reducing the operation time of a user and improving the working efficiency.
According to an embodiment of the present invention, there is provided a first aspect of: a method of data import upload, comprising:
step 1: determining a target type of target data and constructing a unified processing platform;
step 2: when the target type is an electronic form type, a first field is added to the electronic form in a self-defining way, whether a unified processing platform needs to be added is identified and judged, if so, the electronic form added with the first field is added to the unified processing platform, supplementary information is acquired, and the first field is displayed at a designated position of a detail page corresponding to the electronic form;
Step 3: and when the target type is a paper bill type, identifying and judging whether a unified processing platform needs to be added, if so, displaying a custom setting block to a user based on the unified processing platform, and adding related second fields.
Further, the method includes the steps of adding a first field to the electronic form in a self-definition mode, identifying and judging whether a unified processing platform needs to be added, and the method includes the steps of:
scanning the electronic form to obtain form arrangement of the electronic form;
identifying the form arrangement, and according to the identification result, performing a first judgment on the identification definition of each form character in the form arrangement and performing a second judgment on the identification integrity of each form character;
determining character recognition qualification of the corresponding form characters according to the first judging result and the second judging result;
locking the form characters with character recognition qualification greater than preset qualification, obtaining a first sub-layout of the form arrangement according to a locking result, and obtaining a second sub-layout of the form arrangement according to a locking result of the remaining unqualified form characters;
performing first analysis on the layout position of the first sub-layout based on the second sub-layout, and performing second analysis on the first recognition result of the first sub-layout based on the second recognition result of the second sub-layout;
Judging whether the second sub-layout can be eliminated according to the first analysis result and the second analysis result;
if yes, the second sub-layout is removed, and the first identification result of the first sub-layout is reserved and stored;
and if not, transmitting the electronic form import to the unified processing platform.
Further, determining the character recognition qualification of the corresponding form character according to the first judgment result and the second judgment result, including:
judging whether the corresponding form character is an electronic character, if so, determining eligibility according to the following mode, wherein the method comprises the following steps of:
wherein s is i1,0 A standard line representing the i1 st stroke of the corresponding electronic character before the electronic form is not recognized; s is(s) i1,1 A recognition line representing an i1 st stroke of the corresponding electronic character after electronic form recognition; oc 2 represents a line conversion coefficient for an electronic form; e represents the sign of the exponential function;representing the sharpness coefficient of the corresponding electronic character, when the ratio of the recognition sharpness S1 to the preset sharpness S0 determined based on the second judgment result is within (0.4,1)]The corresponding value is 1, when the ratio of the overall recognition definition S1 to the preset definition S0 determined based on the second judgment result is (0,0.4) ]The corresponding value is 0; />Representing the recognition integrity of the i1 th stroke determined by the second judgment result; y represents the recognition qualification of the corresponding electronic character; n1 represents the total number of strokes of the corresponding electronic character.
Further, determining whether the corresponding form character is an electronic character, and if not and at the moment is a handwritten character, determining eligibility according to the following manner, including:
wherein sim (r 0, r 1) represents the outline similarity of the corresponding handwritten character r0 before recognition and the character r1 after recognition of the electronic form; sim (r) i2,0 ,r i2,1 ) Handwriting line r representing the i2 nd stroke of the corresponding handwritten character before unrecognized electronic form i2,0 After recognition ofRecognition line r corresponding to i2 stroke i2,1 Similarity of (2); n2 represents the total number of strokes of the corresponding handwritten character before the electronic form is not recognized; y1 represents recognition eligibility of the corresponding handwritten character.
Further, adding the electronic form added with the first field to the unified processing platform, acquiring the supplementary information, and displaying the first field at the appointed position of the detail page corresponding to the electronic form, wherein the method comprises the following steps:
acquiring supplementary information according to each first field;
acquiring keywords of the supplemental information, and retrieving a pointing position matched with the keywords based on the unified processing platform;
And placing a first field matched with the keyword at the designated position for display.
Further, based on the unified processing platform, the custom setting block is displayed to the user, and relevant second fields are added, including:
when the unified processing platform receives a data bill, identifying and acquiring the bill line layout of the data bill, and matching from a layout database according to the line layout to obtain a corresponding set block set;
sequentially setting and adding guidance to the display custom setting blocks based on the setting block set;
when a user adds a second field in the corresponding custom setting block, an addition request is sent to the unified processing platform, and when the request passes, the addition guidance of the corresponding custom setting block is automatically amplified and displayed;
meanwhile, checking the second field added by the user in a self-definition mode according to the block attribute of the corresponding self-definition setting block;
if the check passes, reserving the added second field;
if the check fails, acquiring failed information, pulling out a display block from the corresponding custom setting block, and displaying the failed information;
Establishing an association relation between the failed information and the added guidance to obtain a scheme to be passed, marking a re-guidance step in the added guidance, and numbering and displaying the sequence of each re-guidance step.
Further, establishing an association relationship between the failed information and the added guidance to obtain a scheme to be passed, including:
acquiring a first position of the failed information based on a second field, performing sequence standard conversion on the second field to obtain a first information sequence, and marking a sequence corresponding to the first position in the first information sequence;
determining a cutting segment of each labeling sequence based on the first information sequence, and determining a first association weight of the corresponding labeling sequence based on the determined cutting segment;
acquiring the total association weight of each cutting segment, and screening a first segment with the total association weight greater than a preset weight;
acquiring segment meanings of each first segment respectively, determining ambiguity information of the segment meanings and guiding meanings corresponding to the added guiding, and establishing an association relation according to ambiguity degree;
and rearranging and obtaining an adjustment step based on all the ambiguity information and the association relation, and obtaining a scheme to be passed.
Further, performing a second analysis on the first recognition result of the first sub-layout based on the second recognition result of the second sub-layout includes:
carrying out layout analysis on the second identification result, and respectively obtaining first identification content of each line block;
extracting second identification content corresponding to the same line block from the first identification result;
analyzing the first identification content independently to obtain first semantics;
comprehensively analyzing the third identification content obtained by combining the first identification content and the second identification content according to the position to obtain second semantics;
when the similarity of the first semantic meaning and the second semantic meaning is larger than the preset similarity, reserving second identification content corresponding to the same line block, and carrying out first qualified calibration on the second identification content in the same line block;
otherwise, reserving second identification content corresponding to the same line block, and performing first disqualification calibration on the second identification content in the same line block;
determining a first number of all first unqualified calibration results and a second number of all first qualified calibration results;
calculating the influence value of the unqualified calibration result on the electronic form;
Wherein m1 represents a first number; m2 represents a second number; d, d j1 Representing the position weight of the corresponding j1 th first disqualified calibration result based on the electronic form; d, d j2 Representing the position weight of the corresponding j2 th first qualified calibration result based on the electronic form; ln represents the sign of the logarithmic function;
when the influence value is larger than a preset value, importing and transmitting the electronic form to the unified processing platform;
otherwise, the second sub-layout is rejected.
Further, in the process of adding the electronic form added with the first field to the unified processing platform, the method further comprises:
acquiring a source of a first field, and performing format conversion on the first field according to a data structure of the source;
and calling the processing interface of the first field and the unified processing platform, and performing second format conversion on the first field after format conversion according to an interface structure of the processing interface to obtain a field matched with the format of the unified processing platform.
The invention provides a system for importing and transmitting data, which comprises:
the type determining module is used for determining the target type of the target data and constructing a unified processing platform;
the electronic form processing module is used for adding a first field to the electronic form in a self-definition mode when the target type is the electronic form type, identifying and judging whether a unified processing platform needs to be added, adding the electronic form added with the first field to the unified processing platform if the unified processing platform needs to be added, acquiring supplementary information, and displaying the first field at a designated position of a detail page corresponding to the electronic form;
And the paper bill processing module is used for identifying and judging whether a unified processing platform is needed to be added when the target type is the paper bill type, and displaying a user-defined setting block to a user based on the unified processing platform if needed, and adding related second fields.
Compared with the prior art, the application has the following beneficial effects:
through determining the data type, constructing a unified processing platform, identifying and judging different types of data, receiving different types of data based on the unified processing platform, and finally effectively guaranteeing the integrity of the data by displaying different fields, the requirement of standard importing is met to a great extent, the operation time of a user can be effectively reduced, and the working efficiency is improved.
Drawings
FIG. 1 is a flowchart of a method for uploading data import according to an embodiment of the present application;
fig. 2 is a block diagram of a system for data import and upload in an embodiment of the present application.
Detailed Description
In order that those skilled in the art will better understand the technical solutions of the present application, the technical solutions of the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is apparent that the described embodiments are only some embodiments of the present application, not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the application without making any inventive effort, are intended to be within the scope of the application.
It is noted that when an element is referred to as being "fixed" or "disposed on" another element, it can be directly on the other element or be indirectly disposed on the other element; when an element is referred to as being "connected to" another element, it can be directly connected to the other element or be indirectly connected to the other element.
It is to be understood that the terms "length," "width," "upper," "lower," "front," "rear," "left," "right," "vertical," "horizontal," "top," "bottom," "inner," "outer," and the like are merely used for convenience in describing and simplifying the description of the present application based on the orientation or positional relationship shown in the drawings, and do not indicate or imply that the devices or components referred to must have a particular orientation, be constructed and operated in a particular orientation, and therefore should not be construed as limiting the present application.
Furthermore, the terms "first," "second," and the like, are used for descriptive purposes only and are not to be construed as indicating or implying a relative importance or number of technical features indicated. Thus, a feature defining "a first" or "a second" may explicitly or implicitly include one or more such feature. In the description of the present application, the meaning of "a plurality" or "a number" means two or more, unless specifically defined otherwise.
It should be understood that the structures, proportions, sizes, etc. shown in the drawings are for the purpose of understanding and reading the disclosure, and are not intended to limit the scope of the application, which is defined by the claims, but rather by the claims, unless otherwise indicated, and that any structural modifications, proportional changes, or dimensional adjustments, which would otherwise be apparent to those skilled in the art, would be made without departing from the spirit and scope of the application.
Example 1:
according to an embodiment of the present application, there is provided a first aspect of: a method for uploading data import, as shown in fig. 1, includes:
step 1: determining a target type of target data and constructing a unified processing platform;
step 2: when the target type is an electronic form type, a first field is added to the electronic form in a self-defining way, whether a unified processing platform needs to be added is identified and judged, if so, the electronic form added with the first field is added to the unified processing platform, supplementary information is acquired, and the first field is displayed at a designated position of a detail page corresponding to the electronic form;
Step 3: and when the target type is a paper bill type, identifying and judging whether a unified processing platform needs to be added, if so, displaying a custom setting block to a user based on the unified processing platform, and adding related second fields.
In this embodiment, the target types are electronic form types and paper form types.
In this embodiment, the first field and the second field may relate to what processing needs to be performed on the form or may supplement the part of the form that is not clearly identified, so as to ensure the efficiency of identifying the imported data.
In this embodiment, when a unified processing platform is not required to be added after recognition, the recognition result of the form or the document is indicated to be clear, and at this time, the corresponding recognition result can be directly imported into a related system, so as to realize uploading of data.
In this embodiment, if a unified processing platform is required to be added after the identification, at this time, the form or the document needs to be identified correspondingly and processed in a complementary manner to ensure the integrity of the data on the form or the document.
In this embodiment, the unified processing platform is based on a platform that can be compatible with different data types, where the unified processing platform is a platform that includes compatible forms and document types, and is mainly used to supplement data.
In this embodiment, the specified position display and the setting block addition are both used to reserve a corresponding supplementary position, so as to ensure the efficiency of the unified processing platform for acquiring the corresponding position information, and further ensure the complete efficiency of data supplementation.
In this embodiment, after the supplemental information is uploaded, the fields added by the user are displayed at a specific certain position of the detail page of the document (the system presets the layout/page position and reserves the layout/page position for the storage of the newly added/customized content of the user), so that the original page of the system is not disturbed.
The beneficial effects of the technical scheme are as follows: through determining the data type, constructing a unified processing platform, identifying and judging different types of data, receiving different types of data based on the unified processing platform, and finally effectively guaranteeing the integrity of the data by displaying different fields, the requirement of standard importing is met to a great extent, the operation time of a user can be effectively reduced, and the working efficiency is improved.
Example 2:
based on embodiment 1, a first field is added to the electronic form in a customized manner, and whether a unified processing platform needs to be added is identified and judged, including:
scanning the electronic form to obtain form arrangement of the electronic form;
Identifying the form arrangement, and according to the identification result, performing a first judgment on the identification definition of each form character in the form arrangement and performing a second judgment on the identification integrity of each form character;
determining character recognition qualification of the corresponding form characters according to the first judging result and the second judging result;
locking the form characters with character recognition qualification greater than preset qualification, obtaining a first sub-layout of the form arrangement according to a locking result, and obtaining a second sub-layout of the form arrangement according to a locking result of the remaining unqualified form characters;
performing first analysis on the layout position of the first sub-layout based on the second sub-layout, and performing second analysis on the first recognition result of the first sub-layout based on the second recognition result of the second sub-layout;
judging whether the second sub-layout can be eliminated according to the first analysis result and the second analysis result;
if yes, the second sub-layout is removed, and the first identification result of the first sub-layout is reserved and stored;
and if not, transmitting the electronic form import to the unified processing platform.
In this embodiment, the form arrangement refers to the form format of the acquired electronic form.
In this embodiment, recognition clarity refers to whether the corresponding character is ambiguous, and completeness refers to whether the corresponding character stroke is complete.
In this embodiment, the preset eligibility is preset, and the preset eligibility corresponding to the character in different situations is different, for example, the recognition character includes an electronic character or a handwritten character.
In this embodiment, the character locking is mainly used for first determining the layout of the qualified characters, and second determining the layout of the unqualified characters, and the layout is determined according to the locked position and the situation of one position of the whole arrangement of the electronic form.
In this embodiment, the first sub-layout is based on the first analysis of the second sub-layout, mainly to determine the position condition of each character in the first sub-layout and the second sub-layout, and the recognition result of the second sub-layout is based on the recognition influence of the recognition result of the first sub-layout to determine whether to reject the second sub-layout, mainly to determine the influence result of unqualified characters on the whole electronic form, if the influence is almost not present, the content of the corresponding layout may be deleted, the recognition result of the first sub-layout is directly imported and transmitted to the corresponding system, if the influence is present, the data is complemented by the processing of the unified processing platform, and finally the corresponding system is imported.
The beneficial effects of the technical scheme are as follows: the first sub-layout and the second sub-layout are obtained by scanning the electronic form and identifying each character in the arrangement, and whether the second sub-layout can be removed or not is effectively determined through position analysis and result analysis, so that the importing efficiency of the form is ensured.
Example 3:
based on the embodiment 2, determining the character recognition qualification of the corresponding form character according to the first determination result and the second determination result includes:
judging whether the corresponding form character is an electronic character, if so, determining eligibility according to the following mode, wherein the method comprises the following steps of:
wherein s is i1,0 A standard line representing the i1 st stroke of the corresponding electronic character before the electronic form is not recognized; s is(s) i1,1 A recognition line representing an i1 st stroke of the corresponding electronic character after electronic form recognition; oc 2 represents a line conversion coefficient for an electronic form; e represents the sign of the exponential function;representing the sharpness coefficient of the corresponding electronic character, when the ratio of the recognition sharpness S1 to the preset sharpness S0 determined based on the second judgment result is within (0.4,1)]The corresponding value is 1, when the ratio of the overall recognition definition S1 to the preset definition S0 determined based on the second judgment result is (0,0.4) ]The corresponding value is 0; />Representing the recognition integrity of the i1 th stroke determined by the second judgment result; y represents the recognition qualification of the corresponding electronic character; n1 represents the total number of strokes of the corresponding electronic character.
In this embodiment, the electronic character refers to a character input by a computer or a mobile phone.
Handwritten characters refer to manually written characters that are embodied in forms in the form of pictures or the like.
The beneficial effects of the technical scheme are as follows: whether the electronic character is qualified or not is effectively determined by the definition coefficient and the recognition integrity of the electronic character, and an effective basis is provided for whether the electronic character is transmitted to a unified processing platform or not subsequently.
Example 4:
based on the embodiment 3, it is determined whether the corresponding form character is an electronic character, and if not and at this time is a handwritten character, eligibility is determined in the following manner, including:
wherein sim (r 0, r 1) represents the outline similarity of the corresponding handwritten character r0 before recognition and the character r1 after recognition of the electronic form; sim (r) i2,0 ,r i2,1 ) Handwriting line r representing the i2 nd stroke of the corresponding handwritten character before unrecognized electronic form i2,0 Recognition line r corresponding to i2 stroke after recognition i2,1 Similarity of (2); n2 represents the total number of strokes of the corresponding handwritten character before the electronic form is not recognized; y1 represents recognition eligibility of the corresponding handwritten character.
The beneficial effects of the technical scheme are as follows: the method effectively determines whether the handwritten character is qualified or not according to the similarity of the outline comparison and the line comparison of the handwritten character, and provides an effective basis for whether the handwritten character is transmitted to a unified processing platform or not.
Example 5:
based on embodiment 1, adding the electronic form added with the first field to the unified processing platform, acquiring the supplementing information, and displaying the first field at the designated position of the detail page corresponding to the electronic form, wherein the method comprises the following steps:
acquiring supplementary information according to each first field;
acquiring keywords of the supplemental information, and retrieving a pointing position matched with the keywords based on the unified processing platform;
and placing a first field matched with the keyword at the designated position for display.
In this embodiment, after the first field is determined, the supplementary information may be obtained, and the first field may also be supplementary information, that is, information that is not clearly identified, may be manually entered, or may be a representative symbol representing some information, to ensure that the part of information must exist, where keywords of the supplementary information need to be extracted, such as: 00. 11, at this time, the designated location, such as the location of page A1, to which the key information 00, 11 matches is retrieved, and at this time, the first field is displayed at location A1.
The beneficial effects of the technical scheme are as follows: and the supplemental information is acquired according to the first field, and the keyword is extracted from the supplemental information, so that the display of the designated position is facilitated, the integrity of the supplement is ensured, and the efficiency of the follow-up input is indirectly improved.
Example 6:
based on the embodiment 1, the displaying the custom setting block to the user based on the unified processing platform, and adding the relevant second field, includes:
when the unified processing platform receives a data bill, identifying and acquiring the bill line layout of the data bill, and matching from a layout database according to the line layout to obtain a corresponding set block set;
sequentially displaying custom setting blocks based on the setting block set, and setting an adding guide in the custom setting blocks;
when a user adds a second field in the corresponding custom setting block, an addition request is sent to the unified processing platform, and when the request passes, the addition guidance of the corresponding custom setting block is automatically amplified and displayed;
meanwhile, checking the second field added by the user in a self-definition mode according to the block attribute of the corresponding self-definition setting block;
if the check passes, reserving the added second field;
If the check fails, acquiring failed information, pulling out a display block from the corresponding custom setting block, and displaying the failed information;
establishing an association relation between the failed information and the added guidance to obtain a scheme to be passed, marking a re-guidance step in the added guidance, and numbering and displaying the sequence of each re-guidance step.
In this embodiment, for example, lines 1, 2 and 3 exist, at this time, the positions where the supplemental content can be placed are determined, that is, the supplemental content is, according to the layout of the lines 1, 2 and 3, and the positions are definitely a set of setting blocks formed by different positions based on the positions of the document itself.
In this embodiment, since the display positions of different setting blocks in the setting block set are different, the content to be displayed in each setting block is the reasonableness of the guaranteed addition that needs to be added by capturing guidance, and the subsequent import efficiency is indirectly improved.
In this embodiment, the second field refers to a field added by a user, and after the addition request passes, the field can be added in a corresponding block, and in the addition process, the corresponding addition guide is displayed in an enlarged manner, so that the reliability of the guide is ensured.
In this embodiment, the block attribute refers to an inspection mode for inspecting the corresponding set block, and the inspection mode can be determined according to the block attribute, and is called from an attribute-mode database, where the database includes different attributes and inspection modes matched with the attributes.
In this embodiment, the failed information refers to a portion of the added information that does not meet the addition criterion.
In this embodiment, for example, the added information is: the transaction amount is 10000, and at this time, the corresponding 10000 does not meet the addition standard, for example, the corresponding 10000 should be capitalized and should not be digital, and further, the guiding modification can be performed by acquiring the guiding step again.
The beneficial effects of the technical scheme are as follows: the method has the advantages that the set of the set blocks is determined according to the line layout, the effective addition of the second field is realized through setting the addition guide, the added content is checked in the addition process, the existing addition problem is effectively determined, the reasonability of addition is guaranteed through guiding again, an effective basis is provided for the follow-up acquisition of complete data, and the high efficiency of data import is guaranteed.
Example 7:
based on embodiment 6, establishing an association relationship between the failed information and the added guidance to obtain a scheme to be passed, including:
Acquiring a first position of the failed information based on a second field, performing sequence standard conversion on the second field to obtain a first information sequence, and marking a sequence corresponding to the first position in the first information sequence;
determining a cutting segment of each labeling sequence based on the first information sequence, and determining a first association weight of the corresponding labeling sequence based on the determined cutting segment;
acquiring the total association weight of each cutting segment, and screening a first segment with the total association weight greater than a preset weight;
acquiring segment meanings of each first segment respectively, determining ambiguity information of the segment meanings and guiding meanings corresponding to the added guiding, and establishing an association relation according to ambiguity degree;
and rearranging and obtaining an adjustment step based on all the ambiguity information and the association relation, and obtaining a scheme to be passed.
In this embodiment, if the second field is "transaction amount is 10000", then the failed information is 1000, it is necessary to determine 10000 a first position based on the second field, and meanwhile, each character in the second field is converted according to standard and reverse direction to perform unified processing, so as to improve processing efficiency, for example, the obtained sequence is 1234589999, at this time 89999 is a sequence of the corresponding first position, and the cut segment of the first information sequence is 12345 a segment 89999 a segment.
In this embodiment, the first association weight refers to the ratio of the first position based on the position of the corresponding cutting segment, and the larger the ratio is, the larger the weight is, and when the first position is fully occupied, the corresponding total association weight is 1, for example, the first association weight of 8 is 0.2.
In this embodiment, the preset weight is typically 0.6.
In this embodiment, a segment meaning, such as 10000, refers to a transaction amount in digital form.
In this embodiment, for example, the transaction amount is in the case of capitalization under normal conditions, at this time, the transaction amount in digital form is ambiguous, and the association relationship is that the transaction amount is different in form, at this time, a sort adjustment step is required, for example, the transaction amount in digital form is converted into the transaction amount in the case of capitalization, which is a scheme to be passed.
The beneficial effects of the technical scheme are as follows: and carrying out standard conversion on the second field, and standardizing the sequence of the first position, further determining the association weights of different labeling sequences based on the corresponding cutting segments, further obtaining the total association weights of the corresponding cutting segments, finally obtaining a related scheme to be passed, realizing correction on the field, and ensuring the rationality of subsequent data import.
Example 8:
based on embodiment 2, performing a second analysis on the first recognition result of the first sub-layout based on the second recognition result of the second sub-layout, including:
carrying out layout analysis on the second identification result, and respectively obtaining first identification content of each line block;
extracting second identification content corresponding to the same line block from the first identification result;
analyzing the first identification content independently to obtain first semantics;
comprehensively analyzing the third identification content obtained by combining the first identification content and the second identification content according to the position to obtain second semantics;
when the similarity of the first semantic meaning and the second semantic meaning is larger than the preset similarity, reserving second identification content corresponding to the same line block, and carrying out first qualified calibration on the second identification content in the same line block;
otherwise, reserving second identification content corresponding to the same line block, and performing first disqualification calibration on the second identification content in the same line block;
determining a first number of all first unqualified calibration results and a second number of all first qualified calibration results;
calculating the influence value of the unqualified calibration result on the electronic form;
Wherein m1 represents a first number; m2 represents a second number; d, d j1 Representing the position weight of the corresponding j1 th first disqualified calibration result based on the electronic form; d, d j2 Representing the position weight of the corresponding j2 th first qualified calibration result based on the electronic form; ln represents the sign of the logarithmic function;
when the influence value is larger than a preset value, importing and transmitting the electronic form to the unified processing platform;
otherwise, the second sub-layout is rejected.
In this embodiment, the layout analysis, for example, has lines 1, 2, 3, where lines 1 and 2 form a line block and lines 2 and 3 form a line block.
The first identification content a and the second identification content B exist in the line block 11, at this time, the meaning of the content a is determined by analyzing the content a, and the corresponding semantics are obtained by analyzing the C formed by the position sequences of the content a and the content B, so as to determine whether the semantics of the two are the same, that is, by determining the similarity, and further determine the retention result and the calibration result.
In this embodiment, the preset value is preset, and is generally 0.2.
The beneficial effects of the technical scheme are as follows: through the analysis of the first identification content and the second identification content in the same line block, the qualified calibration quantity and the unqualified calibration quantity existing in the whole line block can be effectively determined, and then an effective basis is provided for calculating an influence value by combining the position weight of the unqualified calibration result, whether the second sub-layout is removed or not is reasonably judged, and an effective reference is provided for whether the second sub-layout is imported into a system or transmitted to a platform for processing.
Example 9:
based on the embodiment 1, in the process of adding the electronic form added with the first field to the unified processing platform, the method further includes:
acquiring a source of a first field, and performing format conversion on the first field according to a data structure of the source;
and calling the processing interface of the first field and the unified processing platform, and performing second format conversion on the first field after format conversion according to an interface structure of the processing interface to obtain a field matched with the format of the unified processing platform.
In this embodiment, the source of the first field may be a mobile phone end, a computer end, etc. of the user, so that format conversion is performed on the first field according to a format conversion structure of the corresponding end, for example, the first field is converted into data in format 1;
in this embodiment, the processing interface of the unified processing platform will convert the fields of format 1 into data of format 2.
The beneficial effects of the technical scheme are as follows: through format conversion, the transmission reliability between different ends and the platform is ensured.
Example 10:
the invention provides a system for data import transmission, as shown in fig. 2, comprising:
the type determining module is used for determining the target type of the target data and constructing a unified processing platform;
The electronic form processing module is used for adding a first field to the electronic form in a self-definition mode when the target type is the electronic form type, identifying and judging whether a unified processing platform needs to be added, adding the electronic form added with the first field to the unified processing platform if the unified processing platform needs to be added, acquiring supplementary information, and displaying the first field at a designated position of a detail page corresponding to the electronic form;
and the paper bill processing module is used for identifying and judging whether a unified processing platform is needed to be added when the target type is the paper bill type, and displaying a user-defined setting block to a user based on the unified processing platform if needed, and adding related second fields.
The beneficial effects of the technical scheme are as follows: through determining the data type, constructing a unified processing platform, identifying and judging different types of data, receiving different types of data based on the unified processing platform, and finally effectively guaranteeing the integrity of the data by displaying different fields, the requirement of standard importing is met to a great extent, the operation time of a user can be effectively reduced, and the working efficiency is improved.
The previous description of the disclosed embodiments is provided to enable any person skilled in the art to make or use the present invention. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the invention. Thus, the present invention is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.

Claims (8)

1. A method for importing and uploading data, comprising:
step 1: determining a target type of target data and constructing a unified processing platform;
step 2: when the target type is an electronic form type, a first field is added to the electronic form in a self-defining way, whether a unified processing platform needs to be added is identified and judged, if so, the electronic form added with the first field is added to the unified processing platform, supplementary information is acquired, and the first field is displayed at a designated position of a detail page corresponding to the electronic form;
step 3: when the target type is a paper bill type, identifying and judging whether a unified processing platform needs to be added, if so, displaying a custom setting block to a user based on the unified processing platform, and adding related second fields;
the step of displaying the custom setting block to the user based on the unified processing platform and adding the relevant second field comprises the following steps:
when the unified processing platform receives a data bill, identifying and acquiring the bill line layout of the data bill, and matching from a layout database according to the line layout to obtain a corresponding set block set;
Sequentially setting and adding guidance to the display custom setting blocks based on the setting block set;
when a user adds a second field in the corresponding custom setting block, an addition request is sent to the unified processing platform, and when the request passes, the addition guidance of the corresponding custom setting block is automatically amplified and displayed;
meanwhile, checking the second field added by the user in a self-definition mode according to the block attribute of the corresponding self-definition setting block;
if the check passes, reserving the added second field;
if the check fails, acquiring failed information, pulling out a display block from the corresponding custom setting block, and displaying the failed information;
establishing an association relation between the failed information and the added guidance to obtain a scheme to be passed, marking a re-guidance step in the added guidance, and numbering and displaying the sequence of each re-guidance step;
the establishing the association relation between the failed information and the added guidance to obtain a scheme to be passed comprises the following steps:
acquiring a first position of the failed information based on a second field, performing sequence standard conversion on the second field to obtain a first information sequence, and marking a sequence corresponding to the first position in the first information sequence;
Determining a cutting segment of each labeling sequence based on the first information sequence, and determining a first association weight of the corresponding labeling sequence based on the determined cutting segment;
acquiring the total association weight of each cutting segment, and screening a first segment with the total association weight greater than a preset weight;
the method comprises the steps of respectively obtaining segment meanings of each first segment, determining ambiguity information of the segment meanings and guiding meanings corresponding to the added guiding, and establishing an association relation according to ambiguity degree;
and rearranging and obtaining an adjustment step based on all the ambiguity information and the association relation, and obtaining a scheme to be passed.
2. The method of claim 1, wherein custom adding a first field to the electronic form and identifying and determining whether a unified processing platform needs to be added comprises:
scanning the electronic form to obtain form arrangement of the electronic form;
identifying the form arrangement, and according to the identification result, performing a first judgment on the identification definition of each form character in the form arrangement and performing a second judgment on the identification integrity of each form character;
determining character recognition qualification of the corresponding form characters according to the first judging result and the second judging result;
Locking the form characters with the character recognition qualification more than the preset qualification, obtaining a first sub-layout of the form arrangement according to a locking result, and obtaining a second sub-layout of the form arrangement according to a locking result of the remaining unqualified form characters;
performing first analysis on the layout position of the first sub-layout based on the second sub-layout, and performing second analysis on the first recognition result of the first sub-layout based on the second recognition result of the second sub-layout;
judging whether the second sub-layout can be eliminated according to the first analysis result and the second analysis result;
if yes, the second sub-layout is removed, and the first identification result of the first sub-layout is reserved and stored;
and if not, transmitting the electronic form import to the unified processing platform.
3. The method of uploading data import according to claim 2, wherein determining character recognition eligibility of the corresponding form character according to the first determination result and the second determination result comprises:
judging whether the corresponding form character is an electronic character, if so, determining eligibility according to the following mode, wherein the method comprises the following steps of:
Wherein s is i1,0 A standard line representing the i1 st stroke of the corresponding electronic character before the electronic form is not recognized; s is(s) i1,1 A recognition line representing an i1 st stroke of a corresponding electronic character after electronic form recognition; oc 2 represents a line conversion coefficient for an electronic form; e represents the sign of the exponential function;representing the sharpness coefficient of the corresponding electronic character, when the ratio of the recognition sharpness S1 determined based on the second judgment result to the preset sharpness S0 is within (0.4,1]The corresponding value is 1, when the ratio of the overall recognition definition S1 to the preset definition S0 determined based on the second judgment result is (0,0.4)]The corresponding value is 0; />Representing the recognition integrity of the i1 th stroke determined by the second judgment result; y represents the recognition qualification of the corresponding electronic character; n1 represents the total number of strokes of the corresponding electronic character.
4. The method of data import upload of claim 3, wherein determining whether the corresponding form character is an electronic character, and if not and at this time is a handwritten character, determining eligibility comprises:
wherein sim (r 0, r 1) represents the outline similarity of the corresponding handwritten character r0 before recognition and the character r1 after recognition of the electronic form; sim (r) i2,0 ,r i2,1 ) Handwriting line r representing the i2 nd stroke of the corresponding handwritten character before unrecognized electronic form i2,0 Recognition line r corresponding to i2 stroke after recognition i2,1 Similarity of (2); n2 represents the total number of strokes of the corresponding handwritten character before the electronic form is not recognized; y1 represents recognition eligibility of the corresponding handwritten character.
5. The method for importing and uploading data according to claim 1, wherein adding the electronic form added with the first field to the unified processing platform, obtaining the supplementary information, and displaying the first field at a specified position of a detail page corresponding to the electronic form, includes:
acquiring supplementary information according to each first field;
acquiring keywords of the supplemental information, and retrieving a pointing position matched with the keywords based on the unified processing platform;
and placing a first field matched with the keyword at the designated position for display.
6. The method of data import upload of claim 2, wherein performing a second analysis of the first recognition result of the first sub-layout based on the second recognition result of the second sub-layout comprises:
carrying out layout analysis on the second identification result, and respectively obtaining first identification content of each line block;
Extracting second identification content corresponding to the same line block from the first identification result;
analyzing the first identification content independently to obtain first semantics;
comprehensively analyzing the third identification content obtained by combining the first identification content and the second identification content according to the position to obtain second semantics;
when the similarity of the first semantic meaning and the second semantic meaning is larger than the preset similarity, reserving second identification content corresponding to the same line block, and carrying out first qualified calibration on the second identification content in the same line block;
otherwise, reserving second identification content corresponding to the same line block, and performing first disqualification calibration on the second identification content in the same line block;
determining a first number of all first unqualified calibration results and a second number of all first qualified calibration results;
calculating the influence value of the unqualified calibration result on the electronic form;
wherein m1 represents a first number; m2 represents a second number; d, d j1 Representing the position weight of the corresponding j1 th first disqualified calibration result based on the electronic form; d, d j2 Representing the position weight of the corresponding j2 th first qualified calibration result based on the electronic form; ln represents the sign of the logarithmic function;
When the influence value is larger than a preset value, importing and transmitting the electronic form to the unified processing platform;
otherwise, the second sub-layout is rejected.
7. The method for importing and uploading data according to claim 1, wherein adding the electronic form added with the first field to the unified processing platform further comprises:
acquiring a source of a first field, and performing format conversion on the first field according to a data structure of the source;
and calling the processing interface of the first field and the unified processing platform, and performing second format conversion on the first field after format conversion according to an interface structure of the processing interface to obtain a field matched with the format of the unified processing platform.
8. A system for data import upload, comprising:
the type determining module is used for determining the target type of the target data and constructing a unified processing platform;
the electronic form processing module is used for adding a first field to the electronic form in a self-definition mode when the target type is the electronic form type, identifying and judging whether a unified processing platform needs to be added, adding the electronic form added with the first field to the unified processing platform if the unified processing platform needs to be added, acquiring supplementary information, and displaying the first field at a designated position of a detail page corresponding to the electronic form;
The paper bill processing module is used for identifying and judging whether a unified processing platform is needed to be added when the target type is the paper bill type, if so, displaying a user-defined setting block to a user based on the unified processing platform, and adding related second fields; the step of displaying the custom setting block to the user based on the unified processing platform and adding the relevant second field comprises the following steps: when the unified processing platform receives a data bill, identifying and acquiring the bill line layout of the data bill, and matching from a layout database according to the line layout to obtain a corresponding set block set; sequentially setting and adding guidance to the display custom setting blocks based on the setting block set; when a user adds a second field in the corresponding custom setting block, an addition request is sent to the unified processing platform, and when the request passes, the addition guidance of the corresponding custom setting block is automatically amplified and displayed; meanwhile, checking the second field added by the user in a self-definition mode according to the block attribute of the corresponding self-definition setting block; if the check passes, reserving the added second field; if the check fails, acquiring failed information, pulling out a display block from the corresponding custom setting block, and displaying the failed information; establishing an association relation between the failed information and the added guidance to obtain a scheme to be passed, marking a re-guidance step in the added guidance, and numbering and displaying the sequence of each re-guidance step; the establishing the association relation between the failed information and the added guidance to obtain a scheme to be passed comprises the following steps: acquiring a first position of the failed information based on a second field, performing sequence standard conversion on the second field to obtain a first information sequence, and marking a sequence corresponding to the first position in the first information sequence; determining a cutting segment of each labeling sequence based on the first information sequence, and determining a first association weight of the corresponding labeling sequence based on the determined cutting segment; acquiring the total association weight of each cutting segment, and screening a first segment with the total association weight greater than a preset weight; the method comprises the steps of respectively obtaining segment meanings of each first segment, determining ambiguity information of the segment meanings and guiding meanings corresponding to the added guiding, and establishing an association relation according to ambiguity degree; and rearranging and obtaining an adjustment step based on all the ambiguity information and the association relation, and obtaining a scheme to be passed.
CN202211042463.5A 2022-08-29 2022-08-29 Method and system for importing and uploading data Active CN115640952B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211042463.5A CN115640952B (en) 2022-08-29 2022-08-29 Method and system for importing and uploading data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211042463.5A CN115640952B (en) 2022-08-29 2022-08-29 Method and system for importing and uploading data

Publications (2)

Publication Number Publication Date
CN115640952A CN115640952A (en) 2023-01-24
CN115640952B true CN115640952B (en) 2023-09-26

Family

ID=84939783

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211042463.5A Active CN115640952B (en) 2022-08-29 2022-08-29 Method and system for importing and uploading data

Country Status (1)

Country Link
CN (1) CN115640952B (en)

Citations (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1818961A (en) * 2005-02-07 2006-08-16 刘瑞祯 Electronic billing system
CN101814168A (en) * 2010-04-12 2010-08-25 福建伊时代信息科技股份有限公司 Dynamic interactive management system and dynamic interactive management method for image data
CN103077185A (en) * 2012-12-18 2013-05-01 厦门雅迅网络股份有限公司 Object-based user-defined information-expanding method
CN105550929A (en) * 2015-12-03 2016-05-04 中国建设银行股份有限公司 Integrated treatment method and system for commercial drafts
CN106325870A (en) * 2016-08-25 2017-01-11 金蝶软件(中国)有限公司 Realization method and device of API (Application Programming Interface) of dynamic form
CN109657225A (en) * 2018-09-27 2019-04-19 深圳壹账通智能科技有限公司 Data integration method, device, terminal and storage medium
CN109684957A (en) * 2018-12-14 2019-04-26 新博卓畅技术(北京)有限公司 A kind of method and system showing system data according to paper form automatically
CN110223185A (en) * 2019-05-20 2019-09-10 中国平安财产保险股份有限公司 A kind of information benefit transmission method and relevant device based on data processing
CN110941948A (en) * 2019-10-14 2020-03-31 益萃网络科技(中国)有限公司 Document interface display method, device, equipment and storage medium
CN111489246A (en) * 2020-04-09 2020-08-04 贵州爱信诺航天信息有限公司 Electronic integrated posting system for value-added tax invoice
CN112818653A (en) * 2021-02-03 2021-05-18 上海虹迪物流科技有限公司 Form archiving method and device and warehouse management system
CN114612919A (en) * 2022-03-23 2022-06-10 甘肃省农业科学院农产品贮藏加工研究所 Bill information processing system, method and device
CN114648397A (en) * 2022-03-25 2022-06-21 金蝶蝶金云计算有限公司 General account checking system, method, computer device and storage medium
CN114676359A (en) * 2022-03-24 2022-06-28 金蝶软件(中国)有限公司 Form display method and device, computer equipment and storage medium

Patent Citations (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1818961A (en) * 2005-02-07 2006-08-16 刘瑞祯 Electronic billing system
CN101814168A (en) * 2010-04-12 2010-08-25 福建伊时代信息科技股份有限公司 Dynamic interactive management system and dynamic interactive management method for image data
CN103077185A (en) * 2012-12-18 2013-05-01 厦门雅迅网络股份有限公司 Object-based user-defined information-expanding method
CN105550929A (en) * 2015-12-03 2016-05-04 中国建设银行股份有限公司 Integrated treatment method and system for commercial drafts
CN106325870A (en) * 2016-08-25 2017-01-11 金蝶软件(中国)有限公司 Realization method and device of API (Application Programming Interface) of dynamic form
CN109657225A (en) * 2018-09-27 2019-04-19 深圳壹账通智能科技有限公司 Data integration method, device, terminal and storage medium
CN109684957A (en) * 2018-12-14 2019-04-26 新博卓畅技术(北京)有限公司 A kind of method and system showing system data according to paper form automatically
CN110223185A (en) * 2019-05-20 2019-09-10 中国平安财产保险股份有限公司 A kind of information benefit transmission method and relevant device based on data processing
CN110941948A (en) * 2019-10-14 2020-03-31 益萃网络科技(中国)有限公司 Document interface display method, device, equipment and storage medium
CN111489246A (en) * 2020-04-09 2020-08-04 贵州爱信诺航天信息有限公司 Electronic integrated posting system for value-added tax invoice
CN112818653A (en) * 2021-02-03 2021-05-18 上海虹迪物流科技有限公司 Form archiving method and device and warehouse management system
CN114612919A (en) * 2022-03-23 2022-06-10 甘肃省农业科学院农产品贮藏加工研究所 Bill information processing system, method and device
CN114676359A (en) * 2022-03-24 2022-06-28 金蝶软件(中国)有限公司 Form display method and device, computer equipment and storage medium
CN114648397A (en) * 2022-03-25 2022-06-21 金蝶蝶金云计算有限公司 General account checking system, method, computer device and storage medium

Also Published As

Publication number Publication date
CN115640952A (en) 2023-01-24

Similar Documents

Publication Publication Date Title
US9639751B2 (en) Property record document data verification systems and methods
US11232300B2 (en) System and method for automatic detection and verification of optical character recognition data
US5555101A (en) Forms creation and interpretation system
US8064703B2 (en) Property record document data validation systems and methods
US5251273A (en) Data processing system and method for sequentially repairing character recognition errors for scanned images of document forms
US10318804B2 (en) System and method for data extraction and searching
US20050289182A1 (en) Document management system with enhanced intelligent document recognition capabilities
CN111582169B (en) Image recognition data error correction method, device, computer equipment and storage medium
US10482170B2 (en) User interface for contextual document recognition
CN112508011A (en) OCR (optical character recognition) method and device based on neural network
CN113569863B (en) Document checking method, system, electronic equipment and storage medium
CN111858977B (en) Bill information acquisition method, device, computer equipment and storage medium
CN111191153A (en) Information technology consultation service display device
US11080808B2 (en) Automatically attaching optical character recognition data to images
CN116798061A (en) Bill auditing and identifying method, device, terminal and storage medium
CN115640952B (en) Method and system for importing and uploading data
US20070217691A1 (en) Property record document title determination systems and methods
CN112348022B (en) Free-form document identification method based on deep learning
CN111241955B (en) Bill information extraction method and system
US11989693B2 (en) Image-processing device, image processing method, and storage medium on which program is stored
CN117688162B (en) Full text retrieval method and system based on OCR (optical character recognition)
KR102561878B1 (en) Ai blue ocr reading system and method based on machine learning
CN114091876A (en) Method and tool for automatically checking and cutting converted bank receipt
CN116152480A (en) Data extraction and structuring processing system and implementation method
CN116205672A (en) Client information management method, device, computer equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant