CN107870827A - Data quality control method and device based on verification - Google Patents

Data quality control method and device based on verification Download PDF

Info

Publication number
CN107870827A
CN107870827A CN201711083526.0A CN201711083526A CN107870827A CN 107870827 A CN107870827 A CN 107870827A CN 201711083526 A CN201711083526 A CN 201711083526A CN 107870827 A CN107870827 A CN 107870827A
Authority
CN
China
Prior art keywords
verification
data
full dose
data record
record
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201711083526.0A
Other languages
Chinese (zh)
Other versions
CN107870827B (en
Inventor
杨洁
吴永华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Bank of China Ltd
Original Assignee
Bank of China Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Bank of China Ltd filed Critical Bank of China Ltd
Priority to CN201711083526.0A priority Critical patent/CN107870827B/en
Publication of CN107870827A publication Critical patent/CN107870827A/en
Application granted granted Critical
Publication of CN107870827B publication Critical patent/CN107870827B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0751Error or fault detection not based on redundancy

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Automatic Analysis And Handling Materials Therefor (AREA)
  • Stored Programmes (AREA)

Abstract

The present invention discloses a kind of data quality control method and device based on verification, and methods described includes:Obtain full dose data;Checking parameter table corresponding to the full dose data is read, passes through combination verification logic generation the first verification rule set;The full dose data are verified with the described first verification rule set, obtain the first result set;Data record corresponding to check errors information in first result set is deposited to error library;The filter operation instruction is responded, reads the verification type parameter in the data record and the checking parameter table in the error library, it is determined that corresponding to the data record of certainty verification, generates certainty error library;The full dose data are screened with the certainty error library, using the full dose data Jing Guo the screening as the data for reporting and submitting storehouse.Technical scheme provided by the invention, it can be good at being applied to the ageing system for requiring that height, data volume are big and verification rule is complicated.

Description

Data quality control method and device based on verification
Technical field
The present invention relates to software technology field, more particularly to a kind of data quality control method and device based on verification.
Background technology
For traditional verification mode in the complicated verification rule of processing, the scheme of generally use is desk checking or school one by one Test.Desk checking refers to allow the attendant for being familiar with system check rule according to data cases voluntarily amended record, that is, by manpower Error in data investigation is carried out, the personnel requirement safeguarded to manpower is higher.Verify, refer to every verification rule is single one by one one by one Only write-in program, it is desirable to which complete verification rule is converted into code statement and write in a program by developer, so as to realize verification rule Then relative complex data quality control.
And requirement of the desk checking to attendant is very high, it is necessary to very familiar every verification rule, and workload is very big, It is less efficient, it can not also ensure the accuracy of final data, usually occur needing situation about doing over again, misarrangement difficulty height.The opposing party Face, very low using the running efficiency of system verified one by one, verification is all a tables of data read-write operation every time, and IO expenses are big, Reaction speed is slow, can not realize the timely feedback of check results, also more difficult for developer, it is necessary to carry out a large amount of The duplication of labour, the bit error rate is high, also influences the accuracy of final data.
Therefore, in existing technology, it is low low with the accuracy rate of data check efficiency to be all present, so as to can not be effectively to data The problem of quality is controlled, so as to which the ageing system for requiring that height, data volume are big and verification rule is complicated can not be applied to.
The content of the invention
In view of this, the invention provides a kind of data quality control method and device based on verification, can effectively solve Certainly efficiency present in prior art is low low with the accuracy rate of data check, so as to what effectively can not be controlled to the quality of data Problem, it can be good at being applied to the ageing system for requiring that height, data volume are big and verification rule is complicated.
To achieve the above object, the present invention provides following technical scheme:
A kind of data quality control method based on verification, including:
Full dose data are obtained, the full dose data include data record;
Checking parameter table corresponding to the full dose data is read, passes through combination verification logic generation the first verification rule set;
The full dose data are verified with the described first verification rule set, obtain the first result set;
Data record corresponding to check errors information in first result set is deposited to error library;
Obtain filter operation instruction;
The filter operation instruction is responded, reads the school in the data record and the checking parameter table in the error library Type parameter is tested, it is determined that corresponding to the data record of certainty verification, generates certainty error library;
The full dose data are screened with the certainty error library, using the full dose data Jing Guo the screening as reporting and submitting storehouse Data.
Further, checking parameter table corresponding to the full dose data is read, the first school is generated by combination verification logic Rule set is tested, including:
Read the checking mark number being located at corresponding to the full dose data in checking parameter table, check information description, verification Object table, verification object field, the configuration of verification sentence and verification switch, pass through combination verification logic generation the first verification rule Collection.
Further, the full dose data arrange including data item standard, described to verify rule set to described with described first Full dose data are verified, and obtain the first result set, including:
The full dose data are verified with the described first verification rule set, obtain including data item standard row, verification First result set of identification number and verification state;The verification state includes verification correct information and check errors information.
Further, data record corresponding to the check errors information by first result set is deposited to mistake Storehouse, including:
By the data item standard row corresponding with the check errors information difference and check mark number, and the verification The data record that data item standard row corresponding to error message navigate to is deposited to error library.
Further, before the acquisition filter operation instruction, in addition to:
Show the data record in error library.
Further, after the data record shown in error library, before the acquisition filter operation instruction, also wrap Include:
Obtain the data record of user's amended record;The data record of the amended record is relative with the data record in the error library Should;
The data record of the amended record is write into full dose data, the data record of renewal is made marks;
Pass through the checking parameter table and marked data record combination verification logic, the verification rule set of generation second;
Marked data record is verified with the described second verification rule set, obtains the second result set;
The data in the error library are deleted, by data record corresponding to the check errors information in second result set Deposit to error library.
A kind of data quality control device based on verification, including:
First acquisition module, for obtaining full dose data, the full dose data include data record;
Read module, for reading checking parameter table corresponding to the full dose data, pass through combination verification logic generation the One verification rule set;
First correction verification module, for being verified with the described first verification rule set to the full dose data, obtain first Result set;
Storage module, for data record corresponding to the check errors information in first result set to be deposited to mistake Storehouse;
Second acquisition module, for obtaining filter operation instruction;
First generation module, for responding the filter operation instruction, read the data record in the error library and institute The verification type parameter in checking parameter table is stated, it is determined that corresponding to the data record of certainty verification, generates certainty error library;
Filtering module, for screening the full dose data with the certainty error library, by the full dose Jing Guo the screening Data are as the data for reporting and submitting storehouse.
Further, the read module is specifically used for:
Read the checking mark number being located at corresponding to the full dose data in checking parameter table, check information description, verification Object table, verification object field, the configuration of verification sentence and verification switch, pass through combination verification logic generation the first verification rule Collection.
Further, the full dose data arrange including data item standard, and first correction verification module is specifically used for:
The full dose data are verified with the described first verification rule set, obtain including data item standard row, verification First result set of identification number and verification state;The verification state includes verification correct information and check errors information.
Further, the storage module is specifically used for:
By the data item standard row corresponding with the check errors information difference and check mark number, and the verification The data record that data item standard row corresponding to error message navigate to is deposited to error library.
Further, in addition to:
Display module, for showing the data record in error library.
Further, in addition to:
3rd acquisition module, for obtaining the data record of user's amended record;The data record of the amended record and the mistake Data record in storehouse is corresponding;
Mark module, for the data record of the amended record to be write into full dose data, the data record of renewal is made marks;
Second generation module, for passing through the checking parameter table and marked data record combination verification logic, life Into the second verification rule set;
Second correction verification module, for being verified with the described second verification rule set to marked data record, obtain Second result set;
Update module, for deleting the data in the error library, by the check errors information in second result set Corresponding data record is deposited to error library.
Understand that compared with prior art, the invention provides a kind of data based on verification via above-mentioned technical scheme Method of quality control and device.Technical scheme provided by the invention, full dose data are obtained first, wherein, the full dose packet Data record is included, then reads checking parameter table corresponding to the full dose data, passes through the verification of combination verification logic generation first Rule set, disposably all full dose data are verified with the described first verification rule set, obtain the first result set, only once I/O operation, relative to the mode verified one by one in the prior art, IO expenses can be greatly lowered, effectively improve efficiency, meanwhile, Relative to the mode of desk checking, efficiency can be also effectively improved, then by the check errors information in first result set Corresponding data record is deposited to error library, obtains filter operation instruction, is responded the filter operation instruction, is read the mistake The verification type parameter in data record and the checking parameter table in storehouse, it is determined that remembering corresponding to the data of certainty verification Record, certainty error library is generated, the full dose data are finally screened with the certainty error library, by by the complete of the screening Data are measured as the data for reporting and submitting storehouse, can in filter false storehouse certainty mistake data record, in addition, on the one hand, due to Technical scheme provided by the invention performs automatic Verification and not relies primarily on desk checking, can avoid relying primarily on artificial school The problem of testing easy error, on the other hand, technical scheme provided by the invention is relative to prior art, it is not necessary to which developer enters The substantial amounts of duplication of labour of row, can reduce the bit error rate, and so as to improve the accuracy of data check, therefore, the present invention provides Technical scheme can effectively improve the accuracy rate of data check, so as to be effectively controlled to the quality of data.That is this hair The technical scheme of bright offer, it can effectively solve the problem that efficiency present in prior art is low low with the accuracy rate of data check, so as to The problem of effectively quality of data can not being controlled, it can be good at being applied to ageing requiring that high, data volume is big and school The system for testing regular complexity.
Brief description of the drawings
In order to illustrate more clearly about the embodiment of the present invention or technical scheme of the prior art, below will be to embodiment or existing There is the required accompanying drawing used in technology description to be briefly described, it should be apparent that, drawings in the following description are only this The embodiment of invention, for those of ordinary skill in the art, on the premise of not paying creative work, can also basis The accompanying drawing of offer obtains other accompanying drawings.
Fig. 1 is a kind of flow chart of the data quality control method based on verification provided in an embodiment of the present invention;
Fig. 2 is the flow chart of another data quality control method based on verification provided in an embodiment of the present invention;
Fig. 3 is a kind of structure chart of the data quality control device based on verification provided in an embodiment of the present invention;
Fig. 4 is the structure chart of another data quality control device based on verification provided in an embodiment of the present invention.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete Site preparation describes, it is clear that described embodiment is only part of the embodiment of the present invention, rather than whole embodiments.It is based on Embodiment in the present invention, those of ordinary skill in the art are obtained every other under the premise of creative work is not made Embodiment, belong to the scope of protection of the invention.
In order to facilitate the understanding of the purposes, features and advantages of the present invention, it is below in conjunction with the accompanying drawings and specific real Applying mode, the present invention is further detailed explanation.
Embodiment
Referring to Fig. 1, Fig. 1 is a kind of flow of the data quality control method based on verification provided in an embodiment of the present invention Figure.As shown in figure 1, this method includes:
Step S101, obtain full dose data;
Optionally, obtain and lifted automatically by batch program or initiated by attendant, be passed to verification data area in preparation Newest full dose data.Wherein, the full dose data are used to provide data to be reported and submitted, including multiple tables of data, are wrapped in tables of data Include data record.
Optionally, the full dose data include data item standard row (DATA_ID) and data item remaining information.Data item mark Quasi- row include a series of fields, for positioning the entry of specific data record in different pieces of information table;Remaining packet of data item contains Specific data record details.
Step S102, checking parameter table corresponding to the full dose data is read, the first school is generated by combination verification logic Test rule set;
Optionally, the checking parameter table is used to deposit verification Rule Information and relevant configuration, including:
Checking mark number (CHK_ID), for positioning the identification number of every verification rule;
Check information describes, for being expressly recited specific verification rule;
Verification object table, the appropriate table for clearly current verification rule;
Verification object field, for field in the clearly current regular applicable table of verification;
Sentence configuration is verified, for check information description to be translated into code, is called during to perform checking routine;
Type is verified, for setting the severity level of verification, including " certainty mistake " and " indicative mistake ";
Verification switch, for controlling whether every verification rule comes into force;This field realizes jointly with " verification type " Flexible configuration of the attendant to verification rule.
Optionally, the step S102 includes:
Read the checking mark number being located at corresponding to the full dose data in checking parameter table, check information description, verification Object table, verification object field, the configuration of verification sentence and verification switch, pass through combination verification logic generation the first verification rule Collection.
Step S103, the full dose data are verified with the described first verification rule set, obtain the first result set;
Optionally, the step S103 includes:
The full dose data are verified with the described first verification rule set, obtain including data item standard row, verification First result set of identification number and verification state;The verification state includes verification correct information and check errors information.
Specifically, illustrating the step S102 and step S103 in order to clearer, it is exemplified below:
For example, thering are two fields a and b to need to be verified and (verification rule be present) in tables of data A, saying for rule is verified It is bright to be shown in Table 1:
Table 1
After rule supplying checking parameter table will be verified, the extraction verification rule set in units of checking parameter table, then data The rule set of Table A is exactly two rules, is shown in Table 2:
Table 2
After being extracted the verification rule set of checking parameter table, the content in verification rule set combination full dose data is given birth to Into result set.
Consideration based on system high efficiency, is designed as a checking parameter table, i.e., only needs an I/O operation.Therefore, exist Configuration parameter is made into ranks by unpivot functions during generated statement mutually to turn, verification object identical verification rule set is put into Numerical value extraction is carried out to source table in one DML (data manipulation language, DML) sentence, realized Disposable unified whole rules to the table are handled, that is, realize every physical table only once I/O operation, so as to effectively reduce The IO expenses of large data volume table, effectively improve running efficiency of system.
Verification rule set is evened up first, is associated with full dose data, it is assumed that there are two records in tables of data A, it is false The situation for determining field a and field b is as shown in table 3:
Table 3
Data item standard arranges Field a Field b
Record 1 1 1
Record 2 2 22
It can be seen that record 1 meets verification rule 1, but verification rule 2 is not met;2 two verification rules are recorded to meet.Will It verifies rule set and evens up rear form such as table 4:
Table 4
Data item standard arranges RULE-01 RULE-02
Record 1 Correctly Mistake
Record 2 Correctly Correctly
Because after verification rule set is arranged in into the data record of full dose data in the form of fields, in this process Full dose data, which are only needed once to read, to be completed, and effectively increase operational efficiency.Then by the data record evened up to verify Identification number is that dimension carries out transposition, is formed result set, and result set can be with " data item standard row " " checking mark number " " verification The form displaying of three abstract dimensions of state ".Take record 1 and record 2, example such as table 5:
Table 5
Data item standard arranges Checking mark number Verification state
Record 1 RULE-01 Correctly
Record 1 RULE-02 Mistake
Record 2 RULE-01 Correctly
Record 2 RULE-02 Correctly
Result set can intuitively embody the result correctness of the corresponding every verification of record per data in table.
Step S104, data record corresponding to the check errors information in first result set is deposited to error library;
Optionally, the step S104 includes:
By the data item standard row corresponding with the check errors information difference and check mark number, and the verification The data record that data item standard row corresponding to error message navigate to is deposited to error library.
Specifically, still continuing to use above-mentioned example, then will wherein verify state extracts for the data record of mistake, is just formed Error library.Final error library only preserves the data record of one [record 1- verifications rule 2] in above-mentioned example.
Step S105, obtain filter operation instruction;
Specifically, obtain the filter operation instruction of attendant's input.
Step S106, the filter operation instruction is responded, read the data record in the error library and the verification ginseng Verification type parameter in number table, it is determined that corresponding to the data record of certainty verification, generate certainty error library;
Specifically, the verification type parameter in the data record and the checking parameter table in the error library is read, really The fixed data record corresponding to " certainty mistake ", extracts the data record corresponding to " certainty mistake ", and generation certainty is wrong Storehouse by mistake.
Step S107, the full dose data are screened with the certainty error library, by the full dose data Jing Guo the screening As the data for reporting and submitting storehouse;
Specifically, because verification rule seriousness it is generally different, be divided into seriousness higher " certainty verification " and Seriousness is relatively low, only needs to make " the indicative verification " of prompting, that is, the data record for certainty verification class mistake occur can not be reported Send, the data record for indicative verification class mistake only occur can report and submit.Therefore in screening process, system can be by checking parameter Table combination error library, if any data record one or more than one certainty check errors be present if need it is to be filtered, if Data record is correct or only exists indicative check errors then by filtering, and the data record by filtering is placed on and reports and submits storehouse In in case report and submit at any time, reach the target of data filtering.
Technical scheme provided in an embodiment of the present invention, full dose data are obtained first, wherein, the full dose data include data Record, then reads checking parameter table corresponding to the full dose data, and rule set is verified by combination verification logic generation first, Disposably all full dose data are verified with the described first verification rule set, obtain the first result set, only once I/O operation, Relative to the mode verified one by one in the prior art, IO expenses can be greatly lowered, effectively improve efficiency, meanwhile, relative to The mode of desk checking, efficiency can be also effectively improved, then by corresponding to the check errors information in first result set Data record is deposited to error library, obtains filter operation instruction, is responded the filter operation instruction, is read in the error library Verification type parameter in data record and the checking parameter table, it is determined that corresponding to the data record of certainty verification, generation Certainty error library, the full dose data are finally screened with the certainty error library, by the full dose data Jing Guo the screening As the data for reporting and submitting storehouse, can in filter false storehouse certainty mistake data record, in addition, on the one hand, due to the present invention The technical scheme that embodiment provides performs automatic Verification and not relies primarily on desk checking, can avoid relying primarily on artificial school The problem of testing easy error, on the other hand, technical scheme provided in an embodiment of the present invention is relative to prior art, it is not necessary to develops Personnel carry out the substantial amounts of duplication of labour, the bit error rate can be reduced, so as to improve the accuracy of data check, therefore, this hair The technical scheme that bright embodiment provides can effectively improve the accuracy rate of data check, so as to effectively be carried out to the quality of data Control.Technical scheme i.e. provided in an embodiment of the present invention, it can effectively solve the problem that efficiency present in prior art is low and data school The accuracy rate tested is low, so as to the problem of effectively can not being controlled the quality of data, can be good at being applied to ageing requirement System high, data volume is big and verification rule is complicated.
In addition, technical scheme provided in an embodiment of the present invention, verifies rule set, matching verifies object and implements school in generation During testing, the operation such as extraction, matching is by the way of parameter configuration.On the one hand, matched somebody with somebody by the form of checking parameter table Verification rule is put can timely and conveniently to make the renewal of simplicity in verification rule change, can effectively save subsequent development Cost;On the other hand, it can also realize that attendant easily carries out autonomous control to verification rule, so as to agree with practical business Development condition adjusts, and meets some individual demands.
Optionally, the data quality control method based on verification that another embodiment of the present invention provides, the step Before S105, in addition to:
Show the data record in error library.
Specifically, the data record in error library is shown into attendant, the personnel that can maintain easily check error library In data record, so as to decide whether initiate filter operation, be determined whether input filter operation instruction.
Optionally, if the data record in attendant's decision error storehouse is more, filter operation can't temporarily be initiated When, it is necessary to corresponding to the data record in error library, the new data record of correspondence position amended record in full dose data, and to it Verification is re-started, the embodiment of the present invention is defined as increment verification.And it is base to define the step S101~step S104 Plinth verifies.
Referring to Fig. 2, Fig. 2 is another data quality control method based on verification provided in an embodiment of the present invention Flow chart.Another data quality control method based on verification provided in an embodiment of the present invention verifies comprising increment, such as Fig. 2 Shown, this method includes:
Step S201, obtain full dose data;
Optionally, obtain and lifted automatically by batch program or initiated by attendant, be passed to verification data area in preparation Newest full dose data.Wherein, the full dose data are used to provide data to be reported and submitted, including multiple tables of data, are wrapped in tables of data Include data record.
Optionally, the full dose data include data item standard row (DATA_ID) and data item remaining information.Data item mark Quasi- row include a series of fields, for positioning the entry of specific data record in different pieces of information table;Remaining packet of data item contains Specific data record details.
Step S202, checking parameter table corresponding to the full dose data is read, the first school is generated by combination verification logic Test rule set;
Optionally, the checking parameter table is used to deposit verification Rule Information and relevant configuration, including:
Checking mark number (CHK_ID), for positioning the identification number of every verification rule;
Check information describes, for being expressly recited specific verification rule;
Verification object table, the appropriate table for clearly current verification rule;
Verification object field, for field in the clearly current regular applicable table of verification;
Sentence configuration is verified, for check information description to be translated into code, is called during to perform checking routine;
Type is verified, for setting the severity level of verification, including " certainty mistake " and " indicative mistake ";
Verification switch, for controlling whether every verification rule comes into force;This field realizes jointly with " verification type " Flexible configuration of the attendant to verification rule.
Optionally, the step S202 includes:
Read the checking mark number being located at corresponding to the full dose data in checking parameter table, check information description, verification Object table, verification object field, the configuration of verification sentence and verification switch, pass through combination verification logic generation the first verification rule Collection.
Step S203, the full dose data are verified with the described first verification rule set, obtain the first result set;
Optionally, the step S203 includes:
The full dose data are verified with the described first verification rule set, obtain including data item standard row, verification First result set of identification number and verification state;The verification state includes verification correct information and check errors information.
Step S204, data record corresponding to the check errors information in first result set is deposited to error library;
Optionally, the step S204 includes:
By the data item standard row corresponding with the check errors information difference and check mark number, and the verification The data record that data item standard row corresponding to error message navigate to is deposited to error library.
Step S205, show the data record in error library;
Step S206, obtain the data record of user's amended record;
Specifically, the data record of the amended record is corresponding with the data record in the error library.Such as in error library Data record be A tables in record 1, then obtain user's amended record A tables in record 1.
Step S207, the data record of the amended record is write into full dose data, the data record of renewal is made marks;
Specifically, the data record of the amended record is write into full dose data, as the full dose data after renewal.To renewal Data record makes marks, and can facilitate the data record by marking fast positioning to update.
Step S208, by the checking parameter table and marked data record combination verification logic, generate the second school Test rule set;
Step S209, marked data record is verified with the described second verification rule set, obtains the second result Collection;
Step S210, the data in the error library are deleted, the check errors information in second result set is corresponding Data record deposit to error library;
It should be noted that the concrete methods of realizing of the step S208~step S210 and the step S102~step S104 concrete methods of realizing is identical, and difference is:On the one hand, it is necessary to marked in the step S208~step S210 Data record substitutes full dose data and carries out subsequent check, and verifying work amount is much smaller, and efficiency is higher;On the other hand, it is necessary to delete Except the data in former error library, data record corresponding to the check errors information in second result set is deposited to mistake Storehouse.As for other specific implementation parts of the step S208~step S210, the step S102~step S104 can refer to Specific implementation part, the present embodiment repeats no more.
Step S211, obtain filter operation instruction;
Specifically, obtain the filter operation instruction of attendant's input.
Step S212, the filter operation instruction is responded, read the data record in the error library and the verification ginseng Verification type parameter in number table, it is determined that corresponding to the data record of certainty verification, generate certainty error library;
Specifically, the verification type parameter in the data record and the checking parameter table in the error library is read, really The fixed data record corresponding to " certainty mistake ", extracts the data record corresponding to " certainty mistake ", and generation certainty is wrong Storehouse by mistake.
Step S213, the full dose data are screened with the certainty error library, by the full dose data Jing Guo the screening As the data for reporting and submitting storehouse;
Specifically, because verification rule seriousness it is generally different, be divided into seriousness higher " certainty verification " and Seriousness is relatively low, only needs to make " the indicative verification " of prompting, that is, the data record for certainty verification class mistake occur can not be reported Send, the data record for indicative verification class mistake only occur can report and submit.Therefore in screening process, system can be by checking parameter Table combination error library, if any data record one or more than one certainty check errors be present if need it is to be filtered, if Data record is correct or only exists indicative check errors then by filtering, and the data record by filtering is placed on and reports and submits storehouse In in case report and submit at any time, reach the target of data filtering.
Technical scheme provided in an embodiment of the present invention, the mode of increment verification is with the addition of, maintain easily personnel's amended record verification State is wrong data record, and marks the data record of amended record, and labeled data record is only verified in subsequent check, Full dose data need not be re-started with verification, the data volume of verification is smaller, so as to which operational efficiency is higher.
In order to illustrate technical scheme provided by the invention more fully hereinafter, school is based on corresponding to provided in an embodiment of the present invention The data quality control method tested, the present invention disclose a kind of data quality control device based on verification.
Referring to Fig. 3, Fig. 3 is a kind of structure of the data quality control device based on verification provided in an embodiment of the present invention Figure.As shown in figure 3, the device includes:
First acquisition module 301, for obtaining full dose data, the full dose data include data record;
Read module 302, for checking parameter table corresponding to reading the full dose data, generated by combination verification logic First verification rule set;
Optionally, the read module 302 is specifically used for:
Read the checking mark number being located at corresponding to the full dose data in checking parameter table, check information description, verification Object table, verification object field, the configuration of verification sentence and verification switch, pass through combination verification logic generation the first verification rule Collection.
First correction verification module 303, for being verified with the described first verification rule set to the full dose data, obtain the One result set;
Optionally, the full dose data arrange including data item standard, and first correction verification module 303 is specifically used for:
The full dose data are verified with the described first verification rule set, obtain including data item standard row, verification First result set of identification number and verification state;The verification state includes verification correct information and check errors information.
Storage module 304, for by data record corresponding to the check errors information in first result set deposit to Error library;
Optionally, the storage module 304 is specifically used for:
By the data item standard row corresponding with the check errors information difference and check mark number, and the verification The data record that data item standard row corresponding to error message navigate to is deposited to error library.
Second acquisition module 305, for obtaining filter operation instruction;
First generation module 306, for responding filter operation instruction, read data record in the error library and Verification type parameter in the checking parameter table, it is determined that corresponding to the data record of certainty verification, generate certainty mistake Storehouse;
Filtering module 307, for screening the full dose data with the certainty error library, by by the complete of the screening Data are measured as the data for reporting and submitting storehouse.
Using the data quality control device provided in an embodiment of the present invention based on verification, data check can be effectively improved Accuracy rate, so as to be effectively controlled to the quality of data.Data matter i.e. provided in an embodiment of the present invention based on verification Amount control device, it can effectively solve the problem that efficiency present in prior art is low low with the accuracy rate of data check, so as to have The problem of effect is controlled the quality of data, it can be good at requiring that height, data volume are big and verification is regular suitable for ageing Complicated system.
In addition, the data quality control device provided in an embodiment of the present invention based on verification, rule set is verified in generation, With verification object and implement during verifying, the operation such as extraction, matching is by the way of parameter configuration.On the one hand, pass through The form configuration verification rule of checking parameter table can timely and conveniently make the renewal of simplicity, energy in verification rule change It is enough effectively to save subsequent development cost;On the other hand, it can also realize that attendant is easily carried out from master control to verification rule System, adjusts so as to agree with practical business development condition, meets some individual demands.
Optionally, the data quality control device based on verification that another embodiment of the present invention provides, in addition to:
Display module, for showing the data record in error library.
Optionally, if the data record in attendant's decision error storehouse is more, filter operation can't temporarily be initiated When, it is necessary to corresponding to the data record in error library, the new data record of correspondence position amended record in full dose data, and to it Verification is re-started, the embodiment of the present invention is defined as increment verification.And define first acquisition module, read module, Verified based on the part of first correction verification module and storage module.
Referring to Fig. 4, Fig. 4 is another data quality control device based on verification provided in an embodiment of the present invention Flow chart.Another data quality control device based on verification provided in an embodiment of the present invention verifies comprising increment, such as Fig. 4 Shown, the device includes:
First acquisition module 401, for obtaining full dose data, the full dose data include data record;
Read module 402, for checking parameter table corresponding to reading the full dose data, generated by combination verification logic First verification rule set;
First correction verification module 403, for being verified with the described first verification rule set to the full dose data, obtain the One result set;
Storage module 404, for by data record corresponding to the check errors information in first result set deposit to Error library;
Display module 405, for showing the data record in error library;
3rd acquisition module 406, for obtaining the data record of user's amended record;The data record of the amended record and the mistake Data record by mistake in storehouse is corresponding;
Mark module 407, for the data record of the amended record to be write into full dose data, the data record of renewal is marked Note;
Second generation module 408, for by the checking parameter table and marked data record combination verification logic, The verification rule set of generation second;
Second correction verification module 409, for being verified with the described second verification rule set to marked data record, obtain To the second result set;
Update module 410, for deleting the data in the error library, the check errors in second result set are believed Data record corresponding to breath is deposited to error library;
Second acquisition module 411, for obtaining filter operation instruction;
First generation module 412, for responding filter operation instruction, read data record in the error library and Verification type parameter in the checking parameter table, it is determined that corresponding to the data record of certainty verification, generate certainty mistake Storehouse;
Filtering module 413, for screening the full dose data with the certainty error library, by by the complete of the screening Data are measured as the data for reporting and submitting storehouse.
Data quality control device provided in an embodiment of the present invention based on verification, it with the addition of the mode of increment verification, side Just attendant's amended record verification state is the data record of mistake, and marks the data record of amended record, the only school in subsequent check The data record that standard inspection is recorded a demerit, it is not necessary to full dose data are re-started with verification, the data volume of verification is smaller, so as to operational efficiency compared with It is high.
Understand that compared with prior art, the invention provides a kind of data based on verification via above-mentioned technical scheme Method of quality control and device.Technical scheme provided by the invention, full dose data are obtained first, wherein, the full dose packet Data record is included, then reads checking parameter table corresponding to the full dose data, passes through the verification of combination verification logic generation first Rule set, disposably all full dose data are verified with the described first verification rule set, obtain the first result set, only once I/O operation, relative to the mode verified one by one in the prior art, IO expenses can be greatly lowered, effectively improve efficiency, meanwhile, Relative to the mode of desk checking, efficiency can be also effectively improved, then by the check errors information in first result set Corresponding data record is deposited to error library, obtains filter operation instruction, is responded the filter operation instruction, is read the mistake The verification type parameter in data record and the checking parameter table in storehouse, it is determined that remembering corresponding to the data of certainty verification Record, certainty error library is generated, the full dose data are finally screened with the certainty error library, by by the complete of the screening Data are measured as the data for reporting and submitting storehouse, can in filter false storehouse certainty mistake data record, in addition, on the one hand, due to Technical scheme provided by the invention performs automatic Verification and not relies primarily on desk checking, can avoid relying primarily on artificial school The problem of testing easy error, on the other hand, technical scheme provided by the invention is relative to prior art, it is not necessary to which developer enters The substantial amounts of duplication of labour of row, can reduce the bit error rate, and so as to improve the accuracy of data check, therefore, the present invention provides Technical scheme can effectively improve the accuracy rate of data check, so as to be effectively controlled to the quality of data.That is this hair The technical scheme of bright offer, it can effectively solve the problem that efficiency present in prior art is low low with the accuracy rate of data check, so as to The problem of effectively quality of data can not being controlled, it can be good at being applied to ageing requiring that high, data volume is big and school The system for testing regular complexity.
In addition, technical scheme provided in an embodiment of the present invention, verifies rule set, matching verifies object and implements school in generation During testing, the operation such as extraction, matching is by the way of parameter configuration.On the one hand, matched somebody with somebody by the form of checking parameter table Verification rule is put can timely and conveniently to make the renewal of simplicity in verification rule change, can effectively save subsequent development Cost;On the other hand, it can also realize that attendant easily carries out autonomous control to verification rule, so as to agree with practical business Development condition adjusts, and meets some individual demands.
In addition, another technical scheme provided in an embodiment of the present invention, also added the mode of increment verification, convenient dimension Shield personnel amended record verification state is wrong data record, and marks the data record of amended record, and mark is only verified in subsequent check The data record recorded a demerit, it is not necessary to full dose data are re-started with verification, the data volume of verification is smaller, so as to which operational efficiency is higher.
Finally, it is to be noted that, herein, such as first and second or the like relational terms be used merely to by One entity or operation make a distinction with another entity or operation, and not necessarily require or imply these entities or operation Between any this actual relation or order be present.Moreover, term " comprising ", "comprising" or its any other variant meaning Covering including for nonexcludability, so that process, method, article or equipment including a series of elements not only include that A little key elements, but also the other element including being not expressly set out, or also include for this process, method, article or The intrinsic key element of equipment.In the absence of more restrictions, the key element limited by sentence "including a ...", is not arranged Except other identical element in the process including the key element, method, article or equipment being also present.
Each embodiment is described by the way of progressive in this specification, what each embodiment stressed be and other The difference of embodiment, between each embodiment identical similar portion mutually referring to.For device disclosed in embodiment For, because it is corresponded to the method disclosed in Example, so description is fairly simple, related part is said referring to method part It is bright.
Directly it can be held with reference to the step of method or algorithm that the embodiments described herein describes with hardware, processor Capable software module, or the two combination are implemented.Software module can be placed in random access memory (RAM), internal memory, read-only deposit Any other shape well known in reservoir (ROM), electrically programmable ROM, electrically erasable ROM, register or technical field In the storage medium of formula.
The foregoing description of the disclosed embodiments, professional and technical personnel in the field are enable to realize or using the present invention. A variety of modifications to these embodiments will be apparent for those skilled in the art, as defined herein General Principle can be realized in other embodiments without departing from the spirit or scope of the present invention.Therefore, it is of the invention The embodiments shown herein is not intended to be limited to, and is to fit to and principles disclosed herein and features of novelty phase one The most wide scope caused.

Claims (12)

  1. A kind of 1. data quality control method based on verification, it is characterised in that including:
    Full dose data are obtained, the full dose data include data record;
    Checking parameter table corresponding to the full dose data is read, passes through combination verification logic generation the first verification rule set;
    The full dose data are verified with the described first verification rule set, obtain the first result set;
    Data record corresponding to check errors information in first result set is deposited to error library;
    Obtain filter operation instruction;
    The filter operation instruction is responded, reads the verification class in the data record and the checking parameter table in the error library Shape parameter, it is determined that corresponding to the data record of certainty verification, generate certainty error library;
    The full dose data are screened with the certainty error library, using the full dose data Jing Guo the screening as the number for reporting and submitting storehouse According to.
  2. 2. according to the method for claim 1, it is characterised in that read checking parameter table corresponding to the full dose data, lead to Combination verification logic generation the first verification rule set is crossed, including:
    Read the checking mark number being located at corresponding to the full dose data in checking parameter table, check information description, verification object Table, verification object field, the configuration of verification sentence and verification switch, pass through combination verification logic generation the first verification rule set.
  3. 3. according to the method for claim 1, it is characterised in that the full dose data include data item standard row, it is described with The first verification rule set verifies to the full dose data, obtains the first result set, including:
    The full dose data are verified with the described first verification rule set, obtain including data item standard row, checking mark Number and verification state the first result set;The verification state includes verification correct information and check errors information.
  4. 4. according to the method for claim 3, it is characterised in that the check errors information by first result set Corresponding data record is deposited to error library, including:
    By the data item standard row corresponding with the check errors information difference and check mark number, and the check errors The data record that data item standard row corresponding to information navigate to is deposited to error library.
  5. 5. according to the method described in any one of Claims 1 to 4, it is characterised in that before the acquisition filter operation instruction, also Including:
    Show the data record in error library.
  6. 6. according to the method for claim 5, it is characterised in that it is described display error library in data record after, it is described Before obtaining filter operation instruction, in addition to:
    Obtain the data record of user's amended record;The data record of the amended record is corresponding with the data record in the error library;
    The data record of the amended record is write into full dose data, the data record of renewal is made marks;
    Pass through the checking parameter table and marked data record combination verification logic, the verification rule set of generation second;
    Marked data record is verified with the described second verification rule set, obtains the second result set;
    The data in the error library are deleted, data record corresponding to the check errors information in second result set is deposited To error library.
  7. A kind of 7. data quality control device based on verification, it is characterised in that including:
    First acquisition module, for obtaining full dose data, the full dose data include data record;
    Read module, for reading checking parameter table corresponding to the full dose data, the first school is generated by combination verification logic Test rule set;
    First correction verification module, for being verified with the described first verification rule set to the full dose data, obtain the first result Collection;
    Storage module, for data record corresponding to the check errors information in first result set to be deposited to error library;
    Second acquisition module, for obtaining filter operation instruction;
    First generation module, for responding the filter operation instruction, read the data record in the error library and the school The verification type parameter tested in parameter list, it is determined that corresponding to the data record of certainty verification, generate certainty error library;
    Filtering module, for screening the full dose data with the certainty error library, by the full dose data Jing Guo the screening As the data for reporting and submitting storehouse.
  8. 8. device according to claim 7, it is characterised in that the read module is specifically used for:
    Read the checking mark number being located at corresponding to the full dose data in checking parameter table, check information description, verification object Table, verification object field, the configuration of verification sentence and verification switch, pass through combination verification logic generation the first verification rule set.
  9. 9. device according to claim 7, it is characterised in that the full dose data include data item standard row, and described the One correction verification module is specifically used for:
    The full dose data are verified with the described first verification rule set, obtain including data item standard row, checking mark Number and verification state the first result set;The verification state includes verification correct information and check errors information.
  10. 10. device according to claim 9, it is characterised in that the storage module is specifically used for:
    By the data item standard row corresponding with the check errors information difference and check mark number, and the check errors The data record that data item standard row corresponding to information navigate to is deposited to error library.
  11. 11. according to the device described in any one of claim 7~10, it is characterised in that also include:
    Display module, for showing the data record in error library.
  12. 12. device according to claim 11, it is characterised in that also include:
    3rd acquisition module, for obtaining the data record of user's amended record;In the data record of the amended record and the error library Data record it is corresponding;
    Mark module, for the data record of the amended record to be write into full dose data, the data record of renewal is made marks;
    Second generation module, for by the checking parameter table and marked data record combination verification logic, generation the Two verification rule sets;
    Second correction verification module, for being verified with the described second verification rule set to marked data record, obtain second Result set;
    Update module, it is for deleting the data in the error library, the check errors information in second result set is corresponding Data record deposit to error library.
CN201711083526.0A 2017-11-07 2017-11-07 Data quality control method and device based on verification Active CN107870827B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711083526.0A CN107870827B (en) 2017-11-07 2017-11-07 Data quality control method and device based on verification

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711083526.0A CN107870827B (en) 2017-11-07 2017-11-07 Data quality control method and device based on verification

Publications (2)

Publication Number Publication Date
CN107870827A true CN107870827A (en) 2018-04-03
CN107870827B CN107870827B (en) 2021-01-08

Family

ID=61753667

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711083526.0A Active CN107870827B (en) 2017-11-07 2017-11-07 Data quality control method and device based on verification

Country Status (1)

Country Link
CN (1) CN107870827B (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108694522A (en) * 2018-07-06 2018-10-23 中国银行股份有限公司 A kind of data analysing method and device
CN109359277A (en) * 2018-10-22 2019-02-19 众安在线财产保险股份有限公司 Data monitoring method, equipment and computer storage medium
CN110427277A (en) * 2019-07-30 2019-11-08 腾讯科技(深圳)有限公司 Data verification method, device, equipment and storage medium
CN112148721A (en) * 2020-09-25 2020-12-29 新华三大数据技术有限公司 Data checking method and device, electronic equipment and storage medium
CN112948429A (en) * 2021-02-02 2021-06-11 中国工商银行股份有限公司 Data reporting method, device and equipment
CN113076285A (en) * 2021-03-05 2021-07-06 上海明略人工智能(集团)有限公司 Processing method and system for uploading data files

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105183609A (en) * 2015-09-16 2015-12-23 焦点科技股份有限公司 Real-time monitoring system and method applied to software system
CN105740121A (en) * 2016-01-26 2016-07-06 中国银行股份有限公司 Log text monitoring and early-warning method and apparatus
CN106254045A (en) * 2016-08-09 2016-12-21 中国银行股份有限公司 A kind of data verification method and device
CN106407028A (en) * 2016-08-26 2017-02-15 佛山智能装备技术研究院 Double-buffering type robot software log storage method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105183609A (en) * 2015-09-16 2015-12-23 焦点科技股份有限公司 Real-time monitoring system and method applied to software system
CN105740121A (en) * 2016-01-26 2016-07-06 中国银行股份有限公司 Log text monitoring and early-warning method and apparatus
CN106254045A (en) * 2016-08-09 2016-12-21 中国银行股份有限公司 A kind of data verification method and device
CN106407028A (en) * 2016-08-26 2017-02-15 佛山智能装备技术研究院 Double-buffering type robot software log storage method

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108694522A (en) * 2018-07-06 2018-10-23 中国银行股份有限公司 A kind of data analysing method and device
CN108694522B (en) * 2018-07-06 2023-05-09 中国银行股份有限公司 Data analysis method and device
CN109359277A (en) * 2018-10-22 2019-02-19 众安在线财产保险股份有限公司 Data monitoring method, equipment and computer storage medium
CN110427277A (en) * 2019-07-30 2019-11-08 腾讯科技(深圳)有限公司 Data verification method, device, equipment and storage medium
CN112148721A (en) * 2020-09-25 2020-12-29 新华三大数据技术有限公司 Data checking method and device, electronic equipment and storage medium
CN112148721B (en) * 2020-09-25 2022-08-19 新华三大数据技术有限公司 Data checking method and device, electronic equipment and storage medium
CN112948429A (en) * 2021-02-02 2021-06-11 中国工商银行股份有限公司 Data reporting method, device and equipment
CN112948429B (en) * 2021-02-02 2024-04-26 中国工商银行股份有限公司 Data reporting method, device and equipment
CN113076285A (en) * 2021-03-05 2021-07-06 上海明略人工智能(集团)有限公司 Processing method and system for uploading data files

Also Published As

Publication number Publication date
CN107870827B (en) 2021-01-08

Similar Documents

Publication Publication Date Title
CN107870827A (en) Data quality control method and device based on verification
CN106846203B (en) Intelligent marking method and system
CN110008254B (en) Transformer equipment standing book checking processing method
CN106503222A (en) Batch based on Excel imports the method and device of management data base
CN104182484B (en) A kind of method and apparatus for realizing HBase data and the mapping of Java field object
CN103942143B (en) A kind of PC Lint code check result screening techniques compared based on version repository
CN108255702A (en) A kind of test case creation method, apparatus, equipment and storage medium
CN106469215B (en) Data lead-in method and system based on page end
CN109669917B (en) Waveform storage method based on priority grouping
CN105956410B (en) A kind of Universal-purpose quick detection method of IEC61850 full models
CN102087657B (en) IO (Input/output) to point method and IO (Input/output) to point for nuclear power plant simulation platform
CN108763185A (en) The method of calibration and system of Excel file
CN109359277A (en) Data monitoring method, equipment and computer storage medium
CN111309581B (en) Application performance detection method and device in database upgrading scene
CN110209585A (en) A kind of software test case intelligent training method, terminal and storage medium
CN102945262B (en) A kind of RNC configuration data control methods and device
CN109597763B (en) Consistency verification method and device for normalizing multiple items of data
CN110211581A (en) A kind of laboratory automatic speech recognition record identification system and method
CN105183641B (en) The data consistency verification method and system of a kind of kernel module
CN106294128A (en) A kind of automated testing method deriving report data and device
CN110377566A (en) Arrange the method and system of the S parameter test file of PCB
CN107783890A (en) Software defect data processing method and device
CN108427675A (en) Build the method and apparatus of index
CN110071844A (en) A kind of detection script creation system, method and relevant apparatus
CN106649501B (en) Consistency checking system and method for access model of substation monitoring database and SCD

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant