CN113095563A - Method and device for reviewing prediction result of artificial intelligence model - Google Patents

Method and device for reviewing prediction result of artificial intelligence model Download PDF

Info

Publication number
CN113095563A
CN113095563A CN202110374361.2A CN202110374361A CN113095563A CN 113095563 A CN113095563 A CN 113095563A CN 202110374361 A CN202110374361 A CN 202110374361A CN 113095563 A CN113095563 A CN 113095563A
Authority
CN
China
Prior art keywords
review
prediction result
prediction
reviewing
artificial intelligence
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110374361.2A
Other languages
Chinese (zh)
Inventor
王博
郑文杰
陈江琦
刘思言
夏卫尚
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
State Grid Corp of China SGCC
Electric Power Research Institute of State Grid Shandong Electric Power Co Ltd
Global Energy Interconnection Research Institute
Original Assignee
State Grid Corp of China SGCC
Electric Power Research Institute of State Grid Shandong Electric Power Co Ltd
Global Energy Interconnection Research Institute
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by State Grid Corp of China SGCC, Electric Power Research Institute of State Grid Shandong Electric Power Co Ltd, Global Energy Interconnection Research Institute filed Critical State Grid Corp of China SGCC
Priority to CN202110374361.2A priority Critical patent/CN113095563A/en
Publication of CN113095563A publication Critical patent/CN113095563A/en
Priority to PCT/CN2021/123302 priority patent/WO2022213565A1/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management
    • G06Q10/103Workflow collaboration or project management

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Strategic Management (AREA)
  • Human Resources & Organizations (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Data Mining & Analysis (AREA)
  • Economics (AREA)
  • Physics & Mathematics (AREA)
  • Marketing (AREA)
  • Operations Research (AREA)
  • General Business, Economics & Management (AREA)
  • Tourism & Hospitality (AREA)
  • Quality & Reliability (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Game Theory and Decision Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Evolutionary Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Computation (AREA)
  • Development Economics (AREA)
  • Artificial Intelligence (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a review method and a review device for a prediction result of an artificial intelligence model, wherein the review method comprises the following steps: obtaining a prediction result of the artificial intelligence model and a prediction category and a confidence coefficient corresponding to the prediction result; determining a review sequence and a review range according to the prediction category and the confidence coefficient; and carrying out the reexamination on the prediction result based on the preset optimization condition, the reexamination sequence and the reexamination range to obtain a reexamination result. By implementing the method and the device, the prediction result of the artificial intelligent model is reviewed, the process of manual review is avoided, the defect of manual review of the sample based on a preset scheme can be effectively overcome, and the review result is more accurate.

Description

Method and device for reviewing prediction result of artificial intelligence model
Technical Field
The invention relates to the technical field of equipment detection, in particular to a method and a device for reviewing a prediction result of an artificial intelligence model.
Background
At present, artificial intelligence models are used in a number of industries. For example, in the field of power transmission inspection, an unmanned aerial vehicle shoots equipment such as a power transmission tower, and possible problems in a shot picture are screened out based on a target detection model; in the field of security protection, a possible criminal suspect escaping is found based on a face detection and face comparison model; in the Internet field, based on a text classification model, user speeches related to yellow, politics and violence are found; in the industrial field, defective products are found based on a picture target detection model. However, since the artificial intelligence model is not completely accurate, after the artificial intelligence model is applied to find potential problems, a manual review or a manual review is often required, and therefore, a decision needs to be made as to which samples are to be reviewed. The current general method is as follows: and estimating the false detection rate and the missed detection rate of the model under different confidence coefficient conditions based on the verification set, and setting a confidence coefficient threshold value of manual review/manual review by combining actual requirements.
However, when the system of model prediction and manual review is actually applied, because the sample collection mode, the sample source, the sample characteristics, and the like are different from the training set, the false detection rate and the false detection rate of the model prediction result at different confidence levels are different from the value estimated based on the verification set, so the manual review scheme based on the preset confidence threshold causes the waste of manpower: on one hand, the number of samples of the manual review is possibly beyond/lower than the expected number, so that the review cost is beyond the expected cost or the problem detection number is lower; on the other hand, for multi-class prediction results, the workload of manual review may be concentrated on a few classes that are frequently misreported, and the number of problem samples found by review or the number of filtered false-reported samples is low due to the mismatch of review resources.
Disclosure of Invention
In view of this, embodiments of the present invention provide a review method and an apparatus for a prediction result of an artificial intelligence model, so as to solve the problems of low review number, low review accuracy and the like in the prior art when a manual review mode is used to review the prediction result of the artificial intelligence model.
The embodiment of the invention provides a review method for a prediction result of an artificial intelligence model, which comprises the following steps: obtaining a prediction result of an artificial intelligence model and a prediction type and a confidence coefficient corresponding to the prediction result; determining a review sequence and a review range according to the prediction category and the confidence coefficient; and reviewing the prediction result based on preset optimization conditions, the review sequence and the review range to obtain a review result.
Optionally, the review method further includes: and adjusting the review sequence and/or review range of the prediction result according to the review result.
Optionally, the adjusting the review order and/or the review range of the prediction result according to the review result includes: determining a prediction result sample which is subjected to the review according to the review result; calculating the number of correct and wrong samples predicted in a preset range and the corresponding proportion in the prediction result samples after the review; the preset range includes: and at least one of a preset confidence interval, a preset prediction type result and a type marked by the prediction result sample.
Optionally, the preset optimization condition includes at least one of the following three items: under the condition that the total review quantity is determined, the detection quantity of the problem samples reaches the maximum; when the detection number of the problem samples is determined, the total review number reaches the minimum; and under the condition that the total review quantity is determined, the false alarm quantity of the artificial intelligent model determined by the review reaches the maximum.
Optionally, the reviewing the prediction result based on the preset optimization condition and the review order includes: based on the prediction confidence of the artificial intelligence model, carrying out a review from high to low, or carrying out a review from low to high from a preset threshold value; and when the number of the review reaches the preset number or the number of the detected problems reaches the preset number, terminating the review.
Optionally, the reviewing the prediction result based on the preset optimization condition and the review order includes: setting prior distribution of accuracy rates of the artificial intelligence model in different confidence degrees of each category by using verification set data based on a Bayesian method; determining the review sequence based on a multi-arm slot machine method, and calculating posterior distribution of model accuracy in the review process; determining a termination condition of the review based on the optimization condition, wherein the termination condition comprises: when the number of the review reaches the preset number or the number of the detected problems reaches the preset number, terminating the review; and reviewing the prediction result based on the prior distribution, the posterior distribution and the termination condition.
Optionally, the reviewing the prediction result based on the preset optimization condition and the review order includes: and selecting a prediction result sample with the highest confidence coefficient of the non-review in the prediction category for review.
Optionally, the review method further includes: for the prediction result samples outside the review range, judging the prediction result samples with the confidence coefficient lower than that of the manual review samples as being problem-free; the predicted result samples with higher confidence than the manually reviewed samples are considered problematic.
The embodiment of the invention also provides a review device for the prediction result of the artificial intelligence model, which comprises: the prediction result acquisition module is used for acquiring the prediction result of the artificial intelligence model and the prediction type and the confidence coefficient corresponding to the prediction result; a review order and range determining module for determining a review order and a review range according to the prediction category and the confidence; and the review module is used for reviewing the prediction result based on preset optimization conditions, the review sequence and the review range to obtain a review result.
An embodiment of the present invention further provides an electronic device/mobile terminal/server, including: the artificial intelligence model prediction review method comprises a memory and a processor, wherein the memory and the processor are connected in a communication mode, the memory stores computer instructions, and the processor executes the computer instructions to execute the artificial intelligence model prediction result review method in the first aspect or any one implementation manner of the first aspect.
An embodiment of the present invention further provides a computer-readable storage medium, where the computer-readable storage medium stores a computer instruction, where the computer instruction is configured to enable the computer to execute the method for reviewing the artificial intelligence model prediction result in the first aspect or any implementation manner of the first aspect.
The embodiment of the invention has the advantages that the review method of the embodiment of the invention carries out review aiming at the prediction result of the artificial intelligent model, avoids the process of artificial review, can effectively avoid the defect of carrying out sample artificial review based on the preset scheme, has more accurate review result, can adjust the review strategy according to the data characteristics in real time, can more accurately estimate the review workload and more efficiently search problem samples or false-alarm samples.
Drawings
The features and advantages of the present invention will be more clearly understood by reference to the accompanying drawings, which are illustrative and not to be construed as limiting the invention in any way, and in which:
FIG. 1 is a flow chart of a review method for the prediction results of an artificial intelligence model according to an embodiment of the present invention;
FIG. 2 is a schematic structural diagram of a review device for artificial intelligence model prediction results according to an embodiment of the present invention;
fig. 3 shows a hardware configuration diagram of a computer device according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
The embodiment of the invention provides a review method for a prediction result of an artificial intelligence model, which comprises the following steps of:
step S101: and obtaining a prediction result of the artificial intelligence model and a prediction type and a confidence degree corresponding to the prediction result. In this embodiment, the review method is mainly a scheme of reviewing the prediction result of the artificial intelligence model, and therefore, the prediction result of the artificial intelligence model and the prediction type and the confidence degree corresponding to the prediction result need to be obtained first. Optionally, in this embodiment, an object to which a prediction result of the artificial intelligence model is directed is not limited, and the prediction result may be, for example, a defect detection result applied in the power transmission inspection field, or an identification result applied in the internet field and used for identifying data, information, and the like appearing in the network.
Step S102: and determining a review sequence and a review range according to the prediction category and the confidence coefficient.
Step S103: and reviewing the prediction result based on preset optimization conditions, the review sequence and the review range to obtain a review result. In this embodiment, the preset optimization condition may mainly include at least one of the following three items: under the condition that the total review quantity is determined, the detection quantity of the problem samples reaches the maximum; when the detection number of the problem samples is determined, the total review number reaches the minimum; and under the condition that the total review quantity is determined, the false alarm quantity of the artificial intelligent model determined by the review reaches the maximum.
By means of the review method, the defects of manual review of samples based on the preset scheme can be effectively overcome by reviewing the prediction result of the artificial intelligent model, the review result is more accurate, the review strategy can be adjusted in real time according to the data characteristics, the review workload can be estimated more accurately, and problem samples or false-reported samples can be searched more efficiently.
Optionally, in some embodiments of the present invention, the review method further includes: and adjusting the review sequence and/or review range of the prediction result according to the review result. Specifically, this adjustment process mainly includes:
determining a prediction result sample which is subjected to the review according to the review result; calculating the number of correct and wrong samples predicted in a preset range and the corresponding proportion in the prediction result samples after the review; the preset range includes: and at least one of a preset confidence interval, a preset prediction type result and a type marked by the prediction result sample.
Through the process, the defect of sample review based on the preset scheme can be effectively overcome, the review strategy is adjusted in real time according to the data characteristics, the review workload is accurately estimated, and problem samples or false-reported samples are effectively searched.
Optionally, in some embodiments of the present invention, in the step S103, a process of reviewing the prediction result based on a preset optimization condition and the review order includes: based on the prediction confidence of the artificial intelligence model, carrying out a review from high to low, or carrying out a review from low to high from a preset threshold value; and when the number of the review reaches the preset number or the number of the detected problems reaches the preset number, terminating the review. Based on this process, the optimization conditions can be achieved: when the detection number of the problem samples is determined, the total review number reaches the minimum; and under the condition that the total review quantity is determined, the false alarm quantity of the artificial intelligent model determined by the review reaches the maximum.
Optionally, in some embodiments of the present invention, in the step S103, a process of reviewing the prediction result based on a preset optimization condition and the review order includes: setting prior distribution of accuracy rates of the artificial intelligence model in different confidence degrees of each category by using verification set data based on a Bayesian method; determining the review sequence based on a multi-arm slot machine method, and calculating posterior distribution of model accuracy in the review process; determining a termination condition of the review based on the optimization condition, wherein the termination condition comprises: when the number of the review reaches the preset number or the number of the detected problems reaches the preset number, terminating the review; and reviewing the prediction result based on the prior distribution, the posterior distribution and the termination condition.
In this embodiment, the validation set data is used to set a prior distribution of the model at different confidence levels for each class (assuming that the ratio of the model accuracy to the validation set accuracy is constant at different confidence levels for the different classes, and assuming that the prior of the model accuracy at confidence level P0 obeys Beta (tA, (1-t) a), where t is the validation set accuracy and a is a predetermined integer, typically 10 to 50); based on the multi-arm slot machine method, a review sequence is determined (the category with the maximum estimation value of 90% quantile points on the accuracy is selected, the sample which is not reviewed in the category and has the maximum confidence coefficient is used as the next sample to be reviewed), and the posterior distribution of the model accuracy is calculated in the review process (for each category, the picture with the maximum confidence coefficient in the unexamined samples is selected, the confidence coefficient is recorded as P, and the upper 90% quantile points of the model accuracy when the confidence coefficient of the category is P are calculated according to the reviewed pictures).
Through this process, the optimization conditions can be achieved: under the condition that the total review quantity is determined, the detection quantity of the problem samples reaches the maximum; and when the detection number of the problem samples is determined, the total review number reaches the minimum.
Optionally, in some embodiments of the present invention, in the step S103, a process of reviewing the prediction result based on a preset optimization condition and the review order includes: and selecting a prediction result sample with the highest confidence coefficient of the non-review in the prediction category for review.
For the multi-class problem, n (n > ═ 1) conditions are preset, wherein the condition i is continuous ki (ki > ═ 1) review samples of the class, and the number of samples with correct model prediction results does not exceed ri. Firstly, selecting a category, and selecting a sample with the highest confidence coefficient of the non-review in the category for review. If at least one condition is met, switching to the next category; otherwise, the review of the category is continued. And if the total review amount or the number of detected problem/defect samples reaches a given value, stopping the review. For each preset condition, ki and ri are preset positive integers, for example: two conditions, k 1-3, r 1-0, k 2-10, and r 2-4, may be preset, i.e., if for a certain category, there is no problem sample in the first 3 samples reviewed (r 1-0), or the number of problem samples in 10 consecutive samples does not exceed 4, then switching to the next category.
Through this process, the optimization conditions can be achieved: under the condition that the total review quantity is determined, the detection quantity of the problem samples reaches the maximum; and when the detection number of the problem samples is determined, the total review number reaches the minimum.
Further, for the optimization conditions: and under the condition that the total review quantity is determined, the false alarm quantity of the artificial intelligent model determined by the review reaches the maximum. The false alarm condition can be found out as much as possible by repeating the test from low to high according to the confidence coefficient, at this time, n (n > -1) conditions are preset, wherein the condition i is continuous ki (ki > -1) review samples of the category, and the number of samples with correct model prediction results is not less than ri.
Optionally, in some embodiments of the present invention, the review method further includes: for the prediction result samples outside the review range, judging the prediction result samples with the confidence coefficient lower than that of the manual review samples as being problem-free; the predicted result samples with higher confidence than the manually reviewed samples are considered problematic.
The review method for the artificial intelligence model prediction result of the embodiment of the invention is further explained by combining the specific application example.
Application example 1: the intelligent defect detection system is applied to the field of power transmission inspection and carries out intelligent defect detection on power transmission equipment. The system mainly comprises the following modules:
(1) and the picture acquisition module. The picture to be detected can be input into the system through modes of remote manual uploading of a user, uploading of the image acquired by the unmanned aerial vehicle through wifi or a mobile communication network, copying by using a mobile storage device and the like. The picture to be detected can be a picture shot for the inspection of all or part of the towers of one power transmission line.
(2) And a defect intelligent detection module. And predicting the position, the category and the confidence coefficient of a target frame of the hidden danger of the power transmission equipment possibly contained in the picture by using a target detection model (namely the artificial intelligence model in the embodiment of the invention) to obtain a prediction result.
(3) And a review module. And obtaining the accuracy of each category of the intelligent defect detection model on different confidence levels based on the verification set. Assuming that the ratio of model accuracy to validation set accuracy is constant for different classes at different confidence levels, and assuming that the prior of model accuracy at confidence level P0 obeys Beta (tA, (1-t) A), where t is the validation set accuracy and A is a predetermined integer, typically 10 to 50. When the picture is reviewed, if the ratio of the number of reviewed samples (or the number of false alarms in the reviewed samples) to the total picture number reaches a given value, the review is finished; otherwise, firstly, selecting the picture with the maximum confidence level in the unexamined samples for each category, recording the confidence level as P, and calculating the upper 90% quantile point of the model accuracy rate when the confidence level of the category is P according to the reviewed pictures; selecting the category with the maximum estimation value of 90% quantile points on the accuracy rate, returning an unexamined sample with the maximum confidence coefficient in the category as the next sample to be reviewed, and displaying the sample to a user; and after the user finishes the review, collecting and recording the review result of the user on the sample.
(4) And a report generation module. And obtaining a defect list based on the review result, and generating a defect report.
Application example 2: the data auditing system is applied to the field of the Internet and is used for network transmission data. The system comprises the following modules:
(1) and a primary examination module. The initial examination module calls an AI classification model (namely an artificial intelligence model), predicts the input data analysis request, determines the requests of violation categories such as yellow and storm related and with confidence coefficient exceeding a corresponding category given threshold 2 as not passing the examination according to the classification result and the confidence coefficient, determines the requests of the violation categories with confidence coefficient not lower than the corresponding threshold 1 and not higher than the corresponding threshold 2 as needing manual review, and determines the classification results of other categories as passing the examination. And returning the initial examination result, and sending a request to be reexamined to a reexamination module to wait for reexamination.
(2) And a review module. And providing the text contained in the received review request to a review module, and placing the text in a to-be-reviewed list. Calculating (a) review speed every fixed time; (b) estimating the generation speed of the review request of each confidence interval of each category; (c) and estimating the model false alarm rate of each confidence interval of each category. The length of the confidence interval is a preset value, and may be 1%, 5%, or the like. And calculating the maximum review workload according to the (a) and the (b) and a preset condition. Then, the threshold 2 of each category is calculated according to the maximum review workload, so that the number of model false reports of requests of each category between the thresholds 1 and 2 reaches the maximum. And resetting the threshold 2 of the initial review module according to the calculation result, moving the sample with the confidence coefficient exceeding the threshold 2 out of the list to be reviewed in the request to be reviewed, and setting the state of the sample to be reviewed as pass. And displaying the samples in the list to be reviewed to the reviewers in sequence according to the time sequence of entering the list, and setting the state of the corresponding sample as approved or not approved according to the judgment result of the reviewers on the samples.
The embodiment of the present invention further provides a review device for the prediction result of the artificial intelligence model, as shown in fig. 2, the review device includes:
the prediction result obtaining module 1 is used for obtaining a prediction result of the artificial intelligence model and a prediction type and a confidence coefficient corresponding to the prediction result; for details, reference may be made to the related description of step S101 in the above method embodiment, and details are not repeated herein.
A review order and range determining module 2, configured to determine a review order and a review range according to the prediction category and the confidence; for details, reference may be made to the related description of step S102 in the above method embodiment, and details are not repeated herein.
And the review module 3 is used for reviewing the prediction result based on preset optimization conditions, the review sequence and the review range to obtain a review result. For details, reference may be made to the related description of step S103 in the above method embodiment, and details are not repeated here.
By means of the review device, review is conducted according to the prediction result of the artificial intelligent model, the defect that manual review of the sample is conducted based on the preset scheme can be effectively overcome, the review result is more accurate, the review strategy can be adjusted in real time according to the data characteristics, the review workload can be estimated more accurately, and the problem sample or the false-reported sample can be found more efficiently.
An embodiment of the present invention further provides a computer device, as shown in fig. 3, the computer device may include a processor 31 and a memory 32, where the processor 31 and the memory 32 may be connected by a bus or in another manner, and fig. 3 takes the example of being connected by a bus as an example.
The processor 31 may be a Central Processing Unit (CPU). The Processor 31 may also be other general purpose processors, Digital Signal Processors (DSPs), Application Specific Integrated Circuits (ASICs), Field Programmable Gate Arrays (FPGAs) or other Programmable logic devices, discrete Gate or transistor logic devices, discrete hardware components, or combinations thereof.
The memory 32 is a non-transitory computer readable storage medium, and can be used to store non-transitory software programs, non-transitory computer executable programs, and modules, such as program instructions/modules corresponding to the review method for the artificial intelligence model prediction results in the embodiment of the present invention (for example, the prediction result obtaining module 1, the review order and scope determining module 2, and the review module 3 shown in fig. 2). The processor 31 executes various functional applications and data processing of the processor by running the non-transitory software programs, instructions and modules stored in the memory 32, namely, implements the review method of the prediction result of the artificial intelligence model in the above method embodiment.
The memory 32 may include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program required for at least one function; the storage data area may store data created by the processor 31, and the like. Further, the memory 32 may include high speed random access memory, and may also include non-transitory memory, such as at least one magnetic disk storage device, flash memory device, or other non-transitory solid state storage device. In some embodiments, the memory 32 may optionally include memory located remotely from the processor 31, and these remote memories may be connected to the processor 31 via a network. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.
The one or more modules are stored in the memory 32 and, when executed by the processor 31, perform a review method of artificial intelligence model predictions as in the embodiment of FIG. 1.
The details of the computer device may be understood by referring to the corresponding descriptions and effects in the embodiments shown in fig. 1 to fig. 2, and are not described herein again.
It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by a computer program, which can be stored in a computer-readable storage medium, and when executed, can include the processes of the embodiments of the methods described above. The storage medium may be a magnetic Disk, an optical Disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a Flash Memory (Flash Memory), a Hard Disk (Hard Disk Drive, abbreviated as HDD), a Solid State Drive (SSD), or the like; the storage medium may also comprise a combination of memories of the kind described above.
Although the embodiments of the present invention have been described in conjunction with the accompanying drawings, those skilled in the art may make various modifications and variations without departing from the spirit and scope of the invention, and such modifications and variations fall within the scope defined by the appended claims.

Claims (11)

1. A review method for the prediction result of an artificial intelligence model is characterized by comprising the following steps:
obtaining a prediction result of an artificial intelligence model and a prediction type and a confidence coefficient corresponding to the prediction result;
determining a review sequence and a review range according to the prediction category and the confidence coefficient;
and reviewing the prediction result based on preset optimization conditions, the review sequence and the review range to obtain a review result.
2. The method for reviewing the prediction results of the artificial intelligence model according to claim 1, wherein the reviewing method further comprises:
and adjusting the review sequence and/or review range of the prediction result according to the review result.
3. The method for reviewing the artificial intelligence model prediction result of claim 2, wherein the adjusting the review order and/or the review range of the prediction result according to the review result comprises:
determining a prediction result sample which is subjected to the review according to the review result;
calculating the number of correct and wrong samples predicted in a preset range and the corresponding proportion in the prediction result samples after the review; the preset range includes: and at least one of a preset confidence interval, a preset prediction type result and a type marked by the prediction result sample.
4. The method for reviewing the prediction result of the artificial intelligence model according to claim 1, wherein the preset optimization condition comprises at least one of the following three items:
under the condition that the total review quantity is determined, the detection quantity of the problem samples reaches the maximum;
when the detection number of the problem samples is determined, the total review number reaches the minimum;
and under the condition that the total review quantity is determined, the false alarm quantity of the artificial intelligent model determined by the review reaches the maximum.
5. The method for reviewing the prediction result of the artificial intelligence model according to claim 1 or 4, wherein reviewing the prediction result based on the preset optimization condition and the review order comprises:
based on the prediction confidence of the artificial intelligence model, carrying out a review from high to low, or carrying out a review from low to high from a preset threshold value;
and when the number of the review reaches the preset number or the number of the detected problems reaches the preset number, terminating the review.
6. The method for reviewing the prediction result of the artificial intelligence model according to claim 1 or 4, wherein reviewing the prediction result based on the preset optimization condition and the review order comprises:
setting prior distribution of accuracy rates of the artificial intelligence model in different confidence degrees of each category by using verification set data based on a Bayesian method;
determining the review sequence based on a multi-arm slot machine method, and calculating posterior distribution of model accuracy in the review process;
determining a termination condition of the review based on the optimization condition, wherein the termination condition comprises: when the number of the review reaches the preset number or the number of the detected problems reaches the preset number, terminating the review;
and reviewing the prediction result based on the prior distribution, the posterior distribution and the termination condition.
7. The method for reviewing the prediction result of the artificial intelligence model according to claim 1 or 4, wherein reviewing the prediction result based on the preset optimization condition and the review order comprises:
and selecting a prediction result sample with the highest confidence coefficient of the non-review in the prediction category for review.
8. The method for reviewing the prediction results of the artificial intelligence model according to claim 1, wherein the reviewing method further comprises:
for the prediction result samples outside the review range, judging the prediction result samples with the confidence coefficient lower than that of the manual review samples as being problem-free; the predicted result samples with higher confidence than the manually reviewed samples are considered problematic.
9. A review device for the prediction result of an artificial intelligence model is characterized by comprising:
the prediction result acquisition module is used for acquiring the prediction result of the artificial intelligence model and the prediction type and the confidence coefficient corresponding to the prediction result;
a review order and range determining module for determining a review order and a review range according to the prediction category and the confidence;
and the review module is used for reviewing the prediction result based on preset optimization conditions, the review sequence and the review range to obtain a review result.
10. A computer device, comprising:
a memory and a processor, the memory and the processor being communicatively connected to each other, the memory having stored therein computer instructions, the processor executing the computer instructions to perform the method for reviewing the artificial intelligence model prediction result according to any one of claims 1-8.
11. A computer-readable storage medium storing computer instructions for causing a computer to perform the method of reviewing artificial intelligence model predictors according to any one of claims 1 to 8.
CN202110374361.2A 2021-04-07 2021-04-07 Method and device for reviewing prediction result of artificial intelligence model Pending CN113095563A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN202110374361.2A CN113095563A (en) 2021-04-07 2021-04-07 Method and device for reviewing prediction result of artificial intelligence model
PCT/CN2021/123302 WO2022213565A1 (en) 2021-04-07 2021-10-12 Review method and apparatus for prediction result of artificial intelligence model

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110374361.2A CN113095563A (en) 2021-04-07 2021-04-07 Method and device for reviewing prediction result of artificial intelligence model

Publications (1)

Publication Number Publication Date
CN113095563A true CN113095563A (en) 2021-07-09

Family

ID=76674770

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110374361.2A Pending CN113095563A (en) 2021-04-07 2021-04-07 Method and device for reviewing prediction result of artificial intelligence model

Country Status (2)

Country Link
CN (1) CN113095563A (en)
WO (1) WO2022213565A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114612752A (en) * 2022-03-02 2022-06-10 捻果科技(深圳)有限公司 Method and system for intelligently reviewing recognition result of video analysis technology
WO2022213565A1 (en) * 2021-04-07 2022-10-13 全球能源互联网研究院有限公司 Review method and apparatus for prediction result of artificial intelligence model

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117579625B (en) * 2024-01-17 2024-04-09 中国矿业大学 Inspection task pre-distribution method for double prevention mechanism

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107196953A (en) * 2017-06-14 2017-09-22 上海丁牛信息科技有限公司 A kind of anomaly detection method based on user behavior analysis
CN107729924A (en) * 2017-09-25 2018-02-23 平安科技(深圳)有限公司 Picture review probability interval generation method and picture review decision method
CN109658042A (en) * 2018-10-08 2019-04-19 平安科技(深圳)有限公司 Review method, apparatus, equipment and storage medium based on artificial intelligence
CN110689531A (en) * 2019-09-23 2020-01-14 云南电网有限责任公司电力科学研究院 Automatic power transmission line machine inspection image defect identification method based on yolo
CN110992329A (en) * 2019-11-28 2020-04-10 上海微创医疗器械(集团)有限公司 Product surface defect detection method, electronic device and readable storage medium
CN111597958A (en) * 2020-05-12 2020-08-28 西安网算数据科技有限公司 Highly automated bill classification method and system

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10558920B2 (en) * 2017-10-02 2020-02-11 Servicenow, Inc. Machine learning classification with confidence thresholds
CN108805173A (en) * 2018-05-16 2018-11-13 苏州迈为科技股份有限公司 Solar battery sheet aberration method for separating
CN108875963A (en) * 2018-06-28 2018-11-23 北京字节跳动网络技术有限公司 Optimization method, device, terminal device and the storage medium of machine learning model
CN112508044A (en) * 2019-09-16 2021-03-16 华为技术有限公司 Artificial intelligence AI model evaluation method, system and equipment
CN111124863B (en) * 2019-12-24 2024-02-13 北京安兔兔科技有限公司 Intelligent device performance testing method and device and intelligent device
CN111753914B (en) * 2020-06-29 2024-04-16 北京百度网讯科技有限公司 Model optimization method and device, electronic equipment and storage medium
CN113095563A (en) * 2021-04-07 2021-07-09 全球能源互联网研究院有限公司 Method and device for reviewing prediction result of artificial intelligence model

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107196953A (en) * 2017-06-14 2017-09-22 上海丁牛信息科技有限公司 A kind of anomaly detection method based on user behavior analysis
CN107729924A (en) * 2017-09-25 2018-02-23 平安科技(深圳)有限公司 Picture review probability interval generation method and picture review decision method
CN109658042A (en) * 2018-10-08 2019-04-19 平安科技(深圳)有限公司 Review method, apparatus, equipment and storage medium based on artificial intelligence
CN110689531A (en) * 2019-09-23 2020-01-14 云南电网有限责任公司电力科学研究院 Automatic power transmission line machine inspection image defect identification method based on yolo
CN110992329A (en) * 2019-11-28 2020-04-10 上海微创医疗器械(集团)有限公司 Product surface defect detection method, electronic device and readable storage medium
CN111597958A (en) * 2020-05-12 2020-08-28 西安网算数据科技有限公司 Highly automated bill classification method and system

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2022213565A1 (en) * 2021-04-07 2022-10-13 全球能源互联网研究院有限公司 Review method and apparatus for prediction result of artificial intelligence model
CN114612752A (en) * 2022-03-02 2022-06-10 捻果科技(深圳)有限公司 Method and system for intelligently reviewing recognition result of video analysis technology

Also Published As

Publication number Publication date
WO2022213565A1 (en) 2022-10-13

Similar Documents

Publication Publication Date Title
US9129228B1 (en) Robust and fast model fitting by adaptive sampling
CN109726763B (en) Information asset identification method, device, equipment and medium
CN111210399B (en) Imaging quality evaluation method, device and equipment
CN113095563A (en) Method and device for reviewing prediction result of artificial intelligence model
CN111309539A (en) Abnormity monitoring method and device and electronic equipment
CN112528801B (en) Abnormal event detection method, model training method and device
CN112613569B (en) Image recognition method, training method and device for image classification model
CN110869942B (en) Self-feed deep learning method and system
US20200349364A1 (en) Method and apparatus for training lane line identifying model, device, and storage medium
KR20200123584A (en) Apparatus and method for predicting error of annotation
CN113377484A (en) Popup window processing method and device
CN112307900A (en) Method and device for evaluating facial image quality and electronic equipment
CN113516251B (en) Machine learning system and model training method
CN114218505A (en) Abnormal space-time point identification method and device, electronic equipment and storage medium
CN109960626A (en) Recognition methods, device, equipment and the medium of port exception
CN112434717A (en) Model training method and device
CN116863288A (en) Target detection and alarm method, device and equipment based on deep learning
CN111783883A (en) Abnormal data detection method and device
CN112149698A (en) Method and device for screening difficult sample data
CN112152968B (en) Network threat detection method and device
CN113807209A (en) Parking space detection method and device, electronic equipment and storage medium
CN111523826A (en) Data acquisition method, device and equipment
CN111191593A (en) Image target detection method and device, storage medium and sewage pipeline detection device
CN115861321B (en) Production environment detection method and system applied to industrial Internet
CN116450632B (en) Geographic sample data quality evaluation method, device and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination