CN104331513A - High-efficiency prediction method for image retrieval performance - Google Patents

High-efficiency prediction method for image retrieval performance Download PDF

Info

Publication number
CN104331513A
CN104331513A CN201410685896.1A CN201410685896A CN104331513A CN 104331513 A CN104331513 A CN 104331513A CN 201410685896 A CN201410685896 A CN 201410685896A CN 104331513 A CN104331513 A CN 104331513A
Authority
CN
China
Prior art keywords
picture
retrieval
sample picture
result
pictures
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201410685896.1A
Other languages
Chinese (zh)
Inventor
贾强槐
田新梅
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
University of Science and Technology of China USTC
Original Assignee
University of Science and Technology of China USTC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by University of Science and Technology of China USTC filed Critical University of Science and Technology of China USTC
Priority to CN201410685896.1A priority Critical patent/CN104331513A/en
Publication of CN104331513A publication Critical patent/CN104331513A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/56Information retrieval; Database structures therefor; File system structures therefor of still image data having vectorial format
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Image Analysis (AREA)

Abstract

The invention discloses a high-efficiency prediction method for image retrieval performance. The method comprises the following steps: adaptively selecting an example image from a retrieval result upon the spurious correlation feedback technology; determining the vote of each image upon the voting decision-making mechanism and according to the visual similarity between each image in the retrieval result and the example image, and further estimating the association probability between each image in the retrieval result and an input search request; calculating the mean average precision of the image retrieval by synthesizing the association probabilities between all the images in the retrieval result and the input search request to realize high-efficiency prediction on the image retrieval performance. The method disclosed by the invention is lower in complexity, is convenient to realize, cannot be easily influenced by noise, and is capable of accurately performing prediction on the image retrieval performance.

Description

A kind of image retrieval performance prediction method efficiently
Technical field
The present invention relates to image retrieval technologies field, particularly relate to a kind of image retrieval performance prediction method efficiently.
Background technology
Image retrieval technologies is under the prerequisite of given query text or image, according to content information or given query standard, from image data base, search inquires about relevant picture to user, and sorts to picture from high to low according to the degree of correlation between picture and inquiry and ranking results is returned to user.The research of image retrieval technologies has achieved huge progress, many retrieval models are in succession proposed and are constantly obtained verification, improvement and checking in practice, but also there is serious robustness problem in most of searching system, such as, for some inquiry, result for retrieval quality is high, and for other inquiry, result for retrieval contains the irrelevant image of a lot of inquiry.Inquire about between its result for retrieval for difference and often there is larger difference; What is more, even if the good searching system of those average retrieval performances, but to some inquiry, their result for retrieval can not be satisfactory.Therefore, we wish that searching system can identify the inquiry that those result for retrieval are poor automatically, and do corresponding process to them.
Query performance prediction is the advanced problems of information retrieval field research.This technology is attempted when not having relevant information (namely not knowing the degree of correlation of result for retrieval and inquiry), the assessment searching system fine or not degree that it returns results to a certain inquiry.Query performance prediction can produce wholesome effect to user and searching system simultaneously.From the angle of user, this technology can provide valuable feedback information, such as, when the Query Result that searching system dopes certain user is poor time, can by this information feed back to user, then user can reconstruct his inquiry, interactive to obtain result for retrieval better with searching system even further; Concerning searching system, in the ideal case, if searching system can predict the performance of a certain inquiry, so it just automatically can adjust its parameter or algorithm to adapt to different inquiries, thus obtains better retrieval performance.
Relative to long-term for improving the exploration of searching system performance on retrieval model, the research of image retrieval query performance prediction is scarcely out of swaddling-clothes.The initial research about image retrieval query performance prediction is based on text message, text message (the context text returned around picture is retrieved mainly through research, picture URL etc.) and input inquiry text between mutual relationship, the specifics of such as text, generality, ambiguousness and iconicity etc.These researchs are main exists 2 deficiencies: the vision content 1) ignoring image only depends on text message, but text is not enough to the abundant visual information of description and text message often comprises a lot " noise "; 2) do not estimate real image searching result quality, and be simple inquiry being divided into easily and difficult two classes.
Current major part research is the query performance prediction returning picture vision content for retrieval.The main thought of this research is " document " that be first expressed as by picture being made up of vision word, then using some characteristics of method statistic of text analyzing, such as calculating by returning the probability distribution variances between language model and the language model be made up of whole pictures that picture forms; Estimate the Space Consistency returned between picture; Study for the forward picture visual consistency of retrieval return-list rank; And the research visual similarity distribution returning picture etc.These researchs first propose reasonably hypothesis, then make full use of the visual information of picture, estimate the query performance of result for retrieval, promoted the development of image retrieval query performance prediction technology.But these methods just give the indicated value of a query performance height, do not estimate the true value of result for retrieval quality.
In current research method, in addition to the above method, the method from the true retrieval performance aspect research of direct estimation is also had.The roughly thought of the method is, first estimates each correlation probabilities returning picture and given inquiry, then utilizes the correlation probabilities obtained to estimate Average Accuracy, and how accurate its key point be exactly must estimate correlation probabilities.Due to a kind of method that the method is direct estimation result for retrieval quality true value, if so accurately correlation probabilities cannot be estimated, the accuracy of net result so will certainly be affected.
Summary of the invention
The object of this invention is to provide a kind of image retrieval performance prediction method efficiently, the method complexity is lower, is convenient to realize, not easily affected by noise and can predict image retrieval performance comparatively accurately.
The object of the invention is to be achieved through the following technical solutions:
A kind of image retrieval performance prediction method efficiently, the method comprises:
Based on pseudo-linear filter technology adaptive selection sample picture from result for retrieval;
Based on ballot decision-making mechanism, and determine the votes of each pictures according to the vision similarity of each pictures in described result for retrieval and described sample picture, and then estimate the correlation probabilities in described result for retrieval between each pictures and inquiry request of input;
Correlation probabilities in integrated retrieval result between all pictures and the inquiry request of input calculates the Average Accuracy of this image retrieval, realizes efficient image retrieval performance prediction.
Further, describedly to comprise based on pseudo-linear filter technology adaptive selection sample picture from result for retrieval:
Image retrieval method for reordering is utilized to resequence to result for retrieval, then based on pseudo-linear filter technology adaptive selection sample picture from the result for retrieval of rearrangement; Described sample picture comprises the set of positive sample picture and the set of negative sample picture, or only comprises the set of positive sample picture; It is the picture of front K that described positive sample pictures are combined into rank in the result for retrieval of rearrangement, and it is the picture of rear K that described negative sample pictures are combined into rank in the result for retrieval of rearrangement; Wherein, the numerical value of K is determined by the method for consistance score C oS, is expressed as:
K * = max K ∈ [ L , M ] CoS ( K ) ;
Wherein, K *represent the optimal value of K, L and M represents minimum value and the maximal value of K value respectively; CoS (K), for meeting conforming picture to shared number percent in selected positive sample picture, is expressed as:
CoS ( K ) = 1 | K ( K - 1 ) | Σ i , j = 1 , . . . , K ; i ≠ j δ ( I i , I j ) ;
δ ( I i , I j ) = 1 , if s ( I i , I j ) > μ 0 , else ;
In above formula, s (I i, I j) represent picture I iand I jbetween vision similarity, μ is default threshold value.
Further, the step calculating the vision similarity of each pictures and described sample picture in result for retrieval comprises:
Adopt visual word bag model and vector space model all pictures to be expressed as one group of vector, comprising: adopt intensive sampling scale invariant feature switch technology Dense SIFT to extract the SIFT feature of each picture; Then, use clustering algorithm K-means that all SIFT feature obtained are polymerized to the code book comprising S vision word visual word; According to Nearest neighbor rule, all SIFT feature are quantized on corresponding visual word; Re-use TF-IDF weight mechanism and weigh the importance of each visual word in every pictures; Finally, every pictures is represented with vector space model;
Picture I ivector expression be x iin every dimensional vector be calculated as follows:
x l i = tf l · idf l , l = 1,2 , . . . , S ;
Wherein, tf lrepresent vision word w lat picture I ithe frequency of middle appearance, idf lrepresent inverse document frequency, for weighing vision word w limportance in whole pictures;
Cosine function cosine is adopted to calculate picture I in result for retrieval iwith the vision similarity s (I of sample picture Dk i, D k), be expressed as:
s ( I i , D k ) = x i · x D k | | x i | | | | x D k | | , k = 1,2 , . . . , K .
Further, described based on ballot decision-making mechanism, and determine that the votes of each pictures comprises according to the similarity of each pictures in described result for retrieval and described sample picture:
Each pictures in result for retrieval is carried out vision similarity calculating with each sample picture in each the sample picture in the positive sample picture set P in sample picture and negative sample picture set N respectively, if result of calculation is greater than threshold value, then this picture obtains a corresponding initial ballot, is expressed as:
vote ik + = 1 , s ( I i , P k ) > μ 0 , else , i = 1,2 , . . . M , k = 1,2 , . . . , K ;
vote ik - = 1 , s ( I i , N k ) > μ 0 , else , i = 1,2 , . . . M , k = 1,2 , . . . , K ;
Wherein, with represent picture I respectively icarry out vision similarity with a kth picture in a kth picture in positive sample picture set P and negative sample picture set N and calculate the rear initial positive and negative poll obtained, M represents the picture number in result for retrieval, and μ is default threshold value;
The initial ballot sum obtained according to each picture again confirms its final vote number, is expressed as:
r i + = 1 , Σ k = 1 K vote ik + ≥ K / 2 0 , else , i = 1,2 , . . . M ;
r i - = 1 , Σ k = 1 K vote ik + ≥ K / 2 0 , else , i = 1,2 , . . . M ;
Wherein, with represent picture I respectively ifinal positive votes and negative votes.
Further, the correlation probabilities in the described result for retrieval of described estimation between each pictures and inquiry request of input comprises:
If described sample picture comprises the set of positive sample picture and the set of negative sample picture, then picture I iand the correlation probabilities p between the inquiry request of input icomputing formula is:
p i = 1 , if r i + = 1 and r i - = 0 0 , else , i = 1,2 , . . . , M ;
If described sample picture only comprises the set of positive sample picture, then by each positive sample picture in the set of described positive sample picture as a sorter, recycling regression algorithm integrates the classification results of each sorter, obtains picture I iand the correlation probabilities p between the inquiry request of input i:
p i = exp ( Σ k = 1 K f k ( I i ) / K - 0.5 ) 1 + exp ( Σ k = 1 K f k ( I i ) / K - 0.5 ) , i = 1,2 , . . . , M ;
Wherein, f k(I i) represent that a kth sorter is to picture I iclassification results, when picture obtain one just vote time, f k(I i)=1, otherwise, f k(I i)=0.
Further, the Average Accuracy that the correlation probabilities in described integrated retrieval result between all pictures and the inquiry request of input calculates this image retrieval comprises:
Sample picture is comprised to the situation of the set of positive sample picture and the set of negative sample picture, the formula calculating the Average Accuracy of this image retrieval is:
EAP @ T = 1 Z T Σ i = 1 T rel ( i ) · 1 i Σ j = 1 i rel ( j ) ;
Wherein, T represent result for retrieval is resequenced after all pictures of T before rank, rel (i) is a two-valued function, represents that the picture when rank i-th is relevant to the inquiry request of input, then rel (i)=1, otherwise, rel (i)=0, i belongs to [1, T], j belongs to [1, i]; Z tit is a normalization coefficient;
Sample picture is only comprised to the situation of positive sample picture set, the formula calculating the Average Accuracy of this image retrieval is:
E ( AP @ T ) = E [ 1 Z T Σ i = 1 T rel ( i ) Σ j = 1 i rel ( j ) i ] ;
Wherein, E represents mathematical expectation.
As seen from the above technical solution provided by the invention, from result for retrieval, positive negative correlation sample picture is selected based on enhancement mode pseudo-linear filter technology, estimate the correlation probabilities of the every pictures in result for retrieval and inquiry request again based on ballot decision-making mechanism, last comprehensive all correlation probabilities calculate this Average Accuracy retrieved; Compared to existing method, the method complexity is lower, is convenient to realize, not easily affected by noise and can predict image retrieval performance comparatively accurately.
Accompanying drawing explanation
In order to be illustrated more clearly in the technical scheme of the embodiment of the present invention, below the accompanying drawing used required in describing embodiment is briefly described, apparently, accompanying drawing in the following describes is only some embodiments of the present invention, for those of ordinary skill in the art, under the prerequisite not paying creative work, other accompanying drawings can also be obtained according to these accompanying drawings.
The process flow diagram of a kind of efficient image retrieval performance prediction method that Fig. 1 provides for the embodiment of the present invention.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, be clearly and completely described the technical scheme in the embodiment of the present invention, obviously, described embodiment is only the present invention's part embodiment, instead of whole embodiments.Based on embodiments of the invention, those of ordinary skill in the art, not making the every other embodiment obtained under creative work prerequisite, belong to protection scope of the present invention.
Embodiment one
The efficient image retrieval performance prediction method of one that Fig. 1 provides for the embodiment of the present invention.As shown in Figure 1, the method mainly comprises the steps:
Step 11, based on pseudo-linear filter technology adaptive selection sample picture from result for retrieval.
Sample picture selects there is important effect in the present invention, and can have a strong impact on correlation probabilities and estimate this step.Pseudo-linear filter technology is a concept from text retrieval, and the document that its hypothesis has high rank in result for retrieval list is associated with the query often, thinks that the document that those come list end is incoherent with inquiry often simultaneously.
The embodiment of the present invention, directly select those pictures in result for retrieval list with high rank or low rank respectively as positive sample picture and negative sample picture by pseudo-linear filter technology, but those pictures selected may contain " noise ", determine that how many pictures should be selected also to be a difficult thing simultaneously.In order to address these problems, we mainly improve pseudo-linear filter application performance in the present invention from three aspects:
1) initial retrieval result reordering.
In general, image searching result is better, calculates selected picture truer by pseudo-linear filter.Therefore, we reorder to initial retrieval result, then adopt pseudo-linear filter technology to carry out selected positive and negative sample picture reordering on result for retrieval.In the embodiment of the present invention, arbitrary image retrieval method for reordering can adopt, such as Visual Rank, Bayesian Reranking method etc.
2) positive and negative sample picture is selected.
Usually, pseudo-linear filter technology can choose positive and negative sample simultaneously.In image retrieval, often more similar between picture associated with the query, but often vary between uncorrelated picture.In other words, huge with the incoherent number of pictures of inquiry, by the negative sample picture chosen a just part wherein, may comprise " noise " in negative sample picture simultaneously.By the further research to the effect of negative sample picture, we are to whether being taken into account that the present invention exists query.Therefore, in order to address this problem, we have proposed two kinds of strategies: rank in list of 1) simultaneously selecting to reorder is front K (top-K) and rank is that the picture of rear K (bottom-K) is respectively as the set of positive sample picture and the set of negative sample picture; 2) only select the picture of top-K as pseudo-positive example picture.
3) sample picture number is adaptively selected.
Because follow-up step needs to carry out based on the sample picture chosen, therefore sample picture number plays considerable effect.When K value is too large, positive sample pictures credit union introduces " noise " (negative sample); When K value is too small, selected positive sample picture set is not enough to well set forth user's inquiry.In other words, return results reasonable inquiry for those, we can compose a larger K value, and return results poor inquiry for those, and we just select a less K value.In order to address this problem, the embodiment of the present invention proposes a kind of method of adaptively selected sample number, namely CoS (Coherence Score, consistance score) method can be adopted to be respectively the adaptively selected K value of each inquiry, to be formulated as follows:
K * = max K ∈ [ L , M ] CoS ( K ) ;
Wherein, K *represent the optimal value of K, L and M represents minimum value and the maximal value of K value respectively; CoS (K), for meeting conforming picture to shared number percent in selected positive sample picture, is expressed as:
CoS ( K ) = 1 | K ( K - 1 ) | Σ i , j = 1 , . . . , K ; i ≠ j δ ( I i , I j ) ;
δ ( I i , I j ) = 1 , if s ( I i , I j ) > μ 0 , else ;
In above formula, δ (I i, I j) be similar to an impulse function, s (I i, I j) represent picture I iand I jbetween vision similarity; μ is default threshold value, and the similar value between the picture pair being generally set to data centralization 80% is all less than this threshold value.
Step 12, based on ballot decision-making mechanism, and determine the votes of each pictures according to the vision similarity of each pictures in described result for retrieval and described sample picture, and then estimate the correlation probabilities in described result for retrieval between each pictures and inquiry request of input.
Along with sample picture adaptively selected after, the present invention also needs to estimate the correlation probabilities of the inquiry request of every pictures in result for retrieval and input.At present, in image rearrangement sequence, this correlation probabilities estimation problem is regarded as a classification problem by many work; A lot of sorter, such as SVM (support vector machine, support vector machine) sorter, can training in above-mentioned selected positive and negative sample and obtain, then carrying out two classification (relevant or uncorrelated) to retrieving the every pictures returned.Be different from existing method, the embodiment of the present invention proposes a kind of simple voting mechanism efficiently, then designs two kinds of correlation probabilities methods of estimation.
1) vision similarity calculates.
The voting mechanism of the embodiment of the present invention is voted based on similarity, therefore, needs first to calculate the vision similarity of each pictures and described sample picture in result for retrieval.Specifically, the embodiment of the present invention adopts visual word bag model and vector space model all pictures to be expressed as one group of vector, comprise: adopt Dense SIFT (intensive sampling scale invariant feature switch technology) extract each picture SIFT feature (can according to the patch size of 16*16, step-length be 6 criterion extract); Then, K-means (clustering algorithm) is used all SIFT feature obtained to be polymerized to the code book comprising the individual visual word (vision word) of S (such as, S can be 1000); According to Nearest neighbor rule, all SIFT feature are quantized in corresponding vision word; Re-use TF-IDF weight mechanism and weigh the importance of each visual word in every pictures; Finally, every pictures is represented with vector space model;
Picture I ivector expression be x iin every dimensional vector be calculated as follows:
x l i = tf l · idf l , l = 1,2 , . . . , S ;
Wherein, tf lrepresent vision word w lat picture I ithe frequency of middle appearance, idf lrepresent inverse document frequency, for weighing vision word w limportance in whole pictures; Described whole pictures are the result for retrieval set that current searching system returns for several inquiry request inputted, and also can use typical Web353 data set or typical MSRA-MM_V1.0 data set etc.
Cosine function cosine is adopted to calculate picture I in result for retrieval iwith sample picture D k(its vector expression is x D k = [ x D 1 k , x D 2 k , . . . , x DS k ] , k = 1,2 , . . . , K ) Vision similarity s (I i, D k), be expressed as:
s ( I i , D k ) = x i · x D k | | x i | | | | x D k | | .
2) ballot decision-making mechanism.
Our hypothesis: picture associated with the query, should to inquire about picture concerned visually similar to other, and picture uncorrelated from other is different.Based on this hypothesis, we have proposed a simple voting mechanism efficiently.Every pictures in result for retrieval is carried out similarity-rough set with each picture in each picture in selected positive sample picture set P and negative sample picture set N by respectively, if similarity is between the two greater than a certain given threshold value, then this pictures obtains a corresponding initial ballot, is formulated as follows:
vote ik + = 1 , s ( I i , P k ) > μ 0 , else , i = 1,2 , . . . M , k = 1,2 , . . . , K ;
vote ik - = 1 , s ( I i , N k ) > μ 0 , else , i = 1,2 , . . . M , k = 1,2 , . . . , K ;
Wherein, with represent picture I respectively icarry out vision similarity with a kth picture in a kth picture in positive sample picture set P and negative sample picture set N and calculate the rear initial positive and negative poll obtained, M represents the picture number in result for retrieval, and μ is default threshold value.
Due to result for retrieval often more complicated, and may contain " noise " by the sample picture that pseudo-linear filter technology is selected, the ballot that some pictures obtain from these selected sample pictures may be invalid.In order to build a decision-making mechanism of voting efficiently, we add a decision-making mechanism here, it can be described as: if a pictures obtains most ballots (such as from selected sample picture, be at least half), so it just really obtains a ballot, is formulated as follows:
r i + = 1 , Σ k = 1 K vote ik + ≥ K / 2 0 , else , i = 1,2 , . . . M ;
r i - = 1 , Σ k = 1 K vote ik + ≥ K / 2 0 , else , i = 1,2 , . . . M ;
Wherein, with represent picture I respectively ifinal positive votes and negative votes.
The situation that presented hereinbefore is for comprising the set of positive sample picture and the set of negative sample picture in sample picture, applicable equally for the situation only comprising the set of positive sample picture in sample picture; That is, do not consider that negative sample (even if considering that its result is also 0) only calculates positive voting results when calculating, concrete formula please refer to foregoing teachings, repeats no more herein.
3) correlation probabilities is estimated.
After the every pictures in result for retrieval all obtains corresponding voting results, the embodiment of the present invention devises two kinds of methods to estimate the dependent probability of every pictures, is called rigid judgement and soft estimation.Here p is used ibe denoted as the correlation probabilities of the i-th pictures and given inquiry in retrieval list.
A) rigid judgement.
If described sample picture comprises the set of positive sample picture and the set of negative sample picture, be then applicable to the situation of rigid judgement.The embodiment of the present invention follows following hypothesis: picture concerned should vision similar with other picture concerned, simultaneously away from other irrelevant picture, thus our rigid decision method can be defined as follows:
p i = 1 , if r i + = 1 and r i - = 0 0 , else , i = 1,2 , . . . , M ;
As can be seen from the above equation, the present invention marks a pictures and if only if that its only obtains one and just votes for relevant.
B) soft estimation.
If described sample picture only comprises the set of positive sample picture, be then applicable to the situation of soft estimation.
We discuss in a step 11, and we know only may use the meeting of pseudo-positive example sample better, thus we only utilize pseudo-positive example picture to estimate correlation probabilities here.We use the thought of classification, by each positive sample picture in the set of positive sample picture alone as a sorter, then utilize a logistic regression algorithm to integrate the classification results of each sorter, are formulated as follows:
p i = exp ( Σ k = 1 K f k ( I i ) / K - 0.5 ) 1 + exp ( Σ k = 1 K f k ( I i ) / K - 0.5 ) , i = 1,2 , . . . , M ;
Wherein, f k(I i) represent that a kth sorter is to picture I iclassification results, when picture obtain one just vote time, f k(I i)=1, otherwise, f k(I i)=0.
Correlation probabilities in step 13, integrated retrieval result between all pictures and the inquiry request of input calculates the Average Accuracy of this image retrieval, realizes efficient image retrieval performance prediction.
According to judgement rigid in step 12 and the soft correlation probabilities estimating to obtain, the embodiment of the present invention also proposes two kinds of Average Accuracy computing method.
1) for rigid judgement, the correlation probabilities p that we obtain ivalue non-zero that is one, thus we directly use the formula calculating Average Accuracy, as follows:
EAP @ T = 1 Z T Σ i = 1 T rel ( i ) · 1 i Σ j = 1 i rel ( j ) ;
Wherein, T represents in step 11 all pictures carrying out T before rank after initial retrieval result reordering; Rel (i) is a two-valued function, represent the picture relevant to the inquiry request of input (judging according to rigid judgement) when rank i-th, then rel (i)=1, otherwise, rel (i)=0, rel (j) is similar with the implication of rel (i), difference is only that the value of i and j varies in size, and i belongs to [1, T], j belongs to [1, i]; Z tit is a normalization coefficient.
2) for soft estimation, the correlation probabilities p that we obtain ivalue belongs between 0 to 1, thus can not directly apply above-mentioned formula, and therefore we are from the angle of mathematical expectation, has carried out again deriving, shown in specific as follows to above-mentioned formula:
E ( AP @ T ) = E [ 1 Z T Σ i = 1 T rel ( i ) Σ j = 1 i rel ( j ) i ] = 1 Z T Σ i = 1 T Σ j = 1 i E [ rel ( i ) rel ( j ) ] i = 1 Z T Σ i = 1 T 1 i { E [ rel ( i ) 2 ] + Σ j = 1 i - 1 E [ rel ( i ) rel ( j ) ] } = 1 Z T Σ i = 1 T 1 i { p ( rel ( i ) = 1 ) + Σ j = 1 i - 1 p ( rel ( i ) = 1 , rel ( j ) = 1 ) } = 1 Z T Σ i = 1 T 1 i { p ( rel ( i ) = 1 ) + Σ j = 1 i - 1 p ( rel ( i ) = 1 ) p ( rel ( j ) = 1 ) }
E (AP@T) expects to obtain to EAP@T computational mathematics, is an approximate value; E represents mathematical expectation, and p represents the correlation probabilities between picture and the inquiry request of input, such as uses p ibe denoted as the correlation probabilities of the i-th pictures and given inquiry in retrieval list.
In the process of deriving above-mentioned formula, our supposition is between two pictures of the return-list diverse location probability relevant to given inquiry separate.
On the other hand, also many experiments has been carried out for the solution of the present invention.
1) twice experiment is carried out for relative coefficient, once experiment compares to adopt positive and negative sample picture and the relative coefficient comparing result (as shown in table 1) only using positive sample picture, and table 1 tables of data Benq is in only using the query performance prediction method of positive sample picture more effective; The relative coefficient comparative result (as shown in table 2) using the adaptively selected positive sample quantity shown in step 11 and existing fixed qty is compared in another experiment, and table 2 tables of data Benq is better in the method for adaptively selected positive sample number.In table 1, EAP_PN represents the Average Accuracy using positive and negative sample to estimate, EAP_P represents the Average Accuracy only using positive sample to estimate; In table 2, Fixed represents existing fixed qty, and Adaptive represents adaptively selected positive sample quantity of the present invention; Table 1 is with table 2, and Kendall ' s τ is Kendall's coefficient, and Pearson ' s r is Pearson's coefficient, and Spearman ' s ρ is Spearman coefficient; Three kinds of related coefficient balancing methods above, it is the recognised standard evaluating query performance prediction method, their span is all [-1,1] between, 1 represents best positive correlation, and-1 represents perfect negative correlation, and 0 represents completely uncorrelated, the larger expression degree of correlation of coefficient value is better, illustrates that coupling is between the two also better; Hard represents rigid decision method, and Soft represents soft estimation.
Table 1 adopts positive and negative sample picture and the relative coefficient comparative result only using positive sample picture
The relative coefficient comparative result of the adaptively selected positive sample quantity of table 2 and existing fixed qty
2) scheme of the solution of the present invention and prior art is utilized to contrast.Comparing result is as table 3-table 4.Wherein, table 3 is the contrast carried out based on typical Web353 data set, and table 4 is the contrast carried out based on typical MSRA-MM_V1.0 data set; Table 3 represents vision definition score with VCS in table 4, and COS represents consistance score, and RS represents representative score, and QAPE represents self-adaptation query performance prediction, and Ours represents the solution of the present invention; In table 4, Method represents control methods, NDCG: storage gain (Normalized Discounted Cumulative Gain) is lost in normalization, and the same with AP method of the present invention is also a kind of standard method weighing true result for retrieval quality.The comparing result of table 3 and table 4 shows method of the present invention, compares other method, has better effect in query performance prediction.
Table 3 is based on the comparing result of typical Web353 data set
Table 4 is based on the comparing result of typical MSRA-MM_V1.0 data set
Advantage and the good effect of the embodiment of the present invention are as follows:
(1) the present invention proposes a kind of brand-new query performance prediction method, compared to most methods in the past, this algorithm can the retrieval performance of the given inquiry searching system of automatic Prediction, and not only as an instruction, the value approaching to reality value namely estimated.
(2) the present invention proposes a kind of positive sample adaptive selection method, solve the problem how selecting positive sample number for difference inquiry.
(3) the present invention proposes two kinds of brand-new correlation probabilities methods of estimation.Existing method, returns vision hyperlink between picture by dissecting needle to given query and search, estimates the correlation probabilities of every pictures.Will analyze the similarity degree returned between all pictures compared to existing method, the method complexity that the present invention adopts is lower, is convenient to realize, and have benefited from simply voting efficiently decision-making mechanism, the method is more not easily affected by noise simultaneously.
Through the above description of the embodiments, those skilled in the art can be well understood to above-described embodiment can by software simulating, and the mode that also can add necessary general hardware platform by software realizes.Based on such understanding, the technical scheme of above-described embodiment can embody with the form of software product, it (can be CD-ROM that this software product can be stored in a non-volatile memory medium, USB flash disk, portable hard drive etc.) in, comprise some instructions and perform method described in each embodiment of the present invention in order to make a computer equipment (can be personal computer, server, or the network equipment etc.).
The above; be only the present invention's preferably embodiment, but protection scope of the present invention is not limited thereto, is anyly familiar with those skilled in the art in the technical scope that the present invention discloses; the change that can expect easily or replacement, all should be encompassed within protection scope of the present invention.Therefore, protection scope of the present invention should be as the criterion with the protection domain of claims.

Claims (6)

1. an efficient image retrieval performance prediction method, it is characterized in that, the method comprises:
Based on pseudo-linear filter technology adaptive selection sample picture from result for retrieval;
Based on ballot decision-making mechanism, and determine the votes of each pictures according to the vision similarity of each pictures in described result for retrieval and described sample picture, and then estimate the correlation probabilities in described result for retrieval between each pictures and inquiry request of input;
Correlation probabilities in integrated retrieval result between all pictures and the inquiry request of input calculates the Average Accuracy of this image retrieval, realizes efficient image retrieval performance prediction.
2. method according to claim 1, is characterized in that, describedly comprises based on pseudo-linear filter technology adaptive selection sample picture from result for retrieval:
Image retrieval method for reordering is utilized to resequence to result for retrieval, then based on pseudo-linear filter technology adaptive selection sample picture from the result for retrieval of rearrangement; Described sample picture comprises the set of positive sample picture and the set of negative sample picture, or only comprises the set of positive sample picture; It is the picture of front K that described positive sample pictures are combined into rank in the result for retrieval of rearrangement, and it is the picture of rear K that described negative sample pictures are combined into rank in the result for retrieval of rearrangement; Wherein, the numerical value of K is determined by the method for consistance score C oS, is expressed as:
K * = max K ∈ [ L , M ] CoS ( K ) ;
Wherein, K *represent the optimal value of K, L and M represents minimum value and the maximal value of K value respectively; CoS (K), for meeting conforming picture to shared number percent in selected positive sample picture, is expressed as:
CoS ( K ) = 1 | K ( K - 1 ) | Σ i , j = 1 , . . . , K ; i ≠ j δ ( I i , I j ) ;
δ ( I i , I j ) = 1 , if s ( I i , I j ) > μ 0 , else ;
In above formula, s (I i, I j) represent picture I iand I jbetween vision similarity, μ is default threshold value.
3. method according to claim 1 and 2, is characterized in that, the step calculating the vision similarity of each pictures and described sample picture in result for retrieval comprises:
Adopt visual word bag model and vector space model all pictures to be expressed as one group of vector, comprising: adopt intensive sampling scale invariant feature switch technology Dense SIFT to extract the SIFT feature of each picture; Then, use clustering algorithm K-means that all SIFT feature obtained are polymerized to the code book comprising S vision word visual word; According to Nearest neighbor rule, all SIFT feature are quantized on corresponding visual word; Re-use TF-IDF weight mechanism and weigh the importance of each visual word in every pictures; Finally, every pictures is represented with vector space model;
Picture I ivector expression be x iin every dimensional vector be calculated as follows:
x l i = tf l · i df l , l = 1,2 , . . . , S ;
Wherein, tf lrepresent vision word w lat picture I ithe frequency of middle appearance, idf lrepresent inverse document frequency, for weighing vision word w limportance in whole pictures;
Cosine function cosine is adopted to calculate picture I in result for retrieval iwith sample picture D kvision similarity s (I i, D k), be expressed as:
s ( I i , D k ) = x i · x D k | | x i | | | | x D k | | , k = 1,2 , . . . , K .
4. method according to claim 1 and 2, is characterized in that, described based on ballot decision-making mechanism, and determines that the votes of each pictures comprises according to the similarity of each pictures in described result for retrieval and described sample picture:
Each pictures in result for retrieval is carried out vision similarity calculating with each sample picture in each the sample picture in the positive sample picture set P in sample picture and negative sample picture set N respectively, if result of calculation is greater than threshold value, then this picture obtains a corresponding initial ballot, is expressed as:
vote ik + = 1 , s ( I i , P k ) > μ 0 , else , i = 1,2 , . . . M , k = 1,2 , . . . , K ;
vote ik - = 1 , s ( I i , N k ) > μ 0 , else , i = 1,2 , . . . M , k = 1,2 , . . . , K ;
Wherein, with represent picture I respectively icarry out vision similarity with a kth picture in a kth picture in positive sample picture set P and negative sample picture set N and calculate the rear initial positive and negative poll obtained, M represents the picture number in result for retrieval, and μ is default threshold value;
The initial ballot sum obtained according to each picture again confirms its final vote number, is expressed as:
r i + = 1 , Σ k = 1 K vote ik + ≥ K / 2 0 , else , i = 1,2 , . . . M ;
r i - = 1 , Σ k = 1 K vote ik - ≥ K / 2 0 , else , i = 1,2 , . . . M ;
Wherein, r i +with r i -represent picture I respectively ifinal positive votes and negative votes.
5. method according to claim 4, is characterized in that, the correlation probabilities in the described result for retrieval of described estimation between each pictures and inquiry request of input comprises:
If described sample picture comprises the set of positive sample picture and the set of negative sample picture, then picture I iand the correlation probabilities p between the inquiry request of input icomputing formula is:
p i = 1 , if r i + = 1 and r i - = 0 0 , else , i = 1,2 , . . . , M ;
If described sample picture only comprises the set of positive sample picture, then by each positive sample picture in the set of described positive sample picture as a sorter, recycling regression algorithm integrates the classification results of each sorter, obtains picture I iand the correlation probabilities p between the inquiry request of input i:
p i = exp ( Σ k = 1 K f k ( I i ) / K - 0.5 ) 1 + exp ( Σ k = 1 K f k ( I i ) / K - 0.5 ) , i = 1,2 , . . . , M ;
Wherein, f k(I i) represent that a kth sorter is to picture I iclassification results, when picture obtain one just vote time, f k(I i)=1, otherwise, f k(I i)=0.
6. method according to claim 5, is characterized in that, the Average Accuracy that the correlation probabilities in described integrated retrieval result between all pictures and the inquiry request of input calculates this image retrieval comprises:
Sample picture is comprised to the situation of the set of positive sample picture and the set of negative sample picture, the formula calculating the Average Accuracy of this image retrieval is:
EAP @ T = 1 Z T Σ i = 1 T rel ( i ) · 1 i Σ j = 1 i rel ( j ) ;
Wherein, T represent result for retrieval is resequenced after all pictures of T before rank, rel (i) is a two-valued function, represents that the picture when rank i-th is relevant to the inquiry request of input, then rel (i)=1, otherwise, rel (i)=0, i belongs to [1, T], j belongs to [1, i]; Z tit is a normalization coefficient;
Sample picture is only comprised to the situation of positive sample picture set, the formula calculating the Average Accuracy of this image retrieval is:
E ( AP @ T ) = E [ 1 Z T Σ i = 1 T rel ( i ) Σ j = 1 i rel ( j ) i ] ;
Wherein, E represents mathematical expectation.
CN201410685896.1A 2014-11-24 2014-11-24 High-efficiency prediction method for image retrieval performance Pending CN104331513A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410685896.1A CN104331513A (en) 2014-11-24 2014-11-24 High-efficiency prediction method for image retrieval performance

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410685896.1A CN104331513A (en) 2014-11-24 2014-11-24 High-efficiency prediction method for image retrieval performance

Publications (1)

Publication Number Publication Date
CN104331513A true CN104331513A (en) 2015-02-04

Family

ID=52406240

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410685896.1A Pending CN104331513A (en) 2014-11-24 2014-11-24 High-efficiency prediction method for image retrieval performance

Country Status (1)

Country Link
CN (1) CN104331513A (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104731948A (en) * 2015-03-31 2015-06-24 北京奇虎科技有限公司 High-quality image search resource collection method and device
CN106202583A (en) * 2016-08-31 2016-12-07 北京交通大学 The visual signature optimization method of image and device
CN106776849A (en) * 2016-11-28 2017-05-31 西安交通大学 A kind of method and guide system to scheme quick-searching sight spot
CN107704509A (en) * 2017-08-31 2018-02-16 北京联合大学 A kind of method for reordering for combining stability region and deep learning
CN108780558A (en) * 2016-03-11 2018-11-09 电子湾有限公司 List is removed based on similitude
CN111475666A (en) * 2020-03-27 2020-07-31 深圳市墨者安全科技有限公司 Dense vector-based media accurate matching method and system
CN116662588A (en) * 2023-08-01 2023-08-29 山东省大数据中心 Intelligent searching method and system for mass data

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2006023834A (en) * 2004-07-06 2006-01-26 Canon Inc Image processor and its control method, and program
CN103955543A (en) * 2014-05-20 2014-07-30 电子科技大学 Multimode-based clothing image retrieval method

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2006023834A (en) * 2004-07-06 2006-01-26 Canon Inc Image processor and its control method, and program
CN103955543A (en) * 2014-05-20 2014-07-30 电子科技大学 Multimode-based clothing image retrieval method

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
QIANGHUAI JIA等: "Query difficulty estimation via relevance prediction for image retrieval", 《SIGNAL PROCESSING》 *
XINMEI TIAN等: "Query Difficulty Prediction for Web Image Search", 《IEEE TRANSACTIONS ON MULTIMEDIA》 *

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104731948A (en) * 2015-03-31 2015-06-24 北京奇虎科技有限公司 High-quality image search resource collection method and device
CN108780558A (en) * 2016-03-11 2018-11-09 电子湾有限公司 List is removed based on similitude
CN108780558B (en) * 2016-03-11 2023-05-30 电子湾有限公司 Removing lists based on similarity
CN106202583A (en) * 2016-08-31 2016-12-07 北京交通大学 The visual signature optimization method of image and device
CN106202583B (en) * 2016-08-31 2019-06-07 北京交通大学 The visual signature optimization method and device of image
CN106776849A (en) * 2016-11-28 2017-05-31 西安交通大学 A kind of method and guide system to scheme quick-searching sight spot
CN106776849B (en) * 2016-11-28 2020-01-10 西安交通大学 Method for quickly searching scenic spots by using pictures and tour guide system
CN107704509A (en) * 2017-08-31 2018-02-16 北京联合大学 A kind of method for reordering for combining stability region and deep learning
CN111475666A (en) * 2020-03-27 2020-07-31 深圳市墨者安全科技有限公司 Dense vector-based media accurate matching method and system
CN111475666B (en) * 2020-03-27 2023-10-10 深圳市墨者安全科技有限公司 Dense vector-based media accurate matching method and system
CN116662588A (en) * 2023-08-01 2023-08-29 山东省大数据中心 Intelligent searching method and system for mass data
CN116662588B (en) * 2023-08-01 2023-10-10 山东省大数据中心 Intelligent searching method and system for mass data

Similar Documents

Publication Publication Date Title
CN104331513A (en) High-efficiency prediction method for image retrieval performance
CN107862027B (en) Retrieve intension recognizing method, device, electronic equipment and readable storage medium storing program for executing
CN104199965B (en) Semantic information retrieval method
CN101853272B (en) Search engine technology based on relevance feedback and clustering
US9589277B2 (en) Search service advertisement selection
CN103810299B (en) Image retrieval method on basis of multi-feature fusion
US8108374B2 (en) Optimization framework for tuning ranking engine
US7743062B2 (en) Apparatus for selecting documents in response to a plurality of inquiries by a plurality of clients by estimating the relevance of documents
CN1725212A (en) Adaptation of exponential models
US20120253945A1 (en) Bid traffic estimation
CN104572631B (en) The training method and system of a kind of language model
CN103593353A (en) Information search method and display information sorting weight value determination method and device
CN110990533B (en) Method and device for determining standard text corresponding to query text
CN103123653A (en) Search engine retrieving ordering method based on Bayesian classification learning
CN108182182B (en) Method and device for matching documents in translation database and computer readable storage medium
CN106844788B (en) Library intelligent search sorting method and system
CN104156433A (en) Image retrieval method based on semantic mapping space construction
CN110083774B (en) Method and device for determining application recommendation list, computer equipment and storage medium
KR20190128246A (en) Searching methods and apparatus and non-transitory computer-readable storage media
CN105095311A (en) Method, device and system for processing promotion information
CN114139634A (en) Multi-label feature selection method based on paired label weights
CN113987161A (en) Text sorting method and device
CN103488782A (en) Method for recognizing musical emotion through lyrics
CN106776782B (en) Semantic similarity obtaining method and device based on artificial intelligence
CN114817655A (en) Cross-modal retrieval method, network training method, device, equipment and medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20150204

RJ01 Rejection of invention patent application after publication