CN102663447A - Cross-media searching method based on discrimination correlation analysis - Google Patents
Cross-media searching method based on discrimination correlation analysis Download PDFInfo
- Publication number
- CN102663447A CN102663447A CN2012101334886A CN201210133488A CN102663447A CN 102663447 A CN102663447 A CN 102663447A CN 2012101334886 A CN2012101334886 A CN 2012101334886A CN 201210133488 A CN201210133488 A CN 201210133488A CN 102663447 A CN102663447 A CN 102663447A
- Authority
- CN
- China
- Prior art keywords
- data
- image
- sigma
- text
- point set
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 60
- 238000010219 correlation analysis Methods 0.000 title claims abstract description 30
- 239000013598 vector Substances 0.000 claims abstract description 36
- 238000012360 testing method Methods 0.000 claims abstract description 20
- 230000001174 ascending effect Effects 0.000 claims abstract description 5
- 238000000605 extraction Methods 0.000 claims abstract description 5
- 238000006243 chemical reaction Methods 0.000 claims description 14
- 230000008569 process Effects 0.000 claims description 11
- 239000000284 extract Substances 0.000 claims description 7
- 239000011159 matrix material Substances 0.000 claims description 4
- 230000003247 decreasing effect Effects 0.000 claims description 2
- 230000001276 controlling effect Effects 0.000 claims 1
- 238000013507 mapping Methods 0.000 claims 1
- 230000001105 regulatory effect Effects 0.000 claims 1
- 230000009467 reduction Effects 0.000 abstract description 8
- 238000012549 training Methods 0.000 abstract description 4
- 230000009466 transformation Effects 0.000 abstract 2
- 230000006870 function Effects 0.000 description 15
- 238000004458 analytical method Methods 0.000 description 14
- 238000013480 data collection Methods 0.000 description 4
- 238000010586 diagram Methods 0.000 description 3
- 239000012141 concentrate Substances 0.000 description 2
- 238000000513 principal component analysis Methods 0.000 description 2
- 238000012847 principal component analysis method Methods 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 241001269238 Data Species 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 230000000996 additive effect Effects 0.000 description 1
- HUTDUHSNJYTCAR-UHFFFAOYSA-N ancymidol Chemical compound C1=CC(OC)=CC=C1C(O)(C=1C=NC=NC=1)C1CC1 HUTDUHSNJYTCAR-UHFFFAOYSA-N 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000011551 log transformation method Methods 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000000491 multivariate analysis Methods 0.000 description 1
- 238000003909 pattern recognition Methods 0.000 description 1
- 238000012800 visualization Methods 0.000 description 1
Images
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention discloses a cross-media searching method based on discrimination correlation analysis. The method comprises the following steps of establishing a cross-media training database, carrying out feature extraction, mean-value pretreatment and linear projection transformation sequentially for different modal samples, and setting a target function according to a projection space; solving the target function to acquire a linear projection vector; establishing a cross-media test database; sequentially performing the feature extraction and mean-value pretreatment for a target to be searched; utilizing the linear projection vector to perform the linear projection transformation for the feature data after the mean-value pretreatment; and calculating an Euclidean distance between two modal data projection vectors, listing the Euclidean distance in ascending order, and acquiring a cross-media searching result. Due to the adoption of the method, dimensional reduction can be effectively performed for the feature data, so that the feature data can be widely applied to other multi-modal work, for example the multi-modal biological feature recognition.
Description
Technical field
The present invention relates to pattern-recognition and machine learning field, especially a kind ofly stride the medium search method based on what differentiate correlation analysis.
Background technology
In recent years, a large amount of multi-medium datas that occur present two tangible characteristics: high-dimensional property and polyphyly, for example same semantic concept can be represented by plurality of kinds of contents such as the literal on the network, picture, videos.In addition, the Internet user also mainly searches for needed information through text keyword, and this mainly is because search engine can't be understood the mutual relationship between the different modalities medium, thereby has limited the development of search engine.The characteristic dimensionality reduction has disclosed manifold structure and correlativity different modalities data between of high dimensional data in lower dimensional space, and in information retrieval, pattern classification, great function has been brought into play in fields such as information visualization.
The feature dimension reduction method of single mode data has a lot, and principal component analysis (Principal Component Analysis) projects to raw data on the principal direction with maximum variance; (Linear Discriminant Analysis LDA) is a kind of supervision dimension reduction method that has, and finds a projection subspace under the condition of classification information making full use of, and makes different classes of characteristic have optimum identification in linear discriminant analysis; Local linear embedding (Locally Linear Embedding) is a non-linear local reservation method the earliest, and the linear relationship of each data point and its arest neighbors data point is able to keep in projector space; LE (Laplacian Eigenmaps) has kept the distance of local two data points in projector space, LPP (Locality Preserving Projection) is its linear-apporximation algorithm; Multilayer own coding network (Multilayer Autoencoder Network) is the nonlinear stretch of principal component analysis method.Have research work to point out, though nonlinear method treatment of simulated data performance is fine, but not necessarily the principal component analysis method than traditional is good for real data, and more than these methods of mentioning all can not directly apply to the multi-modal medium retrieval of striding.
The feature dimension reduction method research of multi-modal data is not a lot; Canonical correlation analysis (Canonical Correlation Analysis; CCA) be wherein the most famous multivariate data analysis method; It to same subspace, makes multi-modal data difference linear projection multi-modal variable have maximum correlation; Relevant with typical linear different, PLS (Partial Least Square) makes multi-modal variable have maximum covariance in projector space; Under the inspiration of multilayer own coding network, multi-modal degree of depth learning network is suggested and is the common expression of different modalities data study.In a word; Above method more is to be that target removes to seek projector space with the correlativity that maximizes multi-modal variable; And ignored the identification that maximizes different classes of data in the multi-modal data, and identification is often extremely important in multi-modal data retrieval and classification task.
Summary of the invention
Existing multi-modal data analysing method is not generally considered the identification of data; The invention provides a kind of based on differentiating correlation analysis (Discriminant Correlation Analysis; DCA) method; It has merged the thought of canonical correlation analysis and linear discriminant analysis, optimizes the identification of multiple modalities correlation of data and different classes of data simultaneously.
Proposed by the invention a kind ofly stride the medium search method, it is characterized in that this method may further comprise the steps based on what differentiate correlation analysis:
Step 1 is set up and to be comprised right the striding the medium tranining database and extract the proper vector of different modalities sample in this database of image and text one to one, obtains corresponding characteristic point set;
Step 2, the characteristic point set to image and two mode of text carries out the average pre-service respectively, makes that the average of characteristic point set of each mode is 0;
Step 3 will be passed through the pretreated characteristic point set of average and carried out the linear projection conversion, and set an objective function about the linear projection variable according to the projector space that obtains;
Step 4, use characteristic value solving method is found the solution said objective function, obtains linear projection vector a and b;
Step 5, set up comprise image and text one to one right stride the medium test database;
Step 6 is imported object to be retrieved, and extracts the proper vector of object to be retrieved respectively and stride in the medium test database characteristic point set that belongs to the object set of different modalities with object to be retrieved;
Step 7, proper vector and characteristic point set that step 6 is obtained carry out said average pre-service respectively;
Step 8, the linear projection vector a and the b that use said step 4 to obtain carry out the linear projection conversion respectively to process pretreated proper vector of average and characteristic point set;
Step 9; Calculate the Euclidean distance between the projection variable of projection variable and object set of object to be retrieved; And all Euclidean distances are carried out ascending sort, preceding n the corresponding object data of Euclidean distance promptly is the object of striding another mode relevant with image to be retrieved that retrieval obtains in the medium test database said.
The inventive method can be carried out dimensionality reduction effectively to characteristic, thereby is widely used in other a lot of multi-modal work, discerns such as multi-modal biological characteristic.Experiment showed, the inventive method in striding medium retrievals than canonical correlation analysis, and the simple combination performance of canonical correlation analysis and linear discriminant analysis all will be got well.
Description of drawings
Fig. 1 is the realization flow figure of the inventive method;
Fig. 2 be the inventive method on a simulated data collection with the comparing result of other correlation techniques.
Embodiment
For making the object of the invention, technical scheme and advantage clearer, below in conjunction with specific embodiment, and with reference to accompanying drawing, to further explain of the present invention.
Fig. 1 is the realization flow figure of the inventive method; As shown in Figure 1, proposed by the invention a kind ofly comprise training process (Fig. 1 (a)) and test process (Fig. 1 (b) and (c)), particularly based on the medium search method of striding of differentiating correlation analysis; Fig. 1 (a) is for utilizing among the present invention image text in the tranining database to study projection vector a; The process flow diagram of b, shown in Fig. 1 (a), training process of the present invention may further comprise the steps:
Step 1 is set up and to be comprised right the striding the medium tranining database and extract the proper vector of different modalities sample in this database of image and text one to one, obtains corresponding characteristic point set.
The present invention at first sets up image and text is striden the medium tranining database one to one; Use yardstick invariant features conversion (Scale-Invariant Feature Transform then respectively; SIFT) algorithm and latent Di Lei Cray distribute, and (Latent Dirichlet Allocation, LDA) algorithm carries out feature extraction to image and text.
Step 2, the characteristic point set to image and two mode of text carries out the average pre-service respectively, makes that the average of characteristic point set of each mode is 0:
x←x-E(x) (1)
y←y-E(y)
Wherein, x and y are two given mode characteristic point sets, and such as image and text characteristic of correspondence data acquisition, its corresponding respectively data point set is { x
1... x
nAnd { y
1... y
n, the data in each data point set belong to k common classification respectively
E (x), E (y) is the average of original set of data points.
Step 3; To pass through pretreated image of average and text feature data point set carries out the linear projection conversion and obtains projector space; Set an objective function according to said projector space, this objective function is the objective function about the linear projection variable that is used to carry out the linear projection conversion.
Given projection vector a and b, variable set x and y that two mode characteristics of image and text point set is corresponding carry out the linear projection conversion, obtain respective projection variable u and v:
u=a
Tx (2)
v=b
Ty
The step of the said projector space target setting function that conversion obtains according to linear projection further may further comprise the steps:
Step 3.1, the covariance cov of projection variable u and v in the calculating projector space (u, v):
Wherein, ∑ defines the eigenmatrix of covariance for this reason.
Step 3.2, computed image and the inter-class variance of two mode characteristics of text point set in projector space and a type internal variance σ
BAnd σ
W:
Wherein, n representes the number of each data point intensive data, n
mThe number of representing the data of m class in each data point set, k are the number of classification, ω
mThe average of representing the concentrated m class data of two data points:
Be brought into formula (4) and (5), then σ to projection formula (2)
BAnd σ
WCan be rewritten as:
Wherein, S
BAnd S
WBe called " the hash matrix between type " and " hash matrix in type " of multi-modal data, be respectively:
Wherein, E
m(x) and E
m(y) be the average that raw data points is concentrated m class data respectively, C
mRepresent m class data set:
Step 3.3, according to the covariance cov that calculates (u, v), inter-class variance σ
BWith class internal variance σ
WThe target setting function.
The objective function that the present invention differentiates correlation analysis is defined as:
Wherein, σ
BAnd σ
WBe respectively " inter-class variance " and " type internal variance " of two data points collection in projector space, (u v) is the covariance of variable u and v in the projector space to cov, and μ is the adjusting parameter, and it is controlling σ
BAnd cov (u, relative weighting v).
Step 4, use characteristic value solving method is found the solution said objective function, is finally learnt the linear projection vector a and the b that obtain.
In order to find the solution said objective function, need convert said objective function into a generalized eigenvalue problem:
At first define f=(a, b), then objective function (12) can be rewritten as:
Can see that objective function (13) is very similar with the objective function of linear discriminant analysis, adopt lagrange's method of multipliers promptly can convert a generalized eigenvalue problem to (13) into, be shown below:
(μS
B+(1-μ)∑)f=λS
Wf (14)
Find the solution the eigenwert and the proper vector of (14); And arrange proper vector again according to the order that eigenwert is successively decreased; Get linear projection vector a and b that big eigenwert characteristic of correspondence vector obtains as final study; The linear projection vector a and the b that promptly utilize said study to obtain carry out the linear projection conversion respectively to multi-modal characteristic point set, can realize the dimensionality reduction to said multi-modal characteristic point set.
Step 5, set up comprise image and text one to one right stride the medium test database.
Fig. 1 (b) is for concentrating the process flow diagram of retrieving the text relevant with image at text data among the present invention; Fig. 1 (c) is for concentrating the process flow diagram of retrieving the image relevant with text in view data among the present invention; Shown in Fig. 1 (b) and Fig. 1 (c), test process of the present invention may further comprise the steps:
Step 6 is imported object to be retrieved, and extracts the proper vector of object to be retrieved respectively and stride in the medium test database characteristic point set that belongs to the object set of different modalities with object to be retrieved.
In this step; Similar with step 1; (Scale-Invariant Feature Transform, SIFT) algorithm and latent Di Lei Cray distribute, and (Latent Dirichlet Allocation, LDA) algorithm carries out feature extraction to image and text to use the conversion of yardstick invariant features respectively.
For instance, when needs were retrieved a series of text object relevant with certain image, object to be retrieved was an image, extracts the SIFT proper vector x of image respectively
iLDA characteristic point set with test database Chinese version data set
Wherein, N is the number of test database Chinese version data.
Step 7, similar with said step 2, proper vector and characteristic point set that step 6 is obtained carry out the average pre-service respectively.
Step 8, the linear projection vector a and the b that use said step 4 to obtain carry out the linear projection conversion respectively to process pretreated proper vector of average and characteristic point set, so that the pretreated characteristic of process average is carried out dimensionality reduction.
The linear projection vector a and the b that use said step 4 to obtain are with the SIFT proper vector x of image
iLDA characteristic set with test database Chinese version data set
Carry out the linear projection conversion respectively, obtain respective projection variable u
iWith
u
i=a
Tx
i
(15)
Step 9; Calculate the Euclidean distance between the projection variable of projection variable and object set of object to be retrieved; And all Euclidean distances are carried out ascending sort, preceding n the corresponding object data of Euclidean distance promptly is the object of striding another mode relevant with image to be retrieved that retrieval obtains in the medium test database said.
If object to be retrieved is an image; In this step; The Euclidean distance between the projection variable of each text data in the projection variable of computed image and the test database at first; And all Euclidean distances are carried out ascending sort, preceding n the corresponding text data of Euclidean distance promptly is the text object relevant with image to be retrieved that retrieval obtains.Here, result for retrieval quantity n can be set up on their own by the user as required.
What need to specify is, except the retrieval of cross-module attitude, the inventive method also may be used on other anyly need carry out dimension-reduction treatment to carry out the field of feature identification to multi-modal data, discerns such as multi-modal biological characteristic.
Prove that with the test result on simulated data collection and the True Data the inventive method is superior to the combination of canonical correlation analysis, linear discriminant analysis and canonical correlation analysis and linear discriminant analysis respectively below.
Simulated data collection instance is as shown in Figure 1, has generated two two-dimentional point sets among Fig. 1 (a), and asterism (the 1st type) is a point set with crunode (the 2nd type), and square frame (the 1st type) is the another one point set with rhombus (the 2nd type), and these two point sets belong to 2 types respectively; (b) provided the projection result of canonical correlation analysis (CCA) on simulated data; Though these two point sets are very relevant; But they but have a large amount of overlapping regions on low dimension projector space (here data projection to transverse axis), and the projecting direction that obtains of canonical correlation analysis does not have identification thus; (c) provided the projection result of linear discriminant analysis (LDA) on simulated data, though two types after the projection have good identification, the correlativity of two point sets after the projection is very poor; (d) provided the result of linear discriminant analysis (LDA) and a kind of combination of canonical correlation analysis (CCA), promptly earlier each point set has been done linear discriminant analysis, and then do canonical correlation analysis, the result who obtains is with directly to do canonical correlation analysis (b) closely similar; (e) provided the result of linear discriminant analysis (LDA) with other a kind of combination of canonical correlation analysis (CCA); Promptly earlier two point sets are done canonical correlation analysis; And then doing linear discriminant analysis, its result seems more similar with the result (g) of the inventive method (DCA), yet carries out obtaining after the log-transformation (f) and (h) to two results (e) and transverse axis coordinate (g); The result that can see canonical correlation analysis and linear discriminant analysis combination is linear inseparable on horizontal axis; Like P data point in (f) and Q data point, and the result of the inventive method is a linear separability, has explained that the inventive method has more identification.
On an image text data set, tested the performance of differentiating correlation analysis below the True Data collection instance; It is right that this data set comprises 2866 image texts; It is right that wherein training set has 2173 image texts; Test set has 693 image texts right, and each image text belongs to a certain type in following 10 types: art, biology, geography, history, literature, medium, music, royal family, physical culture, military affairs to a class label is arranged.Wherein, image adopts the SIFT characteristic of 128 dimensions, and text adopts the LDA text semantic characteristic of 10 dimensions.Project to the characteristic of these two types of data with two kinds of combinations differentiating correlation analysis, canonical correlation analysis and canonical correlation analysis and linear discriminant analysis the lower dimensional space of 9 dimensions then respectively; In this 9 dimension space, carry out cross-module attitude retrieval tasks; Promptly concentrate the retrieval image relevant, perhaps concentrate the retrieval text relevant with certain image at text data with certain text in view data.The result of cross-module attitude retrieval measures with mean accuracy (MAP, mean average precision), and mean accuracy is the bigger the better, and the mean accuracy here is meant the mean value of each query and search precision.Table 1 has provided the classification results of four kinds of algorithms, can see, differentiates correlation analysis and is superior to additive method.
Table 1
Method | Image is as test data | Text is as test data |
DCA | 0.2108 | 0.2482 |
CCA | 0.2032 | 0.2032 |
CCA+LDA | 0.2020 | 0.2011 |
LDA+CCA | 0.2031 | 0.2034 |
Above-described specific embodiment; The object of the invention, technical scheme and beneficial effect have been carried out further explain, and institute it should be understood that the above is merely specific embodiment of the present invention; Be not limited to the present invention; All within spirit of the present invention and principle, any modification of being made, be equal to replacement, improvement etc., all should be included within protection scope of the present invention.
Claims (10)
1. stride the medium search method based on what differentiate correlation analysis for one kind, it is characterized in that this method may further comprise the steps:
Step 1 is set up and to be comprised right the striding the medium tranining database and extract the proper vector of different modalities sample in this database of image and text one to one, obtains corresponding characteristic point set;
Step 2, the characteristic point set to image and two mode of text carries out the average pre-service respectively, makes that the average of characteristic point set of each mode is 0;
Step 3 will be passed through the pretreated characteristic point set of average and carried out the linear projection conversion, and set an objective function about the linear projection variable according to the projector space that obtains;
Step 4, use characteristic value solving method is found the solution said objective function, obtains linear projection vector a and b;
Step 5, set up comprise image and text one to one right stride the medium test database;
Step 6 is imported object to be retrieved, and extracts the proper vector of object to be retrieved respectively and stride in the medium test database characteristic point set that belongs to the object set of different modalities with object to be retrieved;
Step 7, proper vector and characteristic point set that step 6 is obtained carry out said average pre-service respectively;
Step 8, the linear projection vector a and the b that use said step 4 to obtain carry out the linear projection conversion respectively to process pretreated proper vector of average and characteristic point set;
Step 9; Calculate the Euclidean distance between the projection variable of projection variable and object set of object to be retrieved; And all Euclidean distances are carried out ascending sort, preceding n the corresponding object data of Euclidean distance promptly is the object of striding another mode relevant with image to be retrieved that retrieval obtains in the medium test database said.
2. method according to claim 1 is characterized in that, in the said step 1 and 6, uses yardstick invariant features mapping algorithm and latent Di Lei Cray Distribution Algorithm that image and text are carried out feature extraction respectively.
3. method according to claim 1 is characterized in that, the linear projection map table in the said step 3 is shown:
u=a
Tx
,
V=b
TY wherein, x and y are respectively the set of the corresponding variable of two mode characteristics of image and text point set, a and b are respectively corresponding projection vector, u and v pass through the projection variable that the linear projection conversion obtains.
4. method according to claim 3 is characterized in that, further may further comprise the steps according to the step of the projector space target setting function that obtains:
Step 3.1, and the covariance cov of projection variable u and v in the calculating projector space (u, v);
Step 3.2, computed image and the inter-class variance of two mode characteristics of text point set in projector space and a type internal variance σ
BAnd σ
W
Step 3.3, according to the covariance cov that calculates (u, v), inter-class variance σ
BWith class internal variance σ
WThe target setting function.
5. method according to claim 4 is characterized in that, in the said step 3.1, the covariance cov of projection variable u and v (u v) is expressed as:
Wherein, ∑ defines the eigenmatrix of covariance for this reason.
6. method according to claim 4 is characterized in that, in the said step 3.2, and a said inter-class variance and a type internal variance σ
BAnd σ
WBe expressed as:
Wherein, S
BAnd S
W" the hash matrix between type " and " hash matrix in type " that is called multi-modal data:
Wherein, n representes the number of each data point intensive data, n
mThe number of representing the data of m class in each data point set, k are the number of classification,
C
mRepresent m class data set, E
m(x) and E
m(y) be the average that raw data points is concentrated m class data respectively.
7. method according to claim 4 is characterized in that, said objective function is defined as:
Wherein, μ is for regulating parameter, and it is controlling σ
BAnd cov (u, relative weighting v).
8. method according to claim 1 is characterized in that, in the said step 4, the step that use characteristic value solving method is found the solution said objective function further may further comprise the steps:
At first, (a b), rewrites said objective function to definition f=;
Then, the objective function after adopting lagrange's method of multipliers to rewrite convert into one can try to achieve generalized eigenvalue equality;
At last, find the solution the eigenwert and the proper vector of this equality, and arrange proper vector again, get linear projection vector a and b that big eigenwert characteristic of correspondence vector obtains as final study according to the order that eigenwert is successively decreased.
9. method according to claim 1 is characterized in that, the object to be retrieved in the said step 6 is image or text.
10. method according to claim 1 is characterized in that, result for retrieval quantity n is set up on their own by the user as required in the said step 9.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210133488.6A CN102663447B (en) | 2012-04-28 | 2012-04-28 | Cross-media searching method based on discrimination correlation analysis |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210133488.6A CN102663447B (en) | 2012-04-28 | 2012-04-28 | Cross-media searching method based on discrimination correlation analysis |
Publications (2)
Publication Number | Publication Date |
---|---|
CN102663447A true CN102663447A (en) | 2012-09-12 |
CN102663447B CN102663447B (en) | 2014-04-23 |
Family
ID=46772930
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201210133488.6A Active CN102663447B (en) | 2012-04-28 | 2012-04-28 | Cross-media searching method based on discrimination correlation analysis |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN102663447B (en) |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103793507A (en) * | 2014-01-26 | 2014-05-14 | 北京邮电大学 | Method for obtaining bimodal similarity measure with deep structure |
CN103838836A (en) * | 2014-02-25 | 2014-06-04 | 中国科学院自动化研究所 | Multi-modal data fusion method and system based on discriminant multi-modal deep confidence network |
CN103995903A (en) * | 2014-06-12 | 2014-08-20 | 武汉科技大学 | Cross-media search method based on isomorphic subspace mapping and optimization |
CN104077319A (en) * | 2013-03-29 | 2014-10-01 | 南京邮电大学 | Method for annotating images on basis of measurement of difference among non-Euclidean spaces |
CN103870500B (en) * | 2012-12-14 | 2017-05-24 | 联想(北京)有限公司 | Searching method and searching device |
CN107220475A (en) * | 2016-11-01 | 2017-09-29 | 重庆交通大学 | A kind of bearing features data analysing method based on linear discriminant analysis |
CN107657008A (en) * | 2017-09-25 | 2018-02-02 | 中国科学院计算技术研究所 | Across media training and search method based on depth discrimination sequence study |
CN108399414A (en) * | 2017-02-08 | 2018-08-14 | 南京航空航天大学 | Method of Sample Selection and device |
CN109213876A (en) * | 2018-08-02 | 2019-01-15 | 宁夏大学 | Based on the cross-module state search method for generating confrontation network |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5647058A (en) * | 1993-05-24 | 1997-07-08 | International Business Machines Corporation | Method for high-dimensionality indexing in a multi-media database |
CN101021849A (en) * | 2006-09-14 | 2007-08-22 | 浙江大学 | Transmedia searching method based on content correlation |
CN101334796A (en) * | 2008-02-29 | 2008-12-31 | 浙江师范大学 | Personalized and synergistic integration network multimedia search and enquiry method |
CN101996191A (en) * | 2009-08-14 | 2011-03-30 | 北京大学 | Method and system for searching for two-dimensional cross-media element |
-
2012
- 2012-04-28 CN CN201210133488.6A patent/CN102663447B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5647058A (en) * | 1993-05-24 | 1997-07-08 | International Business Machines Corporation | Method for high-dimensionality indexing in a multi-media database |
CN101021849A (en) * | 2006-09-14 | 2007-08-22 | 浙江大学 | Transmedia searching method based on content correlation |
CN101334796A (en) * | 2008-02-29 | 2008-12-31 | 浙江师范大学 | Personalized and synergistic integration network multimedia search and enquiry method |
CN101996191A (en) * | 2009-08-14 | 2011-03-30 | 北京大学 | Method and system for searching for two-dimensional cross-media element |
Cited By (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103870500B (en) * | 2012-12-14 | 2017-05-24 | 联想(北京)有限公司 | Searching method and searching device |
CN104077319B (en) * | 2013-03-29 | 2018-03-06 | 南京邮电大学 | Image labeling method based on non-Euclidean space Diversity measure |
CN104077319A (en) * | 2013-03-29 | 2014-10-01 | 南京邮电大学 | Method for annotating images on basis of measurement of difference among non-Euclidean spaces |
CN103793507B (en) * | 2014-01-26 | 2016-10-05 | 北京邮电大学 | A kind of method using deep structure to obtain bimodal similarity measure |
CN103793507A (en) * | 2014-01-26 | 2014-05-14 | 北京邮电大学 | Method for obtaining bimodal similarity measure with deep structure |
CN103838836A (en) * | 2014-02-25 | 2014-06-04 | 中国科学院自动化研究所 | Multi-modal data fusion method and system based on discriminant multi-modal deep confidence network |
CN103838836B (en) * | 2014-02-25 | 2016-09-28 | 中国科学院自动化研究所 | Based on discriminant multi-modal degree of depth confidence net multi-modal data fusion method and system |
CN103995903B (en) * | 2014-06-12 | 2017-04-12 | 武汉科技大学 | Cross-media search method based on isomorphic subspace mapping and optimization |
CN103995903A (en) * | 2014-06-12 | 2014-08-20 | 武汉科技大学 | Cross-media search method based on isomorphic subspace mapping and optimization |
CN107220475A (en) * | 2016-11-01 | 2017-09-29 | 重庆交通大学 | A kind of bearing features data analysing method based on linear discriminant analysis |
CN108399414A (en) * | 2017-02-08 | 2018-08-14 | 南京航空航天大学 | Method of Sample Selection and device |
CN108399414B (en) * | 2017-02-08 | 2021-06-01 | 南京航空航天大学 | Sample selection method and device applied to cross-modal data retrieval field |
CN107657008A (en) * | 2017-09-25 | 2018-02-02 | 中国科学院计算技术研究所 | Across media training and search method based on depth discrimination sequence study |
CN109213876A (en) * | 2018-08-02 | 2019-01-15 | 宁夏大学 | Based on the cross-module state search method for generating confrontation network |
CN109213876B (en) * | 2018-08-02 | 2022-12-02 | 宁夏大学 | Cross-modal retrieval method based on generation of countermeasure network |
Also Published As
Publication number | Publication date |
---|---|
CN102663447B (en) | 2014-04-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102663447B (en) | Cross-media searching method based on discrimination correlation analysis | |
CN107515895B (en) | Visual target retrieval method and system based on target detection | |
Chen et al. | Model Metric Co-Learning for Time Series Classification. | |
Gao et al. | Database saliency for fast image retrieval | |
Rodríguez-Serrano et al. | A model-based sequence similarity with application to handwritten word spotting | |
Zhao et al. | Spectral feature selection for data mining | |
CN101187927B (en) | Criminal case joint investigation intelligent analysis method | |
CN103049526B (en) | Based on the cross-media retrieval method of double space study | |
CN103473327A (en) | Image retrieval method and image retrieval system | |
CN105930873B (en) | A kind of walking across mode matching method certainly based on subspace | |
CN106250925B (en) | A kind of zero Sample video classification method based on improved canonical correlation analysis | |
CN103559191A (en) | Cross-media sorting method based on hidden space learning and two-way sorting learning | |
CN105740378B (en) | Digital pathology full-section image retrieval method | |
CN106844481B (en) | Font similarity and font replacement method | |
Trstenjak et al. | Determining the impact of demographic features in predicting student success in Croatia | |
CN103778206A (en) | Method for providing network service resources | |
WO2013159356A1 (en) | Cross-media searching method based on discrimination correlation analysis | |
CN103177121B (en) | Add the locality preserving projections method of Pearson correlation coefficient | |
CN108319959A (en) | A kind of corps diseases image-recognizing method compressed based on characteristics of image with retrieval | |
CN102831161A (en) | Semi-supervision sequencing study method for image searching based on manifold regularization | |
CN104143088A (en) | Face identification method based on image retrieval and feature weight learning | |
Ahsan et al. | Clustering social event images using kernel canonical correlation analysis | |
Eravci et al. | Diversity based relevance feedback for time series search | |
Cao et al. | Research on dynamic time warping multivariate time series similarity matching based on shape feature and inclination angle | |
CN103049570B (en) | Based on the image/video search ordering method of relevant Preserving map and a sorter |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant |