CN103678694A - Method and system for establishing reverse index file of video resources - Google Patents
Method and system for establishing reverse index file of video resources Download PDFInfo
- Publication number
- CN103678694A CN103678694A CN201310739955.4A CN201310739955A CN103678694A CN 103678694 A CN103678694 A CN 103678694A CN 201310739955 A CN201310739955 A CN 201310739955A CN 103678694 A CN103678694 A CN 103678694A
- Authority
- CN
- China
- Prior art keywords
- keyword
- information
- file
- video
- dictionary
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/31—Indexing; Data structures therefor; Storage structures
- G06F16/316—Indexing structures
- G06F16/319—Inverted lists
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/70—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F16/78—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/7867—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using information manually generated, e.g. tags, keywords, comments, title and artist information, manually generated time, location and usage information, user ratings
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- Library & Information Science (AREA)
- Multimedia (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention discloses a method and system for establishing a reverse index file of video resources. The method includes the first step of carrying out word segmentation processing on video file information in a preset word segmentation mode to obtain keywords, the second step of establishing an index relation between the keywords and the video file information containing the keywords so as to establish the reverse index file of a video file. According to the method and system for establishing the reverse index file of the video resources, index efficiency on mass video data can be improved.
Description
Technical field
The present invention relates to information retrieval technique, relate in particular to a kind of inverted index file set up method and system thereof of video resource.
Background technology
Along with scientific and technological development, increasing user is by internet hunt and watch various videos.The video information providing due to internet is very abundant, and has the feature of continuous variation and renewal, has produced multiple search engine thereupon and has carried out Video Information Retrieval Techniques:.
In relational database system, index is the mode of retrieve data full blast.But for the video search engine of the whole network, can not meet its specific (special) requirements:
(1) what search engine was faced is the massive video data of the whole network, such as large-scale video website search engine indexs such as happy views, be all hundred million grades of even webpage quantity of several hundred billion, in the face of the video data of magnanimity like this, make Database Systems be difficult to effectively management.
(2) data manipulation that search engine uses is simple, generally speaking, several functions such as only needs to increase, delete, change, look into, and data have specific form, can design simple efficient application program for these application.General Database Systems are supported large and complete function, have lost speed and space simultaneously.
(3) search engine faces a large amount of user search demands, and this requirement completes being operated in when index is set up of macrooperation amount as much as possible, makes to retrieve operand as far as possible few.General Database Systems are difficult to bear so a large amount of user's requests, and can not satisfy the demands on retrieval response time and retrieval concurrency.
Known in sum, in prior art, there is the technical matters that can not meet the demand of the aspects such as quantity, time, efficiency for the data directory scheme of magnanimity video information, be therefore necessary to propose improved technical scheme and address the above problem.
Summary of the invention
Fundamental purpose of the present invention is to provide a kind of inverted index file set up method and system thereof of video resource, with solve that prior art exists for slow, the inefficient problem of searching mass data speed, wherein:
According to an aspect of the present invention, provide a kind of inverted index file set up method of video resource, it comprises: by default participle mode, video file information is carried out to word segmentation processing and obtain keyword; Set up described keyword and there is the index relative between the video file information of described keyword, thereby set up the inverted index file of video file.
Wherein, described method also comprises: dictionary is provided, and the Data Source of described dictionary comprises: basic dictionary, video copyright dictionary, user-generated content; Described step of video file information being carried out to word segmentation processing by default participle mode comprises: according to described dictionary and by default participle mode, video file information is carried out to word segmentation processing.
Wherein, described participle mode comprises: binary is divided morphology, maximum matching method, statistical method.
Wherein, the described step of setting up described keyword and having an index relative between the video file information of described keyword comprises: record and store the index information of described keyword, described index information comprises: the frequency information that the positional information that the identification information of the video file that comprises keyword, keyword occur, keyword occur; Set up the incidence relation between keyword and its index information.
Wherein, described method also comprises: the result for retrieval that statistics obtains based on inverted index file, search rate is adjusted to the file start-up portion of inverted index file over the keyword of setting threshold.
According to a further aspect in the invention, also provide a kind of inverted index file set up system, it comprises: keyword acquisition module, carries out word segmentation processing for the participle mode by default to video file information and obtains keyword; Inverted index is set up module, for setting up described keyword and having the index relative between the video file information of described keyword, thereby sets up inverted index file.
Wherein, described system also comprises: dictionary maintenance module, and for setting up and safeguard dictionary, the Data Source of described dictionary comprises: basic dictionary, video copyright storehouse, user-generated content; Described keyword acquisition module carries out word segmentation processing according to described dictionary and by default participle mode to video file information.
Wherein, described participle mode comprises: binary is divided morphology, maximum matching method, statistical method.
Wherein, described inverted index is set up module and is comprised: logging modle, for recording and store the index information of described keyword, described index information comprises: the frequency information that the positional information that the identification information of the video file that comprises keyword, keyword occur, keyword occur; Incidence relation is set up module, for setting up the incidence relation between keyword and its index information.
Wherein, described system also comprises: result for retrieval statistical module, for adding up the result for retrieval obtaining based on inverted index file; Processing module, for adjusting to search rate the file start-up portion of inverted index file over the keyword of setting threshold.
According to technical scheme of the present invention, by being carried out to word segmentation processing, video file information obtains keyword, set up keyword and there is the index relative between the video file information of keyword, thereby set up inverted index file, when user uses keyword search video file, can be fast and corresponding information is provided exactly.
Accompanying drawing explanation
Accompanying drawing described herein is used to provide a further understanding of the present invention, forms the application's a part, and schematic description and description of the present invention is used for explaining the present invention, does not form inappropriate limitation of the present invention.In the accompanying drawings:
Fig. 1 is according to the process flow diagram of the inverted index file set up method of the embodiment of the present invention;
Fig. 2 is the structured flowchart of inverted index file set up system according to an embodiment of the invention;
Fig. 3 is the structured flowchart of inverted index file set up system according to another embodiment of the present invention.
Embodiment
General index is just arranging index, is to determine property value by recording; Inverted index is according to property value, to determine the position of record, is therefore called inverted index.The present invention is for having the storage and retrieval of video resource of the video website of magnanimity video resource, by the document of the whole network (video file on internet) is set up by word (word) to the inverted index to document, when user inquires about document (webpage) with keyword, system will be returned to the document (webpage) that contains this keyword to user.
For making the object, technical solutions and advantages of the present invention clearer, below in conjunction with drawings and the specific embodiments, the present invention is described in further detail.
According to the embodiment of the present invention, provide a kind of inverted index file set up method of video resource.With reference to figure 1, be according to the process flow diagram of the inverted index file set up method of the video resource of the embodiment of the present invention, comprise the following steps (step S102-S104):
Step S102, carries out word segmentation processing by default participle mode to video file information and obtains keyword.
Video file information refers to some Word messages such as title that video file comprises, descriptor, brief introduction, obtains the keyword of video file information by word segmentation processing.Usually, word segmentation processing is exactly that continuous word sequence is reassembled into word sequence according to certain standard.The object of participle is exactly that each document analysis is extracted to those words that likely becomes user's query object (word).
According to the difference of category of language that video file information is used, word segmentation processing can be divided into Chinese word segmentation processing and foreign language (take English below as representative explanation) word segmentation processing substantially.English using space as natural separator, by space, just can distinguish word, then reject the words (such as a, the etc.) of some of them redundancy, just can complete word segmentation processing, illustrate below.
For example, there are two pieces of files 1 and 2, the content of file 1 is: " Tom lives in Guangzhou, I live in Guangzhou too. ", all keywords of the file 1 after word segmentation processing are: [tom] [live] [guangzhou] [i] [live] [guangzhou].
The content of file 2 is: " He once lived in Shanghai. ", all keywords of the file 2 after word segmentation processing are: [he] [live] [shanghai].
And the participle of Chinese is more complicated than English participle, between Chinese word, there is no obvious delimiter.The present invention carries out word segmentation processing by introducing dictionary.In actual applications, the Data Source of dictionary includes but not limited to following channel: basic dictionary, video copyright storehouse, user-generated content (User-generated content, referred to as UGC).Wherein, basic dictionary comprises various dictionaries and dictionary, but video file is not strict consistent with the title of dictionary, therefore also needs to use video copyright dictionary.The dictionary of video copyright dictionary for obtaining according to the video resource information with copyright, this dictionary can meet the demand of video file information word segmentation processing.And UGC is that generated by user or that provide or original content, some neologisms that use in network have been supplemented.By above-mentioned multiple dictionary, cooperatively interact and supplement, after word segmentation processing, can access comparatively ideal keyword.
In addition, due to the complicacy of Chinese language, in order to solve the ambiguity producing in participle process, also need to use some minute word algorithm, such as binary, divide the modes such as morphology, maximum matching method, statistical method to carry out word segmentation processing to video file information.So-called binary is divided morphology, and being about to title is 2 to carry out cutting according to step-length, and like this, length is n(n word) title be split as n-1 binary word, its previous word and a rear word have a public word.Maximum matching method comprises maximum matching method, maximum matching method etc. backward forward, repeats no more herein.
Preferably, in employing, divide the modes such as morphology, maximum matching method, statistical method to carry out after word segmentation processing video file information as binary, the word that operation obtains to participle in dictionary is verified, whether accurately determines that participle operates the word obtaining.
Step S104, sets up described keyword and has the index relative between the video file information of described keyword, thereby sets up the inverted index file of video file.
After word segmentation processing obtains keyword, identification information (ID) by keyword together with corresponding file is stored in inverted index file, after All Files is analyzed, by the order of the keyword obtaining to keyword sort, the processing such as merging, add up the probability that each keyword occurs in individual file, and also likely comprise other index informations in index file.For example: number of files, for showing that keyword occurs at how many files; Sum frequency, for the number of times that shows that keyword occurs at All Files; Frequency, for the number of times that shows that keyword occurs at a file.Thereby, set up the incidence relation between keyword and its index information.
Hold above-mentioned example, the index information that keyword is corresponding with it is as shown in table 1, that is to say, " frequency of occurrences " that keyword is corresponding with it and " occurring position " information obtain final index structure.
Table 1
Keyword | Document number [frequency of occurrences] | There is position |
guangzhou | 1[2] | 3,6 |
he | 2[1] | 1 |
i | 1[1] | 4 |
live | 1[2],2[1] | 2,5,2 |
shanghai | 2[1] | 3 |
tom | 1[1] | 1 |
According to above-described embodiment, set up after inverted index file, user input query condition, scanning inverted index file also obtains alternative file collection, according to certain output video file that requires, thereby realize quick and accurate video resource retrieval, met the storage and retrieval requirement of magnanimity video resource.
In actual applications, the search of video resource has paroxysmal feature, for example, for example, when a certain hot video (film, TV play, variety show) release or a certain focus event (media event) generation, in short time, can there is a large amount of searching request, in this case, the result for retrieval that statistics obtains based on inverted index file, adjusts to search rate the file start-up portion of inverted index file, to improve recall precision over the keyword of setting threshold.
According to embodiments of the invention, also provide a kind of inverted index file set up system.As shown in Figure 2, described system at least comprises: keyword acquisition module 10 and inverted index are set up module 20, describes structure and the annexation of each module below in detail.
Inverted index is set up module 20 and is coupled mutually with keyword acquisition module 10, for setting up described keyword and having the index relative between the video file information of described keyword, thereby sets up inverted index file.
With reference to figure 3, in one embodiment of the invention, described system also comprises: dictionary maintenance module 30, for setting up and safeguard dictionary, the Data Source of described dictionary includes but not limited to: basic dictionary, video copyright storehouse, user-generated content.
Based on this, keyword acquisition module 10 carries out word segmentation processing according to described dictionary and by default participle mode to video file information.
Continuation is with reference to figure 3, and described inverted index is set up module 20 and further comprised: logging modle 210 and incidence relation are set up module 220, wherein:
Logging modle 210 is for recording and store the index information of described keyword, and described index information comprises: the frequency information that the positional information that the identification information of the video file that comprises keyword, keyword occur, keyword occur; Incidence relation is set up module 220 and is coupled mutually with logging modle 210, for setting up the incidence relation between keyword and its index information.
In one embodiment of the invention, described inverted index file set up system also comprises: result for retrieval statistical module (not shown), for adding up the result for retrieval obtaining based on inverted index file; Processing module (not shown), adjusts to the file start-up portion of inverted index file for search rate being surpassed to the keyword of setting threshold, thereby improves recall precision.
The operation steps of method of the present invention is corresponding with the architectural feature of system, can cross-reference, repeat no longer one by one.
In sum, according to technical scheme of the present invention, by being carried out to word segmentation processing, video file information obtains keyword, set up keyword and there is the index relative between the video file information of keyword, thereby set up inverted index file, when user uses keyword search video file, can be fast and corresponding information is provided exactly.
The foregoing is only embodiments of the invention, be not limited to the present invention, for a person skilled in the art, the present invention can have various modifications and variations.Within the spirit and principles in the present invention all, any modification of doing, be equal to replacement, improvement etc., within all should being included in claim scope of the present invention.
Claims (10)
1. an inverted index file set up method for video resource, is characterized in that, comprising:
By default participle mode, video file information is carried out to word segmentation processing and obtain keyword;
Set up described keyword and there is the index relative between the video file information of described keyword, thereby set up the inverted index file of video file.
2. method according to claim 1, is characterized in that, also comprises:
Dictionary is provided, and the Data Source of described dictionary comprises: basic dictionary, video copyright dictionary, user-generated content;
Described step of video file information being carried out to word segmentation processing by default participle mode comprises: according to described dictionary and by default participle mode, video file information is carried out to word segmentation processing.
3. method according to claim 1 and 2, is characterized in that, described participle mode comprises: binary is divided morphology, maximum matching method, statistical method.
4. method according to claim 1, is characterized in that, the described step of setting up described keyword and having an index relative between the video file information of described keyword comprises:
Record and store the index information of described keyword, described index information comprises: the frequency information that the positional information that the identification information of the video file that comprises keyword, keyword occur, keyword occur;
Set up the incidence relation between keyword and its index information.
5. method according to claim 1, is characterized in that, also comprises:
The result for retrieval that statistics obtains based on inverted index file, adjusts to search rate the file start-up portion of inverted index file over the keyword of setting threshold.
6. an inverted index file set up system, is characterized in that, comprising:
Keyword acquisition module, carries out word segmentation processing for the participle mode by default to video file information and obtains keyword;
Inverted index is set up module, for setting up described keyword and having the index relative between the video file information of described keyword, thereby sets up inverted index file.
7. system according to claim 6, is characterized in that, also comprises:
Dictionary maintenance module, for setting up and safeguard dictionary, the Data Source of described dictionary comprises: basic dictionary, video copyright storehouse, user-generated content;
Described keyword acquisition module carries out word segmentation processing according to described dictionary and by default participle mode to video file information.
8. according to the system described in claim 6 or 7, it is characterized in that, described participle mode comprises: binary is divided morphology, maximum matching method, statistical method.
9. system according to claim 6, is characterized in that, described inverted index is set up module and comprised:
Logging modle, for recording and store the index information of described keyword, described index information comprises: the frequency information that the positional information that the identification information of the video file that comprises keyword, keyword occur, keyword occur;
Incidence relation is set up module, for setting up the incidence relation between keyword and its index information.
10. system according to claim 6, is characterized in that, also comprises:
Result for retrieval statistical module, for adding up the result for retrieval obtaining based on inverted index file;
Processing module, for adjusting to search rate the file start-up portion of inverted index file over the keyword of setting threshold.
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310739955.4A CN103678694A (en) | 2013-12-26 | 2013-12-26 | Method and system for establishing reverse index file of video resources |
PCT/CN2014/093176 WO2015096609A1 (en) | 2013-12-26 | 2014-12-05 | Method and system for creating inverted index file of video resource |
US15/101,698 US20160306811A1 (en) | 2013-12-26 | 2014-12-05 | Method and system for creating inverted index file of video resource |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310739955.4A CN103678694A (en) | 2013-12-26 | 2013-12-26 | Method and system for establishing reverse index file of video resources |
Publications (1)
Publication Number | Publication Date |
---|---|
CN103678694A true CN103678694A (en) | 2014-03-26 |
Family
ID=50316238
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201310739955.4A Pending CN103678694A (en) | 2013-12-26 | 2013-12-26 | Method and system for establishing reverse index file of video resources |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103678694A (en) |
Cited By (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2015096609A1 (en) * | 2013-12-26 | 2015-07-02 | 乐视网信息技术(北京)股份有限公司 | Method and system for creating inverted index file of video resource |
CN104933120A (en) * | 2015-06-04 | 2015-09-23 | 无锡天脉聚源传媒科技有限公司 | Keyword setting method and device for video album |
CN104978402A (en) * | 2015-06-04 | 2015-10-14 | 无锡天脉聚源传媒科技有限公司 | Keyword setting method and apparatus of video album |
CN104978401A (en) * | 2015-06-04 | 2015-10-14 | 无锡天脉聚源传媒科技有限公司 | Keyword setting method and apparatus of video album |
CN105005576A (en) * | 2015-03-27 | 2015-10-28 | 合一信息技术(北京)有限公司 | System and method for searching similar users of video website |
CN106156155A (en) * | 2015-04-15 | 2016-11-23 | 厦门简帛信息科技有限公司 | A kind of method and system that e-book resource is provided |
CN106874443A (en) * | 2017-02-09 | 2017-06-20 | 北京百家互联科技有限公司 | Based on information query method and device that video text message is extracted |
CN107704628A (en) * | 2017-10-31 | 2018-02-16 | 福建中金在线信息科技有限公司 | Data retrieval method, index relative method for building up and server |
WO2018113673A1 (en) * | 2016-12-23 | 2018-06-28 | 北京奇虎科技有限公司 | Method and apparatus for pushing search result of variety show query |
CN109299466A (en) * | 2018-10-22 | 2019-02-01 | 中国船舶工业综合技术经济研究院 | A kind of document retrieval method and system towards science and techniques of defence field |
CN109783444A (en) * | 2018-12-26 | 2019-05-21 | 亚信科技(中国)有限公司 | Multichannel file index method, device, computer equipment and storage medium |
CN110825913A (en) * | 2019-09-03 | 2020-02-21 | 上海擎测机电工程技术有限公司 | Professional word extraction and part-of-speech tagging method |
CN112541115A (en) * | 2020-12-02 | 2021-03-23 | 创盛视联数码科技(北京)有限公司 | Method for recommending teaching video, electronic equipment and computer readable medium |
CN114707007A (en) * | 2022-06-07 | 2022-07-05 | 苏州大学 | Image text retrieval method and device and computer storage medium |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080059420A1 (en) * | 2006-08-22 | 2008-03-06 | International Business Machines Corporation | System and Method for Providing a Trustworthy Inverted Index to Enable Searching of Records |
CN102201001A (en) * | 2011-04-29 | 2011-09-28 | 西安交通大学 | Fast retrieval method based on inverted technology |
CN103428525A (en) * | 2013-07-22 | 2013-12-04 | 华中科技大学 | Online inquiry and play control method and system for network videos and television programs |
-
2013
- 2013-12-26 CN CN201310739955.4A patent/CN103678694A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080059420A1 (en) * | 2006-08-22 | 2008-03-06 | International Business Machines Corporation | System and Method for Providing a Trustworthy Inverted Index to Enable Searching of Records |
CN102201001A (en) * | 2011-04-29 | 2011-09-28 | 西安交通大学 | Fast retrieval method based on inverted technology |
CN103428525A (en) * | 2013-07-22 | 2013-12-04 | 华中科技大学 | Online inquiry and play control method and system for network videos and television programs |
Non-Patent Citations (2)
Title |
---|
匡振国 等: "一种基于Lucene的影片搜索引擎的研究和应用", 《计算机工程与应用》 * |
郑榕增 等: "基于Lucene 的中文倒排索引技术的研究", 《计算机技术与发展》 * |
Cited By (18)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2015096609A1 (en) * | 2013-12-26 | 2015-07-02 | 乐视网信息技术(北京)股份有限公司 | Method and system for creating inverted index file of video resource |
CN105005576B (en) * | 2015-03-27 | 2018-03-09 | 合一信息技术(北京)有限公司 | A kind of video website similar users search system and method |
CN105005576A (en) * | 2015-03-27 | 2015-10-28 | 合一信息技术(北京)有限公司 | System and method for searching similar users of video website |
CN106156155A (en) * | 2015-04-15 | 2016-11-23 | 厦门简帛信息科技有限公司 | A kind of method and system that e-book resource is provided |
CN104978401A (en) * | 2015-06-04 | 2015-10-14 | 无锡天脉聚源传媒科技有限公司 | Keyword setting method and apparatus of video album |
CN104978401B (en) * | 2015-06-04 | 2019-07-02 | 无锡天脉聚源传媒科技有限公司 | A kind of the keyword setting method and device of video album |
CN104978402A (en) * | 2015-06-04 | 2015-10-14 | 无锡天脉聚源传媒科技有限公司 | Keyword setting method and apparatus of video album |
CN104933120A (en) * | 2015-06-04 | 2015-09-23 | 无锡天脉聚源传媒科技有限公司 | Keyword setting method and device for video album |
WO2018113673A1 (en) * | 2016-12-23 | 2018-06-28 | 北京奇虎科技有限公司 | Method and apparatus for pushing search result of variety show query |
CN106874443A (en) * | 2017-02-09 | 2017-06-20 | 北京百家互联科技有限公司 | Based on information query method and device that video text message is extracted |
CN107704628A (en) * | 2017-10-31 | 2018-02-16 | 福建中金在线信息科技有限公司 | Data retrieval method, index relative method for building up and server |
CN109299466A (en) * | 2018-10-22 | 2019-02-01 | 中国船舶工业综合技术经济研究院 | A kind of document retrieval method and system towards science and techniques of defence field |
CN109299466B (en) * | 2018-10-22 | 2023-07-07 | 中国船舶工业综合技术经济研究院 | Document retrieval method and system oriented to national defense science and technology field |
CN109783444A (en) * | 2018-12-26 | 2019-05-21 | 亚信科技(中国)有限公司 | Multichannel file index method, device, computer equipment and storage medium |
CN110825913A (en) * | 2019-09-03 | 2020-02-21 | 上海擎测机电工程技术有限公司 | Professional word extraction and part-of-speech tagging method |
CN112541115A (en) * | 2020-12-02 | 2021-03-23 | 创盛视联数码科技(北京)有限公司 | Method for recommending teaching video, electronic equipment and computer readable medium |
CN114707007A (en) * | 2022-06-07 | 2022-07-05 | 苏州大学 | Image text retrieval method and device and computer storage medium |
CN114707007B (en) * | 2022-06-07 | 2022-08-30 | 苏州大学 | Image text retrieval method and device and computer storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103678694A (en) | Method and system for establishing reverse index file of video resources | |
US11580176B2 (en) | Search infrastructure | |
Ali et al. | Comparison between SQL and NoSQL databases and their relationship with big data analytics | |
CN110489445B (en) | Rapid mass data query method based on polymorphic composition | |
US8244767B2 (en) | Composite locality sensitive hash based processing of documents | |
US20040205044A1 (en) | Method for storing inverted index, method for on-line updating the same and inverted index mechanism | |
TW201530328A (en) | Method and device for constructing NoSQL database index for semi-structured data | |
CN109857898A (en) | A kind of method and system of mass digital audio-frequency fingerprint storage and retrieval | |
CN111563095B (en) | HBase-based data retrieval device | |
WO2015096609A1 (en) | Method and system for creating inverted index file of video resource | |
CN105631003A (en) | Intelligent index establishing, inquiring and maintaining method supporting mass data classification and counting | |
US9262511B2 (en) | System and method for indexing streams containing unstructured text data | |
US20080010238A1 (en) | Index having short-term portion and long-term portion | |
CN111367991B (en) | MongoDB data real-time synchronization method and system based on message queue | |
CN105117433A (en) | Method and system for statistically querying HBase based on analysis performed by Hive on HFile | |
JP2019512124A (en) | Method and apparatus for archiving database generating index information, search method and apparatus for archived database including index information | |
CN103714158A (en) | Vertical search method and system for video websites | |
CN114139040A (en) | Data storage and query method, device, equipment and readable storage medium | |
CN111782663A (en) | Aggregation index structure and aggregation index method for improving aggregation query efficiency | |
Zhou et al. | Adaptive subspace symbolization for content-based video detection | |
CN103699659A (en) | Method and system for managing word library of video resources | |
CN103678697A (en) | Reverse index storage method and system thereof | |
CN111563123A (en) | Live warehouse metadata real-time synchronization method | |
CN103714147A (en) | Video resource data source processing method and system thereof | |
CN103699658A (en) | Method and system for sorting information of video resources |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C41 | Transfer of patent application or patent right or utility model | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20151228 Address after: Room six, building 19, building 68, No. 100089 South Road, Haidian District, Beijing Applicant after: LETV CLOUD COMPUTING CO., LTD. Address before: Room six, building 19, building 68, No. 100089 South Road, Haidian District, Beijing Applicant before: LeTV Information Technology (Beijing) Co., Ltd. |
|
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20140326 |
|
RJ01 | Rejection of invention patent application after publication |