CN111008288A - Method for realizing media asset retrieval and display based on media asset content annotation - Google Patents
Method for realizing media asset retrieval and display based on media asset content annotation Download PDFInfo
- Publication number
- CN111008288A CN111008288A CN201911356802.5A CN201911356802A CN111008288A CN 111008288 A CN111008288 A CN 111008288A CN 201911356802 A CN201911356802 A CN 201911356802A CN 111008288 A CN111008288 A CN 111008288A
- Authority
- CN
- China
- Prior art keywords
- assets
- media asset
- media
- retrieval
- database
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/40—Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
- G06F16/48—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/40—Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
- G06F16/41—Indexing; Data structures therefor; Storage structures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/40—Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
- G06F16/43—Querying
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Multimedia (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Library & Information Science (AREA)
- Software Systems (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention discloses a method for realizing media asset retrieval and display based on media asset content marking, which comprises the following steps: warehousing the media assets; media assets are synchronized to a marking platform, and marking is carried out on the media assets of different types; after marking is finished, the label data flow back to the media asset library through the MQ, and the media asset library synchronizes the label data to the OTS database; incrementally synchronizing tag data to the ES database to support user retrieval; and retrieving the media assets through the tags of the ES database. The invention has the advantages of being compatible with various types of media asset tag storage and simultaneously enabling the media asset retrieval and the fragment positioning to be more convenient.
Description
Technical Field
The invention relates to a method for realizing media asset retrieval and display based on media asset content marking.
Background
The current network media becomes an important way for people to know information, the traditional media resource platform is single, and the accuracy of information organization is influenced to a certain extent due to the defects of the traditional media resource platform in the aspects of labels, management mechanisms, retrieval mechanisms and the like, so that the news production efficiency is low. Therefore, the media asset marking technology is also valued and developed.
The traditional scheme can not well position the desired media asset data and is inconvenient to use; the cost is high, the medium resource retrieval without the label is slow, and the labor cost is greatly increased.
Disclosure of Invention
In order to solve the defects of the prior art, the invention provides a method for realizing the retrieval and display of the media assets based on the marking of the media asset content, which can be compatible with various types of media asset label storage and simultaneously enables the retrieval of the media assets and the positioning of the segments to be more effective and more convenient.
A method for realizing media asset retrieval and display based on media asset content annotation comprises the following steps:
warehousing the media assets;
media assets are synchronized to a marking platform, and marking is carried out on the media assets of different types;
after marking is finished, the label data flow back to the media asset library through the MQ, and the media asset library synchronizes the label data to the OTS database;
incrementally synchronizing tag data to the ES database to support user retrieval;
and retrieving the media assets through the tags of the ES database.
Further, when the media assets enter the OTS database, the media asset ID is generated through the UUID, and the label is associated through the media asset ID.
Further, the tags in the OTS database include manually labeled tags and tags identified by an artificial intelligence algorithm.
Further, the labels recognized by the artificial intelligence algorithm include human faces, speech and text.
Furthermore, the OTS database is provided with an event model for storing the information of the labels, and can be compatible with warehousing of various label forms.
Furthermore, marking is carried out on the picture media assets in a mode of viewing the picture contents.
Furthermore, for the audio media assets, time points are positioned in a mode of listening audio contents, and marking is carried out;
for video media assets, time points are positioned and marked in a mode of viewing video contents.
Further, setting a global label for the picture media asset; the audio media assets and the video media assets can be provided with time point labels or global labels.
Further, when synchronizing different types of tags to the ES database, the stored field names are isomorphically processed.
Furthermore, a plurality of records of the same medium asset are integrated into an array list and stored in an index column established by the ES database.
The invention has the advantages of being compatible with various types of media asset tag storage, and simultaneously enabling the media asset retrieval and the fragment positioning to be more effective and convenient.
The method can meet the requirement that a user can accurately retrieve and manage various media assets through manual marking and algorithm labels, support various media asset types including picture, audio, video and text picture audio and video mixed media asset labeling modes, and increase the label types at any time through ES retrieval by utilizing an event model of nosql.
Drawings
FIG. 1 is a flow chart of a method for implementing media asset retrieval and display based on media asset content tagging, illustrating a flow from a media asset tagging process to a media asset retrieval by a tag;
FIG. 2 is a schematic diagram of an OTS database storage structure of the method for implementing media asset retrieval and display based on media asset content annotation in FIG. 1;
fig. 3 is a schematic diagram of an ES database storage structure of the method for implementing asset retrieval and display based on asset content annotation in fig. 1.
Detailed Description
The invention is described in detail below with reference to the figures and the embodiments.
As shown in fig. 1 to fig. 3, a method for implementing retrieval and display of a medium asset based on marking of medium asset content includes the following steps:
warehousing the media assets;
media assets are synchronized to a marking platform, and marking is carried out on the media assets of different types;
after marking is finished, the label data flow back to the media asset library through the MQ, and the media asset library synchronizes the label data to the OTS database;
incrementally synchronizing tag data to the ES database to support user retrieval;
the user may retrieve the assets via the tags of the ES database.
Specifically, for the picture media assets, marking is carried out in a mode of viewing the picture contents. For audio media assets, positioning time points and marking in a mode of listening audio contents; for video media assets, time points are positioned and marked in a mode of viewing video contents. Further, setting a global label for the picture media asset; the audio media assets and the video media assets can be provided with time point labels or global labels.
AliOTS storage structure
The OTS has an event (event) model for storing tag information, and can be compatible with warehousing of multiple tag forms. Only one event needs to be newly established, the media asset id is generated through a UUID when one media asset enters, and the tag attribute is associated through the media asset id.
The tags in the OTS database include manually labeled tags and tags identified by an artificial intelligence algorithm. Specifically, the labels are not only manually marked in the label table, but also extended labels such as human faces, voice and texts identified by an artificial intelligent algorithm exist together, so that the retrieval and positioning efficiency of the media resources is greatly improved.
The Aliyun OTS database service is a column-oriented storage database service, the service supports dynamic column information increase, and if a tagged event is newly added to media assets, only data needs to be directly inserted into the newly added column, and OTS can automatically generate column information.
ES storage structure
After the media asset tag data is stored in the OTS, the media asset tag data is incrementally synchronized into the ES in real time to support the retrieval of the user.
Different types of asset tag data are different, and in order to support a user to search all types of assets in a mixed manner, it is necessary to perform isomorphic processing on field names stored when all types of tags are synchronized to an ES database, for example: the index name corresponding to the title of the video asset is the same as the index name corresponding to the title of other types of assets.
Some data fusion is needed for the information of the media asset tag table, a plurality of records may exist in a video, the records are integrated into an array List and are stored in an index column established by the ES, and the index name is tag. The method aims to perform unified word segmentation on the merged result and facilitate the retrieval of the label data information by a user.
After the conversion of the data is completed and the data in the OTS is synchronized to the ES, the ES can support the retrieval requirement of the user for the tag.
Mysql may be used for data volumes below the tens of millions level. For basic storage of the media asset data, in addition to an OTS storage database in Ali, HBase, etc. nosql databases can be used for replacement.
The foregoing illustrates and describes the principles, general features, and advantages of the present invention. It should be understood by those skilled in the art that the above embodiments do not limit the present invention in any way, and all technical solutions obtained by using equivalent alternatives or equivalent variations fall within the scope of the present invention.
Claims (10)
1. A method for realizing media asset retrieval and display based on media asset content annotation is characterized by comprising the following steps:
warehousing the media assets;
media assets are synchronized to a marking platform, and marking is carried out on the media assets of different types;
after marking is finished, the label data flow back to the media asset library through the MQ, and the media asset library synchronizes the label data to the OTS database;
incrementally synchronizing tag data to the ES database to support user retrieval;
and retrieving the media assets through the tags of the ES database.
2. The method for facilitating retrieval and display of assets based on annotation of content to assets of claim 1,
and generating a media asset ID through the UUID when the media asset enters the OTS database, and associating the label through the media asset ID.
3. The method for facilitating retrieval and display of assets based on annotation of content to assets of claim 2,
the tags in the OTS database include manually labeled tags and tags identified by an artificial intelligence algorithm.
4. The method for performing asset retrieval and display based on asset content tagging of claim 3, wherein the tags identified by the artificial intelligence algorithm comprise face, voice and text.
5. The method for facilitating retrieval and display of assets based on annotation of content to assets of claim 1,
the OTS database is provided with an event model for storing the information of the labels, and can be compatible with warehousing of various label forms.
6. The method for realizing media asset retrieval and display based on media asset content annotation as claimed in claim 1, wherein for the picture media asset, the marking is performed by looking at the picture content.
7. The method for facilitating retrieval and display of assets based on annotation of content to assets of claim 1,
for audio media assets, positioning time points and marking in a mode of listening audio contents;
for video media assets, time points are positioned and marked in a mode of viewing video contents.
8. The method for realizing media asset retrieval and display based on media asset content annotation as claimed in claim 1, wherein the picture media asset is provided with a global label; the audio media assets and the video media assets can be provided with time point labels or global labels.
9. The method for retrieving and displaying assets based on annotation of media content as claimed in claim 1, wherein tags of different types are isomorphically processed with the stored field names when synchronized to the ES database.
10. The method for facilitating retrieval and display of assets based on annotation of content to assets of claim 9,
and integrating a plurality of records of the same medium resource into an array list, and storing the array list in an index column established by the ES database.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911356802.5A CN111008288A (en) | 2019-12-25 | 2019-12-25 | Method for realizing media asset retrieval and display based on media asset content annotation |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911356802.5A CN111008288A (en) | 2019-12-25 | 2019-12-25 | Method for realizing media asset retrieval and display based on media asset content annotation |
Publications (1)
Publication Number | Publication Date |
---|---|
CN111008288A true CN111008288A (en) | 2020-04-14 |
Family
ID=70118497
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201911356802.5A Pending CN111008288A (en) | 2019-12-25 | 2019-12-25 | Method for realizing media asset retrieval and display based on media asset content annotation |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111008288A (en) |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106446051A (en) * | 2016-08-31 | 2017-02-22 | 北京新奥特云视科技有限公司 | Deep search method of Eagle media assets |
CN110427498A (en) * | 2019-07-24 | 2019-11-08 | 新华智云科技有限公司 | Storage method, device, storage equipment and the storage medium of media information |
-
2019
- 2019-12-25 CN CN201911356802.5A patent/CN111008288A/en active Pending
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106446051A (en) * | 2016-08-31 | 2017-02-22 | 北京新奥特云视科技有限公司 | Deep search method of Eagle media assets |
CN110427498A (en) * | 2019-07-24 | 2019-11-08 | 新华智云科技有限公司 | Storage method, device, storage equipment and the storage medium of media information |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108052577B (en) | Universal text content mining method, device, server and storage medium | |
US9645787B1 (en) | Tag-based electronic media playlist processing | |
CN105446946B (en) | Rearrangement method, system and the electronic reading terminal of format document | |
CN110866383B (en) | Interactive electronic data list generation method and system | |
CN102193906A (en) | Method for automatically introducing examination paper in WORD format into database system | |
CN108255915B (en) | File management method and device and machine-readable storage medium | |
CN102207948A (en) | Method for generating incident statement sentence material base | |
CN111090656B (en) | Method and system for dynamically constructing object portrait | |
CN106021393B (en) | The grid equipment Standard Information Searching method and system of facing mobile apparatus | |
US20150278248A1 (en) | Personal Information Management Service System | |
CN102314412A (en) | Method and system for recording contextual information and tracing new word context | |
US20180357594A1 (en) | Computer Vision and Digital Image Scanning based Inventory Management System | |
CN104765849A (en) | Method and system for acquiring copied data source information | |
CN104349173A (en) | Video repeating method and device | |
CN110929120B (en) | Method and apparatus for managing technical metadata | |
CN110879799B (en) | Method and device for labeling technical metadata | |
CN111008288A (en) | Method for realizing media asset retrieval and display based on media asset content annotation | |
CN105740406A (en) | Information indexing and searching method | |
CN110134817A (en) | A kind of storage method of recording file, searching method and relevant apparatus | |
Theofanous et al. | Cultural heritage content re-use: An aggregators's point of view | |
CN103186583B (en) | A kind of information record based on mobile terminal and search method and device | |
CN111078905A (en) | Data processing method, device, medium and equipment | |
CN108536872B (en) | Method and device for optimizing knowledge base structure | |
CN101281524A (en) | Method and apparatus for acquiring material | |
CN110457468B (en) | Event classification method and device and storage device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |