CN111008288A - Method for realizing media asset retrieval and display based on media asset content annotation - Google Patents

Method for realizing media asset retrieval and display based on media asset content annotation Download PDF

Info

Publication number
CN111008288A
CN111008288A CN201911356802.5A CN201911356802A CN111008288A CN 111008288 A CN111008288 A CN 111008288A CN 201911356802 A CN201911356802 A CN 201911356802A CN 111008288 A CN111008288 A CN 111008288A
Authority
CN
China
Prior art keywords
assets
media asset
media
retrieval
database
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201911356802.5A
Other languages
Chinese (zh)
Inventor
孟北北
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xinhua Zhiyun Technology Co ltd
Original Assignee
Xinhua Zhiyun Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xinhua Zhiyun Technology Co ltd filed Critical Xinhua Zhiyun Technology Co ltd
Priority to CN201911356802.5A priority Critical patent/CN111008288A/en
Publication of CN111008288A publication Critical patent/CN111008288A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • G06F16/48Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • G06F16/41Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • G06F16/43Querying

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Multimedia (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Library & Information Science (AREA)
  • Software Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a method for realizing media asset retrieval and display based on media asset content marking, which comprises the following steps: warehousing the media assets; media assets are synchronized to a marking platform, and marking is carried out on the media assets of different types; after marking is finished, the label data flow back to the media asset library through the MQ, and the media asset library synchronizes the label data to the OTS database; incrementally synchronizing tag data to the ES database to support user retrieval; and retrieving the media assets through the tags of the ES database. The invention has the advantages of being compatible with various types of media asset tag storage and simultaneously enabling the media asset retrieval and the fragment positioning to be more convenient.

Description

Method for realizing media asset retrieval and display based on media asset content annotation
Technical Field
The invention relates to a method for realizing media asset retrieval and display based on media asset content marking.
Background
The current network media becomes an important way for people to know information, the traditional media resource platform is single, and the accuracy of information organization is influenced to a certain extent due to the defects of the traditional media resource platform in the aspects of labels, management mechanisms, retrieval mechanisms and the like, so that the news production efficiency is low. Therefore, the media asset marking technology is also valued and developed.
The traditional scheme can not well position the desired media asset data and is inconvenient to use; the cost is high, the medium resource retrieval without the label is slow, and the labor cost is greatly increased.
Disclosure of Invention
In order to solve the defects of the prior art, the invention provides a method for realizing the retrieval and display of the media assets based on the marking of the media asset content, which can be compatible with various types of media asset label storage and simultaneously enables the retrieval of the media assets and the positioning of the segments to be more effective and more convenient.
A method for realizing media asset retrieval and display based on media asset content annotation comprises the following steps:
warehousing the media assets;
media assets are synchronized to a marking platform, and marking is carried out on the media assets of different types;
after marking is finished, the label data flow back to the media asset library through the MQ, and the media asset library synchronizes the label data to the OTS database;
incrementally synchronizing tag data to the ES database to support user retrieval;
and retrieving the media assets through the tags of the ES database.
Further, when the media assets enter the OTS database, the media asset ID is generated through the UUID, and the label is associated through the media asset ID.
Further, the tags in the OTS database include manually labeled tags and tags identified by an artificial intelligence algorithm.
Further, the labels recognized by the artificial intelligence algorithm include human faces, speech and text.
Furthermore, the OTS database is provided with an event model for storing the information of the labels, and can be compatible with warehousing of various label forms.
Furthermore, marking is carried out on the picture media assets in a mode of viewing the picture contents.
Furthermore, for the audio media assets, time points are positioned in a mode of listening audio contents, and marking is carried out;
for video media assets, time points are positioned and marked in a mode of viewing video contents.
Further, setting a global label for the picture media asset; the audio media assets and the video media assets can be provided with time point labels or global labels.
Further, when synchronizing different types of tags to the ES database, the stored field names are isomorphically processed.
Furthermore, a plurality of records of the same medium asset are integrated into an array list and stored in an index column established by the ES database.
The invention has the advantages of being compatible with various types of media asset tag storage, and simultaneously enabling the media asset retrieval and the fragment positioning to be more effective and convenient.
The method can meet the requirement that a user can accurately retrieve and manage various media assets through manual marking and algorithm labels, support various media asset types including picture, audio, video and text picture audio and video mixed media asset labeling modes, and increase the label types at any time through ES retrieval by utilizing an event model of nosql.
Drawings
FIG. 1 is a flow chart of a method for implementing media asset retrieval and display based on media asset content tagging, illustrating a flow from a media asset tagging process to a media asset retrieval by a tag;
FIG. 2 is a schematic diagram of an OTS database storage structure of the method for implementing media asset retrieval and display based on media asset content annotation in FIG. 1;
fig. 3 is a schematic diagram of an ES database storage structure of the method for implementing asset retrieval and display based on asset content annotation in fig. 1.
Detailed Description
The invention is described in detail below with reference to the figures and the embodiments.
As shown in fig. 1 to fig. 3, a method for implementing retrieval and display of a medium asset based on marking of medium asset content includes the following steps:
warehousing the media assets;
media assets are synchronized to a marking platform, and marking is carried out on the media assets of different types;
after marking is finished, the label data flow back to the media asset library through the MQ, and the media asset library synchronizes the label data to the OTS database;
incrementally synchronizing tag data to the ES database to support user retrieval;
the user may retrieve the assets via the tags of the ES database.
Specifically, for the picture media assets, marking is carried out in a mode of viewing the picture contents. For audio media assets, positioning time points and marking in a mode of listening audio contents; for video media assets, time points are positioned and marked in a mode of viewing video contents. Further, setting a global label for the picture media asset; the audio media assets and the video media assets can be provided with time point labels or global labels.
AliOTS storage structure
The OTS has an event (event) model for storing tag information, and can be compatible with warehousing of multiple tag forms. Only one event needs to be newly established, the media asset id is generated through a UUID when one media asset enters, and the tag attribute is associated through the media asset id.
The tags in the OTS database include manually labeled tags and tags identified by an artificial intelligence algorithm. Specifically, the labels are not only manually marked in the label table, but also extended labels such as human faces, voice and texts identified by an artificial intelligent algorithm exist together, so that the retrieval and positioning efficiency of the media resources is greatly improved.
The Aliyun OTS database service is a column-oriented storage database service, the service supports dynamic column information increase, and if a tagged event is newly added to media assets, only data needs to be directly inserted into the newly added column, and OTS can automatically generate column information.
ES storage structure
After the media asset tag data is stored in the OTS, the media asset tag data is incrementally synchronized into the ES in real time to support the retrieval of the user.
Different types of asset tag data are different, and in order to support a user to search all types of assets in a mixed manner, it is necessary to perform isomorphic processing on field names stored when all types of tags are synchronized to an ES database, for example: the index name corresponding to the title of the video asset is the same as the index name corresponding to the title of other types of assets.
Some data fusion is needed for the information of the media asset tag table, a plurality of records may exist in a video, the records are integrated into an array List and are stored in an index column established by the ES, and the index name is tag. The method aims to perform unified word segmentation on the merged result and facilitate the retrieval of the label data information by a user.
After the conversion of the data is completed and the data in the OTS is synchronized to the ES, the ES can support the retrieval requirement of the user for the tag.
Mysql may be used for data volumes below the tens of millions level. For basic storage of the media asset data, in addition to an OTS storage database in Ali, HBase, etc. nosql databases can be used for replacement.
The foregoing illustrates and describes the principles, general features, and advantages of the present invention. It should be understood by those skilled in the art that the above embodiments do not limit the present invention in any way, and all technical solutions obtained by using equivalent alternatives or equivalent variations fall within the scope of the present invention.

Claims (10)

1. A method for realizing media asset retrieval and display based on media asset content annotation is characterized by comprising the following steps:
warehousing the media assets;
media assets are synchronized to a marking platform, and marking is carried out on the media assets of different types;
after marking is finished, the label data flow back to the media asset library through the MQ, and the media asset library synchronizes the label data to the OTS database;
incrementally synchronizing tag data to the ES database to support user retrieval;
and retrieving the media assets through the tags of the ES database.
2. The method for facilitating retrieval and display of assets based on annotation of content to assets of claim 1,
and generating a media asset ID through the UUID when the media asset enters the OTS database, and associating the label through the media asset ID.
3. The method for facilitating retrieval and display of assets based on annotation of content to assets of claim 2,
the tags in the OTS database include manually labeled tags and tags identified by an artificial intelligence algorithm.
4. The method for performing asset retrieval and display based on asset content tagging of claim 3, wherein the tags identified by the artificial intelligence algorithm comprise face, voice and text.
5. The method for facilitating retrieval and display of assets based on annotation of content to assets of claim 1,
the OTS database is provided with an event model for storing the information of the labels, and can be compatible with warehousing of various label forms.
6. The method for realizing media asset retrieval and display based on media asset content annotation as claimed in claim 1, wherein for the picture media asset, the marking is performed by looking at the picture content.
7. The method for facilitating retrieval and display of assets based on annotation of content to assets of claim 1,
for audio media assets, positioning time points and marking in a mode of listening audio contents;
for video media assets, time points are positioned and marked in a mode of viewing video contents.
8. The method for realizing media asset retrieval and display based on media asset content annotation as claimed in claim 1, wherein the picture media asset is provided with a global label; the audio media assets and the video media assets can be provided with time point labels or global labels.
9. The method for retrieving and displaying assets based on annotation of media content as claimed in claim 1, wherein tags of different types are isomorphically processed with the stored field names when synchronized to the ES database.
10. The method for facilitating retrieval and display of assets based on annotation of content to assets of claim 9,
and integrating a plurality of records of the same medium resource into an array list, and storing the array list in an index column established by the ES database.
CN201911356802.5A 2019-12-25 2019-12-25 Method for realizing media asset retrieval and display based on media asset content annotation Pending CN111008288A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911356802.5A CN111008288A (en) 2019-12-25 2019-12-25 Method for realizing media asset retrieval and display based on media asset content annotation

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911356802.5A CN111008288A (en) 2019-12-25 2019-12-25 Method for realizing media asset retrieval and display based on media asset content annotation

Publications (1)

Publication Number Publication Date
CN111008288A true CN111008288A (en) 2020-04-14

Family

ID=70118497

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911356802.5A Pending CN111008288A (en) 2019-12-25 2019-12-25 Method for realizing media asset retrieval and display based on media asset content annotation

Country Status (1)

Country Link
CN (1) CN111008288A (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106446051A (en) * 2016-08-31 2017-02-22 北京新奥特云视科技有限公司 Deep search method of Eagle media assets
CN110427498A (en) * 2019-07-24 2019-11-08 新华智云科技有限公司 Storage method, device, storage equipment and the storage medium of media information

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106446051A (en) * 2016-08-31 2017-02-22 北京新奥特云视科技有限公司 Deep search method of Eagle media assets
CN110427498A (en) * 2019-07-24 2019-11-08 新华智云科技有限公司 Storage method, device, storage equipment and the storage medium of media information

Similar Documents

Publication Publication Date Title
CN108052577B (en) Universal text content mining method, device, server and storage medium
US9645787B1 (en) Tag-based electronic media playlist processing
CN105446946B (en) Rearrangement method, system and the electronic reading terminal of format document
CN110866383B (en) Interactive electronic data list generation method and system
CN102193906A (en) Method for automatically introducing examination paper in WORD format into database system
CN108255915B (en) File management method and device and machine-readable storage medium
CN102207948A (en) Method for generating incident statement sentence material base
CN111090656B (en) Method and system for dynamically constructing object portrait
CN106021393B (en) The grid equipment Standard Information Searching method and system of facing mobile apparatus
US20150278248A1 (en) Personal Information Management Service System
CN102314412A (en) Method and system for recording contextual information and tracing new word context
US20180357594A1 (en) Computer Vision and Digital Image Scanning based Inventory Management System
CN104765849A (en) Method and system for acquiring copied data source information
CN104349173A (en) Video repeating method and device
CN110929120B (en) Method and apparatus for managing technical metadata
CN110879799B (en) Method and device for labeling technical metadata
CN111008288A (en) Method for realizing media asset retrieval and display based on media asset content annotation
CN105740406A (en) Information indexing and searching method
CN110134817A (en) A kind of storage method of recording file, searching method and relevant apparatus
Theofanous et al. Cultural heritage content re-use: An aggregators's point of view
CN103186583B (en) A kind of information record based on mobile terminal and search method and device
CN111078905A (en) Data processing method, device, medium and equipment
CN108536872B (en) Method and device for optimizing knowledge base structure
CN101281524A (en) Method and apparatus for acquiring material
CN110457468B (en) Event classification method and device and storage device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination