ATE512411T1 - System und verfahren zur integrierten analyse von intrinsischen und extrinsischen audiovisuellen daten - Google Patents

System und verfahren zur integrierten analyse von intrinsischen und extrinsischen audiovisuellen daten

Info

Publication number
ATE512411T1
ATE512411T1 AT04799283T AT04799283T ATE512411T1 AT E512411 T1 ATE512411 T1 AT E512411T1 AT 04799283 T AT04799283 T AT 04799283T AT 04799283 T AT04799283 T AT 04799283T AT E512411 T1 ATE512411 T1 AT E512411T1
Authority
AT
Austria
Prior art keywords
extrinsic
intrinsic
film
data
information
Prior art date
Application number
AT04799283T
Other languages
English (en)
Inventor
Nevenka Dimitrova
Robert Turetsky
Original Assignee
Koninkl Philips Electronics Nv
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninkl Philips Electronics Nv filed Critical Koninkl Philips Electronics Nv
Application granted granted Critical
Publication of ATE512411T1 publication Critical patent/ATE512411T1/de

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/78Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/783Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/7844Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using original textual content or text extracted from visual content or transcript of audio data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/683Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/685Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using automatically derived transcript of audio data, e.g. lyrics
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/78Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/78Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/783Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/7834Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using audio features
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/78Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/783Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/7837Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using objects detected or recognised in the video content
    • G06F16/784Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using objects detected or recognised in the video content the detected or recognised objects being people
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/78Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/783Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/7847Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using low-level visual features of the video content
    • G06F16/785Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using low-level visual features of the video content using colour or luminescence
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • G06V20/41Higher-level, semantic clustering, classification or understanding of video scenes, e.g. detection, labelling or Markovian modelling of sport events or news items

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Multimedia (AREA)
  • Library & Information Science (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Software Systems (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Transfer Between Computers (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
AT04799283T 2003-12-05 2004-11-30 System und verfahren zur integrierten analyse von intrinsischen und extrinsischen audiovisuellen daten ATE512411T1 (de)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US52747603P 2003-12-05 2003-12-05
EP04100622 2004-02-17
PCT/IB2004/052601 WO2005055196A2 (en) 2003-12-05 2004-11-30 System & method for integrative analysis of intrinsic and extrinsic audio-visual data

Publications (1)

Publication Number Publication Date
ATE512411T1 true ATE512411T1 (de) 2011-06-15

Family

ID=44122679

Family Applications (1)

Application Number Title Priority Date Filing Date
AT04799283T ATE512411T1 (de) 2003-12-05 2004-11-30 System und verfahren zur integrierten analyse von intrinsischen und extrinsischen audiovisuellen daten

Country Status (5)

Country Link
US (1) US20070061352A1 (de)
EP (1) EP1692629B1 (de)
JP (1) JP2007519987A (de)
AT (1) ATE512411T1 (de)
WO (1) WO2005055196A2 (de)

Families Citing this family (38)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2009528756A (ja) 2006-03-03 2009-08-06 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ 複数の画像の要約の自動生成のための方法及び装置
US8732154B2 (en) 2007-02-28 2014-05-20 Samsung Electronics Co., Ltd. Method and system for providing sponsored information on electronic devices
US8200688B2 (en) 2006-03-07 2012-06-12 Samsung Electronics Co., Ltd. Method and system for facilitating information searching on electronic devices
US9100723B2 (en) 2006-03-07 2015-08-04 Samsung Electronics Co., Ltd. Method and system for managing information on a video recording
US8843467B2 (en) 2007-05-15 2014-09-23 Samsung Electronics Co., Ltd. Method and system for providing relevant information to a user of a device in a local network
US8863221B2 (en) 2006-03-07 2014-10-14 Samsung Electronics Co., Ltd. Method and system for integrating content and services among multiple networks
US8195650B2 (en) 2007-02-28 2012-06-05 Samsung Electronics Co., Ltd. Method and system for providing information using a supplementary device
US8115869B2 (en) 2007-02-28 2012-02-14 Samsung Electronics Co., Ltd. Method and system for extracting relevant information from content metadata
US8209724B2 (en) * 2007-04-25 2012-06-26 Samsung Electronics Co., Ltd. Method and system for providing access to information of potential interest to a user
US8510453B2 (en) * 2007-03-21 2013-08-13 Samsung Electronics Co., Ltd. Framework for correlating content on a local network with information on an external network
US8041025B2 (en) * 2006-08-07 2011-10-18 International Business Machines Corporation Systems and arrangements for controlling modes of audio devices based on user selectable parameters
US8935269B2 (en) * 2006-12-04 2015-01-13 Samsung Electronics Co., Ltd. Method and apparatus for contextual search and query refinement on consumer electronics devices
US20090055393A1 (en) * 2007-01-29 2009-02-26 Samsung Electronics Co., Ltd. Method and system for facilitating information searching on electronic devices based on metadata information
US20080221892A1 (en) * 2007-03-06 2008-09-11 Paco Xander Nathan Systems and methods for an autonomous avatar driver
US9286385B2 (en) 2007-04-25 2016-03-15 Samsung Electronics Co., Ltd. Method and system for providing access to information of potential interest to a user
US8111281B2 (en) * 2007-06-29 2012-02-07 Sony Ericsson Mobile Communications Ab Methods and terminals that control avatars during videoconferencing and other communications
US8781996B2 (en) 2007-07-12 2014-07-15 At&T Intellectual Property Ii, L.P. Systems, methods and computer program products for searching within movies (SWiM)
US8176068B2 (en) 2007-10-31 2012-05-08 Samsung Electronics Co., Ltd. Method and system for suggesting search queries on electronic devices
US8001561B2 (en) 2007-11-20 2011-08-16 Samsung Electronics Co., Ltd. System and method for automatically rating video content
US9378286B2 (en) * 2008-03-14 2016-06-28 Microsoft Technology Licensing, Llc Implicit user interest marks in media content
US8237742B2 (en) 2008-06-12 2012-08-07 International Business Machines Corporation Simulation method and system
US8259992B2 (en) 2008-06-13 2012-09-04 International Business Machines Corporation Multiple audio/video data stream simulation method and system
US8332414B2 (en) 2008-07-01 2012-12-11 Samsung Electronics Co., Ltd. Method and system for prefetching internet content for video recorders
US8938465B2 (en) * 2008-09-10 2015-01-20 Samsung Electronics Co., Ltd. Method and system for utilizing packaged content sources to identify and provide information based on contextual information
DE102009060687A1 (de) 2009-11-04 2011-05-05 Siemens Aktiengesellschaft Verfahren und Vorrichtung zum rechnergestützten Annotieren von Multimediadaten
US20120246732A1 (en) * 2011-03-22 2012-09-27 Eldon Technology Limited Apparatus, systems and methods for control of inappropriate media content events
US20150356353A1 (en) * 2013-01-10 2015-12-10 Thomson Licensing Method for identifying objects in an audiovisual document and corresponding device
US9123330B1 (en) 2013-05-01 2015-09-01 Google Inc. Large-scale speaker identification
US10535330B2 (en) 2013-08-05 2020-01-14 Crackle, Inc. System and method for movie karaoke
US9489360B2 (en) * 2013-09-05 2016-11-08 Audible, Inc. Identifying extra material in companion content
FR3035530A1 (fr) * 2015-04-23 2016-10-28 Orange Identification des locuteurs d'un contenu multimedia par l'analyse conjointe de donnees audio et de donnees de sous-titres
CN109788346B (zh) * 2018-08-19 2021-01-22 深圳市量籽科技有限公司 视频文件配置解析方法
CN109309868B (zh) * 2018-08-19 2019-06-18 上海极链网络科技有限公司 视频文件配置解析***
US11024288B2 (en) 2018-09-04 2021-06-01 Gracenote, Inc. Methods and apparatus to segment audio and determine audio segment similarities
CN111078952B (zh) * 2019-11-20 2023-07-21 重庆邮电大学 一种基于层次结构的跨模态可变长度哈希检索方法
CN111104546B (zh) * 2019-12-03 2021-08-27 珠海格力电器股份有限公司 一种构建语料库的方法、装置、计算设备及存储介质
US11381797B2 (en) 2020-07-16 2022-07-05 Apple Inc. Variable audio for audio-visual content
US11770590B1 (en) 2022-04-27 2023-09-26 VoyagerX, Inc. Providing subtitle for video content in spoken language

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5596705A (en) * 1995-03-20 1997-01-21 International Business Machines Corporation System and method for linking and presenting movies with their underlying source information
US6829368B2 (en) * 2000-01-26 2004-12-07 Digimarc Corporation Establishing and interacting with on-line media collections using identifiers in media signals
US5969755A (en) * 1996-02-05 1999-10-19 Texas Instruments Incorporated Motion based event detection system and method
US6363380B1 (en) * 1998-01-13 2002-03-26 U.S. Philips Corporation Multimedia computer system with story segmentation capability and operating program therefor including finite automation video parser
US6243676B1 (en) * 1998-12-23 2001-06-05 Openwave Systems Inc. Searching and retrieving multimedia information
US20040220791A1 (en) * 2000-01-03 2004-11-04 Interactual Technologies, Inc. A California Corpor Personalization services for entities from multiple sources
US6519648B1 (en) * 2000-01-24 2003-02-11 Friskit, Inc. Streaming media search and continuous playback of multiple media resources located on a network
US6834308B1 (en) * 2000-02-17 2004-12-21 Audible Magic Corporation Method and apparatus for identifying media content presented on a media playing device
US6763148B1 (en) * 2000-11-13 2004-07-13 Visual Key, Inc. Image recognition methods
US6925455B2 (en) * 2000-12-12 2005-08-02 Nec Corporation Creating audio-centric, image-centric, and integrated audio-visual summaries
US20030107592A1 (en) 2001-12-11 2003-06-12 Koninklijke Philips Electronics N.V. System and method for retrieving information related to persons in video programs
US8285727B2 (en) * 2003-03-06 2012-10-09 Thomson Licensing S.A. Simplified searching for media services using a control device
JP2004287965A (ja) * 2003-03-24 2004-10-14 Fujitsu Ltd 推奨音楽検索装置、および推奨音楽検索方法、および推奨音楽検索プログラムを格納したコンピュータ読み取り可能な記憶媒体
WO2005069171A1 (ja) * 2004-01-14 2005-07-28 Nec Corporation 文書対応付け装置、および文書対応付け方法

Also Published As

Publication number Publication date
WO2005055196A2 (en) 2005-06-16
US20070061352A1 (en) 2007-03-15
EP1692629B1 (de) 2011-06-08
WO2005055196A3 (en) 2006-02-23
JP2007519987A (ja) 2007-07-19
EP1692629A2 (de) 2006-08-23

Similar Documents

Publication Publication Date Title
ATE512411T1 (de) System und verfahren zur integrierten analyse von intrinsischen und extrinsischen audiovisuellen daten
CN103761261B (zh) 一种基于语音识别的媒体搜索方法及装置
JP4484252B2 (ja) ストーリーセグメンテーション機能を有するマルチメディアコンピュータシステム及びその動作プログラム
CN110008378B (zh) 基于人工智能的语料收集方法、装置、设备及存储介质
US20080201314A1 (en) Method and apparatus for using multiple channels of disseminated data content in responding to information requests
CN116801003A (zh) 用于根据脚本自动制作视频节目的方法和***
US20100008547A1 (en) Method and System for Automated Annotation of Persons in Video Content
US8965916B2 (en) Method and apparatus for providing media content
US20140278845A1 (en) Methods and Systems for Identifying Target Media Content and Determining Supplemental Information about the Target Media Content
Poignant et al. Unsupervised speaker identification using overlaid texts in TV broadcast
US20090132074A1 (en) Automatic segment extraction system for extracting segment in music piece, automatic segment extraction method, and automatic segment extraction program
CN102411578A (zh) 一种多媒体播放***和方法
WO2008054960B1 (en) Use of information correlation for relevant information
US20140114656A1 (en) Electronic device capable of generating tag file for media file based on speaker recognition
CN112468754B (zh) 一种基于音视频识别技术的笔录数据采集方法及装置
JP2005115607A (ja) 映像検索装置
WO2006031466A3 (en) Functionality and system for converting data from a first to a second form
US20150178387A1 (en) Method and system of audio retrieval and source separation
TW201039149A (en) Robust algorithms for video text information extraction and question-answer retrieval
Schmiedeke et al. Overview of mediaeval 2012 genre tagging task
KR100916310B1 (ko) 오디오 신호처리 기반의 음악 및 동영상간의 교차 추천 시스템 및 방법
Hauptmann et al. Artificial intelligence techniques in the interface to a digital video library
CN113204670B (zh) 一种基于注意力模型的视频摘要描述生成方法及装置
Adcock et al. TalkMiner: a search engine for online lecture video
Kale et al. Video Retrieval Using Automatically Extracted Audio

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties