CN1604624A - 用于分析一个图象中的字幕的方法和设备 - Google Patents
用于分析一个图象中的字幕的方法和设备 Download PDFInfo
- Publication number
- CN1604624A CN1604624A CN200410082430.9A CN200410082430A CN1604624A CN 1604624 A CN1604624 A CN 1604624A CN 200410082430 A CN200410082430 A CN 200410082430A CN 1604624 A CN1604624 A CN 1604624A
- Authority
- CN
- China
- Prior art keywords
- program data
- multimedia program
- captions
- multimedia
- text
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 18
- 238000012545 processing Methods 0.000 claims abstract description 21
- 238000004458 analytical method Methods 0.000 claims abstract description 20
- 238000012937 correction Methods 0.000 claims abstract description 13
- 238000004891 communication Methods 0.000 claims description 7
- 230000008569 process Effects 0.000 claims description 7
- 230000003287 optical effect Effects 0.000 claims description 3
- 238000004590 computer program Methods 0.000 claims 8
- 238000009432 framing Methods 0.000 claims 3
- 239000003607 modifier Substances 0.000 claims 1
- 239000012634 fragment Substances 0.000 description 30
- 238000001914 filtration Methods 0.000 description 12
- 230000006870 function Effects 0.000 description 10
- 238000010586 diagram Methods 0.000 description 7
- 230000007246 mechanism Effects 0.000 description 4
- 230000005540 biological transmission Effects 0.000 description 3
- 230000008859 change Effects 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 238000002715 modification method Methods 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 230000008901 benefit Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000004321 preservation Methods 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 230000005236 sound signal Effects 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/439—Processing of audio elementary streams
- H04N21/4396—Processing of audio elementary streams by muting the audio signal
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/41—Structure of client; Structure of client peripherals
- H04N21/414—Specialised client platforms, e.g. receiver in car or embedded in a mobile appliance
- H04N21/4147—PVR [Personal Video Recorder]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/44—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
- H04N21/44008—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics in the video stream
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/44—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
- H04N21/4402—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
- H04N21/440236—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display by media transcoding, e.g. video is transformed into a slideshow of still pictures, audio is converted into text
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/45—Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
- H04N21/4508—Management of client data or end-user data
- H04N21/4532—Management of client data or end-user data involving end-user characteristics, e.g. viewer profile, preferences
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/45—Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
- H04N21/454—Content or additional data filtering, e.g. blocking advertisements
- H04N21/4542—Blocking scenes or portions of the received content, e.g. censoring scenes
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/45—Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
- H04N21/466—Learning process for intelligent management, e.g. learning user preferences for recommending movies
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/45—Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
- H04N21/466—Learning process for intelligent management, e.g. learning user preferences for recommending movies
- H04N21/4662—Learning process for intelligent management, e.g. learning user preferences for recommending movies characterized by learning algorithms
- H04N21/4663—Learning process for intelligent management, e.g. learning user preferences for recommending movies characterized by learning algorithms involving probabilistic networks, e.g. Bayesian networks
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/488—Data services, e.g. news ticker
- H04N21/4884—Data services, e.g. news ticker for displaying subtitles
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/76—Television signal recording
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/16—Analogue secrecy systems; Analogue subscription systems
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/08—Systems for the simultaneous or sequential transmission of more than one television signal, e.g. additional information signals, the signals occupying wholly or partially the same frequency band, e.g. by time division
- H04N7/087—Systems for the simultaneous or sequential transmission of more than one television signal, e.g. additional information signals, the signals occupying wholly or partially the same frequency band, e.g. by time division with signal insertion during the vertical blanking interval only
- H04N7/088—Systems for the simultaneous or sequential transmission of more than one television signal, e.g. additional information signals, the signals occupying wholly or partially the same frequency band, e.g. by time division with signal insertion during the vertical blanking interval only the inserted signal being digital
- H04N7/0884—Systems for the simultaneous or sequential transmission of more than one television signal, e.g. additional information signals, the signals occupying wholly or partially the same frequency band, e.g. by time division with signal insertion during the vertical blanking interval only the inserted signal being digital for the transmission of additional display-information, e.g. menu for programme or channel selection
- H04N7/0885—Systems for the simultaneous or sequential transmission of more than one television signal, e.g. additional information signals, the signals occupying wholly or partially the same frequency band, e.g. by time division with signal insertion during the vertical blanking interval only the inserted signal being digital for the transmission of additional display-information, e.g. menu for programme or channel selection for the transmission of subtitles
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- Probability & Statistics with Applications (AREA)
- Television Systems (AREA)
- Television Signal Processing For Recording (AREA)
- Editing Of Facsimile Originals (AREA)
Abstract
本发明提供了用于分析一个图象中的字幕的方法和设备。多媒体节目数据中字幕中的文本被标识以生成一组文本。对该组文本进行分析以形成一个分析。根据该分析标识需要进行修改的视频片段来形成一个标识的视频片段,并修改这个标识的片段。另外,还可进行颜色校正,以提高字幕中文本的清晰度。
Description
技术领域
本发明涉及经改进的数据处理***,并特别涉及用于处理数据的方法和设备。具体而言,本发明涉及用于处理视频数据的方法、设备和计算机指令。
背景技术
个人录像机(PVR)正越来越受到消费者的青睐。该设备也被称为数字录像机(DVR),允许用户在录制新的节目的同时,回放已经录制的节目。在有些情况下,用户可以一边在一个频道进行收看实况转播,一边从另一个频道录制节目。并且,用户也能在收看实况转播的同时暂停或重放录制的节目。在通常情况下,PVR通过与有线或卫星接收***相连,来接收数字视频和音频内容。与盒式磁带录像机相同,PVR也允许节目的时移,但它还拥有其他一些特点,如录制一个节目的所有剧集。该***包括一个用来存储节目的硬盘驱动器。
PVR同时还提供了多种特性,如通过网络与其他PVR共享录制的节目,存储数码照片,存储MP3文件。但PVR缺乏滤除不良内容的功能。某些情况下,用户希望观看节目,同时希望滤除节目中的不良内容,目前的PVR尚未提供该项特性。
因此,提供用于管理PVR上的节目的改良方法、设备和计算机指令是非常有利的。
发明内容
本发明提供用于处理视频数据的方法、设备和计算机指令。多媒体节目数据中字幕的文本被标识来生成一组文本。对该组文本进行分析以创建一个分析。根据该分析需要进行修改的视频片段被标识来形成一个标识的视频片段,并且这个标识的视频片段被改变。另外,还可进行颜色校正,以提高字幕文本的清晰度。
附图说明
在附录中列出了本发明突出的创新性特点。然而,当结合附图进行阅读时,通过参照图示实施例的详细说明能够最好地理解本发明本身,以及最佳实施方式、目标和优势,其中:
图1是在其中可以实施本发明的数据处理***的示意图;
图2是根据本发明的一个优选实施例的用于过滤多媒体节目的程序流程图;
图3是根据本发明的一个优选实施例的用于执行字幕颜色校正的程序流程图。
具体实施方式
下面参照附图(特别参照附图1)对可在其中实现本发明的数据处理***进行描述。数据处理***100以个人录像机(PVR)为例,它也可被称作数字录像机(DVR)。根据图示,数据处理***100中的元件通过总线***102互联。
数据处理***100包括处理单元104、存储器106、音频单元108、视频单元110、通信单元112、存储设备114和字幕和视频分析单元116。存储器106包含由处理单元104执行的用来提供各种PVR功能的指令。这些功能包括:例如,节目的录制、节目播放分析要处理的图像、以及管理可存储在数据处理***100中的节目等。
音频单元108包括用于从输入端口接收音频信号,并输出音频的元件。这些元件包括:例如,一个音频模数转换器(ADC)和一个音频数模转换器(DAC)等。视频单元110用于接收视频信号,并在数据处理***100中输出视频。视频单元110包括:一个视听(AV)编码器/解码器(编码译码器)。视频单元110能够输出视频信号以在显示器上进行显示,例如与数据处理***100相连的显示器118上。
根据特定的实施方案,音频单元108和视频单元110中的元件可作为硬件元件被敷设到处理单元104中。通信单元112提供一个连接,用于接收多媒体节目。在本实例中,一个多媒体节目包括视频和音频数据。多媒体节目亦可包含闭路标题数据,如字幕等。这些字幕根据用户喜好可以显示,也可以不显示。多媒体节目的实例包括:电视节目、电影和音乐视频。这些多媒体节目可以通过连接通信单元112至各种程序设计资源(如通过因特网、电缆网或卫星)获得。
存储设备114提供了一个位置用于存储多媒体节目。字幕和视频分析单元116提供一种用来分析多媒体节目字幕中的文本,并标识这些程序的特定片段是否应该被静音、成为空白或被完全删除的机构。通过这种方式,用户就能够观看到不含不良内容的多媒体节目。
字幕和视频分析单元116能够解码用于处理的多媒体节目的视频部分。在视频流中,字幕信息一般位于一个与视频数据分离的信道中。如果用户希望观看字幕,字幕信息就在视频适配器或显示单元的帧缓存区中被叠加到视频图象上。该字幕亦被称为该视频的一个闭路标题部分。
字幕中的文本被标识。根据具体方案不同,文本能够通过多种方式标识。在图例中,针对进行字幕输出的视频图象的闭路标题部分执行光学文字标识。从该处理中得到的文本将被输入到过滤器中,以标识多媒体节目中的不良部分。
在这些实例中,过滤操作采用在字幕和视频分析单元16中实现的baysean过滤器进行。Baysean过滤器目前被用于过滤电子邮件信息中的垃圾邮件(SPAM)。该类过滤器适用于评定多媒体节目的不同部分的级别。使用baysean过滤器,可采用baysean推论,即如果在一个场景中字幕或者多媒体节目的片段中要显示的文本经常出现在一个PG级电影中,而很少出现在一个G级电影中,则该多媒体节目片段的文本通常被评定为PG级。如果观看喜好被设定为G级多媒体节目,那么特定的场景将被修改或审查。该段视频图象将被刷白、静音、或被同时刷白和静音。
在这些实例中,一个视频图象片段是指在视频图象中显示字幕的某一部分。当显示新的字幕时,将遇到一个新的多媒体节目片段。
多媒体节目过滤用信息可以由数据处理***100的用户配置。可以创建用于不同电影级别(如G,PG,PG-13和R级)的默认文件设置。这些默认文件可被存储在存储设备114中。另外,供在baysean过滤器中使用的用户提供文件也被存储在存储设备114中。该用户文件可有各种来源。例如:一个包含baysean过滤功能的电子邮件公共程序可被用作一个来源。用于过滤SPAM邮件的文件可被下载到数据处理***100中。当然,任何外部信源可被用于该文件。
此外,字幕和视频分析单元116亦可用于对视频图象进行修改,以提高字幕的清晰度。这些修改包括:颜色校正,以调节屏幕上显示字幕的部分的颜色或调整字幕文本的显示。例如,如果文本颜色与背景颜色相近,可使用不同于背景颜色的颜色来描画组成字幕的文字的轮廓。另外,也可以改变字幕显示区域的背景颜色来提供针对更佳的字幕清晰度的对比度。
在这些实例中,字幕和视频分析单元116能够以多种形式实现。例如,该视频单元能够实现为一个具有合适的特定用途集成电路(ASIC)和指令的独立处理单元,以执行本发明图例中的功能。或者,字幕和视频分析单元116可包含由处理单元104执行的指令,来提供这些功能。
在这些实例中,数据处理***100采用PVR形式。该图示并非表示关于在其中可实现本发明的机构的本发明的体系结构限制。数据处理***100亦可采用具有软件的计算机和适当的适配器卡来实现,以允许使用PVR中的功能对多媒体节目进行接收和处理。
采用这种方式,本发明的机构具备过滤多媒体节目各部分的能力。虽然一个多媒体节目可能整个被定为不良级别,但该节目也可在滤除其中的不良片段后进行观看。可对其进行静音、画面刷白或同时进行上述两种操作。
现在来看图2,根据本发明的优选实施例描述用于过滤节目的程序流程图。图2中所示的程序能够在过滤***,例如图1中的字幕和视频分析单元116中实现。
该处理以对多媒体节目进行解码(步骤200)为开始。在这些实例中,视频流以MPEG2、MPEG3或JPEG等格式被接收。在这些多媒体文件中,音频和视频信道被分离为不同的信道。
包含字幕的闭路标题部分在不同于音频和视频的另一信道中。如需要,闭路标题部分能够被叠加到视频上以显示字幕信息。
该数据的解码可通过使用处理单元,例如图1中处理单元104等元件中的编码/解码处理器进行。根据具体方案不同,编码和解码可实现为如实例或硬件所述,例如包含编码和解码功能的逻辑。
选择一个经过解码的多媒体节目数据的片段(步骤202)。在这些示例中,多媒体节目数据中的数据片段被定义为多个帧。视频数据通常按每个片段30帧来进行显示。
接着,对多媒体节目数据的一个片段进行光学文字标识,以从用于那个片段数据的闭路标题部分中的字幕中获得文本(步骤204)。该文本将被输送到baysean和滤除算法(步骤206)。然后可获得评级(步骤208)。将该片段的评级结果与用户选定的喜好相比较(步骤210)。该喜好可以是电影评级,例如PG-13或R级。
对照用户选定的喜好(步骤212)来判定该片段是否恰当。例如,如果用户选定级别PG-13为恰当,并且根据对片段中文本的过滤标识结果,该片段被评定为R级,则该片段将被判定为不恰当。如果该片段被判定为不恰当,某些音频或视频的组合将被刷白画面或静音(步骤214)。虽然只对某个片段进行处理,但实际上步骤214能够对该片段中的每一个帧进行刷白画面或静音处理。经过修改的多媒体节目数据被存储(步骤216)。
接着,将判定是否有更多未经处理的片段(步骤218)。如果存在更多未经处理的片段,处理将返回到步骤202。否则,多媒体节目数据被重新编码(步骤220),并且在处理结束后保存经过处理的多媒体节目(步骤222)。
在图2的图例中,对片段进行了处理。当然,根据具体方案的不同,处理也可以逐帧地进行。另外,如果编码和解码功能以硬件方式实现,那么其他功能、例如baysean过滤器和帧缓冲器也可位于相同的硬件单元中。
往下翻到图3,是根据本发明的一个优选实施例的字幕颜色校正的程序流程图。图3中所示的处理可在过滤***,例如图1中的字幕和视频分析单元116中实现。
该处理以对多媒体节目进行解码(步骤300)为开始。在这些实例中,多媒体节目中的视频部分保持不变。解码信息被存储(步骤302)。选择多媒体节目中经解码的视频数据的一个片段进行处理(步骤304)。判定该片段是否需要进行颜色校正,以增强所选定片段中字幕的清晰度(步骤306)。根据方案的不同,步骤306可确定字幕中的文本是否需要被屏蔽或变模糊。执行该步骤可以屏蔽坏的或不良的语言。如需要校正,执行颜色校正(步骤308)。根据方案不同,所执行的具体的颜色校正类型各异。例如,可改变文本的背景,以增强文本相对于背景的对比度。
然后,确定在视频数据中是否有更多未经处理的片段(步骤310)。如果存在更多未经处理的片段,则处理将返回到步骤304。否则,数据被重新编码(步骤312),且在处理结束后保存经处理的多媒体节目以备将来的回放(步骤314)。再次参照步骤306,如果不需要执行颜色校正,则处理将如前所述前进到步骤310。
因此,本发明提供了用于多媒体节目过滤的改良方法、设备和计算机指令。图例中所列的本发明的机构允许在保持其它部分不变的情况下,根据用户的个人喜好对多媒体节目的某些部分或片段进行修改。在示例中,这些修改包括刷白视频图象片段中的画面,消除该片段的声音,或同时消除该片段的声音或画面。
需要重点提出的是,虽然在上下文中全部以功能性的数据处理***对本发明进行了描述,但本领域的普通技术人员应当理解,本发明的过程能够以计算机可读介质指令的形式和多种形式散布,并且本发明能够等效地应用而与实际用于执行散布的信号承载介质的特定类型无关。计算机可读介质的实例包括可记录型介质,例如软盘、硬磁盘驱动器、RAM、CD-ROM、DVD-ROM和传输型介质,例如使用如无线电频率和光波传输等传输形式的数字和模拟通信链路、有线或无线通信链路。计算机可读介质可采用这样的编码格式,即解码后可用于特定数据处理***的实际应用。
本发明说明书的提出目的在于给出例图和说明,并非在于详尽介绍或限于本发明散布时的状态。对于本领域内的一般技术人员而言,许多改进和变化将是非常明显的。选择本实施例并对其描述,其目的在于对本发明的原理、实际应用进行最佳说明,并使其他本领域内一般技术人员能够理解本发明可应用于施以各种改进的不同实施例,就如适用于所期望的特殊用途。
Claims (22)
1.一种数据处理***中用于处理多媒体节目数据的方法,该方法包括:
标识多媒体节目数据中字幕中的文本,以生成一组文本;
对该组文本进行分析以形成一个分析;
基于该分析,标识应该进行修改的多媒体节目数据的部分来形成一个标识部分;以及
修改该标识部分。
2.权利要求1的方法,其中该标识步骤包括:
针对多媒体节目数据中的字幕执行光学文字标识,以生成一组文本。
3.权利要求1的方法,其中该多媒体节目数据部分包括视频部分和音频部分,并且通过使视频部分或音频部分中的至少一个为空来修改该标识部分。
4.权利要求1的方法,其中该分析步骤包括:
对该组文本执行baysean过滤。
5.权利要求1的方法,进一步包括:
在开始执行步骤前,对该多媒体节目数据进行解码;以及
在修改该标识部分后,对该多媒体节目数据进行重新编码。
6.权利要求1的方法,其中该多媒体节目数据的该部分为一帧或一组帧。
7.权利要求1的方法,其中多媒体节目为电影。
8.一种数据处理***中用来处理多媒体节目的方法,该方法包括:
对多媒体节目解码,以形成解码的多媒体节目数据;
分析该多媒体节目数据的一部分;
确定该部分多媒体节目数据中字幕的清晰度是否需要提高;以及
响应该部分多媒体节目数据中字幕的清晰度需要提高的情况,针对该多媒体节目数据中包含清晰度需要提高的字幕的一部分执行颜色校正,以提高该字幕的清晰度。
9.一种用于处理多媒体节目数据的数据处理***,该数据处理***包括:
标识装置,用于标识多媒体节目数据中字幕中的文本,以生成一组文本;
分析装置,用于对该组文本进行分析以形成一个分析;
标识装置,用于根据该分析,标识应进行修改的多媒体节目数据的一部分以形成一个标识部分;以及
修改装置,用于修改该标识部分。
10.权利要求9的数据处理***,其中该多媒体节目数据部分包括视频部分和音频部分,并且通过使视频部分或音频部分中的至少一个为空来修改该标识部分。
11.权利要求9的数据处理***,其中该分析装置包括:
执行装置,用于对该组文本进行baysean过滤。
12.权利要求9的数据处理***进一步包括:
解码装置,用于在开始执行步骤前,对多媒体节目数据进行解码;以及
重新编码装置,用于在修改该标识部分后,对该多媒体节目数据进行重新编码。
13.权利要求9的数据处理***,其中该部分多媒体节目数据为一帧或一组帧。
14.一种用于处理多媒体节目的数据处理***,该数据处理***包括:
解码装置,用于对多媒体节目解码,以形成解码的多媒体节目数据;
分析装置,用于分析该多媒体节目数据的一部分;
测定装置,用于确定该部分多媒体节目数据中字幕的清晰度是否需要提高;并且
执行装置,用于响应该部分多媒体节目数据中字幕的清晰度需要提高的情况,针对该多媒体节目数据中包含清晰度需要提高的字幕的部分执行颜色校正,以提高该字幕的清晰度。
15.一种用于处理多媒体节目数据的、计算机可读介质中的计算机程序产品,其中计算机程序产品包括:
第一指令,用于标识多媒体节目数据中的文本,以生成一组文本;
第二指令,用于分析该组文本以形成一个分析;
第三指令,用于基于该分析,标识应进行修改的多媒体节目数据的一部分以形成一个标识部分;并且
第四指令,用于修改该标识部分。
16.权利要求15的计算机程序产品,其中该多媒体节目数据部分包括视频部分和音频部分,并且通过使视频部分或音频部分中的至少一个为空来修改该标识部分。
17.权利要求15的计算机程序产品,其中第二指令包括:
用于对该组文本执行baysean过滤的子指令。
18.权利要求15的计算机程序产品,进一步包括:
用于在开始执行步骤前,解码该多媒体节目数据的第五指令;以及
用于在修改该标识部分后,重新解码该多媒体节目数据的第六指令。
19.权利要求15的计算机程序产品,其中该部分多媒体节目数据为一帧或一组帧。
20.用于处理多媒体节目数据的、计算机可读介质中的计算机程序产品,该计算机程序产品包括:
用于解码多媒体节目以形成解码的多媒体节目数据的第一指令;
用于分析该多媒体节目数据的一部分的第二指令;
用于确定该部分多媒体节目数据中字幕的清晰度是否需要提高的第三指令;以及
响应该部分多媒体节目数据中字幕的清晰度需要提高,针对该多媒体节目数据中包含清晰度需要提高的字幕的部分执行颜色校正,来提高该字幕的清晰度的第四指令。
21.一种数据处理***,包括:
一个总线***;
一个与该总线***连接的通信单元;
一个与该总线***连接的存储器,其中该存储器包括一组指令;
一个与该总线***连接的处理单元,其中该处理单元执行该组指令以标识多媒体节目数据中字幕中的文本,以生成一组文本;对该组文本进行分析以形成一个分析;根据该分析,标识应进行修改的多媒体节目数据的一部分以形成一个标识部分;以及修改该标识部分。
22.一种数据处理***,包括:
一个总线***;
一个与该总线***连接的通信单元;
一个与该总线***连接的存储器,其中该存储器包括一组指令;以及
一个与该总线***连接的处理单元,其中该处理单元执行该组指令以解码多媒体节目来形成解码的多媒体节目数据;分析该多媒体节目数据的一部分;确定该部分多媒体节目数据中字幕的清晰度是否需要提高;并且响应该部分多媒体节目数据中字幕清晰度需要提高的情况,针对该多媒体节目数据中包含清晰度需要提高的字幕的部分执行颜色校正,以提高字幕的清晰度。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/674,975 | 2003-09-30 | ||
US10/674,975 US20050071888A1 (en) | 2003-09-30 | 2003-09-30 | Method and apparatus for analyzing subtitles in a video |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1604624A true CN1604624A (zh) | 2005-04-06 |
CN100382577C CN100382577C (zh) | 2008-04-16 |
Family
ID=34377001
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB2004100824309A Expired - Fee Related CN100382577C (zh) | 2003-09-30 | 2004-09-21 | 用于分析一个图象中的字幕的方法和设备 |
Country Status (3)
Country | Link |
---|---|
US (1) | US20050071888A1 (zh) |
JP (1) | JP2005110263A (zh) |
CN (1) | CN100382577C (zh) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN100442829C (zh) * | 2005-04-28 | 2008-12-10 | 索尼株式会社 | 字幕产生设备和方法 |
CN101753902B (zh) * | 2008-12-10 | 2012-05-16 | 晨星软件研发(深圳)有限公司 | 自动调整屏幕上显示信息的装置与方法 |
CN103945141A (zh) * | 2013-01-23 | 2014-07-23 | 索尼公司 | 视频处理装置、方法和服务器 |
CN109076246A (zh) * | 2016-04-06 | 2018-12-21 | 英特尔公司 | 使用图像数据校正掩码的视频编码方法和*** |
Families Citing this family (46)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20060062552A1 (en) * | 2004-09-23 | 2006-03-23 | Richard Lesser | System and method of adapting sub-picture data for being displayed on mini-screens |
CN100583282C (zh) * | 2004-09-29 | 2010-01-20 | 彩色印片公司 | 色彩判定元数据生成的方法及设备 |
US8041190B2 (en) * | 2004-12-15 | 2011-10-18 | Sony Corporation | System and method for the creation, synchronization and delivery of alternate content |
US20060130119A1 (en) * | 2004-12-15 | 2006-06-15 | Candelore Brant L | Advanced parental control for digital content |
US20090300480A1 (en) * | 2005-07-01 | 2009-12-03 | Searete Llc, A Limited Liability Corporation Of The State Of Delaware | Media segment alteration with embedded markup identifier |
US20090037278A1 (en) * | 2005-07-01 | 2009-02-05 | Searete Llc, A Limited Liability Corporation Of The State Of Delaware | Implementing visual substitution options in media works |
US20080086380A1 (en) * | 2005-07-01 | 2008-04-10 | Searete Llc, A Limited Liability Corporation Of The State Of Delaware | Alteration of promotional content in media works |
US9230601B2 (en) | 2005-07-01 | 2016-01-05 | Invention Science Fund I, Llc | Media markup system for content alteration in derivative works |
US9583141B2 (en) * | 2005-07-01 | 2017-02-28 | Invention Science Fund I, Llc | Implementing audio substitution options in media works |
US20090151004A1 (en) * | 2005-07-01 | 2009-06-11 | Searete Llc, A Limited Liability Corporation Of The State Of Delaware | Media markup for visual content alteration |
US20100154065A1 (en) * | 2005-07-01 | 2010-06-17 | Searete Llc, A Limited Liability Corporation Of The State Of Delaware | Media markup for user-activated content alteration |
US20080013859A1 (en) * | 2005-07-01 | 2008-01-17 | Searete Llc, A Limited Liability Corporation Of The State Of Delaware | Implementation of media content alteration |
US9092928B2 (en) * | 2005-07-01 | 2015-07-28 | The Invention Science Fund I, Llc | Implementing group content substitution in media works |
US20080010083A1 (en) * | 2005-07-01 | 2008-01-10 | Searete Llc, A Limited Liability Corporation Of The State Of Delaware | Approval technique for media content alteration |
US20070294720A1 (en) * | 2005-07-01 | 2007-12-20 | Searete Llc | Promotional placement in media works |
US20070263865A1 (en) * | 2005-07-01 | 2007-11-15 | Searete Llc, A Limited Liability Corporation Of The State Of Delaware | Authorization rights for substitute media content |
US8732087B2 (en) * | 2005-07-01 | 2014-05-20 | The Invention Science Fund I, Llc | Authorization for media content alteration |
US20090037243A1 (en) * | 2005-07-01 | 2009-02-05 | Searete Llc, A Limited Liability Corporation Of The State Of Delaware | Audio substitution options in media works |
US8910033B2 (en) * | 2005-07-01 | 2014-12-09 | The Invention Science Fund I, Llc | Implementing group content substitution in media works |
US9065979B2 (en) * | 2005-07-01 | 2015-06-23 | The Invention Science Fund I, Llc | Promotional placement in media works |
US20070005423A1 (en) * | 2005-07-01 | 2007-01-04 | Searete Llc, A Limited Liability Corporation Of The State Of Delaware | Providing promotional content |
US20090235364A1 (en) * | 2005-07-01 | 2009-09-17 | Searete Llc, A Limited Liability Corporation Of The State Of Delaware | Media markup for promotional content alteration |
US7860342B2 (en) | 2005-07-01 | 2010-12-28 | The Invention Science Fund I, Llc | Modifying restricted images |
US20070276757A1 (en) * | 2005-07-01 | 2007-11-29 | Searete Llc, A Limited Liability Corporation Of The State Of Delaware | Approval technique for media content alteration |
US20080052161A1 (en) * | 2005-07-01 | 2008-02-28 | Searete Llc | Alteration of promotional content in media works |
US9426387B2 (en) * | 2005-07-01 | 2016-08-23 | Invention Science Fund I, Llc | Image anonymization |
US20080052104A1 (en) * | 2005-07-01 | 2008-02-28 | Searete Llc | Group content substitution in media works |
US20070266049A1 (en) * | 2005-07-01 | 2007-11-15 | Searete Llc, A Limited Liability Corportion Of The State Of Delaware | Implementation of media content alteration |
US20090210946A1 (en) * | 2005-07-01 | 2009-08-20 | Searete Llc, A Limited Liability Corporation Of The State Of Delaware | Media markup for promotional audio content |
US20090150199A1 (en) * | 2005-07-01 | 2009-06-11 | Searete Llc, A Limited Liability Corporation Of The State Of Delaware | Visual substitution options in media works |
US20100017885A1 (en) * | 2005-07-01 | 2010-01-21 | Searete Llc, A Limited Liability Corporation Of The State Of Delaware | Media markup identifier for alterable promotional segments |
US20090204475A1 (en) * | 2005-07-01 | 2009-08-13 | Searete Llc, A Limited Liability Corporation Of The State Of Delaware | Media markup for promotional visual content |
US20090150444A1 (en) * | 2005-07-01 | 2009-06-11 | Searete Llc, A Limited Liability Corporation Of The State Of Delaware | Media markup for audio content alteration |
US8185921B2 (en) * | 2006-02-28 | 2012-05-22 | Sony Corporation | Parental control of displayed content using closed captioning |
US20080180539A1 (en) * | 2007-01-31 | 2008-07-31 | Searete Llc, A Limited Liability Corporation | Image anonymization |
JP4899908B2 (ja) * | 2007-02-14 | 2012-03-21 | セイコーエプソン株式会社 | 情報処理装置、情報処理方法、プログラムおよび記録媒体 |
US20080244755A1 (en) * | 2007-03-30 | 2008-10-02 | Searete Llc, A Limited Liability Corporation Of The State Of Delaware | Authorization for media content alteration |
US20080259211A1 (en) * | 2007-04-23 | 2008-10-23 | Nokia Corporation | Using Subtitles for Other Purposes |
US20080270161A1 (en) * | 2007-04-26 | 2008-10-30 | Searete Llc, A Limited Liability Corporation Of The State Of Delaware | Authorization rights for substitute media content |
US9215512B2 (en) * | 2007-04-27 | 2015-12-15 | Invention Science Fund I, Llc | Implementation of media content alteration |
JP5393025B2 (ja) * | 2007-12-21 | 2014-01-22 | 帝国繊維株式会社 | 消防用ホース |
US8615596B1 (en) * | 2009-01-14 | 2013-12-24 | Sprint Communications Company L.P. | Communication method and system for providing content to a communication device according to a user preference |
JP2011053468A (ja) * | 2009-09-02 | 2011-03-17 | Sony Corp | 映像/文字同時表示装置及び頭部装着型ディスプレイ |
EP2579609A1 (en) * | 2011-10-06 | 2013-04-10 | Thomson Licensing | Method and apparatus for providing information for a multimedia content item |
US10476923B2 (en) * | 2013-04-05 | 2019-11-12 | Arris Enterprises Llc | Filtering content for adaptive streaming |
US10268729B1 (en) | 2016-06-08 | 2019-04-23 | Wells Fargo Bank, N.A. | Analytical tool for evaluation of message content |
Family Cites Families (23)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6115057A (en) * | 1995-02-14 | 2000-09-05 | Index Systems, Inc. | Apparatus and method for allowing rating level control of the viewing of a program |
JPH08317301A (ja) * | 1995-05-22 | 1996-11-29 | Hitachi Ltd | 映像出力装置 |
JPH0951489A (ja) * | 1995-08-04 | 1997-02-18 | Sony Corp | データ符号化/復号化方法および装置 |
JPH0965230A (ja) * | 1995-08-21 | 1997-03-07 | Ekushingu:Kk | 字幕表示方法及び字幕表示装置 |
US20030093790A1 (en) * | 2000-03-28 | 2003-05-15 | Logan James D. | Audio and video program recording, editing and playback systems using metadata |
US6097442A (en) * | 1996-12-19 | 2000-08-01 | Thomson Consumer Electronics, Inc. | Method and apparatus for reformatting auxiliary information included in a television signal |
US6181364B1 (en) * | 1997-05-16 | 2001-01-30 | United Video Properties, Inc. | System for filtering content from videos |
US6166780A (en) * | 1997-10-21 | 2000-12-26 | Principle Solutions, Inc. | Automated language filter |
US6075550A (en) * | 1997-12-23 | 2000-06-13 | Lapierre; Diane | Censoring assembly adapted for use with closed caption television |
US20020083441A1 (en) * | 2000-08-31 | 2002-06-27 | Flickinger Gregory C. | Advertisement filtering and storage for targeted advertisement systems |
US6351596B1 (en) * | 2000-01-07 | 2002-02-26 | Time Warner Entertainment Co, Lp | Content control of broadcast programs |
US20020009285A1 (en) * | 2000-03-08 | 2002-01-24 | General Instrument Corporation | Personal versatile recorder: enhanced features, and methods for its use |
US20020065678A1 (en) * | 2000-08-25 | 2002-05-30 | Steven Peliotis | iSelect video |
US6798912B2 (en) * | 2000-12-18 | 2004-09-28 | Koninklijke Philips Electronics N.V. | Apparatus and method of program classification based on syntax of transcript information |
US7210157B2 (en) * | 2000-12-18 | 2007-04-24 | Koninklijke Philips Electronics N.V. | Apparatus and method of program classification using observed cues in the transcript information |
US7050109B2 (en) * | 2001-03-02 | 2006-05-23 | General Instrument Corporation | Methods and apparatus for the provision of user selected advanced close captions |
US20030053798A1 (en) * | 2001-03-22 | 2003-03-20 | Magenya Roshanski | Personal video recorder |
US8949878B2 (en) * | 2001-03-30 | 2015-02-03 | Funai Electric Co., Ltd. | System for parental control in video programs based on multimedia content information |
US6901603B2 (en) * | 2001-07-10 | 2005-05-31 | General Instrument Corportion | Methods and apparatus for advanced recording options on a personal versatile recorder |
US7950033B2 (en) * | 2001-10-10 | 2011-05-24 | Opentv, Inc. | Utilization of relational metadata in a television system |
US20030107592A1 (en) * | 2001-12-11 | 2003-06-12 | Koninklijke Philips Electronics N.V. | System and method for retrieving information related to persons in video programs |
US7054804B2 (en) * | 2002-05-20 | 2006-05-30 | International Buisness Machines Corporation | Method and apparatus for performing real-time subtitles translation |
US7360234B2 (en) * | 2002-07-02 | 2008-04-15 | Caption Tv, Inc. | System, method, and computer program product for selective filtering of objectionable content from a program |
-
2003
- 2003-09-30 US US10/674,975 patent/US20050071888A1/en not_active Abandoned
-
2004
- 2004-09-21 CN CNB2004100824309A patent/CN100382577C/zh not_active Expired - Fee Related
- 2004-09-28 JP JP2004280898A patent/JP2005110263A/ja active Pending
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN100442829C (zh) * | 2005-04-28 | 2008-12-10 | 索尼株式会社 | 字幕产生设备和方法 |
CN101753902B (zh) * | 2008-12-10 | 2012-05-16 | 晨星软件研发(深圳)有限公司 | 自动调整屏幕上显示信息的装置与方法 |
CN103945141A (zh) * | 2013-01-23 | 2014-07-23 | 索尼公司 | 视频处理装置、方法和服务器 |
CN109076246A (zh) * | 2016-04-06 | 2018-12-21 | 英特尔公司 | 使用图像数据校正掩码的视频编码方法和*** |
Also Published As
Publication number | Publication date |
---|---|
JP2005110263A (ja) | 2005-04-21 |
US20050071888A1 (en) | 2005-03-31 |
CN100382577C (zh) | 2008-04-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN100382577C (zh) | 用于分析一个图象中的字幕的方法和设备 | |
EP1610557A1 (en) | System and method for embedding multimedia processing information in a multimedia bitstream | |
CN102771109B (zh) | 通过盖写视频数据进行视频传递和控制的方法、设备和*** | |
EP1635575A1 (en) | System and method for embedding scene change information in a video bitstream | |
CN102893602B (zh) | 具有使用嵌入在比特流中的元数据的呈现控制的视频显示 | |
US8798170B2 (en) | Program recommendation apparatus | |
US7706663B2 (en) | Apparatus and method for embedding content information in a video bit stream | |
US20090034937A1 (en) | Video recording apparatus, scene change extraction method, and video audio recording apparatus | |
JP2006115457A (ja) | マルチメディア編集情報をマルチメディアビットストリームに埋め込むシステムおよびその方法 | |
US20030068087A1 (en) | System and method for generating a character thumbnail sequence | |
US20100145488A1 (en) | Dynamic transrating based on audio analysis of multimedia content | |
CN1505392A (zh) | 记录装置和记录方法 | |
CN1178476C (zh) | 图象处理装置和图象处理方法 | |
US20090196569A1 (en) | Video trailer | |
US20110234900A1 (en) | Method and apparatus for identifying video program material or content via closed caption data | |
US20060059509A1 (en) | System and method for embedding commercial information in a video bitstream | |
CN103024607A (zh) | 用于显示摘要视频的方法和设备 | |
CN1798269A (zh) | 字幕服务菜单显示装置和方法 | |
GB2352915A (en) | A method of retrieving text data from a broadcast image | |
EP1701543A1 (en) | File recording device, file recording method, file recording method program, recording medium containing program of file recording method, file reproduction device, file reproduction method, file reproduction method program, and recording medium containing file reproduction method program | |
CN1867992A (zh) | 具有用于管理文本字幕重现的数据结构的记录介质以及记录和重现的方法和装置 | |
EP1858017A1 (en) | Image processing apparatus and file reproduce method | |
US20060080591A1 (en) | Apparatus and method for automated temporal compression of multimedia content | |
CN101686307A (zh) | 图像处理装置、图像处理方法和程序 | |
CN1433216A (zh) | 用于处理闭路字幕的装置和方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C17 | Cessation of patent right | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20080416 Termination date: 20091021 |