CN113709521A - 一种根据视频内容自动匹配背景的*** - Google Patents

一种根据视频内容自动匹配背景的*** Download PDF

Info

Publication number
CN113709521A
CN113709521A CN202111101320.2A CN202111101320A CN113709521A CN 113709521 A CN113709521 A CN 113709521A CN 202111101320 A CN202111101320 A CN 202111101320A CN 113709521 A CN113709521 A CN 113709521A
Authority
CN
China
Prior art keywords
module
background
video
app
automatically matching
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202111101320.2A
Other languages
English (en)
Other versions
CN113709521B (zh
Inventor
付金龙
付译虹
邢硕
蒋昌杰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wuxin Intelligent Technology Co ltd
Original Assignee
Wuxin Intelligent Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuxin Intelligent Technology Co ltd filed Critical Wuxin Intelligent Technology Co ltd
Priority to CN202111101320.2A priority Critical patent/CN113709521B/zh
Publication of CN113709521A publication Critical patent/CN113709521A/zh
Application granted granted Critical
Publication of CN113709521B publication Critical patent/CN113709521B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/23418Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/23424Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving splicing one content stream with another content stream, e.g. for inserting or substituting an advertisement
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/235Processing of additional data, e.g. scrambling of additional data or processing content descriptors
    • H04N21/2355Processing of additional data, e.g. scrambling of additional data or processing content descriptors involving reformatting operations of additional data, e.g. HTML pages
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • H04N21/4402Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/472End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
    • H04N21/47205End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content for manipulating displayed content, e.g. interacting with MPEG-4 objects, editing locally
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/84Generation or processing of descriptive data, e.g. content descriptors
    • H04N21/8405Generation or processing of descriptive data, e.g. content descriptors represented by keywords
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Theoretical Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Marketing (AREA)
  • Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Television Signal Processing For Recording (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Studio Circuits (AREA)

Abstract

本发明涉及视频合成技术领域,具体涉及一种根据视频内容自动匹配背景的***,包括:前端APP,所述前端APP设有摄像模块、存储模块、绿幕模块、编辑模块和上传模块;APP后台,所述APP后台设有语音识别模块、语义分析模块、素材库、合成模块;所述摄像模块依次通过所述绿幕模块及所述编辑模块与所述存储模块连接;所述存储模块通过所述上传模块与所述语音识别模块及所述语义分析模块连接;所述上传模块所述素材库分别与所述合成模块连接。本发明通过对录制视频中的信息分析,将素材库中相关素材与之进行合成,从而给视频制作者更多的想象空间。

Description

一种根据视频内容自动匹配背景的***
技术领域
本发明涉及视频合成技术领域,具体涉及一种根据视频内容自动匹配背景的***。
背景技术
现有技术中,通过电脑或手机等,可以将现实的图像和虚拟的背景模版简单地合并,制作成头像等,称之为“大头贴”。即使有些网站能提供制作动感影集的功能,也只是简单地将多个“大头贴”按较大的时间间隔播放。这种简单的静态合成方式,已不能满足用户的需求。人们更希望看到动态的现实生活与虚拟的背景模版相结合。如何将动态的现实生活与虚拟的背景模版灵活地合并起来,制作成动态的视频,是本技术领域亟需解决的技术问题。
发明内容
本发明提供一种根据视频内容自动匹配背景的***,通过对录制视频中的信息分析,将素材库中相关素材与之进行合成,从而给视频制作者更多的想象空间。
为了达到上述目的,本发明提供如下技术方案:一种根据视频内容自动匹配背景的***,其包括:前端APP,所述前端APP设有摄像模块、存储模块、绿幕模块、编辑模块和上传模块;APP后台,所述APP后台设有语音识别模块、语义分析模块、素材库、合成模块;所述摄像模块依次通过所述绿幕模块及所述编辑模块与所述存储模块连接;所述存储模块通过所述上传模块与所述语音识别模块及所述语义分析模块连接;所述语音识别模块以及所述语义分析模块通过所述素材库与所述合成模块连接。
优选的,还包括筛选模块,所述素材库通过所述筛选模块与所述合成模块连接。
优选的,一种根据视频内容自动匹配背景的方法,包括以下步骤:
步骤一、对录制视频进行信息提取生成文本内容,并对文本内容进行语义分析获得场景关键词;
步骤二、通过场景关键词在素材库中定位对应的背景影像;
步骤三、将背景影像与录制视频进行合成。
优选的,所述录制视频为通过前端APP录制的绿幕视频,并且所述前端APP对每一所述录制视频进行标签添加和背景限定。
优选的,所述步骤一中包括:通过APP后台的TTS技术对录制视频中的音频信息进行文字提取,并对提取的所述文字以及录制视频的标签进行场景关键词划分。
优选的,所述步骤三中,还包括通过APP后台对所述录制视频进行字幕添加。
优选的,所述背景限定,包括图片背景、视频背景、音频背景。
本发明有益效果为:本发明有益效果为:前端APP通过摄像模块录制视频,并在绿幕模块的处理下,将录制视频的背景处理为绿色,并存入存储模块中,期间使用者还通过编辑模块对录制视频进行设定,带有设定信息的录制视频经上传模块发送至APP后端,并在语音分析模块、语义分析模块的作用下得出场景关键词。素材库根据场景关键词以及录制视频的编辑信息定位背景影像,最终在合成模块的处理下输出合成视频。筛选模块对语音分析模块及语义分析模块划出的多个背景关键词进行一一示例,因此为使用者提供偏好选择空间。使用者通过前端APP对录制视频进行文字输入标签,该标签可以为使用者预订的背景关键词,从而当语义分析时,***能够将对应该背景关键词的背景影像输出。合成模块不但可以从素材库中筛选对应背景关键词的背景素材,还可以通过字幕追加模块直接编制背景字幕,从而更进一步的方便使用者自定义个性内容。
附图说明
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。
图1为本发明***结构示意图;
图2为本发明背景匹配方法流程示意图。
具体实施方式
下面将结合本发明的附图,对本发明的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。
根据图1所示,一种根据视频内容自动匹配背景的***,包括:前端APP1,所述前端APP1设有摄像模块2、存储模块3、绿幕模块4、编辑模块5和上传模块6;APP后台7,所述APP后台7设有语音识别模块8、语义分析模块9、素材库10、合成模块11;所述摄像模块2依次通过所述绿幕模块4及所述编辑模块5与所述存储模块3连接;所述存储模块3通过所述上传模块6与所述语音识别模块8及所述语义分析模块9连接;所述上传模块6所述素材库10分别与所述合成模块11连接。
通过上述设置,前端APP1通过摄像模块2录制视频,并在绿幕模块4的处理下,将录制视频的背景处理为绿色,并存入存储模块3中,期间使用者还通过编辑模块5对录制视频进行设定,带有设定信息的录制视频经上传模块6发送至APP后端,并在语音分析模块、语义分析模块9的作用下得出场景关键词。素材库10根据场景关键词以及录制视频的编辑信息定位背景影像,最终在合成模块11的处理下输出合成视频。
还包括筛选模块,所述素材库10通过所述筛选模块与所述合成模块11连接。
该设置中,筛选模块对语音分析模块及语义分析模块9划出的多个背景关键词进行一一示例,因此为使用者提供偏好选择空间。
所述编辑模块5包括标签编辑模块5、背景类型选择模块。
该设置中,使用者通过前端APP对录制视频进行文字输入标签,该标签可以为使用者预订的背景关键词,从而当语义分析时,***能够将对应该背景关键词的背景影像输出。
所述素材库10包括静态库、动态库和音频库;所述合成模块11设有第一输入端14、第二输入端15、第三输入端16和合成输出端17;所述第一输入端14与所述上传模块6连接、所述第二输入模块与所述筛选模块连接,所述第三输入端16设有字幕追加模块18。
通过该设置,合成模块11不但可以从素材库10中筛选对应背景关键词的背景素材,还可以通过字幕追加模块18直接编制背景字幕,从而更进一步的方便使用者自定义个性内容。
以上所述,仅为本发明的具体实施方式,但本发明的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本发明揭露的技术范围内,可轻易想到变化或替换,都应涵盖在本发明的保护范围之内。因此,本发明的保护范围应所述以权利要求的保护范围为准。

Claims (7)

1.一种根据视频内容自动匹配背景的***,其特征在于,包括:
前端APP,所述前端APP设有摄像模块、存储模块、绿幕模块、编辑模块和上传模块;
APP后台,所述APP后台设有语音识别模块、语义分析模块、素材库、合成模块;
所述摄像模块依次通过所述绿幕模块及所述编辑模块与所述存储模块连接;
所述存储模块通过所述上传模块与所述语音识别模块及所述语义分析模块连接;
所述语音识别模块以及所述语义分析模块通过所述素材库与所述合成模块连接。
2.根据权利要求1所述的视频内容自动匹配背景的***,其特征在于:还包括筛选模块,所述素材库通过所述筛选模块与所述合成模块连接。
3.一种根据视频内容自动匹配背景的方法,用于权利要求1中的***,其特征在于,包括以下步骤:
步骤一、对录制视频进行信息提取生成文本内容,并对文本内容进行语义分析获得场景关键词;
步骤二、通过场景关键词在素材库中定位对应的背景影像;
步骤三、将背景影像与录制视频进行合成。
4.根据权利要求1所述的一种根据视频内容自动匹配背景的方法,其特征在于:所述录制视频为通过前端APP录制的绿幕视频,并且所述前端APP对每一所述录制视频进行标签添加和背景限定。
5.根据权利要求2所述的一种根据视频内容自动匹配背景的方法,其特征在于:所述步骤一中包括:通过APP后台的TTS技术对录制视频中的音频信息进行文字提取,并对提取的所述文字以及录制视频的标签进行场景关键词划分。
6.根据权利要求3所述的一种根据视频内容自动匹配背景的方法,其特征在于:所述步骤三中,还包括通过APP后台对所述录制视频进行字幕添加。
7.根据权利要求2所述的一种根据视频内容自动匹配背景的方法,其特征在于:所述背景限定,包括图片背景、视频背景、音频背景。
CN202111101320.2A 2021-09-18 2021-09-18 一种根据视频内容自动匹配背景的*** Active CN113709521B (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111101320.2A CN113709521B (zh) 2021-09-18 2021-09-18 一种根据视频内容自动匹配背景的***

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111101320.2A CN113709521B (zh) 2021-09-18 2021-09-18 一种根据视频内容自动匹配背景的***

Publications (2)

Publication Number Publication Date
CN113709521A true CN113709521A (zh) 2021-11-26
CN113709521B CN113709521B (zh) 2023-08-29

Family

ID=78661281

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111101320.2A Active CN113709521B (zh) 2021-09-18 2021-09-18 一种根据视频内容自动匹配背景的***

Country Status (1)

Country Link
CN (1) CN113709521B (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114339446A (zh) * 2021-12-28 2022-04-12 北京百度网讯科技有限公司 音视频编辑方法、装置、设备、存储介质及程序产品

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20130032653A (ko) * 2011-09-23 2013-04-02 브로드밴드미디어주식회사 동영상 자막을 키워드로 이용한 영상 검색 시스템 및 방법
KR20150022088A (ko) * 2013-08-22 2015-03-04 주식회사 엘지유플러스 컨텍스트 기반 브이오디 검색 시스템 및 이를 이용한 브이오디 검색 방법
KR101894956B1 (ko) * 2017-06-21 2018-10-24 주식회사 미디어프론트 실시간 증강 합성 기술을 이용한 영상 생성 서버 및 방법
CN111327839A (zh) * 2020-02-27 2020-06-23 江苏尚匠文化传播有限公司 一种基于虚拟视频技术的视频后期制作方法及***

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20130032653A (ko) * 2011-09-23 2013-04-02 브로드밴드미디어주식회사 동영상 자막을 키워드로 이용한 영상 검색 시스템 및 방법
KR20150022088A (ko) * 2013-08-22 2015-03-04 주식회사 엘지유플러스 컨텍스트 기반 브이오디 검색 시스템 및 이를 이용한 브이오디 검색 방법
KR101894956B1 (ko) * 2017-06-21 2018-10-24 주식회사 미디어프론트 실시간 증강 합성 기술을 이용한 영상 생성 서버 및 방법
CN111327839A (zh) * 2020-02-27 2020-06-23 江苏尚匠文化传播有限公司 一种基于虚拟视频技术的视频后期制作方法及***

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114339446A (zh) * 2021-12-28 2022-04-12 北京百度网讯科技有限公司 音视频编辑方法、装置、设备、存储介质及程序产品
CN114339446B (zh) * 2021-12-28 2024-04-05 北京百度网讯科技有限公司 音视频编辑方法、装置、设备、存储介质及程序产品

Also Published As

Publication number Publication date
CN113709521B (zh) 2023-08-29

Similar Documents

Publication Publication Date Title
CN107770626B (zh) 视频素材的处理方法、视频合成方法、装置及存储介质
JP4250301B2 (ja) 映像シーケンスを編集する方法及びシステム
US8930817B2 (en) Theme-based slideshows
US9317531B2 (en) Autocaptioning of images
CN101202864B (zh) 动画再现装置
US8923654B2 (en) Information processing apparatus and method, and storage medium storing program for displaying images that are divided into groups
KR100493674B1 (ko) 멀티미디어 데이터 검색 및 브라우징 시스템
CN110781328A (zh) 基于语音识别的视频生成方法、***、装置和存储介质
CN112579826A (zh) 视频显示及处理方法、装置、***、设备、介质
US7844115B2 (en) Information processing apparatus, method, and program product
KR20090091311A (ko) 스토리 테마를 생성하는 컴퓨터로 구현되는 방법 및 이러한 방법을 수행하기 위한 프로그램 저장 디바이스와 스토리 구성 시스템
KR20090094826A (ko) 스토리 셰어 제품을 자동으로 생성하는 시스템, 컴퓨터로 구현되는 방법 및 프로그램 저장 디바이스
US9666211B2 (en) Information processing apparatus, information processing method, display control apparatus, and display control method
CN112929746A (zh) 视频生成方法和装置、存储介质和电子设备
CN113709521A (zh) 一种根据视频内容自动匹配背景的***
US20150221114A1 (en) Information processing apparatus, information processing method, and program
JP6603929B1 (ja) 動画編集サーバおよびプログラム
CN108255917B (zh) 图像管理方法、设备及电子设备
CN113269855A (zh) 一种文字语义转场景动画的方法、设备及存储介质
JP2021119662A (ja) サーバおよびデータ割り当て方法
JP7133367B2 (ja) 動画編集装置、動画編集方法、及び動画編集プログラム
KR101477492B1 (ko) 동영상 콘텐츠 편집 및 재생을 위한 장치 및 그 방법
KR20080084303A (ko) 멀티미디어 파일에서 원하는 부분만 쉽고 빠르게 정확히 추출하여 u-컨텐츠 만드는 방법
AU745436B2 (en) Automated visual image editing system
KR102523746B1 (ko) 프레젠테이션 문서를 구성하는 슬라이드에 음성 데이터의 삽입을 가능하게 하는 전자 장치 및 그 동작 방법

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant