CN110381266A

CN110381266A - A kind of video generation method, device and terminal

Info

Publication number: CN110381266A
Application number: CN201910700099.9A
Authority: CN
Inventors: 杜念冬; 鲍冠伯; 杨杰
Original assignee: Beijing Baidu Netcom Science and Technology Co Ltd
Current assignee: Baidu Online Network Technology Beijing Co Ltd; Beijing Baidu Netcom Science and Technology Co Ltd
Priority date: 2019-07-31
Filing date: 2019-07-31
Publication date: 2019-10-25

Abstract

This application discloses a kind of video generation methods, are related to video field.Specific implementation are as follows: obtain video content material and video template, video content material includes the image of virtual portrait, the dynamic element of virtual portrait and casting content；The report clips of virtual portrait are generated according to casting content, the image of virtual portrait, the dynamic element of virtual portrait；The report clips of virtual portrait are added in video template, casting video is obtained.The efficiency that video generation can not only be improved, reduces cost, and enrich video content, meets the diversified demand of spectators.

Description

A kind of video generation method, device and terminal

Technical field

This application involves a kind of field of computer technology more particularly to a kind of video fields.

Background technique

Traditional news report, taking a long time from column preparation, the links such as material preparation, gather and edit, make could be complete At, while needing a large amount of manpower intervention.So traditional news report is at high cost, whole production efficiency is low.Currently, can be with Using virtual newscaster's reciting news content, and then the video that virtual newscaster carries out news report is generated, to replace traditional news Casting.However, the video that virtual newscaster carries out news report only includes two elements of background picture and virtual newscaster.In video Element it is single, news is not abundant enough, is unable to satisfy spectators to the diversified demand of news report.

Summary of the invention

The application provides a kind of video generation method, device and terminal, to solve one or more in the prior art Technical problem.

In a first aspect, this application provides a kind of video generation methods, comprising:

It obtains video content material and video template, video content material includes the image of virtual portrait, virtual portrait Dynamic element and casting content；

The report clips of virtual portrait are generated according to casting content, the image of virtual portrait, the dynamic element of virtual portrait；

The report clips of virtual portrait are added in video template, casting video is obtained.

The efficiency that can be improved video generation, reduces cost, and enrich video content, meets the diversified need of spectators It asks.

In one embodiment, casting content includes casting text, and the dynamic element of virtual portrait includes virtual portrait Lip is dynamic, expression and movement, virtual portrait is generated according to casting content, the image of virtual portrait, the dynamic element of virtual portrait Report clips, comprising:

Casting pronunciation type is obtained according to video template；

Casting voice is generated according to casting text and casting pronunciation type；

Casting voice is input in lip movable model, the lip for exporting virtual portrait corresponding with casting voice is dynamic；

Obtain the expression and movement of virtual portrait corresponding with casting text；

According to the image of virtual portrait, broadcast broadcasting for dynamic voice, the lip of virtual portrait, expression and movement generation virtual portrait Report tablet section.

The embodiment is by moving the image of virtual portrait, lip, expression and movement are corresponding with casting voice, so that generating Virtual portrait report clips in, virtual portrait is by expression and acts shown mood and casting voice more meets Broadcast the logic of text.Meanwhile so that casting video is finer and smoother.

In one embodiment, the report clips of virtual portrait are added in video template, obtain casting video, packet It includes:

The distributing position of virtual portrait is obtained according to video template；

The report clips of virtual portrait are added in the distributing position of virtual portrait, casting video is obtained.The embodiment party Formula improves the generating rate of casting video by the way that the report clips of virtual portrait to be added to the distributing position of virtual portrait.

In one embodiment, casting content further includes picture relevant to casting text and/or video, and this method is also Include:

The first display area of picture relevant to casting text and/or video is obtained according to video template；

The first display area will be added to the relevant picture of casting text and/or video.

The embodiment shows the relevant picture of casting text and/or video by the first display area, enriches casting The casting content of video.

In one embodiment, casting text includes casting title, casting text, casting date, method further include:

The second display area of casting text is obtained according to video template, the second display area includes broadcast title first Subregion, the second subregion for broadcasting text and the third subregion for broadcasting the date；

Casting title is added to the first subregion, casting text is added to the second subregion, and the casting date is added to the Three subregions.

In one embodiment, further includes:

At least one of casting scene, head, run-out and clothes of virtual portrait are obtained according to video template.

Second aspect, this application provides a kind of video-generating devices, comprising:

Material and template obtain module, and for obtaining video content material and video template, video content material includes void The image of anthropomorphic object, the dynamic element of virtual portrait and casting content；

Report clips generation module, for raw according to casting content, the image of virtual portrait, the dynamic element of virtual portrait At the report clips of virtual portrait；

Video generation module is broadcasted, for the report clips of virtual portrait to be added in video template, obtains casting view Frequently.

In one embodiment, casting content includes casting text, and the dynamic element of virtual portrait includes virtual portrait Lip is dynamic, expression and movement, report clips generation module include:

Casting pronunciation type acquiring unit, for obtaining casting pronunciation type according to video template；

Speech production unit is broadcasted, for generating casting voice according to casting text and casting pronunciation type；

Lip moves generation unit, is input in lip movable model for that will broadcast voice, and output is corresponding with casting voice virtual The lip of personage is dynamic；

Expression and movement acquiring unit, for obtaining the expression and movement of virtual portrait corresponding with text is broadcasted；

Report clips generation unit, for according to the image of virtual portrait, casting voice, the lip of virtual portrait be dynamic, expression The report clips of virtual portrait are generated with movement.

In one embodiment, casting video generation module includes:

The distribution of the persons position acquisition unit, for obtaining the distributing position of virtual portrait according to video template；

Report clips adding unit, for the report clips of virtual portrait to be added in the distributing position of virtual portrait, Obtain casting video.

In one embodiment, casting content further includes picture relevant to casting text and/or video, further includes:

First display area obtains module, for obtaining picture relevant to casting text and/or view according to video template First display area of frequency；

Picture and video adding module, for the first displaying will to be added to the relevant picture of casting text and/or video Region.

In one embodiment, casting text includes casting title, casting text, casting date, further includes:

Second display area obtain module, for according to video template obtain casting text the second display area, second Display area includes broadcasting the first subregion of title, broadcast the second subregion of text and broadcasting the third sub-district on date Domain；

Text adding module is broadcasted, is added to the first subregion for title will to be broadcasted, casting text is added to the second son Region, casting date are added to third subregion.

In one embodiment, further includes:

Template elements obtain module, for obtaining casting scene, head, run-out and virtual portrait according to video template At least one of clothes.

The third aspect, this application provides a kind of electronic equipment, the function of electronic equipment can be by hardware realization, can also To execute corresponding software realization by hardware.Hardware or software include one or more modules corresponding with above-mentioned function.

It include processor and memory in the structure of electronic equipment, memory is for storing in a possible design Electronic equipment is supported to execute the program of above-mentioned video generation method, processor is configurable for executing the journey stored in memory Sequence.Electronic equipment can also include communication interface, be used for and other equipment or communication.

Other effects possessed by above-mentioned optional way are illustrated hereinafter in conjunction with specific embodiment.

Detailed description of the invention

Attached drawing does not constitute the restriction to the application for more fully understanding this programme.Wherein:

Fig. 1 is the method flow diagram according to the application first embodiment；

Fig. 2 is the method schematic diagram according to the application first embodiment；

Fig. 3 is the another method flow chart according to the application first embodiment；

The video that the application first embodiment may be implemented in Fig. 4 generates scene figure；

Fig. 5 is the schematic diagram according to the application second embodiment；

Fig. 6 is another schematic diagram according to the application second embodiment；

Fig. 7 is another schematic diagram according to the application second embodiment；

Fig. 8 is another schematic diagram according to the application second embodiment；

Fig. 9 is another schematic diagram according to the application second embodiment；

Figure 10 is the block diagram for the electronic equipment for the method for realizing the video generation of the embodiment of the present application.

Specific embodiment

It explains below in conjunction with exemplary embodiment of the attached drawing to the application, including the various of the embodiment of the present application Details should think them only exemplary to help understanding.Therefore, those of ordinary skill in the art should recognize It arrives, it can be with various changes and modifications are made to the embodiments described herein, without departing from the scope and spirit of the present application.Together Sample, for clarity and conciseness, descriptions of well-known functions and structures are omitted from the following description.

Embodiment one ...

In a kind of specific embodiment, a kind of video generation method is provided, as shown in Figure 1, this method comprises:

Step S10: obtaining video content material and video template, video content material include the image of virtual portrait, void The dynamic element and casting content of anthropomorphic object.

In a kind of example, as shown in Fig. 2, each video content material can be inputted.The image of virtual portrait may include Cartoon character, real person's image etc..For example, the host of news channel, Donald duck cartoon host etc..Virtual portrait moves State element may include when carrying out casting content, and the lip of virtual figure image is dynamic, expression and movement etc..For example, true people The lip of various lips folding of the object when speaking is dynamic etc..The expression of smile, the expression of indignation, the expression of laugh, sad expression Deng.Movement may include limb action and headwork.It embraces for example, limb action can be both arms in front, broadcasting content can To include news content or various programme contents.For example, the program cooked, the program of music introduction, sports cast, children's programs Deng.Wherein, news content may include preceding string field word, rear string field word, headline, body, news date etc..In casting Holding can also include various pictures relevant to news content or various programme contents and video etc..

It may include in head, run-out, the distributing position of virtual portrait, various scenes and casting content in video template Picture or video equal distribution position.Different video templates may include different above-mentioned element.For example, can be according to difference Virtual figure image select corresponding scene.The image of virtual portrait is to stand or be seated.Virtual portrait shape can also be given As selecting corresponding clothes, for example, the clothes etc. of black, pink colour, white.Corresponding sound can also be selected to virtual figure image Color type, for example, male voice or female voice.Corresponding head and run-out also may be selected.For example, the head on daytime, at night The run-out of different brands showing advertisement etc. may be selected in head, run-out.

Step S20: virtual portrait is generated according to casting content, the image of virtual portrait, the dynamic element of virtual portrait Report clips.

In a kind of example, as shown in Fig. 2, automatically generating casting view after obtaining video template and video content material Frequently.The report clips of virtual portrait can completely broadcast one Duan Xinwen or program.Specifically, the shape of virtual portrait As the lip in the dynamic element with virtual portrait is dynamic, expression and movement etc. are merged, played together with casting content.For example, Virtual portrait is during carrying out a batch of news casting, the casting voice broadcasted according to news content, with virtual portrait The lip of image moves, expression and movement are corresponding.The content for enriching news report meets diversification of the spectators to news report Demand.

Step S30: the report clips of virtual portrait are added in video template, obtain casting video.

In a kind of example, as shown in Fig. 2, since video template provides the distributing position of virtual portrait, various scenes And wave broadcasts picture or video equal distribution position in content etc., so by the report clips and video template of virtual portrait It is merged, obtains casting video.After generating casting video, casting video download can be set to hardware such as local, hard disks It is saved in standby or cloud device.Or be distributed in each media platform, share etc. in Internet application.

Video generation method provided in this embodiment can not only improve the efficiency of video generation, reduce cost, Er Qiefeng Rich video content, meets spectators to the diversified demand of the videos such as news report.Because using video content material abundant, The technological means of various template is matched simultaneously, so overcoming the single technical problem of video content, and then reaches and enriches view Frequency content meets spectators to the technical effect of the diversified demand of the videos such as news report.

In one embodiment, casting content includes casting text, and the dynamic element of virtual portrait includes virtual portrait Lip is dynamic, expression and movement, according to casting content, as shown in figure 3, step S20 includes:

S201: casting pronunciation type is obtained according to video template；

S202: casting voice is generated according to casting text and casting pronunciation type；

S203: casting voice is input in lip movable model, and the lip for exporting virtual portrait corresponding with casting voice is dynamic；

S204: the expression and movement of virtual portrait corresponding with casting text are obtained；

S205: according to the image of virtual portrait, dynamic voice, the lip of virtual portrait, expression and movement generation visual human are broadcasted The report clips of object.

It include casting pronunciation type in a kind of example, in video template.Casting pronunciation type may include adult male Sound, the adult types such as female voice and child's voice.Alternatively, can also include droning sound, loud and sonorous sound, gentle sound, can The types such as the sound of love.It is, of course, also possible to add a greater variety of casting pronunciation types according to demand in video template.It can be with It is the casting text progress voice broadcast of preceding string field word, headline, body, rear string field word in sequence.Meanwhile it regarding Selection casting pronunciation type corresponding with casting text in casting pronunciation type in frequency template.It will casting text and casting pronunciation class Type synthesis casting voice.

Further, it is also possible to the speech synthesis model that the voice training based on some specific speaker obtains, by casting text Originally it is input in speech synthesis model, obtains the casting voice of specific speaker.Wherein, the sound of some specific speaker It can be described as sound library, obtain is sound library model using the training of sound library.The material moved using various lips and material training lip of speaking are dynamic Model, the lip that obtained lip moves when speaking to true man move similar.Casting voice is input in lip movable model, can export and broadcast Report the corresponding lip of voice dynamic.According to the expression and movement of casting text adjustment virtual portrait, so that expression and movement and casting text This correspondence.Happy text is expressed for example, having in casting text, the expression of collocation is the expression smiled, and movement can be double The movement that arm opens.In addition, the clothes of many selective virtual figure images can also be provided in video template.Finally, By the image of virtual portrait, broadcast dynamic voice, the lip of virtual portrait, expression and the report clips for acting generation virtual portrait.

In present embodiment, by the image of virtual portrait, lip is dynamic, expression and movement are corresponding with casting voice, so that generation Virtual portrait report clips in, virtual portrait is by expression and acts shown mood and casting voice more meets Broadcast the logic of text.Meanwhile so that casting video is finer and smoother.

In one embodiment, as shown in figure 3, step S30, comprising:

Step S301: the distributing position of virtual portrait is obtained according to video template；

Step S302: the report clips of virtual portrait are added in the distributing position of virtual portrait, obtain casting video.

In a kind of example, as shown in figure 4, including the distributing position of at least one virtual portrait in video template.It is trying In sound template, the distributing position and number of virtual portrait can be adaptively adjusted according to demand, in present embodiment In protection scope.The report clips of virtual portrait obtained in a upper embodiment are added into the visual human in video template In the distributing position of object, casting video is obtained.When playing casting video, the head in video template start to play, Zhi Houshi The report clips of virtual portrait are finally run-outs.

In one embodiment, as shown in figure 5, casting content further includes picture relevant to casting text and/or view Frequently, this method further include:

Step S40: the first display area of picture relevant to casting text and/or video is obtained according to video template；

Step S50: the first display area will be added to the relevant picture of casting text and/or video.

In a kind of example, as shown in figure 4, in video template include at least one first display area, for show with Broadcast the relevant picture of text and/or video.For example, casting text is the first news about Tangshan Earthquake, in the first exhibition Show in region, the picture after Tangshan Earthquake can be played, and the video etc. that camera takes in seismic process.It is regarding In frequency template, the position of the first display area and number can be adaptively adjusted according to demand, in present embodiment In protection scope.The setting of first display area, being capable of apparent specific reflection so that the content of casting video is more abundant Broadcast the content of text.

It in one embodiment, should as shown in figure 5, casting text includes casting title, casting text, broadcasts the date Method further include:

Step S60: obtaining the second display area of casting text according to video template, and the second display area includes casting mark First subregion of topic, the second subregion for broadcasting text and the third subregion for broadcasting the date；

Step S70: casting title is added to the first subregion, casting text is added to the second subregion, broadcasts the date It is added to third subregion.

In a kind of example, as shown in figure 4, when broadcasting content, it will usually carry out casting text in a manner of subtitle It shows, better understands the content in casting text convenient for spectators, enrich the content of casting video.For example, for showing casting First subregion of title, for showing that the third subregion on casting date can be showed in the lower right-hand side of screen, left side The positions such as side, the theme currently broadcasted and date are quickly found convenient for spectators.Casting text can be showed in the bottom position of screen It sets, forms subtitle, be played simultaneously with casting voice.

In one embodiment, as shown in Figure 5, further includes:

Step S80: it is obtained in the clothes of casting scene, head, run-out and virtual portrait at least according to video template One.

In a kind of example, the casting scene in video template can be outdoor scene, for example, the scene of tourist festival purpose casting It can be the Forbidden City in front of the door.Casting scene can also be indoor scene, for example, the casting of house ornamentation program can be a family Nordic Style House ornamentation indoor scene.Head and run-out are corresponding with casting text, preferably mutually to echo with casting content.Video template is also Include clothes required for the image of a large amount of virtual portraits, suitable clothes can be selected for different casting themes.

Embodiment two

In another embodiment specific implementation mode, as shown in fig. 6, providing a kind of video-generating device 10, comprising:

Material and template obtain module 101, and for obtaining video content material and video template, video content material includes The image of virtual portrait, the dynamic element of virtual portrait and casting content；

Report clips generation module 102, for according to casting content, the image of virtual portrait, the dynamic of virtual portrait member Element generates the report clips of virtual portrait；

Casting video generation module 103 is broadcasted for the report clips of virtual portrait to be added in video template Video.

In one embodiment, casting content includes casting text, and the dynamic element of virtual portrait includes virtual portrait Lip is dynamic, expression and movement, as shown in fig. 7, report clips generation module 102 includes:

Casting pronunciation type acquiring unit 1021, for obtaining casting pronunciation type according to institute's video template；

Speech production unit 1022 is broadcasted, for generating casting voice according to casting text and casting pronunciation type；

Lip moves generation unit 1023, is input in lip movable model for that will broadcast voice, and output is corresponding with casting voice The lip of virtual portrait is dynamic；

Expression and movement acquiring unit 1024, for obtaining the expression and movement of virtual portrait corresponding with text is broadcasted；

Report clips generation unit 1025, moved for the image of virtual portrait, according to the lip of casting voice, virtual portrait, Expression and movement generate the report clips of virtual portrait.

In one embodiment, as shown in figure 8, casting video generation module 103 includes:

The distribution of the persons position acquisition unit 1031, for obtaining the distributing position of virtual portrait according to video template；

Report clips adding unit 1032, for the report clips of virtual portrait to be added to the distributing position of virtual portrait In, obtain casting video.

In one embodiment, casting content further includes picture relevant to casting text and/or video, such as Fig. 9 institute Show, further includes:

First display area obtains module 104, for according to video template obtain with broadcast the relevant picture of text and/or First display area of video；

Picture and video adding module 105, for the first exhibition will to be added to the relevant picture of casting text and/or video Show region.

In one embodiment, casting text includes casting title, casting text, casting date, as shown in figure 9, also Include:

Second display area obtains module 106, for obtaining the second display area of casting text according to video template, the Two display areas include broadcasting the first subregion of title, broadcast the second subregion of text and broadcasting the third sub-district on date Domain；

Text adding module 107 is broadcasted, is added to the first subregion for title will to be broadcasted, casting text is added to second Subregion, casting date are added to third subregion.

In one embodiment, as shown in Figure 9, further includes:

Template elements obtain module 108, for obtaining casting scene, head, run-out and visual human according to video template At least one of clothes of object.

Embodiment three

According to an embodiment of the present application, present invention also provides a kind of electronic equipment and a kind of readable storage medium storing program for executing.

As shown in Figure 10, it is block diagram according to the electronic equipment of the video generation method of the embodiment of the present application.Electronic equipment It is intended to indicate that various forms of digital computers, such as, laptop computer, desktop computer, workbench, individual digital help Reason, server, blade server, mainframe computer and other suitable computer.Electronic equipment also may indicate that various shapes The mobile device of formula, such as, personal digital assistant, cellular phone, smart phone, wearable device and other similar calculating dresses It sets.Component, their connection and relationship shown in this article and their function are merely exemplary, and are not intended to limit The realization of described herein and/or requirement the application.

As shown in Figure 10, which includes: one or more processors 1001, memory 1002, and for connecting Connect the interface of each component, including high-speed interface and low-speed interface.All parts are interconnected using different bus, and can be with It is installed on public mainboard or installs in other ways as needed.Processor can be to the finger executed in electronic equipment Order is handled, including storage in memory or on memory (such as, to be coupled to and connect in external input/output device Mouthful display equipment) on show graphic user interface (Graphical User Interface, GUI) graphical information finger It enables.In other embodiments, if desired, by multiple processors and/or multiple bus and multiple memories and multiple can deposit Reservoir is used together.It is also possible to connect multiple electronic equipments, each equipment provides the necessary operation in part (for example, as clothes Business device array, one group of blade server or multicomputer system).In Figure 10 by taking a processor 1001 as an example.

Memory 1002 is non-transitory computer-readable storage medium provided herein.Wherein, the memory It is stored with the instruction that can be executed by least one processor, so that at least one described processor executes view provided herein Frequency generation method.The non-transitory computer-readable storage medium of the application stores computer instruction, and the computer instruction is for making Computer executes video generation method provided herein.

Memory 1002 be used as a kind of non-transitory computer-readable storage medium, can be used for storing non-instantaneous software program, Non-instantaneous computer executable program and module, as the corresponding program instruction of video generation method in the embodiment of the present application/ Module (generates for example, attached material shown in fig. 6 and template obtain module 101, report clips generation module 102 and casting video Module 103).Non-instantaneous software program, instruction and the module that processor 1001 is stored in memory 1002 by operation, from And the various function application and data processing of execute server, i.e. video generation method in realization above method embodiment.

Memory 1002 may include storing program area and storage data area, wherein storing program area can store operation system Application program required for system, at least one function；Storage data area can store the use of the electronic equipment generated according to video The data etc. created.In addition, memory 1002 may include high-speed random access memory, it can also include non-instantaneous storage Device, for example, at least a disk memory, flush memory device or other non-instantaneous solid-state memories.In some embodiments, Optional memory 1002 includes the memory remotely located relative to processor 1001, these remote memories can pass through network It is connected to the electronic equipment of video generation.The example of above-mentioned network include but is not limited to internet, intranet, local area network, Mobile radio communication and combinations thereof.

The electronic equipment of video generation method can also include: input unit 1003 and output device 1004.Processor 1001, memory 1002, input unit 1003 and output device 1004 can be connected by bus or other modes, Tu10Zhong For being connected by bus.

Input unit 1003 can receive the number or character information of input, and generate the electronic equipment generated with video User setting and function control related key signals input, such as touch screen, keypad, mouse, track pad, touch tablet, refer to Show the input units such as bar, one or more mouse button, trace ball, control stick.Output device 1004 may include that display is set Standby, auxiliary lighting apparatus (for example, LED) and haptic feedback devices (for example, vibrating motor) etc..The display equipment may include but It is not limited to, liquid crystal display (Liquid Crystal Display, LCD), light emitting diode (Light Emitting Diode, LED) display and plasma scope.In some embodiments, display equipment can be touch screen.

The various embodiments of system and technology described herein can be in digital electronic circuitry, integrated circuit system System, is consolidated specific integrated circuit (Application Specific Integrated Circuits, ASIC), computer hardware It is realized in part, software, and/or their combination.These various embodiments may include: to implement in one or more calculating In machine program, which can hold in programmable system containing at least one programmable processor Row and/or explain, which can be dedicated or general purpose programmable processors, can from storage system, at least One input unit and at least one output device receive data and instruction, and data and instruction is transmitted to the storage system System, at least one input unit and at least one output device.

These calculation procedures (also referred to as program, software, software application or code) include the machine of programmable processor Instruction, and can use programming language, and/or the compilation/machine language of level process and/or object-oriented to implement these Calculation procedure.As used herein, term " machine readable media " and " computer-readable medium " are referred to for referring to machine It enables and/or data is supplied to any computer program product, equipment, and/or the device of programmable processor (for example, disk, light Disk, memory, programmable logic device (programmable logic device, PLD)), including, receiving can as machine The machine readable media of the machine instruction of read signal.Term " machine-readable signal " is referred to for by machine instruction and/or number According to any signal for being supplied to programmable processor.

In order to provide the interaction with user, system and technology described herein, the computer can be implemented on computers Include for user show information display device (for example, CRT (Cathode Ray Tube, cathode-ray tube) or LCD (liquid crystal display) monitor)；And keyboard and indicator device (for example, mouse or trace ball), user can be by this Keyboard and the indicator device provide input to computer.The device of other types can be also used for providing the friendship with user Mutually；For example, the feedback for being supplied to user may be any type of sensory feedback (for example, visual feedback, audio feedback or Touch feedback)；And it can be received with any form (including vocal input, voice input or tactile input) from user Input.

System described herein and technology can be implemented including the computing system of background component (for example, as data Server) or the computing system (for example, application server) including middleware component or the calculating including front end component System is (for example, the subscriber computer with graphic user interface or web browser, user can pass through graphical user circle Face or the web browser to interact with the embodiment of system described herein and technology) or including this backstage portion In any combination of computing system of part, middleware component or front end component.Any form or the number of medium can be passed through Digital data communicates (for example, communication network) and is connected with each other the component of system.The example of communication network includes: local area network (Local Area Network, LAN), wide area network (Wide Area Network, WAN) and internet.

Computer system may include client and server.Client and server is generally off-site from each other and usually logical Communication network is crossed to interact.By being run on corresponding computer and each other with the meter of client-server relation Calculation machine program generates the relationship of client and server.

According to the technical solution of the embodiment of the present application, the efficiency of video generation can not only be improved, cost, Er Qiefeng are reduced Rich video content, meets spectators to the diversified demand of the videos such as news report.Because using video content material abundant, The technological means of various template is matched simultaneously, so overcoming the single technical problem of video content, and then reaches and enriches view Frequency content meets spectators to the technical effect of the diversified demand of the videos such as news report.

It should be understood that various forms of processes illustrated above can be used, rearrangement increases or deletes step.Example Such as, each step recorded in the application of this hair can be performed in parallel or be sequentially performed the order that can also be different and execute, As long as it is desired as a result, being not limited herein to can be realized technical solution disclosed in the present application.

Above-mentioned specific embodiment does not constitute the limitation to the application protection scope.Those skilled in the art should be bright White, according to design requirement and other factors, various modifications can be carried out, combination, sub-portfolio and substitution.It is any in the application Spirit and principle within made modifications, equivalent substitutions and improvements etc., should be included within the application protection scope.

Claims

1. a kind of video generation method characterized by comprising

It obtains video content material and video template, the video content material includes vivid, the described visual human of virtual portrait The dynamic element and casting content of object；

The visual human is generated according to the dynamic element of the casting content, vivid, the described virtual portrait of the virtual portrait The report clips of object；

The report clips of the virtual portrait are added in the video template, casting video is obtained.

2. the method according to claim 1, wherein the casting content includes casting text, the visual human The dynamic element of object includes the lip of the virtual portrait dynamic, expression and movement, according to the casting content, the virtual portrait The dynamic element of vivid, the described virtual portrait generates the report clips of the virtual portrait, comprising:

Casting pronunciation type is obtained according to the video template；

Casting voice is generated according to the casting text and casting pronunciation type；

The casting voice is input in lip movable model, the lip of the virtual portrait corresponding with the casting voice is exported It is dynamic；

Obtain the expression and movement of the virtual portrait corresponding with the casting text；

Described according to vivid, described casting voice, the lip of the virtual portrait of the virtual portrait, dynamic, expression and movement are generated The report clips of virtual portrait.

3. the method according to claim 1, wherein the report clips of the virtual portrait are added to the view In frequency template, casting video is obtained, comprising:

The distributing position of the virtual portrait is obtained according to the video template；

The report clips of the virtual portrait are added in the distributing position of the virtual portrait, obtain casting video.

4. according to the method described in claim 2, it is characterized in that, the casting content further includes related to the casting text Picture and/or video, the method also includes:

The first display area of picture relevant to the casting text and/or video is obtained according to the video template；

Picture relevant to the casting text and/or video are added to first display area.

5. according to the method described in claim 2, it is characterized in that, the casting text includes casting title, casting text, broadcasts The date is reported, the method also includes:

The second display area of the casting text is obtained according to the video template, second display area includes described broadcasts The second subregion and the third subregion on the casting date of first subregion of bid quotation topic, the casting text；

The casting title is added to first subregion, the casting text is added to second subregion, described The casting date is added to the third subregion.

6. the method according to claim 1, wherein further include:

At least one of casting scene, head, run-out and clothes of the virtual portrait are obtained according to the video template.

7. a kind of video-generating device characterized by comprising

Material and template obtain module, and for obtaining video content material and video template, the video content material includes void The dynamic element and casting content of vivid, the described virtual portrait of anthropomorphic object；

Report clips generation module, for according to the dynamic of vivid, the described virtual portrait of the casting content, the virtual portrait The report clips of virtual portrait described in state Element generation；

Casting video generation module is broadcast for the report clips of the virtual portrait to be added in the video template Report video.

8. device according to claim 7, which is characterized in that the casting content includes casting text, the visual human The dynamic element of object includes that the dynamic lip of the virtual portrait, expression and movement, the report clips generation module include:

Casting pronunciation type acquiring unit, for obtaining casting pronunciation type according to the video template；

Speech production unit is broadcasted, for generating casting voice according to the casting text and casting pronunciation type；

Lip moves generation unit, for the casting voice to be input in lip movable model, exports corresponding with the casting voice The lip of the virtual portrait is dynamic；

Expression and movement acquiring unit, for obtaining the expression and movement of the virtual portrait corresponding with the casting text；

Report clips generation unit, for vivid, the described lip for broadcasting voice, the virtual portrait according to the virtual portrait Dynamic, expression and movement generate the report clips of the virtual portrait.

9. device according to claim 7, which is characterized in that the casting video generation module includes:

The distribution of the persons position acquisition unit, for obtaining the distributing position of the virtual portrait according to the video template；

Report clips adding unit, for the report clips of the virtual portrait to be added to the distributing position of the virtual portrait In, obtain casting video.

10. device according to claim 9, which is characterized in that the casting content further includes and the casting text phase The picture and/or video of pass, further includes:

First display area obtains module, for according to the video template obtain relevant to casting text picture and/ Or the first display area of video；

Picture and video adding module, for picture relevant to the casting text and/or video to be added to described first Display area.

11. device according to claim 9, which is characterized in that the casting text include casting title, casting text, Broadcast the date, further includes:

Second display area obtains module, for obtaining the second display area of the casting text according to the video template, Second display area includes the first subregion of the casting title, casting second subregion of text and described Broadcast the third subregion on date；

Text adding module is broadcasted, for the casting title to be added to first subregion, the casting text addition To second subregion, the casting date is added to the third subregion.

12. device according to claim 7, which is characterized in that further include:

Template elements obtain module, for obtaining casting scene, head, run-out and the visual human according to the video template At least one of clothes of object.

13. a kind of electronic equipment characterized by comprising

At least one processor；And

The memory being connect at least one described processor communication；Wherein,

The memory is stored with the instruction that can be executed by least one described processor, and described instruction is by described at least one It manages device to execute, so that at least one described processor is able to carry out method of any of claims 1-6.

14. a kind of non-transitory computer-readable storage medium for being stored with computer instruction, which is characterized in that the computer refers to It enables for making the computer perform claim require method described in any one of 1-6.