CN107241646A

CN107241646A - The edit methods and device of multimedia video

Info

Publication number: CN107241646A
Application number: CN201710566432.2A
Authority: CN
Inventors: 邵可
Original assignee: Beijing Qihoo Technology Co Ltd
Current assignee: Beijing Qihoo Technology Co Ltd
Priority date: 2017-07-12
Filing date: 2017-07-12
Publication date: 2017-10-10
Anticipated expiration: 2037-07-12
Also published as: CN107241646B

Abstract

The invention discloses a kind of edit methods of multimedia video and device, it is related to multimedia technology field, main purpose is the problem of short-sighted frequency intercepted in existing live or small video can not be edited.Main technical schemes include：Obtain multimedia file；Decode the video data and voice data in the multimedia file；The video data is carried out to render processing, and track processing is carried out to the voice data；Video data after processing and the voice data after processing are encoded, multimedia video is obtained.It is mainly used in the editor of multimedia video.

Description

The edit methods and device of multimedia video

Technical field

The present invention relates to multimedia technology field, the edit methods and device of more particularly to a kind of multimedia video.

Background technology

With the fast development of Internet technology, people have been no longer satisfied with simple use mobile phone communication to be handed over Stream and communication, wherein, the social platform that online live, small video etc. is set up using multimedia technology is entered between having become user The Main Means that row is linked up.

At present, user, can be by intercepting one in video when using terminal equipment carries out live or recording small video Segment is preserved, for example, certain live platform just live little girl dance, in order to record little girl rotation video, it is necessary to Intercept the short-sighted frequency that little girl rotates in live video.It is right in order to strengthen the result of broadcast to video content after interception video Multimedia video enters edlin and has become urgent problem to be solved.

The content of the invention

In view of this, the present invention provides a kind of edit methods and device of multimedia video, and main purpose is existing straight Broadcast or small video in the short-sighted frequency that intercepts the problem of can not edit.

According to one aspect of the invention there is provided a kind of edit methods of multimedia video, including：

Obtain multimedia file；

Decode the video data and voice data in the multimedia file；

The video data is carried out to render processing, and track processing is carried out to the voice data；

Video data after processing and the voice data after processing are encoded, multimedia video is obtained.

Further, it is described that the video data is carried out to render processing, and the voice data is carried out at track Reason includes：

Receive in the process instruction of user's input, the process instruction and carry effective mark；

Video effect mark in being identified according to the effect renders the video data, and according in effect mark The audio frequency effect mark processing voice data.

Further, the video effect mark in the mark according to the effect, which renders the video data, includes：

The view data of each frame in the video data is extracted, and filter processing is carried out to described image data；

Target image after identification filter processing in view data is identified according to the video effect, and to the target figure Rendered as carrying out synthesis.

Further, the target image identified according to the video effect after identification filter processing in view data, And to the target image carry out synthesis render including：

If identifying, the video effect is designated synthetic stereo image, splits the target image, according to preset Color rule is to the target image after the target image, the segmentation and renders image and carries out coloring synthesis, described preset Color rule is used to react the target image after the target image, the segmentation, the position display pass rendered between image System.

Further, the audio frequency effect mark in the mark according to the effect, which handles the voice data, includes：

The discrete audio track data in the voice data is gathered according to prefixed time interval；

The discrete audio track data is effectively superimposed with default track according to audio frequency effect mark.

Further, the video data and voice data in the decoding multimedia file include：

Decode the video data and voice data in the multimedia file respectively according to video track and audio track.

Further, it is described that the video data is carried out to render processing, and the voice data is carried out at track After reason, methods described also includes：

When receiving live preview request, the video data and the voice data are shown.

Further, methods described also includes：

Speed adjust instruction is received, is adjusted according to the velocity information carried in the speed adjust instruction in multimedia video The broadcasting speed of video data and voice data.

According to one aspect of the invention there is provided a kind of editing device of multimedia video, including：

Acquiring unit, for obtaining multimedia file；

Decoding unit, for decoding video data and voice data in the multimedia file；

Processing unit, for carrying out rendering processing to the video data, and is carried out at track to the voice data Reason；

Coding unit, for the video data after processing and the voice data after processing to be encoded, obtains multimedia Video.

Further, the processing unit includes：

Effective mark is carried in receiving module, the process instruction for receiving user's input, the process instruction；

Processing module, identifies for the video effect in effect mark and renders the video data, and according to The audio frequency effect mark processing voice data in the effect mark.

Further, the processing module includes：

Extracting sub-module, the view data for extracting each frame in the video data, and described image data are entered The processing of row filter；

Submodule is synthesized, for identifying the target figure after identification filter processing in view data according to the video effect Picture, and target image progress synthesis is rendered.

The synthesis submodule, if specifically for identifying that the video effect is designated synthetic stereo image, splitting The target image, according to preset coloring rule is to the target image after the target image, the segmentation and renders image Coloring synthesis is carried out, the preset coloring rule is used to react target image, the wash with watercolours after the target image, the segmentation Contaminate the position display relation between image.

Further, the processing module also includes：

Submodule is gathered, for gathering the discrete audio track data in the voice data according to prefixed time interval；

Submodule is superimposed, for being had the discrete audio track data and default track according to audio frequency effect mark Effect superposition.

The decoding unit, specifically for being decoded respectively in the multimedia file according to video track and audio track Video data and voice data.

Further, described device also includes：

Display unit, for when receiving live preview request, showing the video data and the voice data.

Further, described device also includes：

Adjustment unit, for receiving speed adjust instruction, and according to the velocity information carried in the speed adjust instruction Adjust the broadcasting speed of video data and voice data in multimedia video.

According to one aspect of the invention there is provided a kind of storage device, wherein a plurality of instruction that is stored with, the instruction is suitable to Loaded by processor and performed：

Obtain multimedia file；

Decode the video data and voice data in the multimedia file；

According to one aspect of the invention there is provided a kind of mobile terminal, including processor, various instructions are adapted for carrying out；With And storage device, suitable for storing a plurality of instruction, the instruction is suitable to be loaded and performed by processor：

Obtain multimedia file；

Decode the video data and voice data in the multimedia file；

By above-mentioned technical proposal, technical scheme provided in an embodiment of the present invention at least has following advantages：

The invention provides a kind of edit methods of multimedia video and device, multimedia file is obtained first, is then solved Video data and voice data in the code multimedia file, then the video data is carried out to render processing, and to institute State voice data and carry out track processing, finally the video data after processing and the voice data after processing are encoded, obtained Multimedia video.Compared with the short-sighted frequency intercepted in existing live or small video can not be edited, the embodiment of the present invention passes through decoding The video data and voice data gone out in multimedia file, is handled video data and voice data respectively, is being encoded to Multimedia video, realizes and enters edlin to live or interception video, increase the result of broadcast of short-sighted frequency so that video is more given birth to Dynamic, the personage in editor's rear video more fits with rendering image, improves the service efficiency of video.

Described above is only the general introduction of technical solution of the present invention, in order to better understand the technological means of the present invention, And can be practiced according to the content of specification, and in order to allow above and other objects of the present invention, feature and advantage can Become apparent, below especially exemplified by the embodiment of the present invention.

Brief description of the drawings

By reading the detailed description of hereafter preferred embodiment, various other advantages and benefit is common for this area Technical staff will be clear understanding.Accompanying drawing is only used for showing the purpose of preferred embodiment, and is not considered as to the present invention Limitation.And in whole accompanying drawing, identical part is denoted by the same reference numerals.In the accompanying drawings：

Fig. 1 shows a kind of edit methods flow chart for multimedia video that the embodiment of the present invention one is provided；

Fig. 2 shows the edit methods flow chart for another multimedia video that the embodiment of the present invention two is provided；

Fig. 3 shows a kind of editing device block diagram for multimedia video that the embodiment of the present invention three is provided；

Fig. 4 shows the editing device block diagram for another multimedia video that the embodiment of the present invention four is provided.

Embodiment

The exemplary embodiment of the disclosure is more fully described below with reference to accompanying drawings.Although showing the disclosure in accompanying drawing Exemplary embodiment, it being understood, however, that may be realized in various forms the disclosure without should be by embodiments set forth here Limited.On the contrary, these embodiments are provided to facilitate a more thoroughly understanding of the present invention, and can be by the scope of the present disclosure Complete conveys to those skilled in the art.

The embodiments of the invention provide a kind of edit methods of multimedia video, as shown in figure 1, methods described includes：

101st, multimedia file is obtained.

Wherein, the multimedia file can be the video file of different-format, such as MP4 forms, MKV forms, 3GP forms Deng, the embodiment of the present invention is not specifically limited, the multimedia file can by terminal device camera carry out shooting obtain Take, can also be intercepted, directly can also be carried from the memory space in terminal device in online live video Take, the embodiment of the present invention is not specifically limited.

It should be noted that for the ease of in editor's VAS application -to-terminal service equipment of multimedia video, in interception video or record Need to set certain video playback time during video processed so that the multimedia video ultimately generated is a shorter video, It is easy to current video editing method to be applied in the less terminal device of memory headroom.

102nd, the video data and voice data in the multimedia file are decoded.

Wherein, the decoding video data and voice data are specifically as follows by reading regarding in multimedia file respectively Frequency according to and voice data realize the decoding of video and audio, i.e., video flowing and audio stream are reduced into analog video number According to and analog audio data.

It should be noted that in embodiments of the present invention, decoding process can be completed by a media decoder, will be many Media file is delivered in media decoder, can automatically derive video data and audio time, for video data, by In video combined by the different image of multiframe, decoded video data can be specially the corresponding image of each frame Information, for voice data, decoded voice data is then the analog signal of impulse form.

103rd, the video data is carried out rendering processing, and track processing is carried out to the voice data.

Wherein, it is described to render the image that processing includes adding bitmap in video, adding dynamic image, adjust video image Effect etc., track processing includes the different tracks of increase and is combined, increases audio etc., and the embodiment of the present invention is not specifically limited, Bitmap is dot matrix image or drawing image.

It should be noted that when video data and voice data are handled, individually separated being handled, also may be used Handled with interrelated.For example, when adding a background in video, only can carry out rendering the back of the body to video data Scape processing, without handling audio track data, still, when personage enunciates in video, it is desirable to which the word of discharge is changed into word Addition is in video, it is necessary to first handle audio track data, according to the word in identification audio track data, by corresponding figure in literal pool As data addition is in video data, at this moment it is accomplished by video data and is jointly processed by with audio track data.

104th, the video data after processing and the voice data after processing are encoded, obtains multimedia video.

Wherein, it is described to be encoded to the coding for being matched the video data after rendering with voice data, with obtain it is smooth, The corresponding multimedia video of audio frequency and video.

The invention provides a kind of edit methods of multimedia video, with the short-sighted frequency intercepted in existing live or small video Can not edit and compare, the embodiment of the present invention by decoding video data and voice data in multimedia file, respectively to regarding Frequency evidence and voice data are handled, and are being encoded to multimedia video, are realized and are entered edlin to live or interception video, increase Plus the result of broadcast of short-sighted frequency so that video is more lively, and the personage in editor's rear video more fits with rendering image, improves The service efficiency of video.

The embodiments of the invention provide the edit methods of another multimedia video, as shown in Fig. 2 methods described includes：

201st, multimedia file is obtained.

This step is identical with step 101 method shown in Fig. 1, will not be repeated here.

It should be noted that to can apply to other straight for the edit methods for the multimedia video being related in the embodiment of the present invention Broadcast or the application program of recorded video in, the editor of video is realized by calling interface, can also according to corresponding program come The application program being used alone is written as, multimedia file is obtained by calling camera directly to shoot, the embodiment of the present invention is not It is specifically limited.

202nd, the video data and audio number in the multimedia file are decoded respectively according to video track and audio track According to.

Wherein, the video track is the content tracks of video playback, and the audio track is the content rail that audio is played Mark, in order to decode the video data and audio track data in multimedia file, to enter respectively to video data and audio track data Row processing, it is therefore desirable to decoded respectively with audio track according to video track.

203rd, the process instruction of user's input is received.

Wherein, effective mark is carried in the process instruction.Shown process instruction is used to indicate that system carries out video tool The video editing of body, shown effect, which is designated mark, can reach the information of different video and audio frequency effect, for example, rendering figure Picture, language and characters conversion, addition background etc., the embodiment of the present invention is not specifically limited.

If it should be noted that the image rendered is the image that user inputs, image can be entered by process instruction Row is incoming.In addition, it is necessary to which the image rendered, the background coordinatograph of addition can be the image pre-set, or use The image that family is inputted, the embodiment of the present invention is not specifically limited.

204th, the video effect mark in being identified according to the effect renders the video data, and according to the effect mark The audio frequency effect mark processing voice data in knowledge.

Wherein, the video effect is designated the mark for carrying out process video effects in video data, the sound Yupin effect is designated the mark for carrying out processing audio frequency effect in voice data, in order to further add different-effect pair The different images answered according to video or audio frequency effect mark, it is necessary to handle video data or voice data.

Video effect mark and audio frequency effect mark in by being identified according to effect carry out wash with watercolours to video and audio respectively Dye processing and audio effect processing so that image enters edlin respectively with sound, optimize the performance to video editing.

For the embodiment of the present invention, the video effect mark during step is identified according to the effect renders the video data It can specifically include：The view data of each frame in the video data is extracted, and filter processing is carried out to described image data； Target image after identification filter processing in view data is identified according to the video effect, and the target image is closed Into rendering.

Wherein, it is made up of due to the video data after parsing image information one by one, in order to add in video Plus image to the image information in each frame, it is necessary to add image, and, it is necessary to filter before handling image information Mirror processing, so as to obtain needing the video effect after filtering.The target image is the correspondence for needing to add bitmap, or to need The object rendered, the embodiment of the present invention is not specifically limited, for example, when video effect is designated addition background image, Then target image is then character image or animal painting, if video effect is designated addition and enunciated special efficacy, target image is behaved Face or mouth.

Add it should be noted that synthesis is rendered in bitmap then for the addition needs addition in each two field picture, each frame Plus the position of bitmap is different, so as to realize that the image rendered in video playback is dynamic.

For the embodiment of the present invention, step identifies the mesh after identification filter is handled in view data according to the video effect Logo image, and synthesis is carried out to the target image render and can specifically include：If identifying, the video effect is designated conjunction Into stereo-picture, then split the target image, according to preset coloring rule to the target after the target image, the segmentation Image and render image carry out coloring synthesis, it is described it is preset coloring rule be used for react after the target image, the segmentation Target image, the position display relation rendered between image.

Wherein, it using vision difference effect by bitmap display is that band has a sense of hierarchy, virtual that the synthetic stereo image, which is, Whether real three-dimensional dynamic image, this stereo-picture shows depending on the bitmap of addition position different in each two field picture Show, how much is display, obtained from.

It should be noted that if video effect is designated synthetic stereo image, specific step is then the segmentation target Image, according to preset coloring rule is to the target image after the target image, the segmentation and renders image and colours Synthesis, wherein, general, the target image of synthetic stereo image is character image, in order to which the personage in image is entered with background Row is distinguished, it is necessary to the image information to each frame is split, described to render image to need the bitmap of addition, described preset Color rule be judge whether show when rendering image coverage goal image, show how much, and render whether image needs to hide Strategy, specific strategy set according to the position of different bitmaps and personage, and the embodiment of the present invention is not specifically limited.

For the embodiment of the present invention, the audio frequency effect mark during step is identified according to the effect handles the voice data It can specifically include：The discrete audio track data in the voice data is gathered according to prefixed time interval；Imitated according to the audio The discrete audio track data is effectively superimposed by fruit mark with default track.

In order to preferably be overlapped different tracks, rather than only it is that volume carries out simply superposition, it is necessary to sound Frequency gathers discrete audio track data according to prefixed time interval according to discretization is carried out, and the prefixed time interval can be 1 Second, 0.05 second etc., the embodiment of the present invention is not specifically limited.Effective superposition can be by the discrete sounds of multiple default tracks Rail data are overlapped, except the discrete audio track data of the audio data collecting in multimedia file, and the default track of others can Think and be stored in the caching of present terminal equipment or in hard disk, the embodiment of the present invention is not specifically limited.For example, collection from Scattered audio track data be child read poem sound, it is necessary to the default track of overlapping addition is the background music of MoonlIght on the Lotus Pond, then will Sound carries out overlapping superposition.

It should be noted that in audio frequency process, the audio-frequency processing method increased income, such as Ffmpg can be selected.

205th, when receiving live preview request, the video data and the voice data are shown.

Wherein, the state of the currently processed video of preview or audio please the need for the live preview request inputs for user Ask, live preview is asked for the video image after instruction simulation playback process, can be browsed for each frame, can also be with Visual form is played out, also the audio after simulation playback process, general, displaying live view request is additionally operable to instruction simulation displaying Untreated original image and original audio, specifically depending on the time for receiving live preview request, the embodiment of the present invention is not It is specifically limited.

206th, the video data after processing and the voice data after processing are encoded, obtains multimedia video.

This step is identical with step 104 method shown in Fig. 1, will not be repeated here.

207th, speed adjust instruction is received, multimedia video is adjusted according to the velocity information carried in the speed adjust instruction The broadcasting speed of video data and voice data in frequency.

Wherein, velocity information is carried in the speed adjust instruction, can be to accelerate speed or slow-down, specifically Data can be carried in velocity information, and the embodiment of the present invention is not specifically limited.

It should be noted that the adjustment specific method for speed can be that 1 second can be adjusted if for video data The frame number of image in clock, to realize that the fast jogging speed of video is played in regulation, can adjust preset time if for voice data The interior speed for playing track, to realize that the fast jogging speed of audio is played in regulation.

For the embodiment of the present invention, specific application scenarios can be with as follows, but not limited to this, including：Intercept boy The multimedia file read online, the video data and audio number of boy's reading are decoded according to video track and audio track According to the effect of user's input is designated switch signs of enunciating, then handles video data and voice data respectively, audio is recognized first The Chinese character that boy in data reads, extracted from preset literal pool be stored with corresponding character image, preset literal pool with The corresponding character image of text-to-speech, character image is added in video data, that is, that finds " hoe standing grain day midday " renders figure As or word sprout figure, according to audio presentation time, by " hoe ", " standing grain ", " day ", " when ", that " noon " is added separately to the time is corresponding In the image of frame, the position of addition is boy's face of identification, then encodes addition character image and audio, is compiled Short-sighted frequency after volume.

The invention provides the edit methods of another multimedia video, the embodiment of the present invention is by decoding multimedia text Video data and voice data in part, render to video data according to video effect mark, are identified according to audio frequency effect Voice data is effectively superimposed, re-encoded as multimedia video, realizes and edlin is entered to live or interception video, increase The result of broadcast of short-sighted frequency so that video is more lively, improve video content shows effect, the personage in editor's rear video with Render image more to fit, the short-sighted frequency of recording can be designed according to different requirements, increase the purposes of short-sighted frequency, improve The service efficiency of video.

Further, as the realization to method shown in above-mentioned Fig. 1, the embodiments of the invention provide a kind of multimedia video Editing device, as shown in figure 3, the device includes：Acquiring unit 31, decoding unit 32, processing unit 33, coding unit 34.

Acquiring unit 31, for obtaining multimedia file；The acquiring unit 31 is held for the editing device of multimedia video Row obtains the functional module of multimedia file.

Decoding unit 32, for decoding video data and voice data in the multimedia file；The decoding unit The function mould of video data and voice data in 32 multimedia files described in the editing device perform decoding of multimedia video Block.

Processing unit 33, for carrying out rendering processing to the video data, and carries out track to the voice data Processing；The processing unit 33 performs for the editing device of multimedia video to carry out rendering processing to the video data, and The functional module of track processing is carried out to the voice data.

Coding unit 34, for the video data after processing and the voice data after processing to be encoded, obtains many matchmakers Volumetric video.The coding unit 34 is performed the video data after processing and the sound after processing for the editing device of multimedia video Frequency obtains the functional module of multimedia video according to being encoded.

The invention provides a kind of editing device of multimedia video, with the short-sighted frequency intercepted in existing live or small video Can not edit and compare, the embodiment of the present invention by decoding video data and voice data in multimedia file, respectively to regarding Frequency evidence and voice data are handled, and are being encoded to multimedia video, are realized and are entered edlin to live or interception video, increase Plus the result of broadcast of short-sighted frequency so that video is more lively, and the personage in editor's rear video more fits with rendering image, improves The service efficiency of video.

Further, as the realization to method shown in above-mentioned Fig. 2, the embodiments of the invention provide another multimedia video The editing device of frequency, as shown in figure 4, the device includes：Acquiring unit 41, decoding unit 42, processing unit 43, coding unit 44th, display unit 45, adjustment unit 46.

Acquiring unit 41, for obtaining multimedia file；

Decoding unit 42, for decoding video data and voice data in the multimedia file；

Processing unit 43, for carrying out rendering processing to the video data, and carries out track to the voice data Processing；

Coding unit 44, for the video data after processing and the voice data after processing to be encoded, obtains many matchmakers Volumetric video.

Specifically, for the ease of carrying out processing video and audio according to the demand of user, the processing unit 43 includes：

Effective mark is carried in receiving module 4301, the process instruction for receiving user's input, the process instruction；

Processing module 4302, the video data is rendered for the video effect mark in effect mark, and Audio frequency effect mark in being identified according to the effect handles the voice data.

Specifically, in order to implement the process step to video data, the processing module 4302 includes：

Extracting sub-module 430201, the view data for extracting each frame in the video data, and to described image Data carry out filter processing；

Submodule 430202 is synthesized, for identifying the mesh after identification filter processing in view data according to the video effect Logo image, and target image progress synthesis is rendered.

The synthesis submodule 430202, if specifically for identifying that the video effect is designated synthetic stereo image, Then split the target image, according to preset coloring rule to the target image and wash with watercolours after the target image, the segmentation Contaminate image carry out coloring synthesis, it is described it is preset coloring rule be used for react the target image after the target image, the segmentation, The position display relation rendered between image.

Specifically, in order to implement the process step to voice data, the processing module 4302 also includes：

Submodule 430203 is gathered, for gathering the discrete track number in the voice data according to prefixed time interval According to；

Submodule 430204 is superimposed, for being identified according to the audio frequency effect by the discrete audio track data and default track Effectively it is superimposed.

The decoding unit 42, specifically for being decoded respectively in the multimedia file according to video track and audio track Video data and voice data.

Further, the audio of video that preview renders and processing is carried out at any time for the ease of user, described device is also wrapped Include：

Display unit 45, for when receiving live preview request, showing the video data and the voice data.

Further, in order to arbitrarily adjust the speed of broadcasting video, described device also includes：

Adjustment unit 46, believes for receiving speed adjust instruction, and according to the speed carried in the speed adjust instruction The broadcasting speed of video data and voice data in breath adjustment multimedia video.

The invention provides the editing device of another multimedia video, the embodiment of the present invention is by decoding multimedia text Video data and voice data in part, render to video data according to video effect mark, are identified according to audio frequency effect Voice data is effectively superimposed, re-encoded as multimedia video, realizes and edlin is entered to live or interception video, increase The result of broadcast of short-sighted frequency so that video is more lively, improve video content shows effect, the personage in editor's rear video with Render image more to fit, the short-sighted frequency of recording can be designed according to different requirements, increase the purposes of short-sighted frequency, improve The service efficiency of video.

The embodiments of the invention provide a kind of storage device, wherein a plurality of instruction that is stored with, the instruction is suitable to by handling Device is loaded and performed：Obtain multimedia file；Decode the video data and voice data in the multimedia file；Regarded to described Frequency renders processing according to progress, and carries out track processing to the voice data；After the video data after processing and processing Voice data encoded, obtain multimedia video.

The embodiments of the invention provide a kind of mobile terminal, including processor, various instructions are adapted for carrying out；And storage is set Standby, suitable for storing a plurality of instruction, the instruction is suitable to be loaded and performed by processor：Obtain multimedia file；Decoding is described more Video data and voice data in media file；The video data is carried out to render processing, and to the voice data Carry out track processing；Video data after processing and the voice data after processing are encoded, multimedia video is obtained.

In the above-described embodiments, the description to each embodiment all emphasizes particularly on different fields, and does not have the portion being described in detail in some embodiment Point, it may refer to the associated description of other embodiment.

It is understood that the correlated characteristic in the above method and device can be referred to mutually.In addition, in above-described embodiment " first ", " second " etc. be to be used to distinguish each embodiment, and do not represent the quality of each embodiment.

It is apparent to those skilled in the art that, for convenience and simplicity of description, the system of foregoing description, The specific work process of device and unit, may be referred to the corresponding process in preceding method embodiment, will not be repeated here.

Algorithm and display be not inherently related to any certain computer, virtual system or miscellaneous equipment provided herein. Various general-purpose systems can also be used together with based on teaching in this.As described above, construct required by this kind of system Structure be obvious.In addition, the present invention is not also directed to any certain programmed language.It is understood that, it is possible to use it is various Programming language realizes the content of invention described herein, and the description done above to language-specific is to disclose this hair Bright preferred forms.

In the specification that this place is provided, numerous specific details are set forth.It is to be appreciated, however, that the implementation of the present invention Example can be put into practice in the case of these no details.In some instances, known method, structure is not been shown in detail And technology, so as not to obscure the understanding of this description.

Similarly, it will be appreciated that in order to simplify the disclosure and help to understand one or more of each inventive aspect, exist Above in the description of the exemplary embodiment of the present invention, each feature of the invention is grouped together into single implementation sometimes In example, figure or descriptions thereof.However, the method for the disclosure should be construed to reflect following intention：It is i.e. required to protect The application claims of shield features more more than the feature being expressly recited in each claim.More precisely, such as following Claims reflect as, inventive aspect is all features less than single embodiment disclosed above.Therefore, Thus the claims for following embodiment are expressly incorporated in the embodiment, wherein each claim is in itself All as the separate embodiments of the present invention.

Those skilled in the art, which are appreciated that, to be carried out adaptively to the module in the equipment in embodiment Change and they are arranged in one or more equipment different from the embodiment.Can be the module or list in embodiment Member or component be combined into a module or unit or component, and can be divided into addition multiple submodule or subelement or Sub-component.In addition at least some in such feature and/or process or unit exclude each other, it can use any Combination is disclosed to all features disclosed in this specification (including adjoint claim, summary and accompanying drawing) and so to appoint Where all processes or unit of method or equipment are combined.Unless expressly stated otherwise, this specification (including adjoint power Profit is required, summary and accompanying drawing) disclosed in each feature can or similar purpose identical, equivalent by offer alternative features come generation Replace.

Although in addition, it will be appreciated by those of skill in the art that some embodiments described herein include other embodiments In included some features rather than further feature, but the combination of the feature of be the same as Example does not mean in of the invention Within the scope of and form different embodiments.For example, in the following claims, times of embodiment claimed One of meaning mode can be used in any combination.

The present invention all parts embodiment can be realized with hardware, or with one or more processor run Software module realize, or realized with combinations thereof.It will be understood by those of skill in the art that can use in practice Microprocessor or digital signal processor (DSP) come realize multimedia video according to embodiments of the present invention edit methods and The some or all functions of some or all parts in device.The present invention is also implemented as being used to perform being retouched here The some or all equipment or program of device (for example, computer program and computer program product) for the method stated. Such program for realizing the present invention can be stored on a computer-readable medium, or can have one or more signal Form.Such signal can be downloaded from internet website and obtained, either on carrier signal provide or with it is any its He provides form.

It should be noted that the present invention will be described rather than limits the invention for above-described embodiment, and ability Field technique personnel can design alternative embodiment without departing from the scope of the appended claims.In the claims, Any reference symbol between bracket should not be configured to limitations on claims.Word "comprising" is not excluded the presence of not Element or step listed in the claims.Word "a" or "an" before element does not exclude the presence of multiple such Element.The present invention can be by means of including the hardware of some different elements and coming real by means of properly programmed computer It is existing.In if the unit claim of equipment for drying is listed, several in these devices can be by same hardware branch To embody.The use of word first, second, and third does not indicate that any order.These words can be explained and run after fame Claim.

Embodiment of the invention discloses that：

A1, a kind of edit methods of multimedia video, including：

Obtain multimedia file；

Decode the video data and voice data in the multimedia file；

A2, the method according to A1, it is described that the video data is carried out to render processing, and to the voice data Carrying out track processing includes：

Video effect mark in A3, the method according to A2, the mark according to the effect renders the video Data include：

A4, the method according to A3, after the mark identification filter processing according to the video effect in view data Target image, and to the target image carry out synthesis render including：

Audio frequency effect mark in A5, the method according to A2, the mark according to the effect handles the audio Data include：

Video data and voice data in A6, the method according to A1, the decoding multimedia file include：

A7, the method according to A1, it is described that the video data is carried out to render processing, and to the voice data Carry out after track processing, methods described also includes：

A8, the method according to A1, methods described also include：

B9, a kind of editing device of multimedia video, including：

Acquiring unit, for obtaining multimedia file；

Decoding unit, for decoding video data and voice data in the multimedia file；

B10, the device according to B9, the processing unit include：

B11, the device according to B10, the processing module include：

B12, the device according to B11,

B13, the device according to B10, the processing module also include：

B14, the device according to B9,

B15, the device according to B9, described device also include：

B16, the device according to B9, described device also include：

C17, a kind of storage device, wherein a plurality of instruction that is stored with, the instruction is suitable to be loaded and performed by processor：

Obtain multimedia file；

Decode the video data and voice data in the multimedia file；

D18, a kind of mobile terminal, including processor, are adapted for carrying out various instructions；And storage device, it is many suitable for storing Bar is instructed, and the instruction is suitable to be loaded and performed by processor：

Obtain multimedia file；

Decode the video data and voice data in the multimedia file；

Claims

1. a kind of edit methods of multimedia video, it is characterised in that including：

Obtain multimedia file；

Decode the video data and voice data in the multimedia file；

2. according to the method described in claim 1, it is characterised in that described that the video data is carried out to render processing, and Track processing is carried out to the voice data to be included：

Video effect mark in being identified according to the effect renders the video data, and the sound in effect mark The yupin effect mark processing voice data.

3. method according to claim 2, it is characterised in that the video effect in the mark according to the effect is identified Rendering the video data includes：

Target image after identification filter processing in view data is identified according to the video effect, and the target image is entered Row synthesis is rendered.

4. method according to claim 3, it is characterised in that described according to video effect mark identification filter processing Target image in view data afterwards, and the target image is carried out synthesis render including：

If identifying, the video effect is designated synthetic stereo image, splits the target image, is advised according to preset coloring Then to the target image after the target image, the segmentation and render image carry out coloring synthesis, it is described it is preset coloring rule Then it is used to react the target image after the target image, the segmentation, the position display relation rendered between image.

5. a kind of editing device of multimedia video, it is characterised in that including：

Acquiring unit, for obtaining multimedia file；

Decoding unit, for decoding video data and voice data in the multimedia file；

Processing unit, for carrying out rendering processing to the video data, and carries out track processing to the voice data；

6. device according to claim 5, it is characterised in that the processing unit includes：

Processing module, the video data is rendered for the video effect mark in effect mark, and according to described The audio frequency effect mark processing voice data in effect mark.

7. device according to claim 6, it is characterised in that the processing module includes：

Extracting sub-module, the view data for extracting each frame in the video data, and described image data are filtered Mirror processing；

Submodule is synthesized, for identifying the target image after identification filter processing in view data according to the video effect, and Synthesis is carried out to the target image to render.

8. device according to claim 7, it is characterised in that

The synthesis submodule, if specifically for identifying that the video effect is designated synthetic stereo image, segmentation is described Target image, according to preset coloring rule is to the target image after the target image, the segmentation and renders image progress Coloring synthesis, the preset coloring rule is used to react the target image after the target image, the segmentation, described renders figure Position display relation as between.

9. a kind of storage device, wherein a plurality of instruction that is stored with, the instruction is suitable to be loaded and performed by processor：

Obtain multimedia file；

Decode the video data and voice data in the multimedia file；

10. a kind of mobile terminal, including processor, are adapted for carrying out various instructions；And storage device, suitable for storing a plurality of refer to Order, the instruction is suitable to be loaded and performed by processor：

Obtain multimedia file；

Decode the video data and voice data in the multimedia file；