CN107241646A - The edit methods and device of multimedia video - Google Patents

The edit methods and device of multimedia video Download PDF

Info

Publication number
CN107241646A
CN107241646A CN201710566432.2A CN201710566432A CN107241646A CN 107241646 A CN107241646 A CN 107241646A CN 201710566432 A CN201710566432 A CN 201710566432A CN 107241646 A CN107241646 A CN 107241646A
Authority
CN
China
Prior art keywords
video
processing
data
voice data
target image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710566432.2A
Other languages
Chinese (zh)
Other versions
CN107241646B (en
Inventor
邵可
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Qihoo Technology Co Ltd
Original Assignee
Beijing Qihoo Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Qihoo Technology Co Ltd filed Critical Beijing Qihoo Technology Co Ltd
Priority to CN201710566432.2A priority Critical patent/CN107241646B/en
Publication of CN107241646A publication Critical patent/CN107241646A/en
Application granted granted Critical
Publication of CN107241646B publication Critical patent/CN107241646B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • H04N21/44012Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving rendering scenes according to scene graphs, e.g. MPEG-4 scene graphs

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Television Signal Processing For Recording (AREA)

Abstract

The invention discloses a kind of edit methods of multimedia video and device, it is related to multimedia technology field, main purpose is the problem of short-sighted frequency intercepted in existing live or small video can not be edited.Main technical schemes include:Obtain multimedia file;Decode the video data and voice data in the multimedia file;The video data is carried out to render processing, and track processing is carried out to the voice data;Video data after processing and the voice data after processing are encoded, multimedia video is obtained.It is mainly used in the editor of multimedia video.

Description

The edit methods and device of multimedia video
Technical field
The present invention relates to multimedia technology field, the edit methods and device of more particularly to a kind of multimedia video.
Background technology
With the fast development of Internet technology, people have been no longer satisfied with simple use mobile phone communication to be handed over Stream and communication, wherein, the social platform that online live, small video etc. is set up using multimedia technology is entered between having become user The Main Means that row is linked up.
At present, user, can be by intercepting one in video when using terminal equipment carries out live or recording small video Segment is preserved, for example, certain live platform just live little girl dance, in order to record little girl rotation video, it is necessary to Intercept the short-sighted frequency that little girl rotates in live video.It is right in order to strengthen the result of broadcast to video content after interception video Multimedia video enters edlin and has become urgent problem to be solved.
The content of the invention
In view of this, the present invention provides a kind of edit methods and device of multimedia video, and main purpose is existing straight Broadcast or small video in the short-sighted frequency that intercepts the problem of can not edit.
According to one aspect of the invention there is provided a kind of edit methods of multimedia video, including:
Obtain multimedia file;
Decode the video data and voice data in the multimedia file;
The video data is carried out to render processing, and track processing is carried out to the voice data;
Video data after processing and the voice data after processing are encoded, multimedia video is obtained.
Further, it is described that the video data is carried out to render processing, and the voice data is carried out at track Reason includes:
Receive in the process instruction of user's input, the process instruction and carry effective mark;
Video effect mark in being identified according to the effect renders the video data, and according in effect mark The audio frequency effect mark processing voice data.
Further, the video effect mark in the mark according to the effect, which renders the video data, includes:
The view data of each frame in the video data is extracted, and filter processing is carried out to described image data;
Target image after identification filter processing in view data is identified according to the video effect, and to the target figure Rendered as carrying out synthesis.
Further, the target image identified according to the video effect after identification filter processing in view data, And to the target image carry out synthesis render including:
If identifying, the video effect is designated synthetic stereo image, splits the target image, according to preset Color rule is to the target image after the target image, the segmentation and renders image and carries out coloring synthesis, described preset Color rule is used to react the target image after the target image, the segmentation, the position display pass rendered between image System.
Further, the audio frequency effect mark in the mark according to the effect, which handles the voice data, includes:
The discrete audio track data in the voice data is gathered according to prefixed time interval;
The discrete audio track data is effectively superimposed with default track according to audio frequency effect mark.
Further, the video data and voice data in the decoding multimedia file include:
Decode the video data and voice data in the multimedia file respectively according to video track and audio track.
Further, it is described that the video data is carried out to render processing, and the voice data is carried out at track After reason, methods described also includes:
When receiving live preview request, the video data and the voice data are shown.
Further, methods described also includes:
Speed adjust instruction is received, is adjusted according to the velocity information carried in the speed adjust instruction in multimedia video The broadcasting speed of video data and voice data.
According to one aspect of the invention there is provided a kind of editing device of multimedia video, including:
Acquiring unit, for obtaining multimedia file;
Decoding unit, for decoding video data and voice data in the multimedia file;
Processing unit, for carrying out rendering processing to the video data, and is carried out at track to the voice data Reason;
Coding unit, for the video data after processing and the voice data after processing to be encoded, obtains multimedia Video.
Further, the processing unit includes:
Effective mark is carried in receiving module, the process instruction for receiving user's input, the process instruction;
Processing module, identifies for the video effect in effect mark and renders the video data, and according to The audio frequency effect mark processing voice data in the effect mark.
Further, the processing module includes:
Extracting sub-module, the view data for extracting each frame in the video data, and described image data are entered The processing of row filter;
Submodule is synthesized, for identifying the target figure after identification filter processing in view data according to the video effect Picture, and target image progress synthesis is rendered.
The synthesis submodule, if specifically for identifying that the video effect is designated synthetic stereo image, splitting The target image, according to preset coloring rule is to the target image after the target image, the segmentation and renders image Coloring synthesis is carried out, the preset coloring rule is used to react target image, the wash with watercolours after the target image, the segmentation Contaminate the position display relation between image.
Further, the processing module also includes:
Submodule is gathered, for gathering the discrete audio track data in the voice data according to prefixed time interval;
Submodule is superimposed, for being had the discrete audio track data and default track according to audio frequency effect mark Effect superposition.
The decoding unit, specifically for being decoded respectively in the multimedia file according to video track and audio track Video data and voice data.
Further, described device also includes:
Display unit, for when receiving live preview request, showing the video data and the voice data.
Further, described device also includes:
Adjustment unit, for receiving speed adjust instruction, and according to the velocity information carried in the speed adjust instruction Adjust the broadcasting speed of video data and voice data in multimedia video.
According to one aspect of the invention there is provided a kind of storage device, wherein a plurality of instruction that is stored with, the instruction is suitable to Loaded by processor and performed:
Obtain multimedia file;
Decode the video data and voice data in the multimedia file;
The video data is carried out to render processing, and track processing is carried out to the voice data;
Video data after processing and the voice data after processing are encoded, multimedia video is obtained.
According to one aspect of the invention there is provided a kind of mobile terminal, including processor, various instructions are adapted for carrying out;With And storage device, suitable for storing a plurality of instruction, the instruction is suitable to be loaded and performed by processor:
Obtain multimedia file;
Decode the video data and voice data in the multimedia file;
The video data is carried out to render processing, and track processing is carried out to the voice data;
Video data after processing and the voice data after processing are encoded, multimedia video is obtained.
By above-mentioned technical proposal, technical scheme provided in an embodiment of the present invention at least has following advantages:
The invention provides a kind of edit methods of multimedia video and device, multimedia file is obtained first, is then solved Video data and voice data in the code multimedia file, then the video data is carried out to render processing, and to institute State voice data and carry out track processing, finally the video data after processing and the voice data after processing are encoded, obtained Multimedia video.Compared with the short-sighted frequency intercepted in existing live or small video can not be edited, the embodiment of the present invention passes through decoding The video data and voice data gone out in multimedia file, is handled video data and voice data respectively, is being encoded to Multimedia video, realizes and enters edlin to live or interception video, increase the result of broadcast of short-sighted frequency so that video is more given birth to Dynamic, the personage in editor's rear video more fits with rendering image, improves the service efficiency of video.
Described above is only the general introduction of technical solution of the present invention, in order to better understand the technological means of the present invention, And can be practiced according to the content of specification, and in order to allow above and other objects of the present invention, feature and advantage can Become apparent, below especially exemplified by the embodiment of the present invention.
Brief description of the drawings
By reading the detailed description of hereafter preferred embodiment, various other advantages and benefit is common for this area Technical staff will be clear understanding.Accompanying drawing is only used for showing the purpose of preferred embodiment, and is not considered as to the present invention Limitation.And in whole accompanying drawing, identical part is denoted by the same reference numerals.In the accompanying drawings:
Fig. 1 shows a kind of edit methods flow chart for multimedia video that the embodiment of the present invention one is provided;
Fig. 2 shows the edit methods flow chart for another multimedia video that the embodiment of the present invention two is provided;
Fig. 3 shows a kind of editing device block diagram for multimedia video that the embodiment of the present invention three is provided;
Fig. 4 shows the editing device block diagram for another multimedia video that the embodiment of the present invention four is provided.
Embodiment
The exemplary embodiment of the disclosure is more fully described below with reference to accompanying drawings.Although showing the disclosure in accompanying drawing Exemplary embodiment, it being understood, however, that may be realized in various forms the disclosure without should be by embodiments set forth here Limited.On the contrary, these embodiments are provided to facilitate a more thoroughly understanding of the present invention, and can be by the scope of the present disclosure Complete conveys to those skilled in the art.
The embodiments of the invention provide a kind of edit methods of multimedia video, as shown in figure 1, methods described includes:
101st, multimedia file is obtained.
Wherein, the multimedia file can be the video file of different-format, such as MP4 forms, MKV forms, 3GP forms Deng, the embodiment of the present invention is not specifically limited, the multimedia file can by terminal device camera carry out shooting obtain Take, can also be intercepted, directly can also be carried from the memory space in terminal device in online live video Take, the embodiment of the present invention is not specifically limited.
It should be noted that for the ease of in editor's VAS application -to-terminal service equipment of multimedia video, in interception video or record Need to set certain video playback time during video processed so that the multimedia video ultimately generated is a shorter video, It is easy to current video editing method to be applied in the less terminal device of memory headroom.
102nd, the video data and voice data in the multimedia file are decoded.
Wherein, the decoding video data and voice data are specifically as follows by reading regarding in multimedia file respectively Frequency according to and voice data realize the decoding of video and audio, i.e., video flowing and audio stream are reduced into analog video number According to and analog audio data.
It should be noted that in embodiments of the present invention, decoding process can be completed by a media decoder, will be many Media file is delivered in media decoder, can automatically derive video data and audio time, for video data, by In video combined by the different image of multiframe, decoded video data can be specially the corresponding image of each frame Information, for voice data, decoded voice data is then the analog signal of impulse form.
103rd, the video data is carried out rendering processing, and track processing is carried out to the voice data.
Wherein, it is described to render the image that processing includes adding bitmap in video, adding dynamic image, adjust video image Effect etc., track processing includes the different tracks of increase and is combined, increases audio etc., and the embodiment of the present invention is not specifically limited, Bitmap is dot matrix image or drawing image.
It should be noted that when video data and voice data are handled, individually separated being handled, also may be used Handled with interrelated.For example, when adding a background in video, only can carry out rendering the back of the body to video data Scape processing, without handling audio track data, still, when personage enunciates in video, it is desirable to which the word of discharge is changed into word Addition is in video, it is necessary to first handle audio track data, according to the word in identification audio track data, by corresponding figure in literal pool As data addition is in video data, at this moment it is accomplished by video data and is jointly processed by with audio track data.
104th, the video data after processing and the voice data after processing are encoded, obtains multimedia video.
Wherein, it is described to be encoded to the coding for being matched the video data after rendering with voice data, with obtain it is smooth, The corresponding multimedia video of audio frequency and video.
The invention provides a kind of edit methods of multimedia video, with the short-sighted frequency intercepted in existing live or small video Can not edit and compare, the embodiment of the present invention by decoding video data and voice data in multimedia file, respectively to regarding Frequency evidence and voice data are handled, and are being encoded to multimedia video, are realized and are entered edlin to live or interception video, increase Plus the result of broadcast of short-sighted frequency so that video is more lively, and the personage in editor's rear video more fits with rendering image, improves The service efficiency of video.
The embodiments of the invention provide the edit methods of another multimedia video, as shown in Fig. 2 methods described includes:
201st, multimedia file is obtained.
This step is identical with step 101 method shown in Fig. 1, will not be repeated here.
It should be noted that to can apply to other straight for the edit methods for the multimedia video being related in the embodiment of the present invention Broadcast or the application program of recorded video in, the editor of video is realized by calling interface, can also according to corresponding program come The application program being used alone is written as, multimedia file is obtained by calling camera directly to shoot, the embodiment of the present invention is not It is specifically limited.
202nd, the video data and audio number in the multimedia file are decoded respectively according to video track and audio track According to.
Wherein, the video track is the content tracks of video playback, and the audio track is the content rail that audio is played Mark, in order to decode the video data and audio track data in multimedia file, to enter respectively to video data and audio track data Row processing, it is therefore desirable to decoded respectively with audio track according to video track.
203rd, the process instruction of user's input is received.
Wherein, effective mark is carried in the process instruction.Shown process instruction is used to indicate that system carries out video tool The video editing of body, shown effect, which is designated mark, can reach the information of different video and audio frequency effect, for example, rendering figure Picture, language and characters conversion, addition background etc., the embodiment of the present invention is not specifically limited.
If it should be noted that the image rendered is the image that user inputs, image can be entered by process instruction Row is incoming.In addition, it is necessary to which the image rendered, the background coordinatograph of addition can be the image pre-set, or use The image that family is inputted, the embodiment of the present invention is not specifically limited.
204th, the video effect mark in being identified according to the effect renders the video data, and according to the effect mark The audio frequency effect mark processing voice data in knowledge.
Wherein, the video effect is designated the mark for carrying out process video effects in video data, the sound Yupin effect is designated the mark for carrying out processing audio frequency effect in voice data, in order to further add different-effect pair The different images answered according to video or audio frequency effect mark, it is necessary to handle video data or voice data.
Video effect mark and audio frequency effect mark in by being identified according to effect carry out wash with watercolours to video and audio respectively Dye processing and audio effect processing so that image enters edlin respectively with sound, optimize the performance to video editing.
For the embodiment of the present invention, the video effect mark during step is identified according to the effect renders the video data It can specifically include:The view data of each frame in the video data is extracted, and filter processing is carried out to described image data; Target image after identification filter processing in view data is identified according to the video effect, and the target image is closed Into rendering.
Wherein, it is made up of due to the video data after parsing image information one by one, in order to add in video Plus image to the image information in each frame, it is necessary to add image, and, it is necessary to filter before handling image information Mirror processing, so as to obtain needing the video effect after filtering.The target image is the correspondence for needing to add bitmap, or to need The object rendered, the embodiment of the present invention is not specifically limited, for example, when video effect is designated addition background image, Then target image is then character image or animal painting, if video effect is designated addition and enunciated special efficacy, target image is behaved Face or mouth.
Add it should be noted that synthesis is rendered in bitmap then for the addition needs addition in each two field picture, each frame Plus the position of bitmap is different, so as to realize that the image rendered in video playback is dynamic.
For the embodiment of the present invention, step identifies the mesh after identification filter is handled in view data according to the video effect Logo image, and synthesis is carried out to the target image render and can specifically include:If identifying, the video effect is designated conjunction Into stereo-picture, then split the target image, according to preset coloring rule to the target after the target image, the segmentation Image and render image carry out coloring synthesis, it is described it is preset coloring rule be used for react after the target image, the segmentation Target image, the position display relation rendered between image.
Wherein, it using vision difference effect by bitmap display is that band has a sense of hierarchy, virtual that the synthetic stereo image, which is, Whether real three-dimensional dynamic image, this stereo-picture shows depending on the bitmap of addition position different in each two field picture Show, how much is display, obtained from.
It should be noted that if video effect is designated synthetic stereo image, specific step is then the segmentation target Image, according to preset coloring rule is to the target image after the target image, the segmentation and renders image and colours Synthesis, wherein, general, the target image of synthetic stereo image is character image, in order to which the personage in image is entered with background Row is distinguished, it is necessary to the image information to each frame is split, described to render image to need the bitmap of addition, described preset Color rule be judge whether show when rendering image coverage goal image, show how much, and render whether image needs to hide Strategy, specific strategy set according to the position of different bitmaps and personage, and the embodiment of the present invention is not specifically limited.
For the embodiment of the present invention, the audio frequency effect mark during step is identified according to the effect handles the voice data It can specifically include:The discrete audio track data in the voice data is gathered according to prefixed time interval;Imitated according to the audio The discrete audio track data is effectively superimposed by fruit mark with default track.
In order to preferably be overlapped different tracks, rather than only it is that volume carries out simply superposition, it is necessary to sound Frequency gathers discrete audio track data according to prefixed time interval according to discretization is carried out, and the prefixed time interval can be 1 Second, 0.05 second etc., the embodiment of the present invention is not specifically limited.Effective superposition can be by the discrete sounds of multiple default tracks Rail data are overlapped, except the discrete audio track data of the audio data collecting in multimedia file, and the default track of others can Think and be stored in the caching of present terminal equipment or in hard disk, the embodiment of the present invention is not specifically limited.For example, collection from Scattered audio track data be child read poem sound, it is necessary to the default track of overlapping addition is the background music of MoonlIght on the Lotus Pond, then will Sound carries out overlapping superposition.
It should be noted that in audio frequency process, the audio-frequency processing method increased income, such as Ffmpg can be selected.
205th, when receiving live preview request, the video data and the voice data are shown.
Wherein, the state of the currently processed video of preview or audio please the need for the live preview request inputs for user Ask, live preview is asked for the video image after instruction simulation playback process, can be browsed for each frame, can also be with Visual form is played out, also the audio after simulation playback process, general, displaying live view request is additionally operable to instruction simulation displaying Untreated original image and original audio, specifically depending on the time for receiving live preview request, the embodiment of the present invention is not It is specifically limited.
206th, the video data after processing and the voice data after processing are encoded, obtains multimedia video.
This step is identical with step 104 method shown in Fig. 1, will not be repeated here.
207th, speed adjust instruction is received, multimedia video is adjusted according to the velocity information carried in the speed adjust instruction The broadcasting speed of video data and voice data in frequency.
Wherein, velocity information is carried in the speed adjust instruction, can be to accelerate speed or slow-down, specifically Data can be carried in velocity information, and the embodiment of the present invention is not specifically limited.
It should be noted that the adjustment specific method for speed can be that 1 second can be adjusted if for video data The frame number of image in clock, to realize that the fast jogging speed of video is played in regulation, can adjust preset time if for voice data The interior speed for playing track, to realize that the fast jogging speed of audio is played in regulation.
For the embodiment of the present invention, specific application scenarios can be with as follows, but not limited to this, including:Intercept boy The multimedia file read online, the video data and audio number of boy's reading are decoded according to video track and audio track According to the effect of user's input is designated switch signs of enunciating, then handles video data and voice data respectively, audio is recognized first The Chinese character that boy in data reads, extracted from preset literal pool be stored with corresponding character image, preset literal pool with The corresponding character image of text-to-speech, character image is added in video data, that is, that finds " hoe standing grain day midday " renders figure As or word sprout figure, according to audio presentation time, by " hoe ", " standing grain ", " day ", " when ", that " noon " is added separately to the time is corresponding In the image of frame, the position of addition is boy's face of identification, then encodes addition character image and audio, is compiled Short-sighted frequency after volume.
The invention provides the edit methods of another multimedia video, the embodiment of the present invention is by decoding multimedia text Video data and voice data in part, render to video data according to video effect mark, are identified according to audio frequency effect Voice data is effectively superimposed, re-encoded as multimedia video, realizes and edlin is entered to live or interception video, increase The result of broadcast of short-sighted frequency so that video is more lively, improve video content shows effect, the personage in editor's rear video with Render image more to fit, the short-sighted frequency of recording can be designed according to different requirements, increase the purposes of short-sighted frequency, improve The service efficiency of video.
Further, as the realization to method shown in above-mentioned Fig. 1, the embodiments of the invention provide a kind of multimedia video Editing device, as shown in figure 3, the device includes:Acquiring unit 31, decoding unit 32, processing unit 33, coding unit 34.
Acquiring unit 31, for obtaining multimedia file;The acquiring unit 31 is held for the editing device of multimedia video Row obtains the functional module of multimedia file.
Decoding unit 32, for decoding video data and voice data in the multimedia file;The decoding unit The function mould of video data and voice data in 32 multimedia files described in the editing device perform decoding of multimedia video Block.
Processing unit 33, for carrying out rendering processing to the video data, and carries out track to the voice data Processing;The processing unit 33 performs for the editing device of multimedia video to carry out rendering processing to the video data, and The functional module of track processing is carried out to the voice data.
Coding unit 34, for the video data after processing and the voice data after processing to be encoded, obtains many matchmakers Volumetric video.The coding unit 34 is performed the video data after processing and the sound after processing for the editing device of multimedia video Frequency obtains the functional module of multimedia video according to being encoded.
The invention provides a kind of editing device of multimedia video, with the short-sighted frequency intercepted in existing live or small video Can not edit and compare, the embodiment of the present invention by decoding video data and voice data in multimedia file, respectively to regarding Frequency evidence and voice data are handled, and are being encoded to multimedia video, are realized and are entered edlin to live or interception video, increase Plus the result of broadcast of short-sighted frequency so that video is more lively, and the personage in editor's rear video more fits with rendering image, improves The service efficiency of video.
Further, as the realization to method shown in above-mentioned Fig. 2, the embodiments of the invention provide another multimedia video The editing device of frequency, as shown in figure 4, the device includes:Acquiring unit 41, decoding unit 42, processing unit 43, coding unit 44th, display unit 45, adjustment unit 46.
Acquiring unit 41, for obtaining multimedia file;
Decoding unit 42, for decoding video data and voice data in the multimedia file;
Processing unit 43, for carrying out rendering processing to the video data, and carries out track to the voice data Processing;
Coding unit 44, for the video data after processing and the voice data after processing to be encoded, obtains many matchmakers Volumetric video.
Specifically, for the ease of carrying out processing video and audio according to the demand of user, the processing unit 43 includes:
Effective mark is carried in receiving module 4301, the process instruction for receiving user's input, the process instruction;
Processing module 4302, the video data is rendered for the video effect mark in effect mark, and Audio frequency effect mark in being identified according to the effect handles the voice data.
Specifically, in order to implement the process step to video data, the processing module 4302 includes:
Extracting sub-module 430201, the view data for extracting each frame in the video data, and to described image Data carry out filter processing;
Submodule 430202 is synthesized, for identifying the mesh after identification filter processing in view data according to the video effect Logo image, and target image progress synthesis is rendered.
The synthesis submodule 430202, if specifically for identifying that the video effect is designated synthetic stereo image, Then split the target image, according to preset coloring rule to the target image and wash with watercolours after the target image, the segmentation Contaminate image carry out coloring synthesis, it is described it is preset coloring rule be used for react the target image after the target image, the segmentation, The position display relation rendered between image.
Specifically, in order to implement the process step to voice data, the processing module 4302 also includes:
Submodule 430203 is gathered, for gathering the discrete track number in the voice data according to prefixed time interval According to;
Submodule 430204 is superimposed, for being identified according to the audio frequency effect by the discrete audio track data and default track Effectively it is superimposed.
The decoding unit 42, specifically for being decoded respectively in the multimedia file according to video track and audio track Video data and voice data.
Further, the audio of video that preview renders and processing is carried out at any time for the ease of user, described device is also wrapped Include:
Display unit 45, for when receiving live preview request, showing the video data and the voice data.
Further, in order to arbitrarily adjust the speed of broadcasting video, described device also includes:
Adjustment unit 46, believes for receiving speed adjust instruction, and according to the speed carried in the speed adjust instruction The broadcasting speed of video data and voice data in breath adjustment multimedia video.
The invention provides the editing device of another multimedia video, the embodiment of the present invention is by decoding multimedia text Video data and voice data in part, render to video data according to video effect mark, are identified according to audio frequency effect Voice data is effectively superimposed, re-encoded as multimedia video, realizes and edlin is entered to live or interception video, increase The result of broadcast of short-sighted frequency so that video is more lively, improve video content shows effect, the personage in editor's rear video with Render image more to fit, the short-sighted frequency of recording can be designed according to different requirements, increase the purposes of short-sighted frequency, improve The service efficiency of video.
The embodiments of the invention provide a kind of storage device, wherein a plurality of instruction that is stored with, the instruction is suitable to by handling Device is loaded and performed:Obtain multimedia file;Decode the video data and voice data in the multimedia file;Regarded to described Frequency renders processing according to progress, and carries out track processing to the voice data;After the video data after processing and processing Voice data encoded, obtain multimedia video.
The embodiments of the invention provide a kind of mobile terminal, including processor, various instructions are adapted for carrying out;And storage is set Standby, suitable for storing a plurality of instruction, the instruction is suitable to be loaded and performed by processor:Obtain multimedia file;Decoding is described more Video data and voice data in media file;The video data is carried out to render processing, and to the voice data Carry out track processing;Video data after processing and the voice data after processing are encoded, multimedia video is obtained.
In the above-described embodiments, the description to each embodiment all emphasizes particularly on different fields, and does not have the portion being described in detail in some embodiment Point, it may refer to the associated description of other embodiment.
It is understood that the correlated characteristic in the above method and device can be referred to mutually.In addition, in above-described embodiment " first ", " second " etc. be to be used to distinguish each embodiment, and do not represent the quality of each embodiment.
It is apparent to those skilled in the art that, for convenience and simplicity of description, the system of foregoing description, The specific work process of device and unit, may be referred to the corresponding process in preceding method embodiment, will not be repeated here.
Algorithm and display be not inherently related to any certain computer, virtual system or miscellaneous equipment provided herein. Various general-purpose systems can also be used together with based on teaching in this.As described above, construct required by this kind of system Structure be obvious.In addition, the present invention is not also directed to any certain programmed language.It is understood that, it is possible to use it is various Programming language realizes the content of invention described herein, and the description done above to language-specific is to disclose this hair Bright preferred forms.
In the specification that this place is provided, numerous specific details are set forth.It is to be appreciated, however, that the implementation of the present invention Example can be put into practice in the case of these no details.In some instances, known method, structure is not been shown in detail And technology, so as not to obscure the understanding of this description.
Similarly, it will be appreciated that in order to simplify the disclosure and help to understand one or more of each inventive aspect, exist Above in the description of the exemplary embodiment of the present invention, each feature of the invention is grouped together into single implementation sometimes In example, figure or descriptions thereof.However, the method for the disclosure should be construed to reflect following intention:It is i.e. required to protect The application claims of shield features more more than the feature being expressly recited in each claim.More precisely, such as following Claims reflect as, inventive aspect is all features less than single embodiment disclosed above.Therefore, Thus the claims for following embodiment are expressly incorporated in the embodiment, wherein each claim is in itself All as the separate embodiments of the present invention.
Those skilled in the art, which are appreciated that, to be carried out adaptively to the module in the equipment in embodiment Change and they are arranged in one or more equipment different from the embodiment.Can be the module or list in embodiment Member or component be combined into a module or unit or component, and can be divided into addition multiple submodule or subelement or Sub-component.In addition at least some in such feature and/or process or unit exclude each other, it can use any Combination is disclosed to all features disclosed in this specification (including adjoint claim, summary and accompanying drawing) and so to appoint Where all processes or unit of method or equipment are combined.Unless expressly stated otherwise, this specification (including adjoint power Profit is required, summary and accompanying drawing) disclosed in each feature can or similar purpose identical, equivalent by offer alternative features come generation Replace.
Although in addition, it will be appreciated by those of skill in the art that some embodiments described herein include other embodiments In included some features rather than further feature, but the combination of the feature of be the same as Example does not mean in of the invention Within the scope of and form different embodiments.For example, in the following claims, times of embodiment claimed One of meaning mode can be used in any combination.
The present invention all parts embodiment can be realized with hardware, or with one or more processor run Software module realize, or realized with combinations thereof.It will be understood by those of skill in the art that can use in practice Microprocessor or digital signal processor (DSP) come realize multimedia video according to embodiments of the present invention edit methods and The some or all functions of some or all parts in device.The present invention is also implemented as being used to perform being retouched here The some or all equipment or program of device (for example, computer program and computer program product) for the method stated. Such program for realizing the present invention can be stored on a computer-readable medium, or can have one or more signal Form.Such signal can be downloaded from internet website and obtained, either on carrier signal provide or with it is any its He provides form.
It should be noted that the present invention will be described rather than limits the invention for above-described embodiment, and ability Field technique personnel can design alternative embodiment without departing from the scope of the appended claims.In the claims, Any reference symbol between bracket should not be configured to limitations on claims.Word "comprising" is not excluded the presence of not Element or step listed in the claims.Word "a" or "an" before element does not exclude the presence of multiple such Element.The present invention can be by means of including the hardware of some different elements and coming real by means of properly programmed computer It is existing.In if the unit claim of equipment for drying is listed, several in these devices can be by same hardware branch To embody.The use of word first, second, and third does not indicate that any order.These words can be explained and run after fame Claim.
Embodiment of the invention discloses that:
A1, a kind of edit methods of multimedia video, including:
Obtain multimedia file;
Decode the video data and voice data in the multimedia file;
The video data is carried out to render processing, and track processing is carried out to the voice data;
Video data after processing and the voice data after processing are encoded, multimedia video is obtained.
A2, the method according to A1, it is described that the video data is carried out to render processing, and to the voice data Carrying out track processing includes:
Receive in the process instruction of user's input, the process instruction and carry effective mark;
Video effect mark in being identified according to the effect renders the video data, and according in effect mark The audio frequency effect mark processing voice data.
Video effect mark in A3, the method according to A2, the mark according to the effect renders the video Data include:
The view data of each frame in the video data is extracted, and filter processing is carried out to described image data;
Target image after identification filter processing in view data is identified according to the video effect, and to the target figure Rendered as carrying out synthesis.
A4, the method according to A3, after the mark identification filter processing according to the video effect in view data Target image, and to the target image carry out synthesis render including:
If identifying, the video effect is designated synthetic stereo image, splits the target image, according to preset Color rule is to the target image after the target image, the segmentation and renders image and carries out coloring synthesis, described preset Color rule is used to react the target image after the target image, the segmentation, the position display pass rendered between image System.
Audio frequency effect mark in A5, the method according to A2, the mark according to the effect handles the audio Data include:
The discrete audio track data in the voice data is gathered according to prefixed time interval;
The discrete audio track data is effectively superimposed with default track according to audio frequency effect mark.
Video data and voice data in A6, the method according to A1, the decoding multimedia file include:
Decode the video data and voice data in the multimedia file respectively according to video track and audio track.
A7, the method according to A1, it is described that the video data is carried out to render processing, and to the voice data Carry out after track processing, methods described also includes:
When receiving live preview request, the video data and the voice data are shown.
A8, the method according to A1, methods described also include:
Speed adjust instruction is received, is adjusted according to the velocity information carried in the speed adjust instruction in multimedia video The broadcasting speed of video data and voice data.
B9, a kind of editing device of multimedia video, including:
Acquiring unit, for obtaining multimedia file;
Decoding unit, for decoding video data and voice data in the multimedia file;
Processing unit, for carrying out rendering processing to the video data, and is carried out at track to the voice data Reason;
Coding unit, for the video data after processing and the voice data after processing to be encoded, obtains multimedia Video.
B10, the device according to B9, the processing unit include:
Effective mark is carried in receiving module, the process instruction for receiving user's input, the process instruction;
Processing module, identifies for the video effect in effect mark and renders the video data, and according to The audio frequency effect mark processing voice data in the effect mark.
B11, the device according to B10, the processing module include:
Extracting sub-module, the view data for extracting each frame in the video data, and described image data are entered The processing of row filter;
Submodule is synthesized, for identifying the target figure after identification filter processing in view data according to the video effect Picture, and target image progress synthesis is rendered.
B12, the device according to B11,
The synthesis submodule, if specifically for identifying that the video effect is designated synthetic stereo image, splitting The target image, according to preset coloring rule is to the target image after the target image, the segmentation and renders image Coloring synthesis is carried out, the preset coloring rule is used to react target image, the wash with watercolours after the target image, the segmentation Contaminate the position display relation between image.
B13, the device according to B10, the processing module also include:
Submodule is gathered, for gathering the discrete audio track data in the voice data according to prefixed time interval;
Submodule is superimposed, for being had the discrete audio track data and default track according to audio frequency effect mark Effect superposition.
B14, the device according to B9,
The decoding unit, specifically for being decoded respectively in the multimedia file according to video track and audio track Video data and voice data.
B15, the device according to B9, described device also include:
Display unit, for when receiving live preview request, showing the video data and the voice data.
B16, the device according to B9, described device also include:
Adjustment unit, for receiving speed adjust instruction, and according to the velocity information carried in the speed adjust instruction Adjust the broadcasting speed of video data and voice data in multimedia video.
C17, a kind of storage device, wherein a plurality of instruction that is stored with, the instruction is suitable to be loaded and performed by processor:
Obtain multimedia file;
Decode the video data and voice data in the multimedia file;
The video data is carried out to render processing, and track processing is carried out to the voice data;
Video data after processing and the voice data after processing are encoded, multimedia video is obtained.
D18, a kind of mobile terminal, including processor, are adapted for carrying out various instructions;And storage device, it is many suitable for storing Bar is instructed, and the instruction is suitable to be loaded and performed by processor:
Obtain multimedia file;
Decode the video data and voice data in the multimedia file;
The video data is carried out to render processing, and track processing is carried out to the voice data;
Video data after processing and the voice data after processing are encoded, multimedia video is obtained.

Claims (10)

1. a kind of edit methods of multimedia video, it is characterised in that including:
Obtain multimedia file;
Decode the video data and voice data in the multimedia file;
The video data is carried out to render processing, and track processing is carried out to the voice data;
Video data after processing and the voice data after processing are encoded, multimedia video is obtained.
2. according to the method described in claim 1, it is characterised in that described that the video data is carried out to render processing, and Track processing is carried out to the voice data to be included:
Receive in the process instruction of user's input, the process instruction and carry effective mark;
Video effect mark in being identified according to the effect renders the video data, and the sound in effect mark The yupin effect mark processing voice data.
3. method according to claim 2, it is characterised in that the video effect in the mark according to the effect is identified Rendering the video data includes:
The view data of each frame in the video data is extracted, and filter processing is carried out to described image data;
Target image after identification filter processing in view data is identified according to the video effect, and the target image is entered Row synthesis is rendered.
4. method according to claim 3, it is characterised in that described according to video effect mark identification filter processing Target image in view data afterwards, and the target image is carried out synthesis render including:
If identifying, the video effect is designated synthetic stereo image, splits the target image, is advised according to preset coloring Then to the target image after the target image, the segmentation and render image carry out coloring synthesis, it is described it is preset coloring rule Then it is used to react the target image after the target image, the segmentation, the position display relation rendered between image.
5. a kind of editing device of multimedia video, it is characterised in that including:
Acquiring unit, for obtaining multimedia file;
Decoding unit, for decoding video data and voice data in the multimedia file;
Processing unit, for carrying out rendering processing to the video data, and carries out track processing to the voice data;
Coding unit, for the video data after processing and the voice data after processing to be encoded, obtains multimedia video.
6. device according to claim 5, it is characterised in that the processing unit includes:
Effective mark is carried in receiving module, the process instruction for receiving user's input, the process instruction;
Processing module, the video data is rendered for the video effect mark in effect mark, and according to described The audio frequency effect mark processing voice data in effect mark.
7. device according to claim 6, it is characterised in that the processing module includes:
Extracting sub-module, the view data for extracting each frame in the video data, and described image data are filtered Mirror processing;
Submodule is synthesized, for identifying the target image after identification filter processing in view data according to the video effect, and Synthesis is carried out to the target image to render.
8. device according to claim 7, it is characterised in that
The synthesis submodule, if specifically for identifying that the video effect is designated synthetic stereo image, segmentation is described Target image, according to preset coloring rule is to the target image after the target image, the segmentation and renders image progress Coloring synthesis, the preset coloring rule is used to react the target image after the target image, the segmentation, described renders figure Position display relation as between.
9. a kind of storage device, wherein a plurality of instruction that is stored with, the instruction is suitable to be loaded and performed by processor:
Obtain multimedia file;
Decode the video data and voice data in the multimedia file;
The video data is carried out to render processing, and track processing is carried out to the voice data;
Video data after processing and the voice data after processing are encoded, multimedia video is obtained.
10. a kind of mobile terminal, including processor, are adapted for carrying out various instructions;And storage device, suitable for storing a plurality of refer to Order, the instruction is suitable to be loaded and performed by processor:
Obtain multimedia file;
Decode the video data and voice data in the multimedia file;
The video data is carried out to render processing, and track processing is carried out to the voice data;
Video data after processing and the voice data after processing are encoded, multimedia video is obtained.
CN201710566432.2A 2017-07-12 2017-07-12 Multimedia video editing method and device Active CN107241646B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710566432.2A CN107241646B (en) 2017-07-12 2017-07-12 Multimedia video editing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710566432.2A CN107241646B (en) 2017-07-12 2017-07-12 Multimedia video editing method and device

Publications (2)

Publication Number Publication Date
CN107241646A true CN107241646A (en) 2017-10-10
CN107241646B CN107241646B (en) 2020-08-14

Family

ID=59990913

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710566432.2A Active CN107241646B (en) 2017-07-12 2017-07-12 Multimedia video editing method and device

Country Status (1)

Country Link
CN (1) CN107241646B (en)

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108234479A (en) * 2017-12-29 2018-06-29 北京百度网讯科技有限公司 For handling the method and apparatus of information
CN109168027A (en) * 2018-10-25 2019-01-08 北京字节跳动网络技术有限公司 Instant video methods of exhibiting, device, terminal device and storage medium
CN109543560A (en) * 2018-10-31 2019-03-29 百度在线网络技术(北京)有限公司 Dividing method, device, equipment and the computer storage medium of personage in a kind of video
CN109587552A (en) * 2018-11-26 2019-04-05 Oppo广东移动通信有限公司 Video personage sound effect treatment method, device, mobile terminal and storage medium
CN111343499A (en) * 2018-12-18 2020-06-26 北京奇虎科技有限公司 Video synthesis method and device
CN111355960A (en) * 2018-12-21 2020-06-30 北京字节跳动网络技术有限公司 Method and device for synthesizing video file, mobile terminal and storage medium
CN111460183A (en) * 2020-03-30 2020-07-28 北京金堤科技有限公司 Multimedia file generation method and device, storage medium and electronic equipment
CN111866404A (en) * 2019-04-25 2020-10-30 华为技术有限公司 Video editing method and electronic equipment
WO2021052130A1 (en) * 2019-09-17 2021-03-25 西安中兴新软件有限责任公司 Video processing method, apparatus and device, and computer-readable storage medium
CN113315928A (en) * 2021-05-25 2021-08-27 南京慕映影视科技有限公司 Multimedia file making system and method
WO2022017006A1 (en) * 2020-07-22 2022-01-27 Oppo广东移动通信有限公司 Video processing method and apparatus, and terminal device and computer-readable storage medium
CN114007077A (en) * 2021-11-17 2022-02-01 北京百度网讯科技有限公司 Multimedia resource processing method and device, electronic equipment and storage medium
CN114979766A (en) * 2022-05-11 2022-08-30 深圳市大头兄弟科技有限公司 Audio and video synthesis method, device, equipment and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080002942A1 (en) * 2006-05-24 2008-01-03 Peter White Method and apparatus for creating a custom track
CN102638658A (en) * 2012-03-01 2012-08-15 盛乐信息技术(上海)有限公司 Method and system for editing audio-video
CN103049908A (en) * 2012-12-10 2013-04-17 北京百度网讯科技有限公司 Method and device for generating stereoscopic video file
CN103327361A (en) * 2012-11-22 2013-09-25 中兴通讯股份有限公司 Method, device and system for obtaining real-time video communication playback data flow
CN104732593A (en) * 2015-03-27 2015-06-24 厦门幻世网络科技有限公司 Three-dimensional animation editing method based on mobile terminal
CN106373170A (en) * 2016-08-31 2017-02-01 北京云图微动科技有限公司 Video making method and video making device

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080002942A1 (en) * 2006-05-24 2008-01-03 Peter White Method and apparatus for creating a custom track
CN102638658A (en) * 2012-03-01 2012-08-15 盛乐信息技术(上海)有限公司 Method and system for editing audio-video
CN103327361A (en) * 2012-11-22 2013-09-25 中兴通讯股份有限公司 Method, device and system for obtaining real-time video communication playback data flow
CN103049908A (en) * 2012-12-10 2013-04-17 北京百度网讯科技有限公司 Method and device for generating stereoscopic video file
CN104732593A (en) * 2015-03-27 2015-06-24 厦门幻世网络科技有限公司 Three-dimensional animation editing method based on mobile terminal
CN106373170A (en) * 2016-08-31 2017-02-01 北京云图微动科技有限公司 Video making method and video making device

Cited By (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108234479A (en) * 2017-12-29 2018-06-29 北京百度网讯科技有限公司 For handling the method and apparatus of information
CN109168027B (en) * 2018-10-25 2020-12-11 北京字节跳动网络技术有限公司 Instant video display method and device, terminal equipment and storage medium
CN109168027A (en) * 2018-10-25 2019-01-08 北京字节跳动网络技术有限公司 Instant video methods of exhibiting, device, terminal device and storage medium
CN109543560A (en) * 2018-10-31 2019-03-29 百度在线网络技术(北京)有限公司 Dividing method, device, equipment and the computer storage medium of personage in a kind of video
CN109587552A (en) * 2018-11-26 2019-04-05 Oppo广东移动通信有限公司 Video personage sound effect treatment method, device, mobile terminal and storage medium
CN111343499A (en) * 2018-12-18 2020-06-26 北京奇虎科技有限公司 Video synthesis method and device
CN111355960B (en) * 2018-12-21 2021-05-04 北京字节跳动网络技术有限公司 Method and device for synthesizing video file, mobile terminal and storage medium
CN111355960A (en) * 2018-12-21 2020-06-30 北京字节跳动网络技术有限公司 Method and device for synthesizing video file, mobile terminal and storage medium
CN111866404A (en) * 2019-04-25 2020-10-30 华为技术有限公司 Video editing method and electronic equipment
CN111866404B (en) * 2019-04-25 2022-04-29 华为技术有限公司 Video editing method and electronic equipment
WO2021052130A1 (en) * 2019-09-17 2021-03-25 西安中兴新软件有限责任公司 Video processing method, apparatus and device, and computer-readable storage medium
CN111460183B (en) * 2020-03-30 2024-02-13 北京金堤科技有限公司 Method and device for generating multimedia file, storage medium and electronic equipment
CN111460183A (en) * 2020-03-30 2020-07-28 北京金堤科技有限公司 Multimedia file generation method and device, storage medium and electronic equipment
WO2022017006A1 (en) * 2020-07-22 2022-01-27 Oppo广东移动通信有限公司 Video processing method and apparatus, and terminal device and computer-readable storage medium
CN113315928A (en) * 2021-05-25 2021-08-27 南京慕映影视科技有限公司 Multimedia file making system and method
CN113315928B (en) * 2021-05-25 2022-03-22 南京慕映影视科技有限公司 Multimedia file making system and method
CN114007077A (en) * 2021-11-17 2022-02-01 北京百度网讯科技有限公司 Multimedia resource processing method and device, electronic equipment and storage medium
CN114007077B (en) * 2021-11-17 2023-09-01 北京百度网讯科技有限公司 Method and device for processing multimedia resources, electronic equipment and storage medium
CN114979766B (en) * 2022-05-11 2023-11-21 深圳市闪剪智能科技有限公司 Audio and video synthesis method, device, equipment and storage medium
CN114979766A (en) * 2022-05-11 2022-08-30 深圳市大头兄弟科技有限公司 Audio and video synthesis method, device, equipment and storage medium

Also Published As

Publication number Publication date
CN107241646B (en) 2020-08-14

Similar Documents

Publication Publication Date Title
CN107241646A (en) The edit methods and device of multimedia video
CN110457994B (en) Face image generation method and device, storage medium and computer equipment
US11736769B2 (en) Content filtering in media playing devices
US10803851B2 (en) Method and apparatus for processing speech splicing and synthesis, computer device and readable medium
CN107172476A (en) A kind of system and implementation method of interactive script recorded video resume
CN108377418A (en) A kind of video labeling treating method and apparatus
CN109496295A (en) Multimedia content generation method, device and equipment/terminal/server
JP2022550372A (en) Method and system for creating binaural immersive audio for audiovisual content
CN105872786B (en) A kind of method and device for launching advertisement by barrage in a program
CN112512649B (en) Techniques for providing audio and video effects
CN111405381A (en) Online video playing method, electronic device and computer readable storage medium
CN110400254A (en) A kind of lipstick examination cosmetic method and device
CN107633029A (en) A kind of method and device for showing electronic document
CN109241323B (en) The method for generating user's poster is commented on based on e-book and calculates equipment
CN106653077A (en) Method and device for recording voice notes as well as readable storage medium
De Lima et al. Video-based interactive storytelling using real-time video compositing techniques
CN113038185A (en) Bullet screen processing method and device
CN105872827A (en) Live broadcast method and device of application interface in mobile terminal
CN106792155A (en) A kind of method and device of the net cast of multiple video strems
CN110797001A (en) Method and device for generating voice audio of electronic book and readable storage medium
CN107135399A (en) Interactive control method, equipment and the computer-readable recording medium of video
CN106792219B (en) It is a kind of that the method and device reviewed is broadcast live
CN115690277A (en) Video generation method, system, device, electronic equipment and computer storage medium
CN105869447A (en) Generating method and device of audiobook
US20180330167A1 (en) Personalized Augmented Reality

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant