WO2009137919A1 - Video mix-down - Google Patents

Video mix-down Download PDF

Info

Publication number
WO2009137919A1
WO2009137919A1 PCT/CA2009/000647 CA2009000647W WO2009137919A1 WO 2009137919 A1 WO2009137919 A1 WO 2009137919A1 CA 2009000647 W CA2009000647 W CA 2009000647W WO 2009137919 A1 WO2009137919 A1 WO 2009137919A1
Authority
WO
WIPO (PCT)
Prior art keywords
mixed
video
frames
mixing
application
Prior art date
Application number
PCT/CA2009/000647
Other languages
French (fr)
Inventor
Richard Gratton
Keith Peters
Chris Allen
Simon Lord
Original Assignee
Xtranormal Technologie Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xtranormal Technologie Inc. filed Critical Xtranormal Technologie Inc.
Publication of WO2009137919A1 publication Critical patent/WO2009137919A1/en

Links

Classifications

    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/02Editing, e.g. varying the order of information signals recorded on, or reproduced from, record carriers
    • G11B27/031Electronic editing of digitised analogue information signals, e.g. audio or video signals

Definitions

  • the present invention relates to the field of creating videos comprising multimedia content.
  • a method for creating a video comprising: mixing down a plurality of media assets in order to obtain at least two mixed-down frames on a computer; transmitting the at least two mixed-down frames from the computer to a server; 0401
  • a system for creating a video comprising: a processor in a server; a memory accessible by the processor; communication means for receiving and transmitting data; and an application coupled to the processor, the application being configured for: receiving at least two mixed-down frames created from at least two media assets; mixing down the at least two mixed- down frames in order to obtain a mixed-down video and encoding the mixed-down video in a format adapted to be delivered as video over an Internet; and returning the mixed- down video to the computer via the communication means .
  • a system for creating a video comprising: a processor in a computer; a memory accessible by the processor; communication means for receiving and transmitting data,- and an application coupled to the processor, the application being configured for: mixing down a plurality of media assets stored in the memory in order to obtain at least two mixed-down frames; transmitting the at least two mixed-down frames to a server via the communication means,- and receiving a mixed-down video from the server via the communication means, the mixed-down video resulting from mixing down the at least two mixed-down frames and being encoded in a format adapted to be delivered as video over an Internet. 0401
  • a method for creating a video comprising: receiving, at a server and from a remote computer, at least two mixed- down frames created from at least two media assets; mixing down the at least two mixed-down frames in order to obtain a mixed-down video and encoding the mixed-down video in a format adapted to be delivered as video over an Internet; and returning the mixed-down video to the remote computer.
  • a method for creating a video comprising: mixing down, at a remote computer, a plurality of media assets stored in a memory in order to obtain at least two mixed-down frames; transmitting the at least two mixed-down frames to a server; and receiving, at the remote computer, a mixed-down video from the server, the mixed-down video resulting from mixing down the at least two mixed-down frames and being encoded in a format adapted to be delivered as video over an
  • media asset refers to visual and/or audio elements which can be stored in a memory of a computer.
  • media assets are video-only assets such as silent videos and movies, audio-only assets such as audio tracks, video-and-audio assets such as talking videos and movies, photographs, images, text, graphics, and the like.
  • frame refers to a single shot or sequence of a media asset.
  • a film frame is one of the many single photographic images in a motion picture. Usually, 24 film frames are needed for one second of film, however embodiments of the present invention should not be limited as such.
  • An audio sample is another example of a frame .
  • mixed-down refers to the action of combining several elements such as media assets and frames of media assets in order to obtain a single element.
  • a mixed-down frame is the result of the combination of several frames. Mixing-down several video frames results in a video and mixing-down several audio frames results in an audio track.
  • rendering is used to describe the process of calculating effects in a video editing file to produce final video output.
  • FIG. 1 is a flow chart illustrating a method of creating a video, in accordance with an embodiment of the present invention
  • FIG. 2 is a block diagram illustrating the splitting of a video into frames, in accordance with an embodiment of the present invention
  • FIG. 3 is a block diagram illustrating a mix-down of media assets in order to obtain a mixed-down frame, in accordance with an embodiment of the present invention
  • FIG. 4 is a block diagram illustrating a mix-down of mixed-down frames in order to obtain a mixed-down video, in accordance with an embodiment of the present invention
  • FIG. 5 is a block diagram illustrating the insertion of effects in a mixed-down video, in accordance with an embodiment of the present invention
  • Fig. 6 is a flow chart illustrating a method of creating a mixed-down video comprising a simplification step and a reconstruction step, in accordance with an embodiment of the present invention
  • Fig. 7 is a block diagram illustrating the simplification of mixed-down frames, in accordance with an embodiment of the present invention.
  • Fig. 8 is a block diagram illustrating an Edit Decision List, in accordance with an embodiment of the present invention.
  • FIG. 9 is a block diagram illustrating the reconstruction of mixed-down frames, in accordance with an embodiment of the present invention.
  • FIG. 10 is block diagram of a system for creating a mixed-down video, in accordance with an embodiment of the present invention.
  • Figure 1 illustrates a method 10 of creating a video according to one embodiment.
  • a user having a computer wants to create a personalized video.
  • the user uses a 0401
  • the browser-based application to create the video and connects to a server.
  • the application is accessed via web browser over a network such as the Internet or an intranet.
  • the user wants to add sound effects or a background image to an original video.
  • the original video may be a video filmed by the user or a video downloaded from the Internet, for example.
  • the original video and the media assets to be added to the video are stored in a memory of the user's computer. Alternatively, the original video and the media assets can be on the server.
  • the first step 12 of the method 10 is the creation of at least two mixed-down frames by combining the different media assets including the original video.
  • Frame mix-down refers to taking multiple media assets composed and/or edited into a single shot or sequence and combining them to obtain a single frame.
  • the original video is divided frame by frame, as illustrated in figure 2. If required, the media assets to be added to the original video are also divided frame by frame.
  • Figure 3 illustrates the creation of a mix- down for one frame 50a of the original video. For each frame 50a-e of the original video, the user superimposes a graphic (and/or text) 52, a picture 54, a background image 56 and/or an audio track 60. The superposition results in a mixed-down frame 60 which is encoded in a format adapted to be delivered as video over the Internet.
  • figure 1 refers to the creation of a personalized video using an original video
  • the mixed-down frame could be something other than a video-and-audio frame.
  • the mixed-down frame results from the combination of at least two media assets of the same 0401
  • the mixed-down frame could be a video-only frame or an audio-only frame.
  • An audio-only mixed-down frame results from the mix-down of at least two audio tracks having the same or a different format.
  • a video- only mixed-down frame may result from the mix-down of at least two videos having no sound track, at least two pictures, at least two graphics, or any combination of videos having no sound tracks, pictures and/or graphics.
  • original video 50 is in an flv format with or without alpha channel.
  • Original video 50 may comprise an audio track or not.
  • Graphic 52 is a vector data and may comprise text.
  • Picture 54 and background picture 56 may be in a GIF, PNG, or JPEG format, for example.
  • Audio track 60 may be in an flv format (without video) or any audio format known to a person skilled in the art such as mp3 , wav, ogg, oga, etc.
  • mixed-down frame 60 is a flattened frame, which means that it is in an flv format.
  • Other formats such as mp4 , m4v, m4a, 3gp, mov, f4v, f4p, f4a, f4b, and the like are also possible.
  • the next step 14 is the transmission of the mixed-down frames from the user's computer to a server.
  • the server receives the mixed-down frames and stores them in a memory.
  • the server combines the multiple mixed- down frames sequentially to obtain a mixed-down video which is first stored in the server memory before being sent to the user at step 18.
  • the mixed-down video is streamed directly from the server to the user's computer 0401
  • the mixed-down video is encoded in a format adapted to be delivered as video over the Internet.
  • the format of the mixed-down video is standard so that the mixed-down video can be played back by any generic media player.
  • the application that was used to create the mixed-down video is not required to render and play back the mixed-down video.
  • the mixed-down video can be played back without having to mix down the shots or timeline since the rendering has already been done during its creation. Consequently, the mixed-down video can be posted to generic video aggregation sites such as YoutubeTM, Google VideoTM, Yahoo! VideoTM, DailymotionTM, etc. Once the mixed-down video has been posted, a person can play back the mixed-down video since the application that was used to create it is not required.
  • Figure 4 illustrates one embodiment of the creation of a mixed-down video.
  • the server receives n mixed-down frames 10Oa-IOOn from the user's computer and saves them in memory.
  • the n mixed-down frames 10Oa-IOOn are combined sequentially two by two to result in m flattened videos 101 a-m.
  • the n mixed-down frames can be combined other than two by two . For example they could be combined three by three or four by four, etc.
  • the m flattened video 101a-m are combined sequentially to result in a single mixed-down video 102.
  • Mixed-down video 102 is encoded in a format adapted to be delivered as video over the Internet .
  • the mixed-down video 102 is created in a single step.
  • the n mixed-down frames 100a-n are 0401
  • the mixed-down video 102 is a flattened video, which means that it is in an flv format.
  • Other standard formats allowing the mixed-down video to be played back by any generic media player are also possible. Examples of other standard formats are mp4 , m4v, m4a, 3gp, mov, f4v, f4p, f4a, f4b, etc.
  • the user may insert effects into the video 102 as illustrated in figure 5.
  • Multiple effects can be inserted into the mixed-down video 102 such as track effects 150, 152, 154, transition effects 156 and audio effect 158.
  • Transition effects 156 are inserted between two successive frames of the mixed-down video 102. Because of the transition effects 156, two successive frames of the mixed-down video 102 overlap during the playback of the mixed-down video 102 and effects such as wiping or dissolving appear between the frames.
  • Audio effect 158 can be an audio bed behind the mixed-down video 102 and acting as background sound, for example. Other audio effects such as echo, flanger, filtering, chorus, and the like can be added.
  • Track effects 150, 152, and 154 refer to bitmap effects which have a defined duration independent of the duration of the frames constituting the mixed-down video 102.
  • Sepia tone effect, blur effect, saturation effect, brightness effect, grayscale effect, and the like are examples of track effects 150, 152, and 154.
  • the final mixed-down video comprising effects is encoded in a format adapted to be delivered as video over the Internet.
  • figure 5 refers to the insertion of effects once the mixed-down video has been created
  • the step of creating the mixed-down video using the mixed-down frames and the step of inserting effects can be combined into a single step.
  • the effects are inserted during the step of mixing down the mixed-down frames illustrated in figure 4.
  • the video obtained at the end of the creation of the mixed-down video is a mixed- down video comprising effects and being encoded in a format adapted to be delivered as video over the Internet.
  • the server offers a panel of effects that can be added during the creation of the mixed- down video.
  • the effects can be in an application on the user's computer and they are uploaded to the server.
  • the server returns in real-time the mixed-down video to the user.
  • the user can see in realtime the effects added to the video and change them if they are not satisfactory.
  • the user does not need to wait until the end of the process to view the mixed-down video.
  • the server returns the mixed-down video in the form of a video file in a format adapted to be delivered as video over the Internet. 0401
  • Figure 6 illustrates one embodiment of a method 170 of creating a mixed-down video comprising the steps of simplification and reconstruction of the mixed-down frames.
  • audio assets and visual assets are treated separately. If an asset such as a video contains both audio and video assets, this asset is broken down into an audio-only asset and a video-only asset. Any method known to a person skilled in the art to break down a video into a video-only asset and an audio-only asset may be used.
  • the first step is the creation of the video mixed-down frames using multiple media assets.
  • video-only mixed-down frames are created using the method illustrated in figure 3.
  • the audio-only assets are split in audio-only frames which are uploaded to the server if they are located on the user's computer.
  • the audio-only frames can be located on the server and the user selects the ones to be inserted into the mixed-down video.
  • Video-only mixed-down frames are encoded in a format adapted to be delivered as video over the Internet.
  • Audio frames can be in an audio format such as mp3 or wav format, for example. It should be noted that the audio frames may contain a single audio asset such as the soundtrack of a video, which is uploaded to the server from the user's computer or provided on the server.
  • the next step 174 is the simplification of the video-only mixed-down frames, of which an embodiment is illustrated in figure 7.
  • the creation of video-only mixed-down frames results in video-only mixed-down frame 200, video-only mixed- down frame 202 and video-only mixed-down frame 204 which are stored in the memory of the user's computer.
  • Video-only mixed-down frames 200, 202, 204 are subsequently converted into bitmaps which contain a plurality of bits such as tiles. Tiles of one bitmap are compared to tiles of a preceding bitmap and are stored in memory only if they are different.
  • video-only mixed-down frame 202 is converted to bitmap 206 which includes 9 tiles Tn to Ti 9 .
  • Each tile T ⁇ j of a bitmap represents a part of the corresponding video-only mixed-down frame and contains all data corresponding to that part of the video-only mixed-down frame.
  • Index i refers to the number of the video-only mixed- down frame and index j refers to the position of the tile in the bitmap.
  • Bitmap 208 is sent from the user's computer to the server. In the case of the first video-only mixed-down frame 200, all tiles Tn to Ti 9 of bitmap 208 are sent to the server.
  • the second video-only mixed-down frame 202 is also converted to a bitmap 210 comprising tiles T 21 to T 29 .
  • the tiles of bitmap 210 are compared to their corresponding tiles of the preceding bitmap 206.
  • Tile Tn is compared to tile T 2x
  • tile T 12 is compared to tile T 22
  • tile T 13 is compared to tile T 23 , etc.
  • Bitmap 212 represents the bitmap corresponding to video-only mixed-down frame 202 that is sent to the server.
  • Bitmap 212 only includes tiles T 25 and T 29 of bitmap 210 which are different from their corresponding tiles of bitmap 206. The same steps are performed for video-only mixed-down frame 15750401
  • Bitmap 214 includes tiles T 3I to T 39 which are compared to tiles T 2i to T 29 of bitmap 210. Only the tiles that are different are saved in bitmap 216 which includes tiles T 3I , T 37 , and T 39 . Bitmap 216 5 is then sent to the server.
  • tiles Tn to Ti 9 , T 25 , T 29 , T 31 , T 37 , and T 39 are compressed before being sent to the server.
  • the audio-only frames are0 sent to the server where they are mixed-down together in order to obtain an audio track for the mixed-down video (step 182) .
  • the audio-only frames are selected by the user amongst a selection of audio frames available on the server. This audio track is saved in memory as an audio file.5 Instructions relative to the audio-only frames are also sent to the server. These instructions may contain the order and timing of each audio-only frame, for example. It should be understood that the audio-only frames can be compressed before they are sent to the server and subsequently 0 decompressed by the server.
  • the instructions relative to the audio-only frames are comprised in an Edit Decision List
  • EDL of which an example is illustrated in figure 8.
  • the EDL is transmitted to the server prior to the creation of the5 mixed-down video.
  • the EDL can be sent to the server via Real Time Messaging Protocol in an amf3 format.
  • the mixed-down video is a silent video. Therefore, the steps 180 and 182 are not performed and the mixed-down video results from the mix- down of video-only mixed-down frames on the server.
  • bitmaps 208, 212, and 216 are sent to the server and stored in the server's memory. If received tiles Tn to Ti 9 , T 25 , T 29 , T 3I , T 37 , and T 39 are compressed, the server decompresses them before storing them in memory.
  • the server reconstructs a complete bitmap using tiles of a preceding bitmap, as illustrated in Figure 9. As bitmap 208 is complete (each position of the bitmap contains a tile) , the server recreates video-only mixed-down frame 200 and stores it in memory. Bitmap 212 is incomplete as only tile T 25 and T 29 are present.
  • the server fills the missing tiles T 2 i, T 22 , T 23 , T 24 , T 26 , T 27 , and T 28 with their corresponding tiles in the preceding bitmap 208.
  • Tn substitutes T 21
  • T 12 substitutes T 22
  • Ti 3 substitutes T 23 , etc, to give rise to bitmap 220.
  • Bitmap 216 is also incomplete as only tiles T 31 , T 37 and T 39 are present. The missing tiles are taken from the preceding bitmap 220.
  • Tile Ti 2 substitutes tile T 32 , tile T 13 substitutes tile T 33 , tile T 14 substitutes tile T 34 , tile T 25 substitutes tile T 35 , tile Ti6 substitutes tile T 36 , and tile T 18 substitutes tile T 38 .
  • This process results in complete bitmap 222 from which video- only mixed-down frame 204 can be reconstructed and stored in memory . 0401
  • figures 7 and 9 refer to dividing video-only mixed-down frames into tiles, it should be understood that any method for dividing the video-only mixed-down frames can be used. For example, they can be divided in columns or rows. The number of tiles, columns, or rows is chosen as part of a design choice. It should also be understood that the number of mixed-down frames 200, 202, 204 is exemplary only and can equal to any integer superior or equal to two.
  • the next step 184 is the creation of a mixed-down video using these reconstructed video-only mixed-down frames and the audio track as illustrated in figure 5. Effects can also be added to the video either during the creation of the mixed-down video or after the creation of the mixed-down video.
  • the resulting mixed-down video is encoded in a format adapted to be delivered as video over the Internet. No rendering process is required to play back the mixed-down video.
  • the last step 186 consists in returning the mixed-down video to the user.
  • the creation of the mixed-down video occurs in several steps.
  • a video track is created using the video-only mixed-down frames according to the method illustrated in figure 4.
  • the method illustrated in figure 5 can be used and effects are added to the video track. Sound effects can be added to the audio track created using the audio-only mixed-down frames.
  • the last step is the combination of the audio track with the video track to obtain the mixed-down video. 0401
  • Figure 10 illustrates one embodiment of a system 300 for creating a mixed-down video.
  • the system comprises a computer 302 in connection with a server 302.
  • Computer 302 and server 304 can be connected via an Internet connection or an intranet connection, for example.
  • Computer 302 is also connected to a monitor 320 and a command unit 322.
  • Computer 302 comprises a processor 306, a memory 308 and a communication unit 310.
  • Media assets used to create the mixed-down video are stored in memory 308.
  • An application coupled to processor 306 displays options such as a list of media assets on monitor 320. The user has access to all media assets stored in his computer via the application. A user selects the media assets to be used to create the mixed-down video via command unit 322.
  • Processor 306 has access to all media assets stored in memory 308. Furthermore, processor 306 is adapted to combine the media assets chosen by the user in order to obtain mixed-down frames according to the method illustrated in figure 3. The mixed-down frames are stored in memory 308 before being sent to server 304 through communication unit 310. 0401
  • the user clicks on a "render” button displayed on monitor 320 by the application coupled to processor 306 using command unit 322.
  • the trigger of the "render” button starts the process of creating the mixed-down video .
  • Communication unit 318 of server 304 receives the mixed-down frames and stores them in memory 316.
  • Processor 316 is adapted to combine the mixed-down frames stored in memory 316 in order to create a mixed-down video according to the method illustrated in figure 4.
  • the resulting mixed-down video is encoded in a format adapted to be delivered as video over the Internet and is stored in memory 316 before being sent to computer 302 through communication unit 318 or directly streamed to computer 302 without being saved in memory 316.
  • Communication unit 310 receives the mixed-down video and stores it in memory 308.
  • processor 306 is further connected to communication unit 310 and the mixed-down frames are directly sent to server 304 through communication unit 310 without being stored in memory 308.
  • processor 314 is further connected to communication unit 318 and the mixed-down video is sent directly to computer 302 without being stored in memory 316.
  • processor 314 is further adapted to add effects either during the creation of the mixed-down video or after the creation of the mixed-down video following 0401
  • the application coupled to processor 306 displays a list of effects that can be added to the video.
  • the user selects the effects to be inserted into the mixed-down video via command unit 322.
  • the user can also choose effect parameters such as the duration of an effect, the position of the effect within the video, etc.
  • the effects and their corresponding parameters are sent to server 318 via processor 306 and communication unit 310 prior to the creation the mixed-down video.
  • processor 306 is further adapted to break down media assets in order to obtain audio-only frames and video-only assets according to the method illustrated in figure 6.
  • Processor 306 is also adapted to simplify the video-only mixed-down frames according to the method illustrated in figure 7 and processor 314 is further adapted to reconstruct the video-only mixed-down frames according to the method illustrated in figure 8, to create the audio-only mixed-down frames, and to combine them with the audio-only mixed-down frames in order to obtain a mixed- down video encoded in a format adapted to be delivered as video over the Internet.
  • processor 306 is further adapted to create the audio-only mixed-down frames and to send them to processor 314 via communication units 310 and 318.
  • memory 308 is part of computer 302 in figure 10, it should be noted that memory 308 may be external to computer 302. 0401
  • computer 302 and server 304 are connected via the Internet.
  • the mix-down of media assets and the mix-down of frames is performed by a Flash runtime application.
  • the mixed-down format is encoded in a standard format and can be played on any generic player such as QuickTimeTM, Windows Media PlayerTM, RealplayerTM, KMplayerTM, etc.
  • the mixed-down frames are compressed by the user' s computer before being sent to the server and subsequently decompressed by the server before the creation of the mixed-down video.
  • the mixed-down video is compressed by the server before being sent to the user's computer.
  • the mixed-down video is then decompressed by the user's computer before being played back.
  • any elements transmitted between the computer 302 and the server 304 may be compressed before being sent and subsequently decompressed once received.
  • the processors 306 and 314 are adapted to compress and/or decompress the transmitted elements .

Abstract

There is described a method for creating a video comprising: mixing-down a plurality of media assets in order to obtain at least two mixed-down frames on a computer; transmitting the at least two mixed-down frames from the computer to a server; mixing-down the at least two mixed-down frames in order to obtain a mixed- down video on the server,- and returning the mixed-down video to the computer; the at least two mixed-down frames and the mixed- down video being encoded in a format adapted to be delivered as video over an Internet.

Description

VIDEO MIX-DOWN
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] The present application claims priority under 35 USC§ 119 (e) of Provisional Patent Application bearing serial number 61/052,550, filed on May 12, 2008, the contents of which are hereby incorporated by reference.
TECHNICAL FIELD
[0002] The present invention relates to the field of creating videos comprising multimedia content.
BACKGROUND OF THE INVENTION
[0003] Many web applications to create videos comprising multimedia content exist, but the video can only be displayed by the application used to create it. For example, if such a video is posted on a video sharing website and if the website does not have the application that was used to create the video, the website will not be able to render the video and it will be impossible to play back the video.
[0004] Therefore, there is a need for creating a video that does not require a specific application to be played back.
SUMMARY OF THE INVENTION
[0005] In accordance with a first broad aspect, there is provided a method for creating a video comprising: mixing down a plurality of media assets in order to obtain at least two mixed-down frames on a computer; transmitting the at least two mixed-down frames from the computer to a server; 0401
mixing down on the server the at least two mixed-down frames in order to obtain a mixed-down video; and returning the mixed-down video to the computer encoded in a format adapted to be delivered as video over an Internet.
[0006] In accordance with another broad aspect of the present invention, there is provided a system for creating a video, comprising: a processor in a server; a memory accessible by the processor; communication means for receiving and transmitting data; and an application coupled to the processor, the application being configured for: receiving at least two mixed-down frames created from at least two media assets; mixing down the at least two mixed- down frames in order to obtain a mixed-down video and encoding the mixed-down video in a format adapted to be delivered as video over an Internet; and returning the mixed- down video to the computer via the communication means .
[0007] In accordance with yet another broad aspect of the present invention, there is provided a system for creating a video, comprising: a processor in a computer; a memory accessible by the processor; communication means for receiving and transmitting data,- and an application coupled to the processor, the application being configured for: mixing down a plurality of media assets stored in the memory in order to obtain at least two mixed-down frames; transmitting the at least two mixed-down frames to a server via the communication means,- and receiving a mixed-down video from the server via the communication means, the mixed-down video resulting from mixing down the at least two mixed-down frames and being encoded in a format adapted to be delivered as video over an Internet. 0401
[0008] In accordance with another broad aspect, there is provided a method for creating a video comprising: receiving, at a server and from a remote computer, at least two mixed- down frames created from at least two media assets; mixing down the at least two mixed-down frames in order to obtain a mixed-down video and encoding the mixed-down video in a format adapted to be delivered as video over an Internet; and returning the mixed-down video to the remote computer.
[0009] In accordance with yet another broad aspect, there is provided a method for creating a video comprising: mixing down, at a remote computer, a plurality of media assets stored in a memory in order to obtain at least two mixed-down frames; transmitting the at least two mixed-down frames to a server; and receiving, at the remote computer, a mixed-down video from the server, the mixed-down video resulting from mixing down the at least two mixed-down frames and being encoded in a format adapted to be delivered as video over an
Internet .
[0010] The term "media asset" refers to visual and/or audio elements which can be stored in a memory of a computer. Examples of media assets are video-only assets such as silent videos and movies, audio-only assets such as audio tracks, video-and-audio assets such as talking videos and movies, photographs, images, text, graphics, and the like.
[0011] The term "frame" refers to a single shot or sequence of a media asset. For example, a film frame is one of the many single photographic images in a motion picture. Usually, 24 film frames are needed for one second of film, however embodiments of the present invention should not be limited as such. An audio sample is another example of a frame .
[0012] The term "mix-down" refers to the action of combining several elements such as media assets and frames of media assets in order to obtain a single element. For example, a mixed-down frame is the result of the combination of several frames. Mixing-down several video frames results in a video and mixing-down several audio frames results in an audio track.
[0013] The term "rendering" is used to describe the process of calculating effects in a video editing file to produce final video output.
BRIEF DESCRIPTION OF THE DRAWINGS
[0014] Further features and advantages of the present invention will become apparent from the following detailed description, taken in combination with the appended drawings, in which:
[0015] Fig. 1 is a flow chart illustrating a method of creating a video, in accordance with an embodiment of the present invention;
[0016] Fig. 2 is a block diagram illustrating the splitting of a video into frames, in accordance with an embodiment of the present invention;
[0017] Fig. 3 is a block diagram illustrating a mix-down of media assets in order to obtain a mixed-down frame, in accordance with an embodiment of the present invention;
- 4 - 0401
[0018] Fig. 4 is a block diagram illustrating a mix-down of mixed-down frames in order to obtain a mixed-down video, in accordance with an embodiment of the present invention;
[0019] Fig. 5 is a block diagram illustrating the insertion of effects in a mixed-down video, in accordance with an embodiment of the present invention;
[0020] Fig. 6 is a flow chart illustrating a method of creating a mixed-down video comprising a simplification step and a reconstruction step, in accordance with an embodiment of the present invention;
[0021] Fig. 7 is a block diagram illustrating the simplification of mixed-down frames, in accordance with an embodiment of the present invention;
[0022] Fig. 8 is a block diagram illustrating an Edit Decision List, in accordance with an embodiment of the present invention;
[0023] Fig. 9 is a block diagram illustrating the reconstruction of mixed-down frames, in accordance with an embodiment of the present invention; and
[0024] Fig. 10 is block diagram of a system for creating a mixed-down video, in accordance with an embodiment of the present invention.
DETAILED DESCRIPTION
[0025] Figure 1 illustrates a method 10 of creating a video according to one embodiment. A user having a computer wants to create a personalized video. The user uses a 0401
browser-based application to create the video and connects to a server. The application is accessed via web browser over a network such as the Internet or an intranet. For example, the user wants to add sound effects or a background image to an original video. The original video may be a video filmed by the user or a video downloaded from the Internet, for example. The original video and the media assets to be added to the video are stored in a memory of the user's computer. Alternatively, the original video and the media assets can be on the server. The first step 12 of the method 10 is the creation of at least two mixed-down frames by combining the different media assets including the original video. Frame mix-down refers to taking multiple media assets composed and/or edited into a single shot or sequence and combining them to obtain a single frame. In order to create a personalized video, the original video is divided frame by frame, as illustrated in figure 2. If required, the media assets to be added to the original video are also divided frame by frame. Figure 3 illustrates the creation of a mix- down for one frame 50a of the original video. For each frame 50a-e of the original video, the user superimposes a graphic (and/or text) 52, a picture 54, a background image 56 and/or an audio track 60. The superposition results in a mixed-down frame 60 which is encoded in a format adapted to be delivered as video over the Internet.
[0026] While figure 1 refers to the creation of a personalized video using an original video, it should be understood that the mixed-down frame could be something other than a video-and-audio frame. The mixed-down frame results from the combination of at least two media assets of the same 0401
or different type. For example, the mixed-down frame could be a video-only frame or an audio-only frame. An audio-only mixed-down frame results from the mix-down of at least two audio tracks having the same or a different format. A video- only mixed-down frame may result from the mix-down of at least two videos having no sound track, at least two pictures, at least two graphics, or any combination of videos having no sound tracks, pictures and/or graphics.
[0027] In one embodiment, original video 50 is in an flv format with or without alpha channel. Original video 50 may comprise an audio track or not. Graphic 52 is a vector data and may comprise text. Picture 54 and background picture 56 may be in a GIF, PNG, or JPEG format, for example. Audio track 60 may be in an flv format (without video) or any audio format known to a person skilled in the art such as mp3 , wav, ogg, oga, etc.
[0028] In one embodiment, mixed-down frame 60 is a flattened frame, which means that it is in an flv format. Other formats such as mp4 , m4v, m4a, 3gp, mov, f4v, f4p, f4a, f4b, and the like are also possible.
[0029] Referring back to figure 1, once the mixed-down frames are created, the next step 14 is the transmission of the mixed-down frames from the user's computer to a server. The server receives the mixed-down frames and stores them in a memory. At step 16, the server combines the multiple mixed- down frames sequentially to obtain a mixed-down video which is first stored in the server memory before being sent to the user at step 18. Alternatively, the mixed-down video is streamed directly from the server to the user's computer 0401
without being stored in the server's memory. The mixed-down video is encoded in a format adapted to be delivered as video over the Internet. The format of the mixed-down video is standard so that the mixed-down video can be played back by any generic media player. As a result, the application that was used to create the mixed-down video is not required to render and play back the mixed-down video. The mixed-down video can be played back without having to mix down the shots or timeline since the rendering has already been done during its creation. Consequently, the mixed-down video can be posted to generic video aggregation sites such as Youtube™, Google Video™, Yahoo! Video™, Dailymotion™, etc. Once the mixed-down video has been posted, a person can play back the mixed-down video since the application that was used to create it is not required.
[0030] Figure 4 illustrates one embodiment of the creation of a mixed-down video. The server receives n mixed-down frames 10Oa-IOOn from the user's computer and saves them in memory. The n mixed-down frames 10Oa-IOOn are combined sequentially two by two to result in m flattened videos 101 a-m. It should be understood that the n mixed-down frames can be combined other than two by two . For example they could be combined three by three or four by four, etc. Then the m flattened video 101a-m are combined sequentially to result in a single mixed-down video 102. Mixed-down video 102 is encoded in a format adapted to be delivered as video over the Internet .
[0031] In another embodiment, the mixed-down video 102 is created in a single step. The n mixed-down frames 100a-n are 0401
directly combined sequentially to result in mixed-down video 102.
[0032] In one embodiment, the mixed-down video 102 is a flattened video, which means that it is in an flv format. Other standard formats allowing the mixed-down video to be played back by any generic media player are also possible. Examples of other standard formats are mp4 , m4v, m4a, 3gp, mov, f4v, f4p, f4a, f4b, etc.
[0033] Once the mixed-down video 102 has been created on the server, the user may insert effects into the video 102 as illustrated in figure 5. Multiple effects can be inserted into the mixed-down video 102 such as track effects 150, 152, 154, transition effects 156 and audio effect 158. Transition effects 156 are inserted between two successive frames of the mixed-down video 102. Because of the transition effects 156, two successive frames of the mixed-down video 102 overlap during the playback of the mixed-down video 102 and effects such as wiping or dissolving appear between the frames. Audio effect 158 can be an audio bed behind the mixed-down video 102 and acting as background sound, for example. Other audio effects such as echo, flanger, filtering, chorus, and the like can be added. Track effects 150, 152, and 154 refer to bitmap effects which have a defined duration independent of the duration of the frames constituting the mixed-down video 102. Sepia tone effect, blur effect, saturation effect, brightness effect, grayscale effect, and the like are examples of track effects 150, 152, and 154. In the embodiment illustrated in figure 5, a sepia tone effect 150, a black-and-white effect 152, a blur effect 154, transitions 156 between two successive frames of the mixed-down video 0401
102, and an audio bed 158 are combined with the mixed-down video 102 to give rise to a final mixed-down video. The final mixed-down video comprising effects is encoded in a format adapted to be delivered as video over the Internet.
[0034] While figure 5 refers to the insertion of effects once the mixed-down video has been created, it should be understood that the step of creating the mixed-down video using the mixed-down frames and the step of inserting effects can be combined into a single step. The effects are inserted during the step of mixing down the mixed-down frames illustrated in figure 4. As a result, the video obtained at the end of the creation of the mixed-down video is a mixed- down video comprising effects and being encoded in a format adapted to be delivered as video over the Internet.
[0035] In one embodiment, the server offers a panel of effects that can be added during the creation of the mixed- down video. Alternatively, the effects can be in an application on the user's computer and they are uploaded to the server.
[0036] In one embodiment, the server returns in real-time the mixed-down video to the user. The user can see in realtime the effects added to the video and change them if they are not satisfactory. The user does not need to wait until the end of the process to view the mixed-down video. Once the user is satisfied with the result, the server returns the mixed-down video in the form of a video file in a format adapted to be delivered as video over the Internet. 0401
[0037] Figure 6 illustrates one embodiment of a method 170 of creating a mixed-down video comprising the steps of simplification and reconstruction of the mixed-down frames. In this embodiment, audio assets and visual assets are treated separately. If an asset such as a video contains both audio and video assets, this asset is broken down into an audio-only asset and a video-only asset. Any method known to a person skilled in the art to break down a video into a video-only asset and an audio-only asset may be used. As in the method illustrated in figure 1, the first step is the creation of the video mixed-down frames using multiple media assets. At step 172, video-only mixed-down frames are created using the method illustrated in figure 3. Concurrently, the audio-only assets are split in audio-only frames which are uploaded to the server if they are located on the user's computer. Alternatively, the audio-only frames can be located on the server and the user selects the ones to be inserted into the mixed-down video. Video-only mixed-down frames are encoded in a format adapted to be delivered as video over the Internet. Audio frames can be in an audio format such as mp3 or wav format, for example. It should be noted that the audio frames may contain a single audio asset such as the soundtrack of a video, which is uploaded to the server from the user's computer or provided on the server.
[0038] In one embodiment, if a single audio-only asset is present, there is no breaking down of the audio-only asset in audio-only frames and the audio-only asset is directly sent to the server.
[0039] Once the video-only mixed-down frames have been created, they are stored in a memory of the user's computer. 04 01
The next step 174 is the simplification of the video-only mixed-down frames, of which an embodiment is illustrated in figure 7. The creation of video-only mixed-down frames results in video-only mixed-down frame 200, video-only mixed- down frame 202 and video-only mixed-down frame 204 which are stored in the memory of the user's computer. Video-only mixed-down frames 200, 202, 204 are subsequently converted into bitmaps which contain a plurality of bits such as tiles. Tiles of one bitmap are compared to tiles of a preceding bitmap and are stored in memory only if they are different.
[0040] For example, video-only mixed-down frame 202 is converted to bitmap 206 which includes 9 tiles Tn to Ti9. Each tile T±j of a bitmap represents a part of the corresponding video-only mixed-down frame and contains all data corresponding to that part of the video-only mixed-down frame. Index i refers to the number of the video-only mixed- down frame and index j refers to the position of the tile in the bitmap. Bitmap 208 is sent from the user's computer to the server. In the case of the first video-only mixed-down frame 200, all tiles Tn to Ti9 of bitmap 208 are sent to the server. The second video-only mixed-down frame 202 is also converted to a bitmap 210 comprising tiles T21 to T29. The tiles of bitmap 210 are compared to their corresponding tiles of the preceding bitmap 206. Tile Tn is compared to tile T2x, tile T12 is compared to tile T22, tile T13 is compared to tile T23, etc. Bitmap 212 represents the bitmap corresponding to video-only mixed-down frame 202 that is sent to the server. Bitmap 212 only includes tiles T25 and T29 of bitmap 210 which are different from their corresponding tiles of bitmap 206. The same steps are performed for video-only mixed-down frame 15750401
204 which is converted to bitmap 214. Bitmap 214 includes tiles T3I to T39 which are compared to tiles T2i to T29 of bitmap 210. Only the tiles that are different are saved in bitmap 216 which includes tiles T3I, T37, and T39. Bitmap 216 5 is then sent to the server.
[0041] In one embodiment, tiles Tn to Ti9, T25, T29, T31, T37, and T39 are compressed before being sent to the server.
[0042] Concurrently or previously to the creation of bitmaps 206, 210, 214, at step 180, the audio-only frames are0 sent to the server where they are mixed-down together in order to obtain an audio track for the mixed-down video (step 182) . Alternatively, the audio-only frames are selected by the user amongst a selection of audio frames available on the server. This audio track is saved in memory as an audio file.5 Instructions relative to the audio-only frames are also sent to the server. These instructions may contain the order and timing of each audio-only frame, for example. It should be understood that the audio-only frames can be compressed before they are sent to the server and subsequently 0 decompressed by the server.
[0043] In one embodiment, the instructions relative to the audio-only frames are comprised in an Edit Decision List
(EDL) of which an example is illustrated in figure 8. The EDL is transmitted to the server prior to the creation of the5 mixed-down video. For example, the EDL can be sent to the server via Real Time Messaging Protocol in an amf3 format.
[0044] In one embodiment, if there is only one audio-only asset, no mix-down is required. The single audio-only asset 0401
(compressed or not) is directly uploaded to the server or selected on the server. In another embodiment, the mixed-down video is a silent video. Therefore, the steps 180 and 182 are not performed and the mixed-down video results from the mix- down of video-only mixed-down frames on the server.
[0045] At step 176, the bitmaps 208, 212, and 216 are sent to the server and stored in the server's memory. If received tiles Tn to Ti9, T25, T29, T3I, T37, and T39 are compressed, the server decompresses them before storing them in memory. At step 178, the server reconstructs a complete bitmap using tiles of a preceding bitmap, as illustrated in Figure 9. As bitmap 208 is complete (each position of the bitmap contains a tile) , the server recreates video-only mixed-down frame 200 and stores it in memory. Bitmap 212 is incomplete as only tile T25 and T29 are present. The server fills the missing tiles T2i, T22, T23, T24, T26, T27, and T28 with their corresponding tiles in the preceding bitmap 208. As a result Tn substitutes T21, T12 substitutes T22, Ti3 substitutes T23, etc, to give rise to bitmap 220. From bitmap 220, video-only mixed-down frame 202 is reconstructed by the server and stored in memory for further processing. Bitmap 216 is also incomplete as only tiles T31, T37 and T39 are present. The missing tiles are taken from the preceding bitmap 220. Tile Ti2 substitutes tile T32, tile T13 substitutes tile T33, tile T14 substitutes tile T34, tile T25 substitutes tile T35, tile Ti6 substitutes tile T36, and tile T18 substitutes tile T38. This process results in complete bitmap 222 from which video- only mixed-down frame 204 can be reconstructed and stored in memory . 0401
[0046] While figures 7 and 9 refer to dividing video-only mixed-down frames into tiles, it should be understood that any method for dividing the video-only mixed-down frames can be used. For example, they can be divided in columns or rows. The number of tiles, columns, or rows is chosen as part of a design choice. It should also be understood that the number of mixed-down frames 200, 202, 204 is exemplary only and can equal to any integer superior or equal to two.
[0047] Referring back to figure 6, once the video-only mixed-down frames have been reconstructed and stored in memory, the next step 184 is the creation of a mixed-down video using these reconstructed video-only mixed-down frames and the audio track as illustrated in figure 5. Effects can also be added to the video either during the creation of the mixed-down video or after the creation of the mixed-down video. The resulting mixed-down video is encoded in a format adapted to be delivered as video over the Internet. No rendering process is required to play back the mixed-down video. The last step 186 consists in returning the mixed-down video to the user.
[0048] In one embodiment, the creation of the mixed-down video occurs in several steps. A video track is created using the video-only mixed-down frames according to the method illustrated in figure 4. Alternatively, the method illustrated in figure 5 can be used and effects are added to the video track. Sound effects can be added to the audio track created using the audio-only mixed-down frames. The last step is the combination of the audio track with the video track to obtain the mixed-down video. 0401
[0049] While, in figure 6, the mixing-down of audio-only frames is performed on the server, it should be understood that this process can be performed on the user's computer. In this case, the user selects audio-only frames present on his computer or downloads audio-only frames available on the server. The creation of the mixed-down audio-only frames is performed on the user's computer and they are uploaded to the server with instructions such as an EDL to create the audio track of the mixed-down video.
[0050] Figure 10 illustrates one embodiment of a system 300 for creating a mixed-down video. The system comprises a computer 302 in connection with a server 302. Computer 302 and server 304 can be connected via an Internet connection or an intranet connection, for example. Computer 302 is also connected to a monitor 320 and a command unit 322. Computer 302 comprises a processor 306, a memory 308 and a communication unit 310. Media assets used to create the mixed-down video are stored in memory 308. An application coupled to processor 306 displays options such as a list of media assets on monitor 320. The user has access to all media assets stored in his computer via the application. A user selects the media assets to be used to create the mixed-down video via command unit 322. Processor 306 has access to all media assets stored in memory 308. Furthermore, processor 306 is adapted to combine the media assets chosen by the user in order to obtain mixed-down frames according to the method illustrated in figure 3. The mixed-down frames are stored in memory 308 before being sent to server 304 through communication unit 310. 0401
[0051] In one embodiment, after selecting the media assets to be combined, the user clicks on a "render" button displayed on monitor 320 by the application coupled to processor 306 using command unit 322. The trigger of the "render" button starts the process of creating the mixed-down video .
[0052] Communication unit 318 of server 304 receives the mixed-down frames and stores them in memory 316. Processor 316 is adapted to combine the mixed-down frames stored in memory 316 in order to create a mixed-down video according to the method illustrated in figure 4. The resulting mixed-down video is encoded in a format adapted to be delivered as video over the Internet and is stored in memory 316 before being sent to computer 302 through communication unit 318 or directly streamed to computer 302 without being saved in memory 316. Communication unit 310 receives the mixed-down video and stores it in memory 308.
[0053] In one embodiment, processor 306 is further connected to communication unit 310 and the mixed-down frames are directly sent to server 304 through communication unit 310 without being stored in memory 308.
[0054] In one embodiment, processor 314 is further connected to communication unit 318 and the mixed-down video is sent directly to computer 302 without being stored in memory 316.
[0055] In one embodiment, processor 314 is further adapted to add effects either during the creation of the mixed-down video or after the creation of the mixed-down video following 0401
the method described in figure 5. In this embodiment, the application coupled to processor 306 displays a list of effects that can be added to the video. The user selects the effects to be inserted into the mixed-down video via command unit 322. The user can also choose effect parameters such as the duration of an effect, the position of the effect within the video, etc. The effects and their corresponding parameters are sent to server 318 via processor 306 and communication unit 310 prior to the creation the mixed-down video.
[0056] In one embodiment, processor 306 is further adapted to break down media assets in order to obtain audio-only frames and video-only assets according to the method illustrated in figure 6. Processor 306 is also adapted to simplify the video-only mixed-down frames according to the method illustrated in figure 7 and processor 314 is further adapted to reconstruct the video-only mixed-down frames according to the method illustrated in figure 8, to create the audio-only mixed-down frames, and to combine them with the audio-only mixed-down frames in order to obtain a mixed- down video encoded in a format adapted to be delivered as video over the Internet. Alternatively, processor 306 is further adapted to create the audio-only mixed-down frames and to send them to processor 314 via communication units 310 and 318.
[0057] While memory 308 is part of computer 302 in figure 10, it should be noted that memory 308 may be external to computer 302. 0401
[0058] In one embodiment, computer 302 and server 304 are connected via the Internet.
[0059] In one embodiment, the mix-down of media assets and the mix-down of frames is performed by a Flash runtime application.
[0060] In one embodiment, the mixed-down format is encoded in a standard format and can be played on any generic player such as QuickTime™, Windows Media Player™, Realplayer™, KMplayer™, etc.
[0061] In one embodiment, the mixed-down frames are compressed by the user' s computer before being sent to the server and subsequently decompressed by the server before the creation of the mixed-down video.
[0062] In one embodiment, the mixed-down video is compressed by the server before being sent to the user's computer. The mixed-down video is then decompressed by the user's computer before being played back.
[0063] It should be understood that any elements transmitted between the computer 302 and the server 304 may be compressed before being sent and subsequently decompressed once received. In this case, the processors 306 and 314 are adapted to compress and/or decompress the transmitted elements .
[0064] It should be noted that the embodiments of the invention described above are intended to be exemplary only. The scope of the invention is therefore intended to be limited solely by the scope of the appended claims.

Claims

15750401I/WE CLAIM:
1. A method for creating a video comprising: mixing down a plurality of media assets in order to obtain at least two mixed-down frames on a computer; transmitting said at least two mixed-down frames from said computer to a server; mixing down on said server said at least two mixed-down frames in order to obtain a mixed-down video; and returning said mixed-down video to said computer encoded in a format adapted to be delivered as video over an Internet .
2. The method as claimed in claim 1, wherein said mixing-down said plurality of media assets comprises splitting at least one of said media assets into media asset frames.
3. The method as claimed in claim 1, wherein said mixing-down said plurality of media assets comprises mixing-down at least two of a video frame, a graphic, a picture, a background image, and an audio track.
4. The method as claimed in any one of claims 1 to 3 , further comprising adding effects during said mixing-down said at least two mixed-down frames.
5. The method as claimed in any one of claims 1 to 3 , further comprising adding effects after said mixing-down said at least two mixed-down frames. 15750401
6. The method as claimed in claim 4 or 5, wherein adding effects comprises adding at least one of a track effect, a transition effect and an audio effect.
7. The method as claimed in any one of claims 1 to 6, wherein said mixing down said at least two mixed-down frames comprises : mixing down said at least two mixed-down frames to obtain at least two flattened videos; and mixing down said at least two flattened videos to obtain said mixed-down video.
8. The method as claimed in any one of claims 1 to 7, wherein said mixing-down said plurality of media assets comprises mixing down at least two video-only assets in order to obtain at least two video-only mixed-down frames.
9. The method as claimed in claim 8, further comprising breaking down at least one of said plurality of media assets into at least one audio-only asset and one of said at least two video-only assets.
10. The method as claimed in claim 8, further comprising converting said at least two video-only mixed-down frames in at least two corresponding bitmaps.
11. The method as claimed in claim 10, further comprising, for each one of said at least two corresponding bitmaps, determining particular bits different from bits of a previous one of said at least two corresponding bitmaps. 15750401
12. The method as claimed in claim 11, wherein said transmitting said at least two mixed-down frames comprises transmitting said particular bits for each one of said at least two corresponding bitmaps to said server.
13. The method as claimed in claim 12, further comprising reconstructing said at least two video-only mixed-down frames by filling in missing bits, for each one of said at least two corresponding bitmaps, with corresponding bits of said previous one of said at least two corresponding bitmaps.
14. The method as claimed in claim 8, wherein said mixing down said at least two mixed-down frames comprises mixing down at least one of said at least two video-only mixed-down frames and an audio track in order to obtain said mixed-down video.
15. The method as claimed in claim 14, further comprising mixing down audio-only frames in order to obtain said audio track.
16. A system for creating a video, comprising: a processor in a server; a memory accessible by said processor; communication means for receiving and transmitting data; and an application coupled to said processor, said application being configured for: receiving at least two mixed-down frames created from at least two media assets,- 15750401
mixing down said at least two mixed-down frames in order to obtain a mixed-down video and encoding said mixed-down video in a format adapted to be delivered as video over an Internet; and returning said mixed-down video to said computer via said communication means.
17. The system as claimed in claim 16, wherein said application is configured for receiving at least two mixed- down frames created from at least two of a video frame, a graphic, a picture, a background image, and an audio track.
18. The system as claimed in claim 16 or 17, wherein said application is configured to add effects while mixing-down said at least two mixed-down frames.
19. The system as claimed in claim 16 or 17, wherein said application is configured to add effects after mixing-down said at least two mixed-down frames.
20. The system as claimed in claim 18 or 19, wherein said application is configured to add at least one of a track effect, a transition effect and an audio effect to said mixed-down video.
21. The system as claimed in any one of claims 16 to 20, wherein said application is further configured for: mixing down said at least two mixed-down frames to obtain at least two flattened videos; and mixing down said at least two flattened videos to obtain said mixed-down video. 15750401
22. The system as claimed in any one of claims 16 to 21, wherein said application is configured: for receiving, via said communication means, at least two video-only mixed-down frames,- and mixing down at least one of said at least two video-only mixed-down frames and an audio track in order to obtain said mixed-down video.
23. The system as claimed in claim 20, wherein said application is further configured for: receiving at least two bitmaps each corresponding to one of said at least two video-only mixed-down frames; and reconstructing said at least two video-only mixed-down frames by filing, for each one of said at least two bitmaps, missing bits with corresponding bits of a previous one of said at least two bitmaps.
24. The system as claimed in claim 22, wherein said application is further configured for mixing down audio-only frames in order to obtain said audio track.
25. A system for creating a video, comprising: a processor in a computer; a memory accessible by said processor; communication means for receiving and transmitting data; and an application coupled to said processor, said application being configured for: 15750401
mixing down a plurality of media assets stored in said memory in order to obtain at least two mixed-down frames,- transmitting said at least two mixed-down frames to a server via said communication means; and receiving a mixed-down video from said server via said communication means, said mixed-down video resulting from mixing down said at least two mixed-down frames and being encoded in a format adapted to be delivered as video over an Internet.
26. The system as claimed in claim 25, wherein said application is further configured for splitting at least one of said media assets into media asset frames.
27. The system as claimed in claim 25 or 26, wherein said application is configured for mixing down at least two of a video frame, a graphic, a picture, a background image, and an audio track in order to obtain said at least two mixed-down frames .
28. The system as claimed in any one of claims 25 to 27, wherein said application is configured for storing said mixed-down video in said memory.
29. The system as claimed in any one of claims 25 to 28, wherein said application is further configured for transmitting effects to be added during or after said mixing- down said at least two mixed-down frames to said server via said communication means. 15750401
30. The system as claimed in claim 29, wherein said application is configured for transmitting at least one of a track effect, a transition effect and an audio effect to said server via said communication means.
31. The system as claimed in any one of claims 25 to 30, wherein said application is further configured for: mixing down at least two video-only assets in order to obtain at least two video-only mixed-down frames; and transmitting said at least two video-only mixed-down frames to said server via said communication means .
32. The system as claimed in claim 31, wherein the application is further configured for breaking down at least one of said plurality of media assets into at least one audio-only asset and one of said at least two video-only assets .
33. The system as claimed in claim 31, wherein said application is further configured for converting said at least two video-only mixed-down frames in at least two corresponding bitmaps.
34. The system as claimed in claim 31, wherein said application is further configured for simplifying said at least two video-only mixed-down frames and transmitting said at least two simplified video-only mixed-down frames to said server via said communication means, said simplifying comprising, for each one of said at least two corresponding bitmaps, determining particular bits different from bits of a previous one of said at least two corresponding bitmaps and 15750401
said transmitting said at least two simplified video-only mixed-down frames comprising transmitting said different bits, for each one of said at least two corresponding bitmaps .
35. The system as claimed in claim 34, wherein said application is further configured for transmitting at least one audio-only frame to said server via said communication means, said at least one audio-only frame being mixed-down on said server to obtain au audio track, said mixed-down video resulting from a mix-down of said audio track and said at least two video-only mixed-down frames.
36. A method for creating a video comprising: receiving, at a server and from a remote computer, at least two mixed-down frames created from at least two media assets; mixing down said at least two mixed-down frames in order to obtain a mixed-down video and encoding said mixed-down video in a format adapted to be delivered as video over an Internet ; and returning said mixed-down video to said remote computer.
37. A method for creating a video comprising: mixing down, at a remote computer, a plurality of media assets stored in a memory in order to obtain at least two mixed-down frames,- transmitting said at least two mixed-down frames to a server; and receiving, at said remote computer, a mixed-down video from said server, said mixed-down video resulting from mixing 0401
down said at least two mixed-down frames and being encoded in a format adapted to be delivered as video over an Internet.
PCT/CA2009/000647 2008-05-12 2009-05-12 Video mix-down WO2009137919A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US5255008P 2008-05-12 2008-05-12
US61/052,550 2008-05-12

Publications (1)

Publication Number Publication Date
WO2009137919A1 true WO2009137919A1 (en) 2009-11-19

Family

ID=41318310

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CA2009/000647 WO2009137919A1 (en) 2008-05-12 2009-05-12 Video mix-down

Country Status (1)

Country Link
WO (1) WO2009137919A1 (en)

Cited By (33)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2012114329A1 (en) * 2011-02-24 2012-08-30 Jbf Interlude 2009 Ltd. System and method for seamless multimedia assembly
WO2013055802A1 (en) 2011-10-10 2013-04-18 Genarts, Inc. Network-based rendering and steering of visual effects
US8600220B2 (en) 2012-04-02 2013-12-03 JBF Interlude 2009 Ltd—Israel Systems and methods for loading more than one video content at a time
US8860882B2 (en) 2012-09-19 2014-10-14 JBF Interlude 2009 Ltd—Israel Systems and methods for constructing multimedia content modules
CN104427259A (en) * 2013-09-03 2015-03-18 广州市千钧网络科技有限公司 Method and device for playing video and audio special effects in real time
US9009619B2 (en) 2012-09-19 2015-04-14 JBF Interlude 2009 Ltd—Israel Progress bar for branched videos
US9190110B2 (en) 2009-05-12 2015-11-17 JBF Interlude 2009 LTD System and method for assembling a recorded composition
US9257148B2 (en) 2013-03-15 2016-02-09 JBF Interlude 2009 LTD System and method for synchronization of selectably presentable media streams
US9520155B2 (en) 2013-12-24 2016-12-13 JBF Interlude 2009 LTD Methods and systems for seeking to non-key frames
US9530454B2 (en) 2013-10-10 2016-12-27 JBF Interlude 2009 LTD Systems and methods for real-time pixel switching
US9641898B2 (en) 2013-12-24 2017-05-02 JBF Interlude 2009 LTD Methods and systems for in-video library
US9653115B2 (en) 2014-04-10 2017-05-16 JBF Interlude 2009 LTD Systems and methods for creating linear video from branched video
US9672868B2 (en) 2015-04-30 2017-06-06 JBF Interlude 2009 LTD Systems and methods for seamless media creation
US9792026B2 (en) 2014-04-10 2017-10-17 JBF Interlude 2009 LTD Dynamic timeline for branched video
US9792957B2 (en) 2014-10-08 2017-10-17 JBF Interlude 2009 LTD Systems and methods for dynamic video bookmarking
US9832516B2 (en) 2013-06-19 2017-11-28 JBF Interlude 2009 LTD Systems and methods for multiple device interaction with selectably presentable media streams
US10218760B2 (en) 2016-06-22 2019-02-26 JBF Interlude 2009 LTD Dynamic summary generation for real-time switchable videos
US10257578B1 (en) 2018-01-05 2019-04-09 JBF Interlude 2009 LTD Dynamic library display for interactive videos
US10448119B2 (en) 2013-08-30 2019-10-15 JBF Interlude 2009 LTD Methods and systems for unfolding video pre-roll
US10460765B2 (en) 2015-08-26 2019-10-29 JBF Interlude 2009 LTD Systems and methods for adaptive and responsive video
US10462202B2 (en) 2016-03-30 2019-10-29 JBF Interlude 2009 LTD Media stream rate synchronization
US10582265B2 (en) 2015-04-30 2020-03-03 JBF Interlude 2009 LTD Systems and methods for nonlinear video playback using linear real-time video players
US11050809B2 (en) 2016-12-30 2021-06-29 JBF Interlude 2009 LTD Systems and methods for dynamic weighting of branched video paths
US11128853B2 (en) 2015-12-22 2021-09-21 JBF Interlude 2009 LTD Seamless transitions in large-scale video
US11164548B2 (en) 2015-12-22 2021-11-02 JBF Interlude 2009 LTD Intelligent buffering of large-scale video
US11232458B2 (en) 2010-02-17 2022-01-25 JBF Interlude 2009 LTD System and method for data mining within interactive multimedia
US11245961B2 (en) 2020-02-18 2022-02-08 JBF Interlude 2009 LTD System and methods for detecting anomalous activities for interactive videos
US11412276B2 (en) 2014-10-10 2022-08-09 JBF Interlude 2009 LTD Systems and methods for parallel track transitions
US11490047B2 (en) 2019-10-02 2022-11-01 JBF Interlude 2009 LTD Systems and methods for dynamically adjusting video aspect ratios
US11601721B2 (en) 2018-06-04 2023-03-07 JBF Interlude 2009 LTD Interactive video dynamic adaptation and user profiling
US11856271B2 (en) 2016-04-12 2023-12-26 JBF Interlude 2009 LTD Symbiotic interactive video
US11882337B2 (en) 2021-05-28 2024-01-23 JBF Interlude 2009 LTD Automated platform for generating interactive videos
US11934477B2 (en) 2021-09-24 2024-03-19 JBF Interlude 2009 LTD Video player integration within websites

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1315373A1 (en) * 2000-08-30 2003-05-28 Sony Corporation Effect adding device, effect adding method, effect adding program, storage medium where effect adding program is stored
WO2006016185A1 (en) * 2004-08-13 2006-02-16 Rpptv Limited Remote program production

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1315373A1 (en) * 2000-08-30 2003-05-28 Sony Corporation Effect adding device, effect adding method, effect adding program, storage medium where effect adding program is stored
WO2006016185A1 (en) * 2004-08-13 2006-02-16 Rpptv Limited Remote program production

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
SONY, VEGAS 7 PROFESSION HD VIDEO AND AUDIO PRODUCTION MANUAL, 26 April 2007 (2007-04-26), Retrieved from the Internet <URL:http://www.sonycreativesoftware.com/dl/dl.aspx?dwnid=206> [retrieved on 20090810] *

Cited By (48)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9190110B2 (en) 2009-05-12 2015-11-17 JBF Interlude 2009 LTD System and method for assembling a recorded composition
US9607655B2 (en) 2010-02-17 2017-03-28 JBF Interlude 2009 LTD System and method for seamless multimedia assembly
US11232458B2 (en) 2010-02-17 2022-01-25 JBF Interlude 2009 LTD System and method for data mining within interactive multimedia
WO2012114329A1 (en) * 2011-02-24 2012-08-30 Jbf Interlude 2009 Ltd. System and method for seamless multimedia assembly
WO2013055802A1 (en) 2011-10-10 2013-04-18 Genarts, Inc. Network-based rendering and steering of visual effects
EP2766816A4 (en) * 2011-10-10 2016-01-27 Vivoom Inc Network-based rendering and steering of visual effects
US8600220B2 (en) 2012-04-02 2013-12-03 JBF Interlude 2009 Ltd—Israel Systems and methods for loading more than one video content at a time
US9271015B2 (en) 2012-04-02 2016-02-23 JBF Interlude 2009 LTD Systems and methods for loading more than one video content at a time
US8860882B2 (en) 2012-09-19 2014-10-14 JBF Interlude 2009 Ltd—Israel Systems and methods for constructing multimedia content modules
US9009619B2 (en) 2012-09-19 2015-04-14 JBF Interlude 2009 Ltd—Israel Progress bar for branched videos
US10474334B2 (en) 2012-09-19 2019-11-12 JBF Interlude 2009 LTD Progress bar for branched videos
US9257148B2 (en) 2013-03-15 2016-02-09 JBF Interlude 2009 LTD System and method for synchronization of selectably presentable media streams
US10418066B2 (en) 2013-03-15 2019-09-17 JBF Interlude 2009 LTD System and method for synchronization of selectably presentable media streams
US9832516B2 (en) 2013-06-19 2017-11-28 JBF Interlude 2009 LTD Systems and methods for multiple device interaction with selectably presentable media streams
US10448119B2 (en) 2013-08-30 2019-10-15 JBF Interlude 2009 LTD Methods and systems for unfolding video pre-roll
CN104427259A (en) * 2013-09-03 2015-03-18 广州市千钧网络科技有限公司 Method and device for playing video and audio special effects in real time
US9530454B2 (en) 2013-10-10 2016-12-27 JBF Interlude 2009 LTD Systems and methods for real-time pixel switching
US9520155B2 (en) 2013-12-24 2016-12-13 JBF Interlude 2009 LTD Methods and systems for seeking to non-key frames
US9641898B2 (en) 2013-12-24 2017-05-02 JBF Interlude 2009 LTD Methods and systems for in-video library
US9792026B2 (en) 2014-04-10 2017-10-17 JBF Interlude 2009 LTD Dynamic timeline for branched video
US11501802B2 (en) 2014-04-10 2022-11-15 JBF Interlude 2009 LTD Systems and methods for creating linear video from branched video
US10755747B2 (en) 2014-04-10 2020-08-25 JBF Interlude 2009 LTD Systems and methods for creating linear video from branched video
US9653115B2 (en) 2014-04-10 2017-05-16 JBF Interlude 2009 LTD Systems and methods for creating linear video from branched video
US11348618B2 (en) 2014-10-08 2022-05-31 JBF Interlude 2009 LTD Systems and methods for dynamic video bookmarking
US9792957B2 (en) 2014-10-08 2017-10-17 JBF Interlude 2009 LTD Systems and methods for dynamic video bookmarking
US11900968B2 (en) 2014-10-08 2024-02-13 JBF Interlude 2009 LTD Systems and methods for dynamic video bookmarking
US10885944B2 (en) 2014-10-08 2021-01-05 JBF Interlude 2009 LTD Systems and methods for dynamic video bookmarking
US10692540B2 (en) 2014-10-08 2020-06-23 JBF Interlude 2009 LTD Systems and methods for dynamic video bookmarking
US11412276B2 (en) 2014-10-10 2022-08-09 JBF Interlude 2009 LTD Systems and methods for parallel track transitions
US10582265B2 (en) 2015-04-30 2020-03-03 JBF Interlude 2009 LTD Systems and methods for nonlinear video playback using linear real-time video players
US9672868B2 (en) 2015-04-30 2017-06-06 JBF Interlude 2009 LTD Systems and methods for seamless media creation
US10460765B2 (en) 2015-08-26 2019-10-29 JBF Interlude 2009 LTD Systems and methods for adaptive and responsive video
US11804249B2 (en) 2015-08-26 2023-10-31 JBF Interlude 2009 LTD Systems and methods for adaptive and responsive video
US11164548B2 (en) 2015-12-22 2021-11-02 JBF Interlude 2009 LTD Intelligent buffering of large-scale video
US11128853B2 (en) 2015-12-22 2021-09-21 JBF Interlude 2009 LTD Seamless transitions in large-scale video
US10462202B2 (en) 2016-03-30 2019-10-29 JBF Interlude 2009 LTD Media stream rate synchronization
US11856271B2 (en) 2016-04-12 2023-12-26 JBF Interlude 2009 LTD Symbiotic interactive video
US10218760B2 (en) 2016-06-22 2019-02-26 JBF Interlude 2009 LTD Dynamic summary generation for real-time switchable videos
US11553024B2 (en) 2016-12-30 2023-01-10 JBF Interlude 2009 LTD Systems and methods for dynamic weighting of branched video paths
US11050809B2 (en) 2016-12-30 2021-06-29 JBF Interlude 2009 LTD Systems and methods for dynamic weighting of branched video paths
US10257578B1 (en) 2018-01-05 2019-04-09 JBF Interlude 2009 LTD Dynamic library display for interactive videos
US11528534B2 (en) 2018-01-05 2022-12-13 JBF Interlude 2009 LTD Dynamic library display for interactive videos
US10856049B2 (en) 2018-01-05 2020-12-01 Jbf Interlude 2009 Ltd. Dynamic library display for interactive videos
US11601721B2 (en) 2018-06-04 2023-03-07 JBF Interlude 2009 LTD Interactive video dynamic adaptation and user profiling
US11490047B2 (en) 2019-10-02 2022-11-01 JBF Interlude 2009 LTD Systems and methods for dynamically adjusting video aspect ratios
US11245961B2 (en) 2020-02-18 2022-02-08 JBF Interlude 2009 LTD System and methods for detecting anomalous activities for interactive videos
US11882337B2 (en) 2021-05-28 2024-01-23 JBF Interlude 2009 LTD Automated platform for generating interactive videos
US11934477B2 (en) 2021-09-24 2024-03-19 JBF Interlude 2009 LTD Video player integration within websites

Similar Documents

Publication Publication Date Title
WO2009137919A1 (en) Video mix-down
US11410704B2 (en) Generation and use of user-selected scenes playlist from distributed digital content
KR100987880B1 (en) Method, computer readable medium and system for editing a low-resolution media asset to generate a high-resolution edited media asset
US8868465B2 (en) Method and system for publishing media content
US9031963B2 (en) Method of interactive video blogging
KR100976887B1 (en) Method and system for creating and applying dynamic media specification creator and applicator
CN101960844B (en) For the system and method strengthening track for the application be included in media file of encoding
US8639086B2 (en) Rendering of video based on overlaying of bitmapped images
US8265457B2 (en) Proxy editing and rendering for various delivery outlets
US20080013916A1 (en) Systems and methods for encoding, editing and sharing multimedia files
KR100991583B1 (en) Method, computer-readable storage medium and system for combining edit information with media content
KR100908144B1 (en) Multimedia editing / playback system and its operation method
KR100987862B1 (en) Method and system for recording edits to media content
JP2004040502A (en) Information-reproducing apparatus, information-reproducing method, and information reproducing system
CN101395909A (en) Method and system for combining edit information with media content
JP2008048091A (en) Motion picture tagging program, motion picture tag system, and motion picture distributing method
JP2015510727A (en) Method and system for providing file data for media files
JP2024510139A (en) Methods, apparatus and computer programs for supporting pre-roll and mid-roll during media streaming and playback
Khasawneh Multimedia essentials and challenges
KR20160009921A (en) Method for sharing video file based on slide show

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 09745334

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 09745334

Country of ref document: EP

Kind code of ref document: A1