TWI698117B - Generating method and playing method of multimedia file, multimedia file generation apparatus and multimedia file playback apparatus - Google Patents
Generating method and playing method of multimedia file, multimedia file generation apparatus and multimedia file playback apparatus Download PDFInfo
- Publication number
- TWI698117B TWI698117B TW107127500A TW107127500A TWI698117B TW I698117 B TWI698117 B TW I698117B TW 107127500 A TW107127500 A TW 107127500A TW 107127500 A TW107127500 A TW 107127500A TW I698117 B TWI698117 B TW I698117B
- Authority
- TW
- Taiwan
- Prior art keywords
- multimedia file
- data track
- positions
- file
- image object
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/02—Editing, e.g. varying the order of information signals recorded on, or reproduced from, record carriers
- G11B27/031—Electronic editing of digitised analogue information signals, e.g. audio or video signals
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/10—Indexing; Addressing; Timing or synchronising; Measuring tape travel
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/10—Indexing; Addressing; Timing or synchronising; Measuring tape travel
- G11B27/102—Programmed access in sequence to addressed parts of tracks of operating record carriers
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Television Signal Processing For Recording (AREA)
- Signal Processing For Digital Recording And Reproducing (AREA)
Abstract
Description
本發明是有關於一種影片播放技術,且特別是有關於一種多媒體檔案的產生方法與多媒體檔案產生裝置,以及多媒體檔案的播放方法與多媒體檔案播放裝置。The invention relates to a video playback technology, and in particular to a multimedia file generation method and multimedia file generation device, as well as a multimedia file playback method and multimedia file playback device.
現今360度影片以及全景相機越來越流行,使用者已經可透過電腦螢幕觀看360度影片(亦稱為全景影片)。透過多個鏡頭所拍攝的不同角度的影像並經過後製影像縫合處理,可產生無視野死角的360度影片,其能夠提供使用者身歷其境的效果。Nowadays, 360-degree videos and panoramic cameras are becoming more and more popular, and users can already watch 360-degree videos (also known as panoramic videos) through the computer screen. Images of different angles shot through multiple lenses and processed by post-production image stitching can produce 360-degree videos with no blind spots in the field of view, which can provide the user with an immersive effect.
當使用者利用電腦螢幕觀賞360度影片時,使用者事實上只能看到整個360度場景中的一隅。使用者需要調整360度影片的播放視角,才有辦法觀賞到360度場景中的不同角落。因此,當使用者觀賞360度影片時,使用者需要手動調整播放視角來搜尋感興趣物件,此舉大幅降低使用者觀看360度影片的便利性。再者,受限於一般消費性電子產品的效能,一般消費性電子裝置也難以針對360度影片進行即時的影像物件辨識與追蹤。因此,當使用者想要鎖定某一個感興趣物件進行觀看時,使用者需要隨著感興趣物件的移動而隨時隨地手動控制播放視角。When a user watches a 360-degree video on a computer screen, the user can actually only see a corner of the entire 360-degree scene. The user needs to adjust the viewing angle of the 360-degree video to be able to watch different corners of the 360-degree scene. Therefore, when a user watches a 360-degree video, the user needs to manually adjust the viewing angle to search for objects of interest, which greatly reduces the convenience for the user to watch the 360-degree video. Furthermore, limited by the performance of general consumer electronic products, it is also difficult for general consumer electronic devices to identify and track real-time image objects for 360-degree videos. Therefore, when the user wants to lock a certain object of interest for viewing, the user needs to manually control the playback angle of view anytime and anywhere as the object of interest moves.
有鑑於此,本發明提供一種多媒體檔案的產生方法與多媒體檔案產生裝置,可基於影像物件的位置資訊建立多媒體檔案的特定資料軌,以產生包括全景影片且記錄有位置資訊的該多媒體檔案。In view of this, the present invention provides a multimedia file generating method and a multimedia file generating device, which can create a specific data track of a multimedia file based on the location information of an image object to generate the multimedia file including a panoramic video and recorded with location information.
並且,本發明提供一種多媒體檔案的播放方法與多媒體檔案播放裝置,可依據多媒體檔案中的特定資料軌獲取影像物件的位置資訊,以依據使用者感興趣的影像物件的位置資訊動態調整播放視角。In addition, the present invention provides a multimedia file playback method and multimedia file playback device, which can obtain position information of image objects according to a specific data track in the multimedia file, and dynamically adjust the playback angle of view according to the position information of the image object that the user is interested in.
本發明的實施例提供一種多媒體檔案的產生方法,適用於多媒體檔案產生裝置,所述方法包括下列步驟。取得關聯於時間軸的全景影片,其中此全景影片包括至少一影像物件。取得此影像物件相對於時間軸的多個物件位置。將這些物件位置製作成物件位置檔案。依據此物件位置檔案產生多媒體檔案的至少一資料軌,以生成包括全景影片且記錄有物件位置的多媒體檔案。The embodiment of the present invention provides a method for generating multimedia files, which is suitable for a device for generating multimedia files. The method includes the following steps. Obtain a panoramic video related to the time axis, where the panoramic video includes at least one image object. Obtain multiple object positions of this image object relative to the timeline. Make these object locations into object location files. At least one data track of the multimedia file is generated based on the object location file to generate a multimedia file including a panoramic video and recording the object location.
本發明的實施例提供一種多媒體檔案產生裝置,包括儲存裝置與處理器。儲存裝置儲存有多個模組。處理器耦接儲存裝置,載入並執行儲存裝置中的模組。所述模組包括影片取得模組、位置取得模組、檔案製作模組,以及檔案嵌入模組。影片取得模組取得關聯於時間軸的全景影片,其中此全景影片包括至少一影像物件。位置取得模組取得影像物件相對於時間軸的多個物件位置。檔案製作模組將這些物件位置製作成物件位置檔案。檔案嵌入模組依據物件位置檔案產生多媒體檔案的至少一資料軌,以生成包括全景影片且記錄有物件位置的多媒體檔案。An embodiment of the present invention provides a multimedia file generating device, including a storage device and a processor. The storage device stores multiple modules. The processor is coupled to the storage device, and loads and executes the modules in the storage device. The modules include a video acquisition module, a location acquisition module, a file production module, and a file embedding module. The video obtaining module obtains a panoramic video associated with the time axis, where the panoramic video includes at least one image object. The position obtaining module obtains the position of multiple objects of the image object relative to the time axis. The file creation module creates these object locations into object location files. The file embedding module generates at least one data track of the multimedia file according to the object location file to generate the multimedia file including the panoramic video and recording the object location.
對應的,本發明的實施例提供一種多媒體檔案的播放方法,適用於多媒體檔案播放裝置,所述方法包括下列步驟。接收包括關聯於時間軸的全景影片的多媒體檔案。提取多媒體檔案的第一資料軌,以取得全景影片中第一影像物件相對於時間軸的多個第一物件位置。當播放全景影片,顯示對應於第一影像物件的圖示於螢幕的畫面。響應於偵測到施於圖示的選擇操作,依據第一資料軌所記錄的第一物件位置決定用以播放全景影片的播放視角,並基於播放視角播放包括第一影像物件的畫面。Correspondingly, an embodiment of the present invention provides a multimedia file playback method, which is suitable for a multimedia file playback device, and the method includes the following steps. Receive multimedia files including panoramic videos associated with the time axis. The first data track of the multimedia file is extracted to obtain the positions of the first image objects relative to the time axis in the panoramic video. When the panoramic video is played, an icon corresponding to the first image object is displayed on the screen. In response to detecting the selection operation performed on the icon, the playback angle of view for playing the panoramic video is determined according to the position of the first object recorded in the first data track, and the screen including the first image object is played based on the playback angle.
對應的,本發明的實施例提供一種多媒體檔案播放裝置,包括螢幕、儲存有多個模組的儲存裝置,以及處理器。處理器耦接儲存裝置與螢幕,載入並執行儲存裝置中的所述模組。所述模組包括影片接收模組、資料軌提取模組、介面提供模組,以及影片播放模組。影片接收模組接收包括關聯於時間軸的全景影片的多媒體檔案。資料軌提取模組提取多媒體檔案的第一資料軌,以取得全景影片中第一影像物件相對於時間軸的多個第一物件位置。當播放全景影片,介面提供模組顯示對應於第一影像物件的圖示於螢幕的畫面。影片播放模組響應於偵測到施於圖示的選擇操作,依據第一資料軌所記錄的第一物件位置決定用以播放全景影片的播放視角,並基於播放視角播放包括第一影像物件的畫面。Correspondingly, an embodiment of the present invention provides a multimedia file playback device, including a screen, a storage device storing a plurality of modules, and a processor. The processor is coupled to the storage device and the screen, and loads and executes the module in the storage device. The modules include a video receiving module, a data track extraction module, an interface providing module, and a video playback module. The video receiving module receives a multimedia file including a panoramic video related to the time axis. The data track extraction module extracts the first data track of the multimedia file to obtain a plurality of first object positions of the first image object relative to the time axis in the panoramic video. When the panoramic video is played, the interface providing module displays the icon corresponding to the first image object on the screen. In response to detecting the selection operation performed on the icon, the video playback module determines the playback angle of view for playing the panoramic video based on the position of the first object recorded in the first data track, and based on the playback angle of view, plays the first image object Picture.
基於上述,多媒體檔案產生裝置可依據出現於全景影片中之影像物件的物件位置建立多媒體檔案的特定資料軌。基此,當播放全景影片時,多媒體檔案播放裝置可從多媒體檔案的特定資料軌中獲取影像物件於全景影片中的物件位置,進而依據影像物件的物件位置動態決定播放視角。藉此,使用者不須隨時手動調整全景影片的播放視角,就可以鎖定全景影片中的特定影像物件進行觀看。Based on the above, the multimedia file generating device can create a specific data track of the multimedia file according to the object position of the image object appearing in the panoramic video. Based on this, when the panoramic video is played, the multimedia file playback device can obtain the object position of the image object in the panoramic video from the specific data track of the multimedia file, and then dynamically determine the playback angle of view according to the object position of the image object. In this way, the user does not need to manually adjust the viewing angle of the panoramic video at any time, and can lock a specific image object in the panoramic video for viewing.
為讓本發明的上述特徵和優點能更明顯易懂,下文特舉實施例,並配合所附圖式作詳細說明如下。In order to make the above-mentioned features and advantages of the present invention more comprehensible, the following specific embodiments are described in detail in conjunction with the accompanying drawings.
本發明的部份實施例接下來將會配合附圖來詳細描述,以下的描述所引用的元件符號,當不同附圖出現相同的元件符號將視為相同或相似的元件。這些實施例只是本發明的一部份,並未揭示所有本發明的可實施方式。更確切的說,這些實施例只是本發明的專利申請範圍中的方法以及頭戴顯示裝置的範例。Part of the embodiments of the present invention will be described in detail in conjunction with the accompanying drawings. The reference symbols in the following description will be regarded as the same or similar elements when the same symbol appears in different drawings. These embodiments are only a part of the present invention, and do not disclose all the possible implementation modes of the present invention. More precisely, these embodiments are only examples of methods and head-mounted display devices within the scope of the patent application of the present invention.
圖1是依照本發明實施例所繪示之多媒體檔案產生裝置的方塊圖,但此僅是為了方便說明,並不用以限制本發明。首先圖1先介紹多媒體檔案產生裝置之所有構件以及配置關係,詳細功能將配合圖2一併揭露。FIG. 1 is a block diagram of a multimedia file generating device according to an embodiment of the present invention, but this is only for convenience of description and is not intended to limit the present invention. First, Figure 1 first introduces all the components and configuration relationships of the multimedia file generating device, and detailed functions will be disclosed in conjunction with Figure 2.
請參照圖1,多媒體檔案產生裝置10可以是具備運算能力的各式電子裝置,例如桌上型電腦、筆記型電腦或伺服器等等,本發明對此並不限制。多媒體檔案產生裝置10包括處理器110與儲存裝置120,其功能分述如下:Please refer to FIG. 1, the multimedia
儲存裝置120例如是任何型態的固定式或可移動式隨機存取記憶體(random access memory,RAM)、唯讀記憶體(read-only memory,ROM)、快閃記憶體(flash memory)或類似元件或上述元件的組合。在本實施例中,儲存裝置120用以記錄影片取得模組121、位置取得模組122、檔案製作模組123,以及檔案嵌入模組123。The
處理器110例如是中央處理單元(Central Processing Unit,CPU),或是其他可程式化之一般用途或特殊用途的微處理器(Microprocessor)、數位訊號處理器(Digital Signal Processor,DSP)、可程式化控制器、特殊應用積體電路(Application Specific Integrated Circuits,ASIC)、可程式化邏輯裝置(Programmable Logic Device,PLD)或其他類似裝置或這些裝置的組合,其與儲存裝置120連接。The
在本實施例中,儲存在儲存裝置120中的模組例如是電腦程式,而可由處理器110載入,據以執行本實施例的多媒體檔案的產生方法。In this embodiment, the modules stored in the
圖2是依照本發明實施例所繪示之多媒體檔案的產生方法的流程圖,而圖2的方法流程可以圖1的多媒體檔案產生裝置10的各元件實現。請同時參照圖1及圖2,以下即搭配圖1中多媒體檔案產生裝置10的各項元件與裝置,說明本實施例之多媒體檔案的產生方法的詳細步驟。FIG. 2 is a flowchart of a method for generating a multimedia file according to an embodiment of the present invention, and the method flow of FIG. 2 can be implemented by the components of the multimedia
於步驟S201,影片取得模組121取得關聯於時間軸的全景影片,其中此全景影片包括至少一影像物件。在此,影片取得模組121可以是自多媒體檔案產生裝置10本身的影像擷取模組(未繪示)或者是自其它電子裝置取得全景影片。全景影片也可稱為360度影片,其由對應至一時間軸上不同時間戳記(timestamp)的視頻幀所構成,而這些視頻幀為以特定格式保存的360度影像。上述特定格式例如是Equiangular格式等等。需說明的是,於本發明的實施例中,全景影片包括拍攝至少一被攝物件而產生的至少一影像物件,亦即,所述影像物件呈現於全景影片的視頻幀之中。全景影片中的影像物件例如是人臉,但本發明對此並不限制,其也可能是其他種類的影像物件。In step S201, the
於步驟S202,位置取得模組122取得影像物件相對於時間軸的多個物件位置。於一實施例中,影像物件的多個物件位置可由影片編輯者事先透過肉眼觀看並經由手動編輯而產生。換言之,透過影片編輯者肉眼觀賞全景影片並標註影像物件的多個物件位置,位置取得模組122可取得影像物件於一立體座標系下的多個物件位置。或者,於一實施例中,影像物件相對於時間軸的多個物件位置可由影像處理技術的物件偵測與辨識演算法而自動產生。換言之,透過利用物件偵測與辨識演算法追蹤全景影片中的特定影像物件,位置取得模組122可取得影像物件於一立體座標系下相對於不同時間區間的多個物件位置。影像物件的物件位置可例如由球座標系的球座標來表示。In step S202, the
於一實施例中,此影像物件的多個物件位置分別對應至時間軸上的多個時間區間。也就是說,影像物件的這些物件位置是可依據固定或不固定的時間間隔而被取樣出來。請參照圖3A,圖3A是依據本發明實施例所繪示之對應至多個時間區間的多個物件位置的示意圖。對於一影像物件而言,位置取得模組122可取得對應於時間區間P1的物件位置(r1,θ1,ψ1)、對應於時間區間P2的物件位置(r2,θ2,ψ2),以及對應於時間區間P3的物件位置(r3,θ3,ψ3)。需說明的是,時間區間P1~P3的時間長度可相同或相異,本發明對此並不限制。In one embodiment, the multiple object positions of the image object correspond to multiple time intervals on the time axis, respectively. In other words, the object positions of the image objects can be sampled according to fixed or non-fixed time intervals. Please refer to FIG. 3A. FIG. 3A is a schematic diagram illustrating the positions of multiple objects corresponding to multiple time intervals according to an embodiment of the present invention. For an image object, the
此外,於一實施例中,全景影片中影像物件的數量可以是兩個以上。像是,全景影片中的至少一影像物件可包括第一影像物件與第二影像物件。對應的,相對於時間軸的物件位置將包括第一影像物件的多個第一物件位置與第二影像物件的多個第二物件位置。請參照圖3B,圖3B是依據本發明實施例所繪示之對應至多個時間區間的多個物件位置的示意圖。對於第一影像物件而言,位置取得模組122可取得對應於時間區間P1的物件位置(r4,θ4,ψ4)與對應於時間區間P2的物件位置(r5,θ5,ψ5)。對於第二影像物件而言,位置取得模組122可取得對應於時間區間P1的物件位置(r6,θ6,ψ6)以及對應於時間區間P2的物件位置(r7,θ7,ψ7)。In addition, in one embodiment, the number of image objects in the panoramic video may be more than two. For example, the at least one image object in the panoramic video may include a first image object and a second image object. Correspondingly, the position of the object relative to the time axis will include a plurality of first object positions of the first image object and a plurality of second object positions of the second image object. Please refer to FIG. 3B. FIG. 3B is a schematic diagram of multiple object positions corresponding to multiple time intervals according to an embodiment of the present invention. For the first image object, the
接著,回到圖2的流程,於步驟S203,檔案製作模組123將這些物件位置製作成物件位置檔案。具體而言,檔案製作模組123可以預設的檔案格式將對應至時間軸上各時間區間的物件位置編寫為一物件位置檔案。於一實施例中,產生物件位置檔案的方式可類似於產生影片字幕檔的方式。請參照圖4,圖4是依據本發明實施例所繪示之物件位置檔案的範例。物件位置檔案40記錄有分別命名為“物件名稱A”與“物件名稱B”的兩個影像物件於全景影片中的物件位置,而這些物件位置是以固定的時間間隔被記錄下來。圖4所示範例係以時間間隔為1秒為例,但本發明並不以此為限。例如,於時間點00:01.000,命名為“物件名稱A”的影像物件的物件位置為(r6,θ6,ψ6),而命名為“物件名稱B”的影像物件的物件位置為(r7,θ7,ψ7)。於時間點00:02.000,命名為“物件名稱A”的影像物件的物件位置為(r8,θ8,ψ8),而命名為“物件名稱B”的影像物件的物件位置為(r9,θ9,ψ9)。Then, returning to the flow of FIG. 2, in step S203, the
此外,於一實施例中,檔案製作模組123可將記錄為多個立體位置座標的物件位置映射成多個二維位置座標,並將這些二維位置座標記錄於物件位置檔案之中。一般而言,全景影片中的各張視頻幀系透過將全景影像映射為二維影像的方式來進行保存,像是以Equiangular格式來進行保存。這些記錄為多個立體位置座標(例如球座標)的物件位置也可被映射為二維座標系統下的二維位置座標並予以儲存,以降低物件位置檔案的資料量。In addition, in one embodiment, the
之後,於步驟S204,檔案嵌入模組124依據物件位置檔產生多媒體檔案的至少一資料軌,以生成包括全景影片且記錄有物件位置的多媒體檔案。具體而言,圖5是依據本發明實施例所繪示之多媒體檔案架構的示意圖。多媒體檔案50包括標頭51與多媒體資料52,而多媒體資料52包括了可分類為多個資料軌的多媒體資料。換言之,多媒體檔案50可包括多個資料軌。標頭51中記錄有這些資料軌之特性的描述與這些資料軌的數目,這些資料軌可包含視訊資料軌(video track)521、音訊資料軌(audio track)522、字幕資料軌(sub-picture track)523以及物件位置資料軌(object position track)524。其中,視訊資料軌是用來對視訊資料分類;音訊資料軌則是用來對音訊資料分類,不同的音訊資料軌可代表不同的語言;字幕資料軌是用來對字幕資料分類,不同的字幕資料軌可代表不同的語言的字幕。Then, in step S204, the
於一實施例中,當物件位置檔案包括第一影像物件的多個第一物件位置與第二影像物件的多個第二物件位置時(例如圖4的範例所示),檔案嵌入模組124可產生對應至第一影像物件的第一資料軌,並將物件位置檔案中的第一物件位置(例如圖4的(r4,θ4,ψ4)、(r6,θ6,ψ6)、(r6,θ6,ψ6))嵌入至第一資料軌。另一方面,檔案嵌入模組124可產生對應至第二影像物件的第二資料軌,並將物件位置檔案中的第二物件位置(例如圖4的(r5,θ5,ψ5)、(r7,θ7,ψ7)、(r9,θ9,ψ9))嵌入至第二資料軌。也就是說,物件位置資料軌的數目將被標註之視影像物件的數目而決定,而每一個影像物件的物件位置係由對應的物件位置資料軌來記錄。亦即不同的物件位置資料軌可代表不同的影像物件的位置資訊。In one embodiment, when the object position file includes a plurality of first object positions of a first image object and a plurality of second object positions of a second image object (for example, as shown in the example of FIG. 4), the
值得注意的是,相較於習知的多媒體檔案,本實施例之多媒體檔案50更包括用以記錄物件位置的物件位置資料軌524。檔案嵌入模組124可依據物件位置檔案建立多媒體檔案50的至少一資料軌(即物件位置資料軌524),例如將圖4所示的物件位置檔案40中的資料嵌入至多媒體檔案50的物件位置資料軌524。於此,將特定資料嵌入至多媒體檔案50的至少一資料軌代表將特定資料嵌入至多媒體檔案50中所述資料軌的資料塊中。再者,標頭51更記錄有物件位置資料軌之特性的描述與物件位置資料軌的數目。如此一來,用以播放多媒體檔案50的播放器除了可播放多媒體檔案50中的全景影片之外,更可從物件位置資料軌524得知一或多個影像物件於全景影片中的位置資訊。It is worth noting that, compared with the conventional multimedia file, the
在說明完如何產生記錄有影像物件於全景影片中的物件位置的多媒體檔案之後,以下將列舉實施例說明如何依據本揭露的多媒體檔案播放全景影片。After explaining how to generate a multimedia file recording the object position of the image object in the panoramic video, the following examples illustrate how to play the panoramic video according to the disclosed multimedia file.
圖6是依照本發明實施例所繪示之多媒體檔案播放裝置的方塊圖,但此僅是為了方便說明,並不用以限制本發明。首先圖6先介紹多媒體檔案播放裝置之所有構件以及配置關係,詳細功能將配合圖7一併揭露。FIG. 6 is a block diagram of a multimedia file playback device according to an embodiment of the present invention, but this is only for convenience of description and is not intended to limit the present invention. First, Figure 6 first introduces all the components and configuration relationships of the multimedia file playback device. The detailed functions will be disclosed together with Figure 7.
請參照圖6,多媒體檔案播放裝置60可以是具備運算能力與畫面顯示能力的各式電子裝置,例如桌上型電腦、筆記型電腦、智慧型手機、平板等等,本發明對此並不限制。多媒體檔案播放裝置60包括處理器610、儲存裝置620與螢幕630。Please refer to FIG. 6, the multimedia
儲存裝置620例如是任何型態的固定式或可移動式隨機存取記憶體、唯讀記憶體、快閃記憶體或類似元件或上述元件的組合。在本實施例中,儲存裝置620用以記錄影片接收模組621、資料軌提取模組622、介面提供模組623,以及影片播放模組624。於一實施例中,上述模組可實施為一軟體播放器。The
處理器910例如是中央處理單元,或是其他可程式化之一般用途或特殊用途的微處理器、數位訊號處理器、可程式化控制器、特殊應用積體電路、可程式化邏輯裝置或其他類似裝置或這些裝置的組合,其與儲存裝置620連接。The processor 910 is, for example, a central processing unit, or other programmable general-purpose or special-purpose microprocessors, digital signal processors, programmable controllers, special application integrated circuits, programmable logic devices, or other A similar device or a combination of these devices is connected to the
螢幕630其係用以顯示多媒體檔案播放裝置60輸出的畫面而提供給使用者觀看。在本實施例中,多媒體檔案播放裝置60例如是液晶顯示器(Liquid Crystal Display,LCD)、發光二極體(Light-Emitting Diode,LED)顯示器、場發射顯示器(Field Emission Display,FED)或其他種類的顯示器。The
在本實施例中,儲存在儲存裝置620中的模組例如是電腦程式,而可由處理器610載入,據以執行本實施例的多媒體檔案的播放方法。In this embodiment, the module stored in the
圖7是依照本發明實施例所繪示之多媒體檔案的播放方法的流程圖,而圖7的方法流程可以圖6的多媒體檔案播放裝置60的各元件實現。請同時參照圖6及圖7,以下即搭配圖6中多媒體檔案產生裝置60的各項元件與裝置,說明本實施例之多媒體檔案的播放方法的詳細步驟。FIG. 7 is a flowchart of a multimedia file playback method according to an embodiment of the present invention, and the method flow of FIG. 7 can be implemented by the components of the multimedia
於步驟S701,影片接收模組621接收包括關聯於時間軸的全景影片的多媒體檔案。影片接收模組621可經由有線或無線網路接收包括全景影片的多媒體檔案,也可讀取儲存於儲存裝置620或其他外接儲存裝置中的多媒體檔案。於步驟S702,資料軌提取模組622提取多媒體檔案的第一資料軌,以取得全景影片中第一影像物件相對於時間軸的多個第一物件位置。具體而言,資料軌提取模組622可解多工(demux)多媒體檔案,以獲取對應至各個資料軌的多媒體資料。於一實施例中,多媒體檔案的資料軌可包括視頻資料軌、音訊資料軌、字幕資料軌以及物件位置資料軌。資料軌提取模組622可從多媒體檔案提取出分類至物件位置資料軌的多媒體資料,分類至物件位置資料軌的多媒體資料即為全景影片中第一影像物件相對於時間軸的多個第一物件位置。關於物件位置資料軌之中的物件位置已於前述實施例詳加描述,於此不再贅述。相似的,資料軌提取模組622也可從多媒體檔案提取出分類至視頻資料軌的視頻資料,並解碼視頻資料而獲取全景影片的多個視頻幀。In step S701, the
之後,於步驟S703,當播放全景影片時,介面提供模組623顯示對應於第一影像物件的圖示於螢幕610的畫面。具體而言,介面提供模組623可提供播放器的使用者介面,其可包括畫面播放區與播放控制列。值得注意的是,藉由解析多媒體檔案的標頭(例如圖5所示的標頭51)中物件位置資料軌的數目,介面提供模組623可得知全景影片的影片內容包括多少個事先被標註之影像物件。於是,於播放全景影片的同時,介面提供模組623可顯示事先被標註之影像物件的圖示於螢幕610的畫面。在此的圖示可以是任何形狀的互動式物件,並且各個圖示中將會呈現所對應的影像物件的名稱或代表圖案,以快速導覽使用者有關於全景影像的重點。此外,各個圖示可顯示於播放畫面的邊緣或是播放器的播放控制列中,以避免影響使用者觀看全景影片。After that, in step S703, when the panoramic video is played, the
接著,處理器610將會不斷地偵測使用者是否針對任何圖示進行選擇,而響應於偵測到使用者施於某一圖示的選擇操作。因此,於步驟S704,影片播放模組624響應於偵測到施於圖示的選擇操作,依據第一資料軌所記錄的第一物件位置決定用以播放全景影片的播放視角,並基於播放視角播放包括第一影像物件的畫面。也就是說,當使用者選擇對應至第一影像物件的圖示時,影片播放模組624可從物件位置資料軌得知第一影像物件於全景影片中當下的物件位置。接著,影片播放模組624可依據第一影像物件當下的物件位置決定播放視角,而播放畫面將會從全景影片的預設區域位移至第一影像物件所在的第一區域,以讓使用者可快速地觀看所選擇的重點對象。Then, the
值得注意的是,被選定的第一影像物件的第一物件位置可能改變。以圖3B為例,第一影像物件的第一物件位置可能從(r4,θ4,ψ4)轉變為(r6,θ6,ψ6)。若未調整播放視角,第一影像物件可能消失於播放畫面之中。於一實施例中,影片播放模組624可響應於識別到第一物件位置的改變,重新依據改變後的第一物件位置來切換播放視角。以圖3B為例,響應於第一影像物件的第一物件位置從(r4,θ4,ψ4)改變為(r6,θ6,ψ6),影片播放模組624將播放視角從第一視角切換為第二視角。對應的,播放畫面將會從原本的第一區域調整為第一影像物件所在的第二區域。亦即,影片播放模組624於時間區間P1中以第一視角播放全景影片的第一區域,接著於時間區間P2中以第二視角播放全景影片的第二區域。如此,使用者可在沒有手動調整播放視角的情況下持續觀看到所選擇的重點對象。It is worth noting that the position of the first object of the selected first image object may change. Taking FIG. 3B as an example, the position of the first object of the first image object may be changed from (r4, θ4, ψ4) to (r6, θ6, ψ6). If the playback angle is not adjusted, the first image object may disappear in the playback screen. In one embodiment, the
可知的,影像物件的數目可以為兩個以上。於一實施例中,除了提取多媒體檔案的第一資料軌,資料軌提取模組622更可提取多媒體檔案的第二資料軌,以取得全景影片中第二影像物件相對於該時間軸的多個第二物件位置。於是,當播放全景影片,介面提供模組623也將顯示對應於第二影像物件的另一圖示於螢幕610的畫面。於是,影片播放模組624響應於偵測到施於另一圖示的選擇操作,依據第二資料軌所記錄的第二物件位置切換播放視角,並基於所切換的播放視角播放包括第二影像物件的畫面。It can be seen that the number of image objects can be more than two. In one embodiment, in addition to extracting the first data track of the multimedia file, the data
舉例而言,圖8A與8B是依照本發明實施例所繪示之多媒體檔案的播放範例示意圖。請先參照圖8A,當多媒體檔案播放裝置60播放本揭露所產生的多媒體檔案時,全景影片隨時間軸播放。使用者可藉由對虛擬控制鈕82的操作來調整全景影片的播放視角。多媒體檔案播放裝置60可依據多媒體檔案的標頭而獲取物件位置資料軌之描述的特性與物件位置資料軌的數目,從而得知事先被標註之影像物件的數目與物件名稱等等。於本範例中,假設事先被標註之影像物件的數目為3,因此多媒體檔案播放裝置60將顯示三個圖示I1~I3於畫面80,且這三個圖示I1~I3分別呈現有此三個影像物件的代表名稱‘A’、‘B’、‘C’。For example, FIGS. 8A and 8B are schematic diagrams showing examples of playing multimedia files according to an embodiment of the present invention. Please refer to FIG. 8A first. When the multimedia
假設使用者欲觀看對應於圖示I1的影像物件83(即關鍵人物A),響應於偵測到使用者針對圖示I1進行選擇操作,多媒體檔案播放裝置60將依據物件位置資料軌所記錄之影像物件83的物件位置決定播放全景影片的播放視角,以依據剛決定的播放視角來播放包括影像物件83的畫面80。於本範例中,被選定的影像物件83將位於畫面80的中間。之後,假設使用者欲觀看對應於圖示I2的影像物件84(即關鍵人物B),響應於偵測到使用者針對圖示I2進行選擇操作,多媒體檔案播放裝置60將依據物件位置資料軌所記錄之影像物件84的物件位置切換播放視角,並基於所切換的播放視角播放包括影像物件84的畫面86。於本範例中,切換播放視角之後,被選定的影像物件84將位於畫面80的中間。Assuming that the user wants to view the
綜上所述,於本發明實施例中,包括全景影片的多媒體檔案更包括記錄有影像物件之位置資訊的物件位置資料軌。多媒體檔案產生裝置將影像物件的物件位置嵌入至多媒體檔案之中,致使多媒體檔案播放裝置於播放全景影片期間可依據物件位置資料軌即時得知特定影像物件的物件位置。基此,使用者的多媒體播放裝置不須具備強大的運算能力來進行影像物件的辨識與追蹤。此外,當使用者選擇感興趣的影像物件之後,多媒體檔案播放裝置可依據此影像物件的物件位置動態調整全景影片的播放視角,來達到追蹤特定影像物件的播放功能。因此,使用者無須手動調整播放視角來確保有觀看到感興趣的影像物件,從而大幅提昇觀賞360度影片的便利性。本發明更可讓使用者快速地瀏覽全景影片中的重點,以讓使用者觀看全景影片時可擁有直覺快速的操作以及觀看體驗。In summary, in the embodiment of the present invention, the multimedia file including the panoramic video further includes an object location data track recording the location information of the image object. The multimedia file generating device embeds the object position of the image object into the multimedia file, so that the multimedia file playback device can instantly know the object position of the specific image object according to the object position data track during the playback of the panoramic video. Based on this, the user's multimedia playback device does not need to have powerful computing capabilities to identify and track image objects. In addition, after the user selects the image object of interest, the multimedia file playback device can dynamically adjust the viewing angle of the panoramic video according to the object position of the image object to achieve the playback function of tracking the specific image object. Therefore, the user does not need to manually adjust the playback angle of view to ensure that the image object of interest is viewed, thereby greatly improving the convenience of viewing 360-degree videos. The present invention allows the user to quickly browse the key points in the panoramic video, so that the user can have intuitive and fast operation and viewing experience when watching the panoramic video.
雖然本發明已以實施例揭露如上,然其並非用以限定本發明,任何所屬技術領域中具有通常知識者,在不脫離本發明的精神和範圍內,當可作些許的更動與潤飾,故本發明的保護範圍當視後附的申請專利範圍所界定者為準。Although the present invention has been disclosed in the above embodiments, it is not intended to limit the present invention. Anyone with ordinary knowledge in the technical field can make slight changes and modifications without departing from the spirit and scope of the present invention. The scope of protection of the present invention shall be subject to those defined by the attached patent scope.
10:多媒體檔案產生裝置 60:多媒體檔案播放裝置 110、610:處理器 120、620:儲存裝置 630:螢幕 121:影片取得模組 122:位置取得模組 123:檔案製作模組 124:檔案嵌入模組 621:影片接收模組 622:資料軌提取模組 623:介面提供模組 624:影片播放模組 P1~P3:時間區間 40:物件位置檔案 50:多媒體檔案 51:標頭 52:多媒體資料 521:視訊資料軌 522:音訊資料軌 523:字幕資料軌 524:物件位置資料軌 80、86:畫面 82:虛擬控制鈕 I1~I3:圖示 83、84:影像物件 S201~S204、S701~S704:步驟10: Multimedia file generating device 60: Multimedia
圖1是依照本發明實施例所繪示之多媒體檔案產生裝置的方塊圖。 圖2是依照本發明實施例所繪示之多媒體檔案的產生方法的流程圖。 圖3A與圖3B是依據本發明實施例所繪示之對應至多個時間區間的多個物件位置的示意圖。 圖4是依據本發明實施例所繪示之物件位置檔案的範例。 圖5是依據本發明實施例所繪示之多媒體檔案架構的示意圖。 圖6是依照本發明實施例所繪示之多媒體檔案播放裝置的方塊圖。 圖7是依照本發明實施例所繪示之多媒體檔案的播放方法的流程圖。 圖8A與8B是依照本發明實施例所繪示之多媒體檔案的播放範例示意圖。FIG. 1 is a block diagram of a multimedia file generating device according to an embodiment of the invention. FIG. 2 is a flowchart of a method for generating a multimedia file according to an embodiment of the present invention. 3A and 3B are schematic diagrams illustrating the positions of multiple objects corresponding to multiple time intervals according to an embodiment of the present invention. FIG. 4 is an example of an object location file drawn according to an embodiment of the present invention. FIG. 5 is a schematic diagram of a multimedia file structure according to an embodiment of the present invention. FIG. 6 is a block diagram of a multimedia file playback device according to an embodiment of the invention. FIG. 7 is a flowchart of a method for playing multimedia files according to an embodiment of the present invention. 8A and 8B are schematic diagrams showing examples of playing multimedia files according to an embodiment of the present invention.
S701~S704:步驟S701~S704: steps
Claims (20)
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
TW107127500A TWI698117B (en) | 2018-08-07 | 2018-08-07 | Generating method and playing method of multimedia file, multimedia file generation apparatus and multimedia file playback apparatus |
US16/532,511 US11211097B2 (en) | 2018-08-07 | 2019-08-06 | Generating method and playing method of multimedia file, multimedia file generation apparatus and multimedia file playback apparatus |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
TW107127500A TWI698117B (en) | 2018-08-07 | 2018-08-07 | Generating method and playing method of multimedia file, multimedia file generation apparatus and multimedia file playback apparatus |
Publications (2)
Publication Number | Publication Date |
---|---|
TW202008781A TW202008781A (en) | 2020-02-16 |
TWI698117B true TWI698117B (en) | 2020-07-01 |
Family
ID=69406348
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
TW107127500A TWI698117B (en) | 2018-08-07 | 2018-08-07 | Generating method and playing method of multimedia file, multimedia file generation apparatus and multimedia file playback apparatus |
Country Status (2)
Country | Link |
---|---|
US (1) | US11211097B2 (en) |
TW (1) | TWI698117B (en) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111163267B (en) * | 2020-01-07 | 2021-12-21 | 影石创新科技股份有限公司 | Panoramic video editing method, device, equipment and storage medium |
CN114026595A (en) * | 2020-06-29 | 2022-02-08 | 深圳市大疆创新科技有限公司 | Image processing method, device and storage medium |
CN111917979B (en) * | 2020-07-27 | 2022-09-23 | 维沃移动通信有限公司 | Multimedia file output method and device, electronic equipment and readable storage medium |
CN111954012A (en) * | 2020-08-12 | 2020-11-17 | 上海遥知信息技术有限公司 | Multimedia resource file playing method and device, computer equipment and storage medium |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
TWI478581B (en) * | 2008-06-10 | 2015-03-21 | Sunplus Technology Co Ltd | Method for seamless playback of multiple multimedia files |
US9620168B1 (en) * | 2015-12-21 | 2017-04-11 | Amazon Technologies, Inc. | Cataloging video and creating video summaries |
WO2017103065A1 (en) * | 2015-12-17 | 2017-06-22 | Koninklijke Kpn N.V. | Generating output video from video streams |
US20170195576A1 (en) * | 2016-01-05 | 2017-07-06 | 360fly, Inc. | Dynamic field of view adjustment for panoramic video content |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080294691A1 (en) * | 2007-05-22 | 2008-11-27 | Sunplus Technology Co., Ltd. | Methods for generating and playing multimedia file and recording medium storing multimedia file |
WO2011114610A1 (en) * | 2010-03-18 | 2011-09-22 | パナソニック株式会社 | Omnidirectional image processing device and omnidirectional image processing method |
JP6167703B2 (en) * | 2013-07-08 | 2017-07-26 | 株式会社リコー | Display control device, program, and recording medium |
TW201738649A (en) | 2016-04-28 | 2017-11-01 | Yun-Hong Shen | Corresponding method between panorama video and message and system thereof having cloud software calculator to obtain an angle value and a first time point from the cloud database, and calculate and search a picture of panorama video corresponding to the message |
US11272160B2 (en) * | 2017-06-15 | 2022-03-08 | Lenovo (Singapore) Pte. Ltd. | Tracking a point of interest in a panoramic video |
-
2018
- 2018-08-07 TW TW107127500A patent/TWI698117B/en active
-
2019
- 2019-08-06 US US16/532,511 patent/US11211097B2/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
TWI478581B (en) * | 2008-06-10 | 2015-03-21 | Sunplus Technology Co Ltd | Method for seamless playback of multiple multimedia files |
WO2017103065A1 (en) * | 2015-12-17 | 2017-06-22 | Koninklijke Kpn N.V. | Generating output video from video streams |
US9620168B1 (en) * | 2015-12-21 | 2017-04-11 | Amazon Technologies, Inc. | Cataloging video and creating video summaries |
US20170195576A1 (en) * | 2016-01-05 | 2017-07-06 | 360fly, Inc. | Dynamic field of view adjustment for panoramic video content |
Also Published As
Publication number | Publication date |
---|---|
TW202008781A (en) | 2020-02-16 |
US11211097B2 (en) | 2021-12-28 |
US20200051594A1 (en) | 2020-02-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
TWI698117B (en) | Generating method and playing method of multimedia file, multimedia file generation apparatus and multimedia file playback apparatus | |
US8875212B2 (en) | Systems and methods for remote control of interactive video | |
US8656282B2 (en) | Authoring tool for providing tags associated with items in a video playback | |
US8559683B2 (en) | Electronic apparatus and scene-type display method | |
US9484065B2 (en) | Intelligent determination of replays based on event identification | |
Chen et al. | An autonomous framework to produce and distribute personalized team-sport video summaries: A basketball case study | |
KR20160097870A (en) | System and method for browsing summary image | |
ES2914124T3 (en) | Media targeting | |
TW201520827A (en) | System and method of providing augmented reality effect for multi-media data | |
US20190130193A1 (en) | Virtual Reality Causal Summary Content | |
US8244005B2 (en) | Electronic apparatus and image display method | |
CN104023181A (en) | Information processing method and device | |
JP6203188B2 (en) | Similar image search device | |
TWM506428U (en) | Display system for video stream on augmented reality | |
EP2942949A1 (en) | System for providing complex-dimensional content service using complex 2d-3d content file, method for providing said service, and complex-dimensional content file therefor | |
KR102138835B1 (en) | Apparatus and method for providing information exposure protecting image | |
JP5850188B2 (en) | Image display system | |
US20120128325A1 (en) | Authoring device and authoring method | |
CN110929056B (en) | Multimedia file generating method, multimedia file playing method, multimedia file generating device and multimedia file playing device | |
WO2021087773A1 (en) | Recognition method and apparatus, electronic device, and storage medium | |
US20110231763A1 (en) | Electronic apparatus and image processing method | |
TWI762830B (en) | System for displaying hint in augmented reality to play continuing film and method thereof | |
US11962743B2 (en) | 3D display system and 3D display method | |
US20230326094A1 (en) | Integrating overlaid content into displayed data via graphics processing circuitry and processing circuitry using a computing memory and an operating system memory | |
JP2010157960A (en) | Imaging apparatus |