JP2016110645A

JP2016110645A - Dividing device, analysis device, and program

Info

Publication number: JP2016110645A
Application number: JP2015230099A
Authority: JP
Inventors: 馬場　秋継; Akitsugu Baba; 秋継馬場; 悠樹広中; Yuki Hironaka; 藤澤　和也; Kazuya Fujisawa; 和也藤澤; 謙二郎加井; Kenjiro Kai; 洋一所; Yoichi Tokoro
Original assignee: Nippon Hoso Kyokai NHK
Current assignee: Japan Broadcasting Corp
Priority date: 2014-11-28
Filing date: 2015-11-25
Publication date: 2016-06-20
Anticipated expiration: 2035-11-25
Also published as: JP6796376B2

Abstract

PROBLEM TO BE SOLVED: To provide a dividing device, an analysis device, and a program for appropriately dividing and sending out timed text such as subtitle text.SOLUTION: An acquisition unit 11 acquires text document data including a plurality of text sentences in which time-of-day information is added. A time-of-day analysis unit 12 generates fragmentation information for fragmenting the text document data into a plurality of groups including the text sentences on the basis of the time-of-day information. A reference relation analysis unit 14 analyzes, for each fragment that is a group of fragmented text sentences, the header description information of the text document referenced from the fragment, and generates reference relation information indicating the relationship of the fragment with the header description referenced from the fragment. A sending information generation unit generates fragmented text document sending information that includes the fragmentation information and the reference relation information.SELECTED DRAWING: Figure 1

Description

本発明は、データを分割するための分割装置および解析装置、ならびにプログラムに関する。 The present invention relates to a dividing device, an analyzing device, and a program for dividing data.

テレビ放送における字幕テキストを伝送し、表示するために、タイムドテキストの技術が用いられる。タイムドテキストとは、時刻情報を伴うテキストデータを構造化したものである。字幕テキストに関しては、時刻情報として提示時刻が付加される。放送局側から、映像や音声のコンテンツと共にタイムドテキストを送信し、受信機側では、付加された提示時刻に基づいて、そのテキストを、映像や音声と共に提示する。 Timed text technology is used to transmit and display subtitle text in television broadcasts. Timed text is structured text data with time information. For caption text, a presentation time is added as time information. The broadcast station transmits timed text together with video and audio content, and the receiver presents the text together with video and audio based on the added presentation time.

非特許文献１には、標準化された規格であるタイムドテキストマークアップ言語（ＴＴＭＬ）によるデータの記述方法が記載されている。
また、非特許文献２には、非特許文献１のＴＴＭＬをベースとして、テキストに加え、画像、音声、ＷＥＢフォントによる非組込フォントの提示にも対応したタイムドテキストマークアップ言語（ＡＲＩＢ−ＴＴＭＬ）によるデータの記述方法が記載されている。
さらに、非特許文献３の、例えば図９−２（ｐ．１１４）には、ＡＲＩＢ−ＴＴＭＬ文書ファイルを含む一連のファイルを伝送する方式の概要が記載されている。 Non-Patent Document 1 describes a data description method using a timed text markup language (TTML), which is a standardized standard.
Non-Patent Document 2 includes a timed text markup language (ARIB-TTML) that is based on the TTML of Non-Patent Document 1 and supports the presentation of non-embedded fonts in addition to text, images, sounds, and WEB fonts. ) Describes the data description method.
Further, for example, FIG. 9-2 (p. 114) of Non-Patent Document 3 describes an outline of a system for transmitting a series of files including an ARIB-TTML document file.

World Wide Web Consortium（Ｗ３Ｃ，ワールド・ワイド・ウェブ・コンソーシアム），「Timed Text Markup Language 1 (TTML1) (Second Edition)」，西暦２０１３年（平成２５年）９月２４日，［平成２６年１１月９日検索］，インターネット＜ＵＲＬ：http://www.w3.org/TR/ttaf1-dfxp/＞World Wide Web Consortium (W3C, World Wide Web Consortium), “Timed Text Markup Language 1 (TTML1) (Second Edition)”, September 24, 2013 (November 2014) 9 days search], Internet <URL: http://www.w3.org/TR/ttaf1-dfxp/> 「標準規格ＡＲＩＢＳＴＤ−Ｂ６２１．０版デジタル放送におけるマルチメディア符号化方式（第２世代）」，「第一編第３部第３章字幕・文字スーパーの記述言語」，ｐ．６３−７８，平成２６年７月３１日，一般社団法人電波産業会“Standard ARIB STD-B62 Version 1.0 Multimedia Coding System for Digital Broadcasting (Second Generation)”, “Volume 1 Part 3 Chapter 3 Subtitle / Text Super Description Language”, p. 63-78, July 31, 2014, Japan Radio Industry Association 「標準規格ＡＲＩＢＳＴＤ−Ｂ６０１．０版デジタル放送におけるＭＭＴによるメディアトランスポート方式」，「第９章字幕・文字スーパーの伝送」，ｐ．１１４−１２１，平成２６年７月３１日，一般社団法人電波産業会“Standard ARIB STD-B60 Version 1.0 Digital Transport with MMT Media Transport System”, “Chapter 9 Transmission of Subtitles / Superimposed Characters”, p. 114-121, July 31, 2014, Japan Radio Industry Association

従来の技術では、テレビ番組等、映像コンテンツの字幕テキストは、番組全体を単位として一つのＴＴＭＬ文書ファイルとして構成されている。ＤＶＤやブルーレイディスクなどの記録媒体に映像コンテンツを記録して販売する場合も同様である。また、ビデオオンデマンドのサービス（要求に応じてインターネット等の通信回線を用いてコンテンツを配信するサービス）においても、ひとつのまとまった番組の全体の字幕テキストを一度に送信する形態がとられる。 In the conventional technology, caption text of video content such as a television program is configured as one TTML document file for the entire program. The same applies when video content is recorded on a recording medium such as a DVD or a Blu-ray disc. Also, a video-on-demand service (a service that distributes content using a communication line such as the Internet in response to a request) also takes a form in which the entire subtitle text of a single program is transmitted at a time.

しかしながら、例えば、３０分ないしは数時間におよぶ映像コンテンツの字幕テキストのデータ量は膨大であり、これを短時間内に放送波にのせて伝送することは困難である。また、受信機側では、視聴者は任意のタイミングで、受信機の電源をオンにしたり、放送サービス（放送チャンネル）を切り替えたりする。このため、視聴者があるタイミングで特定の放送番組の視聴を開始したときに、そのタイミングにおいて必要な字幕テキストをすばやく伝送する必要がある。番組の全体の字幕テキストを一度に送信する形態の場合、データ量が膨大であるため、すばやく伝送する事が困難であることに加え、番組が放送中の間に、繰り返し全体の字幕テキストを送信する必要があるため、放送の伝送帯域の多くを消費してしまう。 However, for example, the amount of subtitle text data of video content over 30 minutes or several hours is enormous, and it is difficult to transmit this over broadcast waves within a short time. On the receiver side, the viewer turns on the receiver power or switches the broadcast service (broadcast channel) at an arbitrary timing. For this reason, when the viewer starts viewing a specific broadcast program at a certain timing, it is necessary to quickly transmit the necessary subtitle text at that timing. In the case of transmitting the entire subtitle text of the program at once, the amount of data is enormous, making it difficult to transmit quickly, and it is necessary to repeatedly transmit the entire subtitle text while the program is broadcast This consumes much of the broadcast transmission bandwidth.

したがって、放送局側の設備として、一番組全体の分がまとまったＴＴＭＬ文書ファイル（字幕テキスト等）を、適切なサイズの断片に分割したり、分割された断片を単位として放送信号に載せて送出したりすることが求められる。
また、放送時にリアルタイムで字幕テキストを送出するためには、ＴＴＭＬ文書ファイルの分割処理の負荷を軽減することが求められる。 Therefore, as a facility on the broadcasting station side, a TTML document file (subtitle text, etc.) for a whole program is divided into pieces of an appropriate size, or the divided pieces are put on a broadcast signal and sent as a unit. It is required to do.
Also, in order to transmit subtitle text in real time during broadcasting, it is required to reduce the load of the TTML document file division process.

本発明は、上記の課題認識に基づいて行なわれたものであり、例えば放送用の一番組全体の字幕テキスト等のタイムドテキストを、断片に分割するための、分割装置および解析装置、ならびにプログラムを提供するものである。 The present invention has been made based on the above problem recognition. For example, a dividing device, an analyzing device, and a program for dividing timed text such as subtitle text of an entire program for broadcasting into pieces. Is to provide.

［１］上記の課題を解決するため、本発明の一態様による解析装置は、時刻情報が付加された複数のテキスト文を含むテキスト文書データを取得する取得部と、前記時刻情報に基づいて前記テキスト文書データを、前記テキスト文を含む複数のグループに断片化するための断片化情報を生成する時刻解析部と、前記断片化された前記テキスト文のグループである断片ごとに、前記断片から参照される前記テキスト文書のヘッダ記述の情報を解析し、前記断片と前記断片から参照される前記ヘッダ記述との関係を表す参照関係情報を生成する参照関係解析部と、前記断片化情報と前記参照関係情報とを含んだ断片化テキスト文書送出情報を生成する送出情報生成部と、を具備することを特徴とする。 [1] In order to solve the above-described problem, an analysis apparatus according to an aspect of the present invention includes an acquisition unit that acquires text document data including a plurality of text sentences to which time information is added, and the above-described time information. A time analysis unit for generating fragmentation information for fragmenting text document data into a plurality of groups including the text sentence, and referring to each fragment that is a group of the fragmented text sentence from the fragment Analyzing the header description information of the text document to be generated, generating a reference relationship information representing a relationship between the fragment and the header description referenced from the fragment, the fragmentation information and the reference A transmission information generation unit that generates fragmented text document transmission information including relation information.

［２］また、本発明の一態様は、上記の解析装置において、前記断片を放送により伝送する際の、前記断片に含まれる前記テキスト文から参照される画像ファイルや音声ファイルや非組込フォントファイルのロケーション情報と、前記画像ファイルや前記音声ファイルや前記非組込フォントファイルの前記ロケーション情報が前記テキスト文書データのどの部分に記述されているかを示すロケーション情報記述位置指定情報と、前記画像ファイルや前記音声ファイルや前記非組込フォントファイルを前記断片と共に放送により伝送する際の放送信号中のリソースの取得位置を特定するための放送の名前空間による放送ロケーション情報と、を含んだ放送ロケーション変換情報を生成する変換情報解析部、をさらに具備し、前記送出情報生成部は、前記放送ロケーション変換情報をも含んだ断片化テキスト文書送出情報を生成する、ことを特徴とする。 [2] Further, according to one aspect of the present invention, in the above analysis device, when the fragment is transmitted by broadcasting, an image file, an audio file, or a non-embedded font referred to from the text sentence included in the fragment File location information, location information description position specifying information indicating in which part of the text document data the location information of the image file, the audio file, and the non-embedded font file is described, and the image file And broadcast location information according to the broadcast name space for specifying the resource acquisition position in the broadcast signal when the audio file and the non-embedded font file are transmitted together with the fragment by broadcast. A conversion information analysis unit for generating information, and the transmission information generation unit , It generates the fragmentation text document delivery information including also broadcast location conversion information, wherein the.

［３］また、本発明の一態様は、上記［１］の解析装置において、前記送出情報生成部は、前記取得部によって取得された前記テキスト文書データに前記断片化情報と前記参照関係情報とを含んだ前記断片化テキスト文書送出情報を付加して、情報付加済テキスト文書データとして出力する、ことを特徴とする。
［４］また、本発明の一態様は、上記［２］の解析装置において、前記送出情報生成部は、前記取得部によって取得された前記テキスト文書データに前記断片化情報と前記参照関係情報と前記放送ロケーション変換情報とを含んだ前記断片化テキスト文書送出情報を付加して、情報付加済テキスト文書データとして出力する、ことを特徴とする。 [3] Further, according to an aspect of the present invention, in the analysis device according to [1], the transmission information generation unit includes the fragmentation information, the reference relationship information, and the reference information in the text document data acquired by the acquisition unit. The fragmented text document transmission information including the information is added and output as information-added text document data.
[4] Further, according to an aspect of the present invention, in the analysis device according to [2], the transmission information generation unit includes the fragmentation information, the reference relationship information, and the reference information in the text document data acquired by the acquisition unit. The fragmented text document transmission information including the broadcast location conversion information is added and output as information-added text document data.

［５］また、本発明の一態様は、上記の解析装置において、前記断片化情報に含まれる個々の断片に関する情報は、当該断片に含まれる前記テキスト文のグループを特定するための、
（１）前記断片に含まれる、前記テキスト文に付加されていた前記テキスト文を識別するＩＤのリスト、
（２）前記断片に含まれる前記テキスト文のうち一番時間順が早い前記テキスト文に付加されていた開始時刻の情報、
（３）前記断片に含まれる前記テキスト文のうち一番時間順が早い前記テキスト文に付加されていた開始時刻および一番時間順が遅い前記テキスト文に付加されていた終了時刻の情報、
の少なくともいずれかを含むものであり、前記参照関係情報は、前記断片の提示に必要な前記テキスト文書のヘッダ記述として、非組込フォントの情報と、埋め込み画像の情報、テキストのスタイルの情報と、テキスト提示の領域の情報との、少なくともいずれかを含むものである、ことを特徴とする。 [5] Further, according to one aspect of the present invention, in the above analysis device, the information about each fragment included in the fragmentation information is for specifying a group of the text sentences included in the fragment.
(1) A list of IDs for identifying the text sentence included in the fragment and attached to the text sentence;
(2) Information of the start time added to the text sentence having the earliest time order among the text sentences included in the fragment;
(3) Information on a start time added to the text sentence with the earliest time order among the text sentences included in the fragment and an end time added to the text sentence with the latest time order;
The reference relationship information includes, as header description of the text document necessary for presentation of the fragment, information on non-embedded font, information on embedded image, information on text style, , Including at least one of the text presentation area information.

［６］上記の課題を解決するため、本発明の一態様による分割装置は、時刻情報が付加された複数のテキスト文を含むテキスト文書データに加え、前記時刻情報に基づいて前記テキスト文書データを前記テキスト文の複数のグループに断片化するための断片化情報と、前記断片化された前記テキスト文のグループである断片ごとに、前記断片から参照される前記テキスト文書のヘッダ記述との関係を表す参照関係情報とを含んだ断片化テキスト文書送出情報を読み込み、前記断片化情報に基づいて前記テキスト文書データを前記テキスト文の複数のグループに分割するとともに、前記参照関係情報に基づいて、分割された断片である前記テキスト文のグループに、前記断片から参照される前記テキスト文書のヘッダ記述の情報を付加する分割部と、前記分割部によって分割された前記テキスト文の断片から参照されるリソースファイルを取得するリソースファイルデータ取得部と、前記分割部によって分割された前記テキスト文と、前記リソースファイルデータ取得部によって取得された前記リソースファイルとを含むデータを出力する出力部と、を具備することを特徴とする。 [6] In order to solve the above-described problem, a dividing device according to an aspect of the present invention, in addition to text document data including a plurality of text sentences to which time information is added, adds the text document data based on the time information. The relationship between fragmentation information for fragmenting into a plurality of groups of the text sentence and the header description of the text document referenced from the fragment for each fragment that is a group of the fragmented text sentence. Reading fragmented text document transmission information including reference relationship information to be expressed, dividing the text document data into a plurality of groups of the text sentences based on the fragmentation information, and dividing based on the reference relationship information Division for adding information of the header description of the text document referred to by the fragment to the group of the text sentence that is a fragment A resource file data acquisition unit that acquires a resource file referenced from the fragment of the text sentence divided by the division unit, the text sentence divided by the division unit, and the resource file data acquisition unit And an output unit for outputting data including the resource file.

［７］また、本発明の一態様は、上記の分割装置において、前記分割部は、前記断片を放送により伝送する際の、前記断片に含まれる前記テキスト文から参照される画像ファイルや音声ファイルや非組込フォントファイルのロケーション情報と、前記画像ファイルや前記音声ファイルや前記非組込フォントファイルの前記ロケーション情報が前記テキスト文書データのどの部分に記述されているかを示すロケーション情報記述位置指定情報と、前記画像ファイルや前記音声ファイルや前記非組込フォントファイルを前記断片と共に放送により伝送する際の放送信号中のリソースの取得位置を特定するための放送の名前空間による放送ロケーション情報と、を含んだ放送ロケーション変換情報を更に含む、前記断片化テキスト文書送出情報を読み込み、前記放送ロケーション変換情報に基づいて、前記断片に含まれる前記画像ファイルや前記音声ファイルや前記非組込フォントファイルのロケーション情報を、放送の名前空間によるロケーション情報に書き換えて前記断片に分割する、ことを特徴とする。 [7] In addition, according to one aspect of the present invention, in the above dividing device, the dividing unit transmits an image file or an audio file referred to from the text sentence included in the fragment when the fragment is transmitted by broadcasting. And location information description position designation information indicating in which part of the text document data the location information of the image file, the audio file, and the non-embedded font file is described. Broadcast location information based on a broadcast name space for specifying a resource acquisition position in a broadcast signal when the image file, the audio file, and the non-embedded font file are transmitted together with the fragment by broadcast. The fragmented text document transmission information further including the included broadcast location conversion information is read. Based on the broadcast location conversion information, the location information of the image file, the audio file, and the non-embedded font file included in the fragment is rewritten into location information based on a broadcast name space and divided into the fragments. It is characterized by that.

［８］また、本発明の一態様は、上記の分割装置において、前記分割部は、時刻情報が付加されたテキストを含むテキスト文書データに、前記断片化テキスト文書送出情報が付加されている情報付加済テキスト文書データを読み込み、前記断片化テキスト文書送出情報に含まれる前記断片化情報に基づいて前記テキスト文書データを、テキスト文の複数のグループに分割するとともに、前記参照関係情報に基づいて分割された断片である前記テキスト文のグループに、前記断片から参照される前記テキスト文書のヘッダ記述の情報を付加し、また、前記分割部は、前記断片化テキスト文書情報に前記放送ロケーション変換情報が含まれる場合は、前記放送ロケーション変換情報に基づいて、前記断片に含まれる前記リソースファイルのロケーション情報を、放送の名前空間によるロケーション情報に書き換える、ことを特徴とする。 [8] Further, according to one aspect of the present invention, in the above dividing device, the dividing unit includes information in which the fragmented text document transmission information is added to text document data including text to which time information is added. Read the added text document data, divide the text document data into a plurality of groups of text sentences based on the fragmentation information included in the fragmented text document transmission information, and divide based on the reference relation information Information of a header of the text document referenced from the fragment is added to the group of text sentences that are the fragmented, and the dividing unit includes the broadcast location conversion information in the fragmented text document information. If included, the location of the resource file included in the fragment is based on the broadcast location conversion information. The ® down information, rewrites the location information by name space of broadcasting, characterized in that.

［９］また、本発明の一態様による分割装置は、時刻情報が付加された複数のテキスト文を含むテキスト文書データを取得する取得部と、前記時刻情報に基づいて前記テキスト文書データを、前記テキスト文を含む複数のグループに断片化するための断片化情報を生成する時刻解析部と、前記断片化された前記テキスト文のグループである断片ごとに、前記断片から参照される前記テキスト文書のヘッダ記述の情報を解析し、前記断片と前記断片から参照される前記ヘッダ記述との関係を表す参照関係情報を生成する参照関係解析部と、前記テキスト文書データに加え、前記断片化情報と前記参照関係情報とを含んだ断片化テキスト文書送出情報を読み込み、前記断片化情報に基づいて前記テキスト文書データを前記テキスト文の複数のグループに分割するとともに、前記参照関係情報に基づいて分割された断片である前記テキスト文のグループに前記断片から参照される前記テキスト文書のヘッダ記述の情報を付加する分割部と、前記分割部によって分割された前記テキスト文の断片から参照されるリソースファイルを取得するリソースファイルデータ取得部と、前記分割部によって分割された前記テキスト文と、前記リソースファイルデータ取得部によって取得されたリソースファイルとを含むデータを出力する出力部と、を具備することを特徴とする。 [9] Further, the dividing device according to one aspect of the present invention includes an acquisition unit that acquires text document data including a plurality of text sentences to which time information is added, and the text document data based on the time information. A time analysis unit that generates fragmentation information for fragmentation into a plurality of groups including a text sentence, and for each fragment that is a group of the fragmented text sentence, the text document referenced from the fragment Analyzing header description information, generating a reference relationship information representing a relationship between the fragment and the header description referenced from the fragment; in addition to the text document data, the fragmentation information and the fragmentation information The fragmented text document transmission information including the reference relation information is read, and the text document data is converted into a plurality of groups of the text sentence based on the fragmentation information. A division unit for adding information on a header description of the text document referred to from the fragment to a group of the text sentence that is a fragment divided based on the reference relation information; and A resource file data acquisition unit that acquires a resource file referenced from the fragment of the divided text sentence, the text sentence divided by the division unit, and a resource file acquired by the resource file data acquisition unit And an output unit that outputs data including the output data.

［１０］また、本発明の一態様は、上記の分割装置において、前記出力部は、前記断片に含まれる前記テキスト文に付加された前記提示時刻情報のうち、一番早い提示開始時刻にしたがって、分割された前記テキスト文と、前記リソースファイルとを含むデータを順次出力する、ことを特徴とする。 [10] Further, according to one aspect of the present invention, in the dividing device, the output unit according to the earliest presentation start time among the presentation time information added to the text sentence included in the fragment. The data including the divided text sentence and the resource file are sequentially output.

［１１］また、本発明の一態様は、上記の分割装置において、前記断片化情報に含まれる個々の断片に関する情報は、当該断片に含まれる前記テキスト文のグループを特定するための、
（１）前記断片に含まれる、前記テキスト文に付加されていた前記テキスト文を識別するＩＤのリスト、
（２）前記断片に含まれる前記テキスト文のうち一番時間順が早い前記テキスト文に付加されていた開始時刻の情報、
（３）前記断片に含まれる前記テキスト文のうち一番時間順が早い前記テキスト文に付加されていた開始時刻および一番時間順が遅い前記テキスト文に付加されていた終了時刻の情報、
の少なくともいずれかを含むものであり、前記参照関係情報は、前記断片の提示に必要な前記テキスト文書のヘッダ記述として、非組込フォントの情報と、埋め込み画像の情報、テキストのスタイルの情報と、テキスト提示の領域の情報との、少なくともいずれかを含むものである、ことを特徴とする。 [11] Further, according to one aspect of the present invention, in the above dividing device, the information about each fragment included in the fragmentation information is for specifying a group of the text sentences included in the fragment.
(1) A list of IDs for identifying the text sentence included in the fragment and attached to the text sentence;
(2) Information of the start time added to the text sentence having the earliest time order among the text sentences included in the fragment;
(3) Information on a start time added to the text sentence with the earliest time order among the text sentences included in the fragment and an end time added to the text sentence with the latest time order;
The reference relationship information includes, as header description of the text document necessary for presentation of the fragment, information on non-embedded font, information on embedded image, information on text style, , Including at least one of the text presentation area information.

［１２］また、本発明の一態様は、上記の分割装置において、前記参照関係情報は、前記断片の提示に必要な前記テキスト文書のヘッダ記述として、非組込フォントの情報と、埋め込み画像の情報、テキストのスタイルの情報と、テキスト提示の領域の情報との、少なくともいずれかを含むものである、ことを特徴とする。 [12] Further, according to an aspect of the present invention, in the above dividing apparatus, the reference relation information includes non-embedded font information, embedded image information as a header description of the text document necessary for presentation of the fragment. It includes at least one of information, text style information, and text presentation area information.

［１３］また、本発明の一態様は、上記の解析装置としてコンピューターを機能させるためのプログラムである。 [13] One embodiment of the present invention is a program for causing a computer to function as the analysis apparatus.

［１４］また、本発明の一態様は、上記の分割装置としてコンピューターを機能させるためのプログラムである。 [14] One embodiment of the present invention is a program for causing a computer to function as the above-described dividing device.

本発明によれば、時刻情報が付加されたテキスト情報を、放送等の伝送に適した形に分割して、出力することができる。
また、本発明による字幕情報が付加されたテキスト情報は、一番組全体のテキスト情報として記述、管理できるため、インターネットでのビデオオンデマンドサービスにおいて一般的に用いられる一番組全体のテキスト情報を一括して送信することにも対応でき、インターネットでの字幕テキストの提供に適した形式でも出力することができる。 According to the present invention, text information to which time information is added can be divided and output in a form suitable for transmission such as broadcasting.
In addition, since the text information to which the caption information according to the present invention is added can be described and managed as text information of the entire program, the text information of the entire program generally used in the video-on-demand service on the Internet is collected. Can also be output, and can be output in a format suitable for providing subtitle text on the Internet.

本発明の第１実施形態による分割装置（送出装置）の概略機能構成を示すブロック図である。It is a block diagram which shows schematic function structure of the division | segmentation apparatus (transmission apparatus) by 1st Embodiment of this invention. 同実施形態による分割装置が取得するテキスト文書データの構成を示す概略図である。It is the schematic which shows the structure of the text document data which the dividing device by the same embodiment acquires. 同実施形態による分割装置によって解析される情報を示す概略図である。It is the schematic which shows the information analyzed by the division | segmentation apparatus by the embodiment. 同実施形態による分割装置の処理の手順を示すフローチャートである。It is a flowchart which shows the procedure of the process of the division | segmentation apparatus by the embodiment. 本発明の第２実施形態による装置構成を示す概略ブロック図である。It is a schematic block diagram which shows the apparatus structure by 2nd Embodiment of this invention. 同実施形態による解析装置の概略機能構成を示すブロック図である。It is a block diagram which shows the schematic function structure of the analyzer by the same embodiment. 同実施形態による分割装置の概略機能構成を示すブロック図である。It is a block diagram which shows schematic function structure of the division | segmentation apparatus by the embodiment. 同実施形態による解析装置の処理の手順を示すフローチャートである。It is a flowchart which shows the procedure of the process of the analyzer by the same embodiment. 同実施形態による分割装置の処理の手順を示すフローチャートである。It is a flowchart which shows the procedure of the process of the division | segmentation apparatus by the embodiment. 第１実施形態および第２実施形態における解析結果の情報を付加したテキスト文書データの例を示す概略図（１／６）である。It is the schematic (1/6) which shows the example of the text document data which added the information of the analysis result in 1st Embodiment and 2nd Embodiment. 第１実施形態および第２実施形態における解析結果の情報を付加したテキスト文書データの例を示す概略図（２／６）である。It is the schematic (2/6) which shows the example of the text document data which added the information of the analysis result in 1st Embodiment and 2nd Embodiment. 第１実施形態および第２実施形態における解析結果の情報を付加したテキスト文書データの例を示す概略図（３／６）である。It is the schematic (3/6) which shows the example of the text document data which added the information of the analysis result in 1st Embodiment and 2nd Embodiment. 第１実施形態および第２実施形態における解析結果の情報を付加したテキスト文書データの例を示す概略図（４／６）である。It is the schematic (4/6) which shows the example of the text document data which added the information of the analysis result in 1st Embodiment and 2nd Embodiment. 第１実施形態および第２実施形態における解析結果の情報を付加したテキスト文書データの例を示す概略図（５／６）である。It is the schematic (5/6) which shows the example of the text document data which added the information of the analysis result in 1st Embodiment and 2nd Embodiment. 第１実施形態および第２実施形態における解析結果の情報を付加したテキスト文書データの例を示す概略図（６／６）である。It is the schematic (6/6) which shows the example of the text document data which added the information of the analysis result in 1st Embodiment and 2nd Embodiment. 第１実施形態および第２実施形態において出力される断片化テキスト文書データの例を示す概略図である。It is the schematic which shows the example of the fragmented text document data output in 1st Embodiment and 2nd Embodiment. 第１実施形態および第２実施形態において出力される、パッケージ化した字幕データの構造の例を示す概略図である。It is the schematic which shows the example of the structure of the packaged subtitle data output in 1st Embodiment and 2nd Embodiment.

次に、図面を参照しながら、本発明の実施形態について説明する。
［第１実施形態］
図１は、第１実施形態による分割装置（送出装置）の概略機能構成を示すブロック図である。図示するように、分割装置１は、取得部１１と、時刻解析部１２と、変換情報解析部１３と、参照関係解析部１４と、分割部１５と、出力部１７と、リソースファイルデータ取得部１８とを含んで構成される。また、図示するテキスト文書データ８１と断片化字幕データ８５とは、適宜、記録媒体等に記録された形態で保持される。具体的には、データ記憶手段としては、磁気ハードディスク装置や、半導体メモリ等が用いられる。 Next, embodiments of the present invention will be described with reference to the drawings.
[First Embodiment]
FIG. 1 is a block diagram showing a schematic functional configuration of a dividing device (sending device) according to the first embodiment. As illustrated, the dividing device 1 includes an acquisition unit 11, a time analysis unit 12, a conversion information analysis unit 13, a reference relationship analysis unit 14, a division unit 15, an output unit 17, and a resource file data acquisition unit. 18. Further, the illustrated text document data 81 and fragmented caption data 85 are appropriately stored in a form recorded on a recording medium or the like. Specifically, a magnetic hard disk device, a semiconductor memory, or the like is used as the data storage means.

取得部１１は、時刻情報が付加されたテキストを含むテキスト文書データ８１を外部から取得する。テキスト文書データ８１は、このテキスト文書データ８１の詳細については後述する。
時刻解析部１２は、テキスト文書データ８１に含まれる時刻情報に基づいて、テキスト文書データ８１を断片化するための断片化情報を生成する。ここで、断片化とは、時間軸にしたがって、テキスト文書データ８１を、より短い適切な時間範囲を有する複数のグループに分割することである。なお、断片化によって分割された各グループは、１個または複数個のテキスト文を含む。また、時刻解析部１２によって分割されたテキスト（所定の時間の範囲内のテキスト文）を、以後、断片（フラグメント）と呼ぶ場合がある。適切な時間範囲とは、例えば、テレビ放送の字幕としての伝送用に適した時間範囲である。
時刻解析部１２は、生成した断片化情報を分割部１５に渡す。 The acquisition unit 11 acquires text document data 81 including text to which time information is added from the outside. Details of the text document data 81 will be described later.
The time analysis unit 12 generates fragmentation information for fragmenting the text document data 81 based on the time information included in the text document data 81. Here, the fragmentation is to divide the text document data 81 into a plurality of groups having a shorter appropriate time range according to the time axis. Each group divided by fragmentation includes one or a plurality of text sentences. Further, the text (text sentence within a predetermined time range) divided by the time analysis unit 12 may be hereinafter referred to as a fragment. An appropriate time range is, for example, a time range suitable for transmission as captions for television broadcasting.
The time analysis unit 12 passes the generated fragmentation information to the dividing unit 15.

参照関係解析部１４は、断片化されたテキスト文のグループである断片ごとに、その断片から参照されるテキスト文書のヘッダ記述の情報を解析し、その断片と、その断片から参照されるヘッダ記述との関係を表す参照関係情報を生成する。
参照関係解析部１４は、生成した参照関係情報を分割部１５に渡す。
なお、ヘッダ記述とは、テキスト文から参照されるテキスト文書データのヘッダ部に記述されているフォントの情報や埋め込み画像の情報やスタイル定義情報や字幕提示の領域情報などである。ヘッダ記述の詳細については後述する。 The reference relationship analysis unit 14 analyzes, for each fragment that is a fragmented text sentence group, the header description information of the text document referenced from the fragment, and the header description referenced from the fragment. Reference relationship information representing the relationship between and is generated.
The reference relationship analyzing unit 14 passes the generated reference relationship information to the dividing unit 15.
The header description includes font information, embedded image information, style definition information, caption presentation area information, and the like described in a header portion of text document data referred to from a text sentence. Details of the header description will be described later.

変換情報解析部１３は、断片化された字幕テキストのグループに含まれるリソースファイルを参照するためのロケーション情報を解析する。そして、変換情報解析部１３は、元のロケーション情報の記述を放送の名前空間によるロケーション情報へ書き換えるための、放送ロケーション変換情報を生成する。
変換情報解析部１３は、生成した放送ロケーション変換情報を分割部１５に渡す。 The conversion information analysis unit 13 analyzes location information for referring to a resource file included in a fragmented subtitle text group. Then, the conversion information analysis unit 13 generates broadcast location conversion information for rewriting the description of the original location information into location information in the broadcast name space.
The conversion information analysis unit 13 passes the generated broadcast location conversion information to the dividing unit 15.

分割部１５は、テキスト文書データ８１と、時刻解析部１２から渡される断片化情報と、参照関係解析部１４から渡される参照関係情報とを取得する。そして、分割部１５は、断片化情報に基づいてテキスト文書データ８１を、テキスト文を含んだ複数のグループに分割するとともに、分割された断片であるテキスト文のグループに、その断片から参照されるテキスト文書のヘッダ記述の情報を付加する。
出力部１７は、分割部１５によって分割されたテキスト文のグループである断片と、その断片から参照されるリソースファイルのデータとを、放送等で利用される伝送フォーマットにて出力する。このとき、出力部１７は、断片に含まれるテキスト文に付加された時刻情報のうち一番早い開始時間にしたがって、分割されたテキスト文のグループである断片化テキスト文書データと、関連付けられたリソースのデータを放送等で利用される伝送フォーマットにて順次出力する。なお、出力部１７は、リソースファイルのデータを、リソースファイルデータ取得部１８から受け取る。
リソースファイルデータ取得部１８は、テキスト文書データ８１から参照されている外部のリソースファイル８７を取得して、上記の出力部１７に渡す。 The dividing unit 15 acquires the text document data 81, fragmentation information passed from the time analysis unit 12, and reference relationship information passed from the reference relationship analysis unit 14. Then, the dividing unit 15 divides the text document data 81 into a plurality of groups including the text sentence based on the fragmentation information, and is referenced from the fragment to the group of text sentences that are the divided fragments. Add header description information of text document.
The output unit 17 outputs a fragment that is a group of text sentences divided by the dividing unit 15 and data of a resource file referenced from the fragment in a transmission format used in broadcasting or the like. At this time, the output unit 17 generates the fragmented text document data that is a group of text sentences divided according to the earliest start time among the time information added to the text sentence included in the fragment, and the associated resource. Are sequentially output in a transmission format used in broadcasting or the like. The output unit 17 receives the resource file data from the resource file data acquisition unit 18.
The resource file data acquisition unit 18 acquires an external resource file 87 referred to from the text document data 81 and passes it to the output unit 17.

図２は、分割装置１が取得するテキスト文書データ８１の概略構成を示す概略図である。同図に示すテキスト文書データ８１は、テレビ放送の字幕のデータであり、ＴＴＭＬ（Timed Text Markup Language，タイムドテキストマークアップ言語）の形式によるものである。ＴＴＭＬは、例えば「標準規格ＡＲＩＢＳＴＤ−Ｂ６２１．０版デジタル放送におけるマルチメディア符号化方式（第2世代）」，「第一編第3部第3章字幕・文字スーパーの記述言語」（ｐ．６３−７８，平成２６年７月３１日，一般社団法人電波産業会）で規定されたＡＲＩＢ−ＴＴＭＬにしたがう。ＴＴＭＬ文書は、時刻情報が付加された複数のテキスト文を保持することができる。本実施形態におけるＴＴＭＬ文書は、テレビ放送の字幕テキストおよびそのテキストの提示時刻（presentation time）の情報を含む。ＴＴＭＬ文書は、ＸＭＬ（Extensible Markup Language）文書の一種であり、時刻情報以外にも種々の情報を含んでいる。 FIG. 2 is a schematic diagram showing a schematic configuration of the text document data 81 acquired by the dividing device 1. The text document data 81 shown in the figure is television broadcast subtitle data, and is in a TTML (Timed Text Markup Language) format. TTML is, for example, “Standard ARIB STD-B62 Version 1.0, Multi-Media Coding System for Digital Broadcasting (2nd Generation)”, “Part 1, Part 3, Chapter 3, Subtitle / Text Super Description Language” (p 63-78, July 31, 2014, the Radio Industry Association of Japan) according to ARIB-TTML. The TTML document can hold a plurality of text sentences to which time information is added. The TTML document in the present embodiment includes subtitle text of television broadcasting and information on the presentation time of the text. The TTML document is a kind of XML (Extensible Markup Language) document and includes various information in addition to time information.

図示するように、テキスト文書データ８１は、ヘッダ部（ｈｅａｄ要素）に、埋め込みイメージ情報や、非組込フォント情報や、スタイル情報や、字幕提示の領域情報を含む。
具体的には、テキスト文書データ８１は、メタデータ（ｍｅｔａｄａｔａ要素）の一部として、埋め込みイメージ情報を持っている。埋め込みイメージ情報は、ｓｍｐｔｅ：ｉｍａｇｅ要素として保持されるものであり、バイナリー形式のイメージを適宜コード化して文字としてテキスト文書データ８１内に含まれる。
また、テキスト文書データ８１は、スタイリング情報（ｓｔｙｌｉｎｇ要素）の一部として、非組込フォント情報（ａｒｉｂ−ｔｔ：ｆｏｎｔ−ｆａｃｅ要素）を持っている。非組込フォント情報には、ＴＴＭＬ文書とともに表示可能な非組込フォントのリソースフィルのロケーション情報等を記述する。
また、テキスト文書データ８１は、スタイリング情報の一部として、スタイル情報（ｓｔｙｌｅ要素）を持っている。このスタイル情報は、文字色や、フォントファミリーや、フォントサイズや、文字の配置（アラインメント指定）などの情報を含む。後続のｂｏｄｙ要素内に記述される字幕本文から、ここで定義したスタイル情報を参照して利用できる。
また、テキスト文書データ８１は、レイアウト情報（ｌａｙｏｕｔ要素）の一部として、字幕提示の領域情報（ｒｅｇｉｏｎ要素）を含む。この領域情報は、テキストを表示する領域（座標範囲）に関する情報である。後続のｂｏｄｙ要素内に記述される字幕本文から、ここで定義した領域情報を参照して利用できる。 As shown in the figure, the text document data 81 includes embedded image information, non-embedded font information, style information, and subtitle presentation area information in a header portion (head element).
Specifically, the text document data 81 has embedded image information as part of metadata (metadata element). The embedded image information is held as a “smpte: image” element, and an image in a binary format is appropriately encoded and included in the text document data 81 as characters.
Further, the text document data 81 has non-embedded font information (arib-tt: font-face element) as a part of styling information (styling element). The non-embedded font information describes the location information of the resource file of the non-embedded font that can be displayed together with the TTML document.
The text document data 81 has style information (style element) as part of styling information. This style information includes information such as character color, font family, font size, and character arrangement (alignment designation). It can be used by referring to the style information defined here from the subtitle text described in the subsequent body element.
Further, the text document data 81 includes caption presentation area information (region element) as part of layout information (layout element). This area information is information relating to an area (coordinate range) for displaying text. It can be used by referring to the area information defined here from the caption text described in the subsequent body element.

また、テキスト文書データ８１は、ボディ部（ｂｏｄｙ要素）に字幕本文のテキストの情報を保持する。字幕本文は、ｐ要素や、ｄｉｖ要素として、テキスト文書データ８１内に含まれる。なお、字幕本文を保持するｐ要素やｄｉｖ要素は、上記のヘッダ部内の各情報（埋め込みイメージ情報、非組込フォント情報、スタイル情報、字幕提示の領域情報）を参照する。 Further, the text document data 81 holds the text information of the subtitle text in the body part (body element). The subtitle text is included in the text document data 81 as a p element or a div element. Note that the p element and div element that hold the caption text refer to each piece of information (embedded image information, non-embedded font information, style information, and caption presentation area information) in the header section.

分割装置１の入力となるテキスト文書データ８１は、例えば放送番組の単位でひとまとまりのファイルである。番組の長さは、多くの場合、数分から数時間の範囲内のものである。このテキスト文書データ８１は、例えば、ＤＶＤやブルーレイディスク等の記録媒体に記録されたパッケージの一部として組み込まれる場合には特段の不都合はないが、そのままでは、放送等のように逐次伝送される形態のコンテンツには向かない。 The text document data 81 to be input to the dividing device 1 is a group of files in units of broadcast programs, for example. The length of the program is often in the range of minutes to hours. The text document data 81 is not particularly inconvenient when it is incorporated as a part of a package recorded on a recording medium such as a DVD or a Blu-ray disc. Not suitable for form content.

分割装置１は、そのようなテキスト文書データ８１を入力し、この文書に含まれるテキスト文をより短い時間帯ごとに分割して、放送用字幕用の複数のフラグメント（断片）のＴＴＭＬファイルとして出力する。分割装置によって分割された後のフラグメントのＴＴＭＬファイルは、各時間帯のテキスト文のグループの情報（１つまたは複数のｐ要素やｄｉｖ要素の情報）に、それらのｐ要素やｄｉｖ要素から参照されるテキスト文書データ８１のヘッダ部に記述されている、埋め込みイメージ情報、非組込フォント情報、スタイル情報、字幕提示の領域情報を追加したＴＴＭＬの記述方式に従った文書である。なお、分割装置１は、入力する文書ファイルの中から、分割後のファイルに必要な要素のみを適宜選択して出力する。 The dividing device 1 inputs such text document data 81, divides a text sentence included in the document into shorter time zones, and outputs the divided text as a TTML file of a plurality of fragments for broadcasting subtitles. To do. The TTML file of the fragment after being divided by the dividing device is referred to by the text element group information (information of one or more p elements and div elements) of each time zone from those p elements and div elements. This is a document according to the TTML description method to which embedded image information, non-embedded font information, style information, and subtitle presentation area information described in the header portion of the text document data 81 is added. The dividing device 1 appropriately selects and outputs only the elements necessary for the divided file from the input document file.

つまり、分割装置１の時刻解析部１２は、入力されたテキスト文書データ８１に含まれる各テキスト文に付加された時刻情報（提示時刻の情報）に基づいて、テキスト文のグループへの断片化を行う。そして、時刻解析部１２は、入力されたデータに、時刻解析の結果の情報を付加する。時刻解析の結果とは、入力されたデータを時間軸に沿っていかに断片化するかを表す情報である。つまり、時刻解析部１２によって付加される断片化情報とは、各断片の開始時刻（および必要に応じて終了時刻）を表す情報である。ＴＴＭＬにおいては、字幕本文の各時間帯のテキスト文（p要素やｄｉｖ要素）に開始時刻等を表す情報が記述されているため、ｐ要素やｄｉｖ要素の属性値として記述されているｉｄ情報を指定することで、上記の各断片の開始時刻（および必要に応じて終了時刻）を特定でき、各断片に含まれるテキスト文のｉｄ情報を、いかに断片化するかを表す断片化情報とすることもできる。複数のテキスト文をまたがった時間を指定する場合には、複数のテキスト文のｉｄ値のリストを指定することもできる。 That is, the time analysis unit 12 of the dividing device 1 performs fragmentation of text sentences into groups based on time information (presentation time information) added to each text sentence included in the input text document data 81. Do. Then, the time analysis unit 12 adds information on the result of time analysis to the input data. The result of time analysis is information indicating whether input data is fragmented along the time axis. That is, the fragmentation information added by the time analysis unit 12 is information representing the start time (and the end time if necessary) of each fragment. In TTML, information indicating the start time and the like is described in the text sentence (p element and div element) in each time zone of the subtitle body. Therefore, the id information described as the attribute value of the p element or div element is used. By specifying, the start time (and end time if necessary) of each fragment above can be specified, and the id information of the text sentence included in each fragment should be fragmented information indicating how to fragment. You can also. When specifying a time spanning a plurality of text sentences, a list of id values of the plurality of text sentences can be specified.

また、参照関係解析部１４は、入力されたテキスト文書データ８１を分割するために、テキスト文書データ８１に含まれる時間帯（時間軸で区切った断片）ごとの、テキスト文書データ８１のヘッダ部のうち必要な部分の記述への参照の状況を解析する。そして、参照関係解析部１４は、解析した結果である参照関係情報を、入力データに付加する。分割部１５は、これらの、解析結果が付加されたデータを受け取り、それに基づいて分割されたファイルを生成する。
また、変換情報解析部１３は、断片化された字幕テキストのグループに含まれるリソースファイルの参照のためのロケーション情報を解析し、元のロケーション情報の記述を放送の名前空間によるロケーション情報へ書き換えるための、放送ロケーション変換情報を生成する。 In addition, the reference relationship analysis unit 14 divides the input text document data 81 in the header portion of the text document data 81 for each time zone (fragment divided by the time axis) included in the text document data 81. Analyze the situation of reference to the description of the necessary part. Then, the reference relationship analysis unit 14 adds the reference relationship information that is the analysis result to the input data. The dividing unit 15 receives the data to which the analysis result is added, and generates a divided file based on the data.
In addition, the conversion information analysis unit 13 analyzes the location information for referring to the resource file included in the fragmented subtitle text group, and rewrites the original location information description into the location information in the broadcast namespace. The broadcast location conversion information is generated.

図３は、分割装置１の時刻解析部１２と参照関係解析部１４と変換情報解析部１３とによってそれぞれ解析された結果として付加される、データを時間軸に沿っていかに断片化するかをＴＴＭＬ文書内に記述するためのＸＭＬの構造を示す概略図である。この付加情報を含むデータが、分割部１５に渡される。同図は、便宜上、ＸＭＬ形式のデータの階層構造を表として表した形である。なお、同図における横方向のインデントの位置は、階層の深さに対応している。但し、分割部１５が受け取るデータ（解析結果を付加したデータ）は、ＸＭＬ形式に限らず、同等の他の形式のデータであっても良い。また、この例ではＴＴＭＬ文書に中のｍｅｔａｄａｔａ要素としてデータを付加する例を示したが、字幕用のＴＴＭＬ文書とは別に、付加情報のファイルとして別のファイルを生成し、管理するようにしても良い。図示するように、図３は、ＴＴＭＬ文書ファイル内の階層構成のタグ情報およびパラメーターの種類と、同ファイル内に含まれる各要素の出現回数を示している。 FIG. 3 shows TTML whether data to be fragmented along the time axis is added as a result of analysis by the time analysis unit 12, the reference relationship analysis unit 14, and the conversion information analysis unit 13 of the dividing device 1. It is the schematic which shows the structure of XML for describing in a document. Data including this additional information is passed to the dividing unit 15. This figure shows a hierarchical structure of XML data as a table for convenience. The position of the indent in the horizontal direction in the figure corresponds to the depth of the hierarchy. However, the data received by the dividing unit 15 (data to which the analysis result is added) is not limited to the XML format, and may be data in another equivalent format. In this example, data is added as a metadata element in a TTML document. However, a separate file is generated as an additional information file and managed separately from the caption TTML document. good. As shown in FIG. 3, FIG. 3 shows the tag information and parameter types of the hierarchical structure in the TTML document file, and the number of appearances of each element included in the file.

なお、同図においては、各要素および各属性の出現回数の情報をも示している。出現回数の欄に「１」と示す属性は、共通の上位要素に属する同一レベルのものとしては１回出現する。出現回数の欄に「０．．１」と示す属性は、共通の上位要素に属する同一レベルのものとしては０回ないしは１回出現する。出現回数の欄に「０．．ｎ」と示す属性は、共通の上位要素に属する同一レベルのものとしては０回ないしはｎ回（ｎは自然数）出現する。 In the figure, information on the number of appearances of each element and each attribute is also shown. The attribute indicated by “1” in the appearance count column appears once for the same level belonging to the common upper element. The attribute indicated by “0..1” in the column of the number of appearances appears 0 times or once for the same level belonging to the common upper element. The attribute indicated by “0..n” in the appearance count column appears 0 times or n times (n is a natural number) at the same level belonging to a common upper element.

以下、各々の要素および属性について説明する。
ｔｔ要素は、ＴＴＭＬ文書ファイルにおける最上位の要素である。
ｈｅａｄ要素は、ＴＴＭＬ文書ファイルにおけるヘッダ部（ｈｅａｄ要素）である。
ｍｅｔａｄａｔａ要素は、ヘッダ部の中に含まれているメタデータである。ＴＴＭＬにおいては、ＴＴＭＬ文書に関する任意の情報おｍｅｄａｄａｔａ要素下に記述することができる。
ｃａｐｔｉｏｎＥｘｃｈａｎｇｅＩｎｆｏｒｍａｔｉｏｎ要素は、メタデータの一部として含まれている、字幕キャプションの交換に関する情報である。
ｔｒａｎｓｍｉｓｓｉｏｎＩｎｆｏｒｍａｔｉｏｎ要素は、ｃａｐｔｉｏｎＥｘｃｈａｎｇｅＩｎｆｏｒｍａｔｉｏｎの一部として含まれている、伝送に関する情報である。
ｔｒａｎｓｍｉｓｓｉｏｎＵｎｉｔｓ要素は、放送における伝送単位である「ｕｎｉｔ」を格納するための親要素である。 Hereinafter, each element and attribute will be described.
The tt element is the highest element in the TTML document file.
The head element is a header part (head element) in the TTML document file.
The metadata element is metadata included in the header part. In TTML, it can be described under any information and metadata elements related to a TTML document.
The captionExchangeInformation element is information regarding the exchange of caption captions included as part of the metadata.
The transmissionInformation element is information relating to transmission that is included as part of the captionExchangeInformation.
The transmissionUnits element is a parent element for storing “unit” which is a transmission unit in broadcasting.

ｕｎｉｔ要素は、放送で伝送される字幕データの伝送単位を示す要素である。
＠ｘｍｌ：ｉｄ属性は、ｕｎｉｔの属性であり、字幕テキストの伝送単位の識別子を示す。この識別子により、伝送単位ごとの字幕データを番号等で管理することができる。なお、ｕｎｉｔを識別するために、連番等を値として持つ＠ｎｕｍｂｅｒ要素を用いるようにしても良い。
＠ｔｉｍｅｃｏｄｅ属性は、ｕｎｉｔの属性であり、提示時刻を示す。提示時刻は、当該伝送単位として伝送される字幕データを提示する時刻であり、例えば番組開始時点からの相対時刻で表される。提示時刻を表す形式は、例えば、「ｈｈ：ｍｍ：ｓｓ：ｎｎｎ」（時−分−秒−ミリ秒）である。放送局側の送出装置（本実施形態における分割装置１）は、この提示時刻に基づき、字幕データを送出する。なお、提示時刻よりも所定時間（伝送や処理等に要するオーバーヘッド時間）前に、送出装置は、字幕データを送出する。なお、＠ｔｉｍｅｃｏｄｅ属性の値としての提示時刻には、当該伝送単位に含まれる各字幕テキストの提示開始時刻のうち、一番早い開始時間の値を用いる。これにより、放送信号を受信する受信機側での提示に間に合うように、断片化字幕データ８５を送出することができる。 The unit element is an element indicating a transmission unit of caption data transmitted by broadcasting.
The @xml: id attribute is a unit attribute and indicates an identifier of a transmission unit of subtitle text. With this identifier, subtitle data for each transmission unit can be managed by a number or the like. In order to identify a unit, an @number element having a serial number or the like as a value may be used.
The @timecode attribute is a unit attribute and indicates a presentation time. The presentation time is a time at which caption data transmitted as the transmission unit is presented, and is represented by, for example, a relative time from the program start time. The format representing the presentation time is, for example, “hh: mm: ss: nnn” (hour-minute-second-millisecond). The broadcast station-side transmission device (dividing device 1 in the present embodiment) transmits subtitle data based on this presentation time. Note that the sending apparatus sends the caption data before a predetermined time (overhead time required for transmission, processing, etc.) before the presentation time. As the presentation time as the value of the @timecode attribute, the value of the earliest start time among the presentation start times of the subtitle texts included in the transmission unit is used. As a result, the fragmented caption data 85 can be sent out in time for presentation on the receiver side that receives the broadcast signal.

ｒｅｓｏｕｒｃｅ要素は、字幕データの伝送単位に含まれる各リソース（ｒｅｓｏｕｒｃｅ要素）に対応する要素である。ｒｅｓｏｕｒｃｅ要素は、そのリソースを構成するために必要な情報やデータを指し示すための情報を属性として含む。ｒｅｓｏｕｒｃｅ要素のｄａｔａｔｙｐｅ属性（下記）に応じて、記述可能な他の属性を切り替える。具体的には、ｄａｔａｔｙｐｅ＝「００００」の場合と、ｄａｔａｔｙｐｅ≠「００００」との場合で切り替える。 The resource element is an element corresponding to each resource (resource element) included in the transmission unit of caption data. The resource element includes information necessary for configuring the resource and information for indicating data as attributes. Other attributes that can be described are switched according to the datatype attribute (described below) of the resource element. Specifically, switching is performed between datatype = “0000” and datatype ≠ “0000”.

＠ｄａｔａｔｙｐｅ属性は、ｒｅｓｏｕｒｃｅの属性であり、データタイプを表す。例えば、ＡＲＩＢ標準規格である「デジタル放送におけるＭＭＴによるメディアトランスポート方式」（ARIB STD-B60 1.0版，２００４年７月３１日策定）の第１１７ページには、表９−１として、伝送時のデータタイプの一覧が示されている。ここでのｄａｔａｔｙｐｅ属性は、上記規格に準ずるものとして考えることができる。具体的には、ｄａｔａｔｙｐｅの値が「００００」であることは、当該リソースが字幕テキストそのもの（ＡＲＩＢ−ＴＴＭＬ文書ファイル）であることを示す。また、ｄａｔａｔｙｐｅの値が「００００」以外であることは、ＴＴＭＬ文書ファイル以外の外部リソースであることを示す。例えば、ｄａｔａｔｙｐｅの値が「０００１」であるとき、そのリソースはＰＮＧ形式の画像ファイルである。また、ｄａｔａｔｙｐｅの値が「００１０」であるとき、そのリソースはＳＶＧ形式の画像ファイルである。また、ｄａｔａｔｙｐｅの値が「０１１０」であるとき、そのリソースはＳＶＧ形式のフォントファイルである。また、ｄａｔａｔｙｐｅの値が「０１１１」であるとき、そのリソースはＷＯＦＦ形式のフォントファイルである。 The @datatype attribute is a resource attribute and represents a data type. For example, on page 117 of the ARIB standard "Media transport system using MMT in digital broadcasting" (ARIB STD-B60 1.0 version, formulated on July 31, 2004), as shown in Table 9-1, A list of data types is shown. The datatype attribute here can be considered as conforming to the above standard. Specifically, the value “datatype” being “0000” indicates that the resource is the caption text itself (ARIB-TTML document file). A datatype value other than “0000” indicates an external resource other than a TTML document file. For example, when the value of datatype is “0001”, the resource is an image file in PNG format. Further, when the value of datatype is “0010”, the resource is an image file in the SVG format. When the value of datatype is “0110”, the resource is a font file in the SVG format. When the value of datatype is “0111”, the resource is a font file in the WOFF format.

ｄａｔａｔｙｐｅに後続する属性の種類は、＠ｄａｔａｓｉｚｅ属性を除き、上記のｄａｔａｔｙｐｅの値に応じて異なる。ｄａｔａｔｙｐｅの値が「００００」の場合は、後続する属性として、＠ｆｏｎｔ−ｆａｃｅ、＠ｓｔｙｌｅ、＠ｒｅｇｉｏｎ、＠ｓｕｂｔｉｔｌｅが用いられる。ｄａｔａｔｙｐｅの値が「００００」以外の場合は、後続する属性として、＠ｉｄｒｅｆ、＠ｓｒｃｐａｔｈ、＠ｓｒｃｖａｌｕｅ、＠ｒｅｐｌａｃｅｔｏが用いられる。ｄａｔａｔｙｐｅの値が「００００」以外の場合のこれらの属性情報は、外部リソースファイルのパスの情報や、放送伝送の名前空間への書き換えに関する情報を含むものである。
図中では、ｄａｔａｔｙｐｅ＝「００００」の場合と、ｄａｔａｔｙｐｅ≠「００００」との場合とのそれぞれに、異なるハッチングパターンを付して示している。 The type of the attribute following datatype differs according to the value of the above datatype except for the @datasize attribute. When the value of datatype is “0000”, @ font-face, @style, @region, and @subtitle are used as subsequent attributes. When the value of datatype is other than “0000”, @idref, @srcpath, @srcvalue, and @replaceto are used as subsequent attributes. These attribute information when the value of datatype is other than “0000” includes information on the path of the external resource file and information on rewriting to the name space of broadcast transmission.
In the figure, different hatching patterns are shown for datatype = “0000” and datatype ≠ “0000”, respectively.

＠ｄａｔａｓｉｚｅ属性は、当該リソースのデータサイズを示すものである。この属性は、ｄａｔａｔｙｐｅの値によらず記述することができる。 The @datasize attribute indicates the data size of the resource. This attribute can be described regardless of the value of datatype.

次に挙げる＠ｉｍａｇｅ属性、＠ｆｏｎｔ−ｆａｃｅ属性、＠ｓｔｙｌｅ属性、＠ｒｅｇｉｏｎ属性、＠ｓｕｂｔｉｔｌｅ属性は、いずれも、ｄａｔａｔｙｐｅの値が「００００」の場合（ＴＴＭＬ文書を表す）に記述されるものである。また、これらの＠ｉｍａｇｅ属性、＠ｆｏｎｔ−ｆａｃｅ属性、＠ｓｔｙｌｅ属性、＠ｒｅｇｉｏｎ属性、＠ｓｕｂｔｉｔｌｅ属性の値は、本ｕｎｉｔ要素で伝送する字幕データを伝送するため、番組単位のＴＴＭＬ文書を分割する際に、抽出すべき要素を指定している。つまり、分割後のＴＴＭＬ文書に含まれるテキスト文と、それらのテキスト文から参照するヘッダ部に記述された情報の参照関係を予め解析しておき、その解析結果（参照される要素の識別子の情報）を伝送情報（ｔｒａｎｓｍｉｓｓｉｏｎＩｎｆｏｒｍａｔｉｏｎ）の一部として含めておく。言い換えれば、伝送単位（ｕｎｉｔ）ごとに、含まれるテキスト文の識別子と、参照先のヘッダ部内の情報の識別子の情報を保持しておくようにする。識別子はＴＴＭＬ文書内のｉｍａｇｅ要素、ｆｏｎｔ−ｆａｃｅ要素、ｓｔｙｌｅ要素、ｒｅｇｉｏｎ要素、字幕本文のｄｉｖ要素やｐ要素のｘｍｌ：ｉｄ属性として指定された識別子を利用する。＠ｉｍａｇｅ属性、＠ｆｏｎｔ−ｆａｃｅ属性、＠ｓｔｙｌｅ属性、＠ｒｅｇｉｏｎ属性、＠ｓｕｂｔｉｔｌｅ属性には、それぞれ複数の識別子を記述することができ、複数の属性値を記述した場合は、複数の要素を分割後のＴＴＭＬ文書に含めることを意味する。なお、ｄａｔａｔｙｐｅの値が「００００」であるｒｅｓｏｕｒｃｅ要素は必ず１つのみ存在する。 The following @image attribute, @ font-face attribute, @style attribute, @region attribute, and @subtitle attribute are all described when the value of datatype is “0000” (represents a TTML document). is there. Also, the values of these @image attribute, @ font-face attribute, @style attribute, @region attribute, and @subtitle attribute divide the TTML document for each program in order to transmit subtitle data transmitted by this unit element. The element to be extracted is specified. In other words, the reference relation between the text sentence included in the divided TTML document and the information described in the header section referred to from the text sentence is analyzed in advance, and the analysis result (information on the identifier of the referenced element) ) Is included as part of transmission information (transmissionInformation). In other words, for each transmission unit (unit), the identifier of the included text sentence and the information of the identifier of the information in the header part of the reference destination are held. As an identifier, an identifier specified as an xml: id attribute of an image element, a font-face element, a style element, a region element, a div element of a caption text, or a p element in a TTML document is used. In @image attribute, @ font-face attribute, @style attribute, @region attribute, and @subtitle attribute, multiple identifiers can be described respectively. When multiple attribute values are described, multiple elements are divided. It means to be included in a later TTML document. Note that there is always only one resource element whose datatype value is “0000”.

ｒｅｓｏｕｒｃｅ要素の＠ｓｕｂｔｉｔｌｅ属性は、ＴＴＭＬ文書中のｔｔ／ｂｏｄｙ／ｄｉｖ／ｄｉｖ要素もしくはｔｔ／ｂｏｄｙ／ｄｉｖ／ｐ要素（これらはいずれも、字幕テキスト）における識別子を規定する。なお、ｔｔ／ｂｏｄｙ／ｄｉｖ／ｄｉｖ要素およびｔｔ／ｂｏｄｙ／ｄｉｖ／ｐ要素においては、ｘｍｌ：ｉｄ属性によってその字幕テキストの識別子を規定する。＠ｓｕｂｔｉｔｌｅ属性に記述する情報は、番組単位の字幕テキスト文のうち、当該伝送単位（ｕｎｉｔ）にどのテキスト文を含めるかの情報であり、時刻解析部１２が生成する断片化情報に該当する。 The @subtitle attribute of the resource element defines an identifier in the tt / body / div / div element or the tt / body / div / p element (both are subtitle text) in the TTML document. In the tt / body / div / div element and the tt / body / div / p element, the identifier of the caption text is defined by the xml: id attribute. The information described in the @subtitle attribute is information indicating which text sentence is included in the transmission unit (unit) among the subtitle text sentences of the program unit, and corresponds to the fragmentation information generated by the time analysis unit 12.

ｒｅｓｏｕｒｃｅ要素の＠ｉｍａｇｅ属性は、ＴＴＭＬ文書中の、ｔｔ／ｈｅａｄ／ｍｅｔａｄａｔａ／ｓｍｐｔｅ：ｉｍａｇｅ要素（イメージ）における識別子を指定する。なお、ｔｔ／ｈｅａｄ／ｍｅｔａｄａｔａ／ｓｍｐｔｅ：ｉｍａｇｅ要素においては、＠ｘｍｌ：ｉｄ属性によってそのイメージの識別子を規定する。 The @image attribute of the resource element specifies an identifier in the tt / head / metadata / smpte: image element (image) in the TTML document. In the tt / head / metadata / smpte: image element, the identifier of the image is defined by the @xml: id attribute.

ｒｅｓｏｕｒｃｅ要素の＠ｆｏｎｔ−ｆａｃｅ属性は、ＴＴＭＬ文書中のｔｔ／ｈｅａｄ／ｓｔｙｌｉｎｇ／ａｒｉｂ−ｔｔ：ｆｏｎｔ−ｆａｃｅ要素（フォント）における識別子を指定する。なお、ｔｔ／ｈｅａｄ／ｓｔｙｌｉｎｇ／ａｒｉｂ−ｔｔ：ｆｏｎｔ−ｆａｃｅ要素においては、ｉｄ属性によってそのフォントフェースの識別子を規定する。 The @ font-face attribute of the resource element specifies an identifier in the tt / head / styling / arib-tt: font-face element (font) in the TTML document. In the tt / head / styling / arib-tt: font-face element, the identifier of the font face is defined by the id attribute.

ｒｅｓｏｕｒｃｅ要素の＠ｓｔｙｌｅ属性は、ＴＴＭＬ文書中のｔｔ／ｈｅａｄ／ｓｔｙｌｉｎｇ／ｓｔｙｌｅ要素（様々な表示スタイルの規定）における識別子を指定する。なお、ｔｔ／ｈｅａｄ／ｓｔｙｌｉｎｇ／ｓｔｙｌｅ要素においては、ｘｍｌ：ｉｄ属性によってそのスタイルの識別子を規定する。 The @style attribute of the resource element specifies an identifier in a tt / head / styling / style element (definition of various display styles) in the TTML document. In the tt / head / styling / style element, the identifier of the style is defined by the xml: id attribute.

ｒｅｓｏｕｒｃｅ要素の＠ｒｅｇｉｏｎ属性は、ＴＴＭＬ文書中のｔｔ／ｈｅａｄ／ｌａｙｏｕｔ／ｒｅｇｉｏｎ要素（表示の領域）における識別子を規定する。なお、ｔｔ／ｈｅａｄ／ｌａｙｏｕｔ／ｒｅｇｉｏｎ要素においては、ｘｍｌ：ｉｄ属性によってその領域の識別子を規定する。 The @region attribute of the resource element defines an identifier in the tt / head / layout / region element (display area) in the TTML document. In the tt / head / layout / region element, the identifier of the area is defined by the xml: id attribute.

なお、上記＠ｉｍａｇｅ属性、＠ｆｏｎｔ−ｆａｃｅ属性、＠ｓｔｙｌｅ属性、＠ｒｅｇｉｏｎ属性に記述する情報は、参照関係解析部１４が生成する各断片から参照されるテキスト文書のヘッダ記述の情報であり、それぞれ埋め込み画像の情報、非組込フォントの情報、スタイルの情報、字幕提示の領域情報などの参照関係情報に該当する。 Note that the information described in the @image attribute, @ font-face attribute, @style attribute, and @region attribute is information on the header description of the text document referenced from each fragment generated by the reference relationship analysis unit 14, These correspond to reference relationship information such as embedded image information, non-embedded font information, style information, and subtitle presentation area information.

ｒｅｓｏｕｒｃｅ要素のｄａｔａｔｙｐｅの値が「００００」以外の場合は、前述のとおり、リソースがＡＲＩＢ−ＴＴＭＬ文書以外であることを示し、ｒｅｓｏｕｒｃｅ要素には、当該伝送単位に含まれるが外部参照するリソースに関する情報を、リソース毎に記述する。つまり、ｄａｔａｔｙｐｅの値が「００００」以外のｒｅｓｏｕｒｃｅ要素が、ｕｎｉｔ要素にi個記述されている場合は、ＡＲＩＢ−ＴＴＭＬ文書以外に、i個のリソースを伝送単位として送出する事を意味する。
ｒｅｓｏｕｒｃｅ要素の＠ｉｄｒｅｆ属性は、伝送単位として一緒に送出するＡＲＩＢ−ＴＴＭＬ文書において、外部リソースの参照を行っている要素を指定するものである。具体的には、＠ｉｄｒｅｆ属性は、外部リソースを行っている要素の識別子（ｘｍｌ：ｉｄ属性）を用いる。
ｒｅｓｏｕｒｃｅ要素の＠ｓｒｃｐａｔｈ属性（ソースパス）は、上記のｉｄｒｅｆ属性で指定した要素を起点とした、リソースファイルのロケーションを指定する属性へのパスをｘｐａｔｈ（ｈｔｔｐ：／／ｗｗｗ．ｗ３．ｏｒｇ／ＴＲ／ｘｐａｔｈ／）により指定するものである。
ｒｅｓｏｕｒｃｅ要素の＠ｓｒｃｖａｌｕｅ属性（ソースバリュー）は、上記のｓｒｃｐａｔｈ属性で示した属性の値（リソースファイルのロケーション情報）である。
ｒｅｓｏｕｒｃｅ要素の＠ｒｅｐｌａｃｅｔｏ属性（リプレース・トゥ）は、当該リソースを放送で伝送した場合に、受信機が当該リソースを放送信号中から取得できるようにするため、放送の名前空間によるリソースのロケーションを指定するものである。つまり、ｒｅｐｌａｃｅｔｏ属性は、放送として伝送されるときには、ファイルの元の名（ｓｒｃｖａｌｕｅ属性で指定される値）を、このｒｅｐｌａｃｅｔｏ属性で指定される名に置き換えることを指定するものである。
ｒｅｓｏｕｒｃｅ要素の＠ｉｄｒｅｆ属性、＠ｓｒｃｐａｔｈ属性、＠ｓｒｃｖａｌｕｅ属性、＠ｒｅｐｌａｃｅｔｏ属性の一連の属性によって、ｕｎｉｔ属性で指定した字幕の伝送単位において、ＡＲＩＢ−ＴＴＭＬ文書から参照するリソースファイルの存在と、リソースファイルのロケーション情報を置き換えに必要な情報を指定することができる。つまり、これらの情報は、変換情報解析部１３が生成する、放送ロケーション変換情報に該当する。 When the datatype value of the resource element is other than “0000”, as described above, it indicates that the resource is other than the ARIB-TTML document, and the resource element includes information on the resource included in the transmission unit but externally referenced. Is described for each resource. That is, when i resource elements other than “0000” in the datatype value are described in the unit element, this means that i resources are transmitted as a transmission unit in addition to the ARIB-TTML document.
The @idref attribute of the resource element specifies an element that refers to an external resource in the ARIB-TTML document that is sent together as a transmission unit. Specifically, for the @idref attribute, an identifier (xml: id attribute) of an element that performs an external resource is used.
For the @srcpath attribute (source path) of the resource element, the path to the attribute that specifies the location of the resource file starting from the element specified by the idref attribute is xpath (http://www.w3.org/TR / Xpath /).
The @srcvalue attribute (source value) of the resource element is the attribute value (resource file location information) indicated by the srcpath attribute.
The resource element's @replaceto attribute (replace to) specifies the location of the resource in the broadcast namespace so that the receiver can obtain the resource from the broadcast signal when the resource is transmitted in broadcast To do. That is, the replaceto attribute specifies that the original name of the file (value specified by the srcvalue attribute) is replaced with the name specified by the replaceto attribute when transmitted as a broadcast.
The existence of a resource file to be referred to from the ARIB-TTML document and the resource file in the transmission unit of the subtitle specified by the unit attribute by the series of attributes of @resource attribute, @srcref attribute, @srcpath attribute, @srcvalue attribute, and @replaceto attribute It is possible to specify information necessary for replacing the location information. That is, these pieces of information correspond to broadcast location conversion information generated by the conversion information analysis unit 13.

なお、上記の、ｒｅｓｏｕｒｃｅ要素の＠ｓｒｃｖａｌｕｅ属性は、断片に含まれるテキスト文から参照されるリソースファイル（画像ファイルや音声ファイルや非組込フォントファイル等）のロケーション情報である。
また、上記の、ｒｅｓｏｕｒｃｅ要素の＠ｓｒｃｐａｔｈ属性は、前記リソースファイル（画像ファイルや音声ファイルや非組込フォントファイル等）のロケーション情報がテキスト文書データのどの部分に記述されているかを示すロケーション情報記述位置指定情報である。
また、上記の、ｒｅｓｏｕｒｃｅ要素の＠ｒｅｐｌａｃｅｔｏ属性は、前記リソースファイル（画像ファイルや音声ファイルや非組込フォントファイル等を断片と共に放送により伝送する際の放送信号中のリソースの取得位置を特定するための放送の名前空間による放送ロケーション情報である。
ここで述べたロケーション情報と、ロケーション情報記述位置指定情報と、放送の名前空間による放送ロケーション情報とを含むものが、放送ロケーション変換情報である。 Note that the @srcvalue attribute of the resource element is location information of a resource file (such as an image file, an audio file, or a non-embedded font file) that is referenced from a text sentence included in the fragment.
Also, the @srcpath attribute of the resource element described above is a location information description that indicates in which part of the text document data the location information of the resource file (image file, audio file, non-embedded font file, etc.) is described. It is position designation information.
In addition, the above-described @replaceto attribute of the resource element is used to specify the resource acquisition position in the broadcast signal when transmitting the resource file (image file, audio file, non-embedded font file, etc. together with the fragment) Broadcast location information by the name space of the broadcast.
Broadcast location conversion information includes the location information described here, location information description position designation information, and broadcast location information based on a broadcast name space.

次に、本実施形態における処理手順について説明する。
図４は、分割装置１による処理の手順を示すフローチャートである。
同図に示すように、まずステップＳ１１において、取得部１１は、テキスト文書データ８１を取得し、取得したテキスト文書データ８１に含まれる各要素にＩＤ（識別子）を付与済みであるか否かを判断する。この判断は、テキスト文書データ８１の各要素に関してＩＤが付与済みである場合には、それらの付与済みのＩＤを利用することによって、再付与の処理をスキップするためのものである。そして、ＩＤが付与済みである場合（ステップＳ１１：ＹＥＳ）には、ステップＳ１３に飛ぶ。また、ＩＤが付与されていない場合（ステップＳ１１：ＮＯ）には、次のステップＳ１２に進む。 Next, a processing procedure in the present embodiment will be described.
FIG. 4 is a flowchart illustrating a processing procedure performed by the dividing apparatus 1.
As shown in the figure, first, in step S11, the acquisition unit 11 acquires text document data 81, and determines whether or not an ID (identifier) has been assigned to each element included in the acquired text document data 81. to decide. This determination is for skipping the reassignment process by using the assigned IDs when the IDs have been assigned to the elements of the text document data 81. If the ID has been assigned (step S11: YES), the process jumps to step S13. If no ID is assigned (step S11: NO), the process proceeds to the next step S12.

次にステップＳ１２に進んだ場合、取得部１１は、テキスト文書データ８１に含まれている各要素の適宜ＩＤを付与する。なお、ここで付与するＩＤは、要素を識別できるものであれば充分である。具体的に、本ステップにおいてＩＤが付与される要素は、ＴＴＭＬ文書データにおける、次の６種類の要素である。即ち；
− ｔｔ／ｈｅａｄ／ｍｅｔａｄａｔａ／ｓｍｐｔｅ：ｉｍａｇｅ
− ｔｔ／ｈｅａｄ／ｓｔｙｌｉｎｇ／ａｒｉｂ−ｔｔ／ｆｏｎｔ−ｆａｃｅ
− ｔｔ／ｈｅａｄ／ｓｔｙｌｉｎｇ／ｓｔｙｌｅ
− ｔｔ／ｈｅａｄ／ｌａｙｏｕｔ／ｒｅｇｉｏｎ
− ｔｔ／ｈｅａｄ／ｄｉｖ／ｄｉｖ
− ｔｔ／ｈｅａｄ／ｄｉｖ／ｐ Next, when the process proceeds to step S <b> 12, the acquisition unit 11 assigns an appropriate ID of each element included in the text document data 81. The ID assigned here is sufficient if it can identify the element. Specifically, the elements to which IDs are assigned in this step are the following six types of elements in the TTML document data. Ie;
-Tt / head / metadata / smpte: image
-Tt / head / styling / arib-tt / font-face
-Tt / head / styling / style
-Tt / head / layout / region
-Tt / head / div / div
-Tt / head / div / p

ここで、要素の種類を容易に区別できるようなＩＤの付与のしかたをしても良い。例えば、次の通りである。
ｔｔ／ｈｅａｄ／ｍｅｔａｄａｔａ／ｓｍｐｔｅ：ｉｍａｇｅの要素に対しては、「ＳＭＰＴＥ」で始まるＩＤを付与する。一例としては、「ＳＭＰＴＥ＿ｌｏｇｏ１６」などといったＩＤを付与する。
ｔｔ／ｈｅａｄ／ｓｔｙｌｉｎｇ／ａｒｉｂ−ｔｔ／ｆｏｎｔ−ｆａｃｅの要素に対しては、「ｆ」で始まり、その後に連続番号を伴うＩＤを付与する。一例としては、「ｆ０１」、「ｆ０２」、・・・などといったＩＤを付与する。
ｔｔ／ｈｅａｄ／ｓｔｙｌｉｎｇ／ｓｔｙｌｅの要素やｔｔ／ｈｅａｄ／ｌａｙｏｕｔ／ｒｅｇｉｏｎの要素に対しては、「ｓ」で始まり、その後に連続番号を伴うＩＤを付与する。一例としては、「ｓ１」、「ｓ２」、・・・などといったＩＤを付与する。
ｔｔ／ｈｅａｄ／ｄｉｖ／ｄｉｖやｔｔ／ｈｅａｄ／ｄｉｖ／ｐの要素に対しては、「ｃ」で始まり、その後に連続番号を伴うＩＤを付与する。一例としては、「ｃ００１」、「ｃ００２」、・・・などといったＩＤを付与する。
このように各要素にＩＤを付与することにより、以後の処理において、そのＩＤによってそれぞれの要素を参照することができる。 Here, an ID may be given so that the types of elements can be easily distinguished. For example:
tt / head / metadata / smpte: An ID starting with “SMPTE” is assigned to the image element. As an example, an ID such as “SMPTE_logo16” is assigned.
For an element of tt / head / styling / arib-tt / font-face, an ID starting with “f” followed by a serial number is assigned. As an example, IDs such as “f01”, “f02”,.
For an element of tt / head / styling / style and an element of tt / head / layout / region, an ID starting with “s” and subsequently accompanied by a serial number is assigned. As an example, IDs such as “s1”, “s2”,.
For the elements of tt / head / div / div and tt / head / div / p, an ID starting with “c” and followed by a serial number is assigned. As an example, IDs such as “c001”, “c002”,.
By assigning an ID to each element in this way, each element can be referred to by the ID in the subsequent processing.

次にステップＳ１３において、時刻解析部１２は、テキスト文書データ８１に含まれる全ての字幕文テキストに付与されている提示時刻の解析を行い、そして各々が字幕文テキストを有する複数のグループに断片化する断片化情報を生成する。時刻解析部１２による断片化の方法は、任意である。通常は、放送番組において特定の字幕テキストが表示されている時間（提示開始時刻から提示終了時刻まで）は、数秒から、せいぜい十数秒の範囲内に収まることが多い。また、番組の途中から視聴を開始する視聴者がいることを考慮すると、１つの伝送単位があまり長い時間（例えば１０秒、あるいはそれ以上）に渡ることは好ましくない。一例として、時刻解析部１２は、所定の時間（数秒程度）を超えるごとに伝送単位を区切る。また、他の例として、時刻解析部１２は、字幕テキストに対応する１つのｄｉｖ要素あるいはｐ要素ごとに、伝送単位を区切る。その他、伝送容量を考慮して、時刻解析部１２による伝送単位の区切り方を決めても良い。伝送単位の区切り方の詳細は、一種の設計事項である。
いずれの方法を取るにせよ、時刻解析部１２は、断片化した結果の時刻の区切りに含まれる字幕テキスト文（ｔｔ／ｈｅａｄ／ｄｉｖ／ｄｉｖ要素やｔｔ／ｈｅａｄ／ｄｉｖ／ｐ要素）のＩＤのリストの情報である断片化情報を生成して、参照関係解析部１４および分割部１５に渡す。 Next, in step S13, the time analysis unit 12 analyzes the presentation times given to all the caption texts included in the text document data 81, and fragments into a plurality of groups each having caption texts. Generate fragmentation information. The method of fragmentation by the time analysis unit 12 is arbitrary. Usually, the time during which a specific subtitle text is displayed in a broadcast program (from the presentation start time to the presentation end time) often falls within a range from several seconds to at most ten and several seconds. Also, considering that there is a viewer who starts watching from the middle of a program, it is not preferable that one transmission unit extends for a very long time (for example, 10 seconds or more). As an example, the time analysis unit 12 divides a transmission unit every time a predetermined time (about several seconds) is exceeded. As another example, the time analysis unit 12 divides the transmission unit for each div element or p element corresponding to the caption text. In addition, the transmission unit may be determined by the time analysis unit 12 in consideration of the transmission capacity. The details of how to separate transmission units are a kind of design matter.
Regardless of which method is used, the time analysis unit 12 uses the ID of the subtitle text sentence (tt / head / div / div element or tt / head / div / p element) included in the fragmented time separator. Fragmentation information, which is list information, is generated and passed to the reference relationship analysis unit 14 and the division unit 15.

次にステップＳ１４において、参照関係解析部１４は、断片化情報をもとに断片化された字幕テキストのグループからの、テキスト文書データ８１のヘッダ部に記述された情報への参照関係を解析し、参照関係情報を生成する。ここで、参照関係の解析の対象となるヘッダ部内の情報は、次の通りである。即ち、スタイル（ｓｔｙｌｅ要素）や、字幕提示の領域（ｒｅｇｉｏｎ要素）や、埋め込みイメージ（ｓｍｐｔｅ：ｉｍａｇｅ要素）や、非組込フォント（ａｒｉｂ−ｔ：ｆｏｎｔ−ｆａｃｅ要素）などである。
そして、参照関係解析部１４は、断片化情報に参照関係に関する情報を付加して、変換情報解析部１３および分割部１５に渡す。 Next, in step S14, the reference relationship analysis unit 14 analyzes the reference relationship from the fragmented caption text group to the information described in the header portion of the text document data 81 from the fragmented subtitle text group. , Generate reference relationship information. Here, the information in the header part to be analyzed for the reference relationship is as follows. That is, a style (style element), a subtitle presentation area (region element), an embedded image (smpte: image element), a non-embedded font (arib-t: font-face element), and the like.
Then, the reference relationship analysis unit 14 adds information related to the reference relationship to the fragmentation information and passes it to the conversion information analysis unit 13 and the division unit 15.

次にステップＳ１５において、変換情報解析部１３は、断片化された字幕テキストのグループに含まれるリソースファイルの参照のためのロケーション情報を解析し、放送ロケーション変換情報を生成し分割部１５に渡す。なお、変換情報解析部１３が生成する放送ロケーション変換情報は、具体的には、ｒｅｓｏｕｒｃｅ要素における＠ｓｒｃｐａｔｈ属性と、＠ｓｒｃｖａｌｕｅ属性と、＠ｒｅｐｌａｃｅｔｏ属性の、それぞれの値である。なお、放送ロケーション変換情報に含まれるこれらの属性については、図３を参照しながら説明した通りである。 Next, in step S15, the conversion information analysis unit 13 analyzes location information for referring to the resource file included in the fragmented subtitle text group, generates broadcast location conversion information, and passes it to the division unit 15. Note that the broadcast location conversion information generated by the conversion information analysis unit 13 is specifically the values of the @srcpath attribute, @srcvalue attribute, and @replaceto attribute in the resource element. Note that these attributes included in the broadcast location conversion information are as described with reference to FIG.

なお、時刻解析部１２、参照関係解析部１４、変換情報解析部１３による解析結果の情報生成（付加）の一例は、後で、図１０，図１１，図１２，図１３，図１４，図１５を参照しながら説明する。 An example of information generation (addition) of analysis results by the time analysis unit 12, the reference relationship analysis unit 14, and the conversion information analysis unit 13 will be described later with reference to FIG. 10, FIG. 11, FIG. 12, FIG. This will be described with reference to FIG.

次にステップＳ１６において、分割部１５は、ステップＳ１３において生成した断片化の情報とステップＳ１４において生成した参照関係の情報と、ステップＳ１５において生成した放送ロケーション情報への変換に関する情報に基づいて、入力されたテキスト文書データ８１を分割し、断片化された複数のテキスト文書データを生成する。 Next, in step S16, the dividing unit 15 inputs based on the fragmentation information generated in step S13, the reference relationship information generated in step S14, and the information related to the conversion to the broadcast location information generated in step S15. The divided text document data 81 is divided to generate a plurality of fragmented text document data.

次にステップＳ１７において、出力部１７は、分割部１５によって生成された断片化テキスト文書データが、外部イメージファイル、外部オーディオファイル、外部非組込フォントファイルなどを参照する場合は、断片化テキスト文書データと、リソースファイルデータ取得部１８によって取得した外部のリソースファイル８７を結合し、放送等で提供するフォーマットにしたがいパッケージ化した断片化字幕データ８５を生成する。この断片化字幕データ８５は、断片化によって区切られた時間帯の字幕テキストデータと、参照リソースファイルのデータとを含む。
そして、出力部１７は、複数の断片化字幕データ８５のそれぞれを、各断片に含まれる字幕テキストのうち一番早い提示開始時刻に合わせて（受信機側での提示に間に合うようなタイミングで）、出力（送出）する。
なお、分割部１５によるファイル分割、および出力部１７によるデータの送出の詳細な処理手順は、第２実施形態において説明する図９の手順と同様のものとしても良い。
以上で、本フローチャート全体の処理を終了する。 Next, in step S17, the output unit 17 determines that the fragmented text document data generated by the dividing unit 15 refers to an external image file, an external audio file, an external non-embedded font file, and the like. The data and the external resource file 87 acquired by the resource file data acquisition unit 18 are combined to generate fragmented subtitle data 85 packaged according to a format provided by broadcasting or the like. The fragmented subtitle data 85 includes subtitle text data in a time zone delimited by fragmentation, and reference resource file data.
Then, the output unit 17 matches each of the plurality of fragmented caption data 85 with the earliest presentation start time among the caption texts included in each fragment (at a timing in time for presentation on the receiver side). , Output (send).
The detailed processing procedure of file division by the dividing unit 15 and data transmission by the output unit 17 may be the same as the procedure of FIG. 9 described in the second embodiment.
Above, the process of the whole flowchart is complete | finished.

［第２実施形態］
次に、第２実施形態について説明する。なお、上述した実施形態と共通の事項については記載を省略し、本実施形態に特有の事項を中心に以下の説明を行う。
図５は、本実施形態による装置構成を示す概略ブロック図である。図示するように、本実施形態によるテキスト（字幕等）の分割・送出システムは、テキスト文書データ８１と、解析装置５と、情報付加済テキスト文書データ８３と、分割装置２（送出装置）と、断片化字幕データ８５とを含んで構成される。なお、テキスト文書データ８１と、情報付加済テキスト文書データ８３と、断片化字幕データ８５とは、適宜、記録媒体等に記録された形態で保持される。具体的には、データ記憶手段としては、磁気ハードディスク装置や、半導体メモリ等が用いられる。 [Second Embodiment]
Next, a second embodiment will be described. In addition, description is abbreviate | omitted about the matter which is common in embodiment mentioned above, and the following description is performed focusing on the matter peculiar to this embodiment.
FIG. 5 is a schematic block diagram showing a device configuration according to the present embodiment. As shown in the figure, the text (caption etc.) dividing / sending system according to the present embodiment includes a text document data 81, an analysis device 5, an information-added text document data 83, a dividing device 2 (sending device), And fragmented subtitle data 85. The text document data 81, the information-added text document data 83, and the fragmented caption data 85 are appropriately stored in a form recorded on a recording medium or the like. Specifically, a magnetic hard disk device, a semiconductor memory, or the like is used as the data storage means.

同図に示す解析装置５は、テキスト文書データ８１を読み込み、断片化のための解析を行い、解析結果を付加して、情報付加済テキスト文書データ８３を出力する。解析装置５が行う解析には、断片化するための提示時刻の解析と、断片化した結果のテキスト文からのテキスト文書データ８１のヘッダ部の情報への参照の解析と、断片化した結果のテキスト文がリソースファイルを参照する場合の、リソースファイルのロケーション情報の解析の結果を含む。
また、分割装置２は、上記の情報付加済テキスト文書データ８３を読み込み、各断片の字幕テキストに対応した複数の断片化テキスト文書データを生成し、生成された断片化テキスト文書データが、外部イメージファイル、外部オーディオファイル、外部非組込フォントファイルなどを参照する場合は、断片化テキスト文書データと、リソースファイルデータ取得部によって取得した外部ファイルデータを結合し、放送等で提供するフォーマットにしたがいパッケージ化した断片化字幕データ８５を生成する。断片化字幕データ８５は、入力された字幕テキストを、所定の提示時刻の範囲で区切って独立のまとまった単位のファイルとして構成されるものである。 The analysis device 5 shown in the figure reads the text document data 81, performs analysis for fragmentation, adds the analysis result, and outputs the information-added text document data 83. The analysis performed by the analysis apparatus 5 includes analysis of the presentation time for fragmentation, analysis of the reference to the information in the header part of the text document data 81 from the text sentence resulting from fragmentation, and the result of fragmentation. Contains the result of analyzing the location information of the resource file when the text statement references the resource file.
The dividing device 2 reads the information-added text document data 83, generates a plurality of fragmented text document data corresponding to the subtitle text of each fragment, and the generated fragmented text document data is an external image. When referring to files, external audio files, external non-embedded font files, etc., a package according to the format provided by broadcasting etc. by combining fragmented text document data and external file data acquired by the resource file data acquisition unit Fragmented fragmented caption data 85 is generated. The fragmented subtitle data 85 is configured as an independent unit file by dividing input subtitle text within a predetermined presentation time range.

図６は、本実施形態による解析装置５の概略機能構成を示すブロック図である。図示するように、解析装置５は、取得部３１と、時刻解析部３２と、変換情報解析部３３と、参照関係解析部３４と、付加部３６（送出情報生成部）とを含んで構成される。 FIG. 6 is a block diagram illustrating a schematic functional configuration of the analysis apparatus 5 according to the present embodiment. As shown in the figure, the analysis device 5 includes an acquisition unit 31, a time analysis unit 32, a conversion information analysis unit 33, a reference relationship analysis unit 34, and an addition unit 36 (transmission information generation unit). The

取得部３１は、時刻情報が付加されたテキストを含むテキスト文書データ８１を外部から取得する。
時刻解析部３２は、テキスト文書データ８１に含まれる各テキスト文に付加された時刻情報に基づいて、テキスト文書データ８１を複数のテキスト文のグループに断片化するための断片化情報を生成する。なお、断片化された後の各グループは、元のテキスト文書データ８１に含まれるテキスト文の時間範囲よりも、短い時間範囲のテキスト文を含むものである。
なお、時刻解析部３２は、生成した断片化情報を参照関係解析部３４および付加部３６に渡す。
参照関係解析部３４は、断片化されたテキスト文のグループである断片ごとに、その断片に含まれるテキスト文から参照されるテキスト文書データ８１のヘッダ部の情報を解析し、その断片と、参照される前記ヘッダ部の情報（ヘッダ記述）との関係を表す参照関係情報を生成する。
参照関係解析部３４は、断片化情報と生成した参照関係情報を変換情報解析部および付加部３６に渡す。
変換情報解析部３３は、断片化されたテキスト文のグループに含まれるリソースファイルの参照のためのロケーション情報を解析し、元のロケーション情報の記述を放送の名前空間によるロケーション情報へ書き換えるための、放送ロケーション変換情報を生成する。
変換情報解析部３３は、生成した放送ロケーション変換情報を付加部３６に渡す。
付加部３６は、取得部３１によって取得されたテキスト文書データ８１に、時刻解析部３２から渡された断片化情報と参照関係解析部３４から渡された参照関係情報と変換解析部３３から渡された放送ロケーション変換情報を付加して、情報付加済テキスト文書データ８３として出力する。 The acquisition unit 31 acquires text document data 81 including text to which time information is added from the outside.
The time analysis unit 32 generates fragmentation information for fragmenting the text document data 81 into a plurality of text sentence groups based on the time information added to each text sentence included in the text document data 81. Each group after fragmentation includes a text sentence in a time range shorter than the time range of the text sentence included in the original text document data 81.
The time analysis unit 32 passes the generated fragmentation information to the reference relationship analysis unit 34 and the addition unit 36.
The reference relationship analysis unit 34 analyzes the information of the header part of the text document data 81 referenced from the text sentence included in the fragment for each fragment which is a fragmented text sentence group, and the fragment and reference The reference relationship information representing the relationship with the information (header description) of the header portion to be generated is generated.
The reference relationship analysis unit 34 passes the fragmentation information and the generated reference relationship information to the conversion information analysis unit and addition unit 36.
The conversion information analysis unit 33 analyzes the location information for referring to the resource file included in the fragmented text sentence group, and rewrites the description of the original location information into the location information based on the broadcast namespace. Broadcast location conversion information is generated.
The conversion information analysis unit 33 passes the generated broadcast location conversion information to the addition unit 36.
The adding unit 36 is passed from the conversion analysis unit 33 to the text document data 81 acquired by the acquisition unit 31, the fragmentation information passed from the time analysis unit 32, the reference relationship information passed from the reference relationship analysis unit 34, and the conversion analysis unit 33. The broadcast location conversion information is added and output as information-added text document data 83.

図７は、本実施形態による分割装置２の概略機能構成を示すブロック図である。図示するように、分割装置２は、分割部３５と、出力部３７と、リソースファイルデータ取得部３８とを含んで構成される。 FIG. 7 is a block diagram illustrating a schematic functional configuration of the dividing device 2 according to the present embodiment. As illustrated, the dividing device 2 includes a dividing unit 35, an output unit 37, and a resource file data acquisition unit 38.

分割部３５は、情報付加済テキスト文書データ８３を読み込み、情報付加済テキスト文書データ８３に含まれる断片化情報に基づいて情報付加済テキスト文書データ８３に含まれるテキスト文書を分割し断片化テキストデータを生成する。
なお、分割部３５が読み込む情報付加済テキスト文書データ８３は、前述の通り、時刻情報が付加された複数のテキスト文を含むテキスト文書データ８１に、時刻情報に基づいてテキスト文の複数のグループに断片化するための断片化情報を付加し、さらに断片化されたテキスト文の各グループごとに、その断片から参照されるテキスト文書データ８１のヘッダ部の情報との関係を表す参照関係情報を付加し、さらに断片化されたテキスト文の各グループがリソースファイルを参照する場合に、リソースァイルの参照のためのロケーション情報を元のロケーション情報の記述から放送の名前空間によるロケーション情報へ書き換えるための放送ロケーション変換情報を付加してなるものである。 The dividing unit 35 reads the information-added text document data 83, divides the text document included in the information-added text document data 83 based on the fragmentation information included in the information-added text document data 83, and generates fragmented text data. Is generated.
Note that the information-added text document data 83 read by the dividing unit 35 is, as described above, the text document data 81 including a plurality of text sentences to which time information is added, and a plurality of groups of text sentences based on the time information. Fragmentation information for fragmentation is added, and for each group of fragmented text sentences, reference relation information representing the relationship with the information of the header portion of the text document data 81 referenced from the fragment is added In addition, when each group of fragmented text statements refers to a resource file, the broadcast for rewriting the location information for referring to the resource file from the original location information description to the location information in the broadcast namespace Location conversion information is added.

出力部３７は、分割部３５によって分割されたテキストに加え、分割されたテキスト文書が、外部イメージファイル、外部オーディオファイル、外部非組込フォントファイルなどを参照する場合は、分割されたテキストと、リソースファイルデータ取得部によって取得した外部リソースファイルデータを結合し、放送等で提供するフォーマットにしたがいパッケージ化した断片化字幕データ８５を生成し出力する。断片化字幕データ８５は、入力された字幕テキストを、所定の提示時刻の範囲で区切って独立のまとまった単位のファイルとして構成されるものである。このとき、出力部３７は、各断片に含まれる字幕テキストのうち一番早い提示時刻情報に合わせて、分割されたテキスト含む断片化字幕データを順次出力する。
リソースファイルデータ取得部３８は、情報付加済テキスト文書データ８３から参照されているリソースファイル８７を取得し、上記の出力部３７に渡す。 When the divided text document refers to an external image file, an external audio file, an external non-embedded font file, etc. in addition to the text divided by the dividing unit 35, the output unit 37 The external resource file data acquired by the resource file data acquisition unit is combined to generate and output fragmented subtitle data 85 packaged according to a format provided by broadcasting or the like. The fragmented subtitle data 85 is configured as an independent unit file by dividing input subtitle text within a predetermined presentation time range. At this time, the output unit 37 sequentially outputs fragmented subtitle data including the divided text in accordance with the earliest presentation time information among the subtitle texts included in each fragment.
The resource file data acquisition unit 38 acquires the resource file 87 referred to from the information-added text document data 83 and passes it to the output unit 37.

次に、本実施形態における処理手順について説明する。
図８は、解析装置５による処理の手順を示すフローチャートである。
同図に示すように、まずステップＳ３１において、取得部３１は、テキスト文書データ８１を取得し、取得したテキスト文書データ８１に含まれる各要素にＩＤ（識別子）を付与済みであるか否かを判断する。この判断は、テキスト文書データ８１の各要素に関してＩＤが付与済みである場合には、それらの付与済みのＩＤを利用することによって、再付与の処理をスキップするためのものである。そして、ＩＤが付与済みである場合（ステップＳ３１：ＹＥＳ）には、ステップＳ３３に飛ぶ。また、ＩＤが付与されていない場合（ステップＳ３１：ＮＯ）には、次のステップＳ３２に進む。 Next, a processing procedure in the present embodiment will be described.
FIG. 8 is a flowchart illustrating a processing procedure performed by the analysis apparatus 5.
As shown in the figure, first, in step S31, the acquisition unit 31 acquires the text document data 81, and determines whether or not each element included in the acquired text document data 81 has been given an ID (identifier). to decide. This determination is for skipping the reassignment process by using the assigned IDs when the IDs have been assigned to the elements of the text document data 81. If the ID has been assigned (step S31: YES), the process jumps to step S33. If no ID is assigned (step S31: NO), the process proceeds to the next step S32.

次にステップＳ３２に進んだ場合、取得部３１は、テキスト文書データ８１に含まれている各要素の適宜ＩＤを付与する。なお、本ステップにおける処理は、第１実施形態でのステップＳ１２における処理と同様である。よって、ここでは詳細な説明を省略する。 Next, when the process proceeds to step S <b> 32, the acquisition unit 31 assigns an appropriate ID of each element included in the text document data 81. Note that the processing in this step is the same as the processing in step S12 in the first embodiment. Therefore, detailed description is omitted here.

次にステップＳ３３において、時刻解析部３２は、テキスト文書データ８１に含まれる全ての字幕文テキストに付与されている提示時刻の解析を行い、そして複数の字幕文テキストのグループに断片化する断片化情報を生成する。なお、本ステップにおける処理は、第１実施形態でのステップＳ１３における処理と同様である。よって、ここでは詳細な説明を省略する。 Next, in step S33, the time analysis unit 32 analyzes the presentation times given to all the subtitle texts included in the text document data 81, and fragments into a plurality of subtitle sentence text groups. Generate information. Note that the processing in this step is the same as the processing in step S13 in the first embodiment. Therefore, detailed description is omitted here.

次にステップＳ３４において、参照関係解析部３４は、断片化情報をもとに断片化された字幕テキストのグループからの、テキスト文書データ８１のヘッダ部に記述された情報への参照関係を解析し、参照関係情報を付加する。なお、本ステップにおける処理は、第１実施形態でのステップＳ１４における処理と同様である。よって、ここでは詳細な説明を省略する。 Next, in step S34, the reference relationship analysis unit 34 analyzes the reference relationship from the fragmented subtitle text group to the information described in the header portion of the text document data 81 from the fragmented text group. Reference relation information is added. Note that the processing in this step is the same as the processing in step S14 in the first embodiment. Therefore, detailed description is omitted here.

次にステップＳ３５において、変換情報解析部３３は、断片化された字幕テキストのグループに含まれるリソースファイルの参照のためのロケーション情報を解析し、放送ロケーション変換情報を生成し、分割部に渡す。なお、本ステップにおける処理は、第１実施形態でのステップＳ１５における処理と同様である。よって、ここでは詳細な説明を省略する。 Next, in step S35, the conversion information analysis unit 33 analyzes the location information for referring to the resource file included in the fragmented caption text group, generates broadcast location conversion information, and passes it to the dividing unit. Note that the processing in this step is the same as the processing in step S15 in the first embodiment. Therefore, detailed description is omitted here.

次にステップＳ３６において、付加部３６は、テキスト文書データ８１を適切に分割するために必要な情報を付加する。ここで付加部３６が付加する情報は、大きく、ステップＳ３３において生成された断片化情報と、ステップＳ３４において生成された参照関係情報と、ステップＳ３５において生成された放送ロケーション変換情報である。付加部３６は、ＴＴＭＬ文書データとしてのテキスト文書データ８１におけるヘッダ部（ｈｅａｄ要素）の中の、メタデータ（ｍｅｔａｄａｔａ要素）の部分に上記の付加情報を格納し、情報付加済テキスト文書データ８３として出力する。 In step S36, the adding unit 36 adds information necessary for appropriately dividing the text document data 81. Here, the information added by the adding unit 36 is largely fragmentation information generated in step S33, reference relationship information generated in step S34, and broadcast location conversion information generated in step S35. The adding unit 36 stores the additional information in the metadata (metadata element) portion of the header (head element) in the text document data 81 as the TTML document data, and the information added text document data 83 is stored. Output.

なお、本実施形態においても、テキスト文書データ８１におけるヘッダ部に上記の付加情報を格納することによって情報付加済テキスト文書データ８３を出力する代わりに、付加情報を例えば別ファイルの形態として生成し、分割装置２に渡すようにしても良い。 In this embodiment, instead of outputting the information-added text document data 83 by storing the additional information in the header portion of the text document data 81, the additional information is generated in the form of a separate file, for example. You may make it pass to the division | segmentation apparatus 2. FIG.

次に、分割装置２の処理手順について説明する。
図９は、分割装置２による処理の手順を示すフローチャートである。
同図に示すように、まずステップＳ４１において、分割部３５は、付加情報を含むテキスト文書データである情報付加済テキスト文書データ８３を読み込む。この情報付加済テキスト文書データ８３はＸＭＬ文書データの一種であり、分割部３５は読み込んだＸＭＬデータをパージングすることにより、ＤＯＭ（ドキュメントオブジェクトモデル，Document Object Model）を作成する。これにより、分割部３５は、読み込んだ情報付加済テキスト文書データ８３の構成をツリー構造で把握する。 Next, a processing procedure of the dividing device 2 will be described.
FIG. 9 is a flowchart illustrating a processing procedure performed by the dividing device 2.
As shown in the figure, first, in step S41, the dividing unit 35 reads the information added text document data 83 which is text document data including additional information. This information-added text document data 83 is a kind of XML document data, and the dividing unit 35 parses the read XML data to create a DOM (Document Object Model). Thereby, the dividing unit 35 grasps the configuration of the read information-added text document data 83 in a tree structure.

次のステップＳ４２からＳ４５までの処理は、ステップＳ４１で読み込んだデータのメタデータ内に含まれる伝送単位（ｕｎｉｔ要素）ごとに繰り返す。 The processing from the next step S42 to S45 is repeated for each transmission unit (unit element) included in the metadata of the data read in step S41.

ステップＳ４２において、分割部３５は、ユニット（伝送単位、ｕｎｉｔ要素）内の１つ目のリソース（ｒｅｓｏｕｒｃｅ要素）を読み込み、出力要素を追加する。ここで、出力要素とは、埋め込み画像（ｉｍａｇｅ属性により指定）と、非組込フォントフォント（ｆｏｎｔ−ｆａｃｅ属性により指定）と、スタイル（ｓｔｙｌｅ属性により指定）と、字幕提示の領域（ｒｅｇｉｏｎ属性により指定）と、字幕テキスト文（ｓｕｂｔｉｔｌｅ属性により指定）のための要素（ｐ要素やｄｉｖ要素）を追加する。なお、あるユニット内の最初のリソースに関して、データタイプ（ｄａｔａｔｙｐｅ属性）の値は、必ず「００００」である。 In step S42, the dividing unit 35 reads the first resource (resource element) in the unit (transmission unit, unit element), and adds an output element. Here, the output elements are an embedded image (specified by the image attribute), a non-embedded font font (specified by the font-face attribute), a style (specified by the style attribute), and a subtitle presentation area (by the region attribute). And an element (p element or div element) for subtitle text (specified by the subtitle attribute). Note that the value of the data type (datatype attribute) is always “0000” for the first resource in a unit.

なお、ステップＳ４２における処理の詳細は次の通りである。
分割部３５は、ユニット内の１つ目のリソースを読み込み、そのリソース（ｒｅｓｏｕｒｃｅ要素）の属性ごとに、下記の（１）から（５）までの処理を行うことによって、空のＴＴＭＬ文書に要素を追加する。なお、空のＴＴＭＬ文書とは、「＜ｔｔ＞＜／ｔｔ＞」（ｔｔ要素の開始と終了）のみからなる文書である。なお、下の説明において、ＩＤリストとは、単数または複数のＩＤ（識別子）を持ち得るリストの表現である。具体的な例として、ＩＤリストが複数のＩＤを含む場合には、それら複数のＩＤを空白文字で区切って並べた文字列が、ＩＤリストである。 Details of the processing in step S42 are as follows.
The dividing unit 35 reads the first resource in the unit, and performs the following processing (1) to (5) for each attribute of the resource (resource element), thereby generating an element in an empty TTML document. Add An empty TTML document is a document consisting only of “<tt></tt>” (start and end of tt element). In the description below, an ID list is an expression of a list that can have one or more IDs (identifiers). As a specific example, when the ID list includes a plurality of IDs, a character string in which the plurality of IDs are separated by a blank character and arranged is the ID list.

（１）１つ目のｒｅｓｏｕｒｃｅ要素のｉｍａｇｅ属性に指定されたＩＤリストを基に、入力側のＴＴＭＬ文書（情報付加済テキスト文書データ８３のこと。以下においても、同様。）中のｔｔ／ｈｅａｄ／ｍｅｔａｄａｔａ／ｓｍｐｔｅ：ｉｍａｇｅ要素であって上記ＩＤリストと同一のＩＤを持つ要素を、出力側のＴＴＭＬ文書（断片化テキスト文書データのこと。以下においても、同様。）中にコピーする。
（２）１つ目のｒｅｓｏｕｒｃｅ要素のｆｏｎｔ−ｆａｃｅ属性に指定されたＩＤリストを基に、入力側のＴＴＭＬ文書中のｔｔ／ｈｅａｄ／ｓｔｙｌｉｎｇ／ａｒｉｂ−ｔｔ：ｆｏｎｔ−ｆａｃｅ要素であって同一のＩＤを持つ要素を、出力側のＴＴＭＬ文書中にコピーする。
（３）１つ目のｒｅｓｏｕｒｃｅ要素のｓｔｙｌｅ属性に指定されたＩＤリストを基に、入力側のＴＴＭＬ文書中のｔｔ／ｈｅａｄ／ｓｙｌｉｎｇ／ｓｔｙｌｅ要素であって同一のＩＤを持つ要素を、出力側のＴＴＭＬ文書中にコピーする。
（４）１つ目のｒｅｓｏｕｒｃｅ要素のｒｅｇｉｏｎ属性に指定されたＩＤリストを基に、入力側のＴＴＭＬ文書中のｔｔ／ｈｅａｄ／ｌａｙｏｕｔ／ｒｅｇｉｏｎ要素であって同一のＩＤを持つ要素を、出力側のＴＴＭＬ文書中にコピーする。
（５）１つ目のｒｅｓｏｕｒｃｅ要素のｓｕｂｔｉｔｌｅ属性に指定されたＩＤリストを基に、入力側のＴＴＭＬ文書中のｔｔ／ｂｏｄｙ／ｄｉｖ要素の下の、ｄｉｖ要素またはｐ要素であって、同一のＩＤを持つ要素を、出力側のＴＴＭＬ文書中にコピーする。 (1) tt / head in the TTML document (information-added text document data 83; the same applies hereinafter) based on the ID list specified in the image attribute of the first resource element / Metadata / smpte: An image element having the same ID as that in the ID list is copied into an output TTML document (fragmented text document data; the same applies hereinafter).
(2) tt / head / styling / arib-tt: font-face elements in the TTML document on the input side based on the ID list specified in the font-face attribute of the first resource element, which are the same The element having the ID is copied into the TTML document on the output side.
(3) Based on the ID list specified in the style attribute of the first resource element, an tt / head / syling / style element in the TTML document on the input side having the same ID is output on the output side. Copy into the TTML document.
(4) Based on the ID list specified in the region attribute of the first resource element, the tt / head / layout / region element having the same ID in the TTML document on the input side is output to the output side. Copy into the TTML document.
(5) Based on the ID list specified in the subtitle attribute of the first resource element, it is a div element or p element under the tt / body / div element in the input side TTML document, and the same The element having the ID is copied into the TTML document on the output side.

次のステップＳ４３の処理は、現在のｕｎｉｔ要素内の２つ目以後のｒｅｓｏｕｒｃｅ要素の各々について、繰り返して実行する。なお、２つ目以後のｒｅｓｏｕｒｃｅ要素においては、ｄａｔａｔｙｐｅ属性の値は「００００」以外である。
ステップＳ４３において、分割部３５は、ｕｎｉｔ要素内の２つ目以後のリソース要素を読み込み、下記の（１）、（２）の処理を行う。
（１）出力側の文書中のｉｄｒｅｆ要素で指定されたＩＤを持つ要素を起点に、ｓｒｃｐａｔｈ要素に記述されているｘｐａｔｈによって指定された要素または属性の値を、ｒｅｐｌａｃｅｔｏ要素で指定さえた値に置き換える。
（２）ｓｒｃｖａｌｕｅ属性で指定された外部リソースファイルを、放送伝送用のデータ形式に変換する。なお、具体的な変換方法は、伝送方式によって異なる。例えばＴＳ（トランスポートストリーム）方式の場合には、カルーセル伝送用のデータに変換する。また、ＭＭＴ（ＭＰＥＧメディアトランスポート）方式の場合には、ＭＭＴのＭＦＵ（メディアフラグメントユニット）に変換する。 The processing in the next step S43 is repeatedly executed for each of the second and subsequent resource elements in the current unit element. In the second and subsequent resource elements, the value of the datatype attribute is other than “0000”.
In step S43, the dividing unit 35 reads the second and subsequent resource elements in the unit element, and performs the following processes (1) and (2).
(1) Starting from the element having the ID specified by the idref element in the output side document, the value of the element or attribute specified by xpath described in the srcpath element is set to the value specified by the replaceto element. replace.
(2) The external resource file specified by the srcvalue attribute is converted into a data format for broadcast transmission. The specific conversion method varies depending on the transmission method. For example, in the case of the TS (transport stream) system, the data is converted into data for carousel transmission. Also, in the case of the MMT (MPEG media transport) system, conversion to MMT MFU (media fragment unit) is performed.

当該ｕｎｉｔ要素内のすべてのｒｅｓｏｕｒｃｅ要素についてのステップＳ４３の処理が終了すると、次のステップＳ４４の処理に移る。
ステップＳ４４において、出力部３７は、伝送単位に応じて生成されたＴＴＭＬ文書（断片化テキスト文書データの１つ）を、放送として送出伝送するためのデータ形式に変換する。なお、具体的な変換方法は、伝送方式によって異なる。例えばＴＳ（トランスポートストリーム）方式の場合には、カルーセル伝送用のデータに変換する。また、ＭＭＴ（ＭＰＥＧメディアトランスポート）方式の場合には、ＭＭＴのＭＦＵ（メディアフラグメントユニット）に変換する。 When the process of step S43 is completed for all resource elements in the unit element, the process proceeds to the next step S44.
In step S44, the output unit 37 converts the TTML document (one of the fragmented text document data) generated according to the transmission unit into a data format for transmission as a broadcast. The specific conversion method varies depending on the transmission method. For example, in the case of the TS (transport stream) system, the data is converted into data for carousel transmission. Also, in the case of the MMT (MPEG media transport) system, conversion to MMT MFU (media fragment unit) is performed.

次にステップＳ４５において、出力部３７は、生成されたＴＴＭＬ文書（断片化テキスト文書データの１つ）とリソースファイル用のデータを、放送として送出伝送するためのデータ形式にパッケージ化し、ｔｉｍｅｃｏｄｅ要素（タイムコード）で指定された提示時刻での受信機側での提示に間に合うように、放送に多重するようにして送出する。具体的には、出力部３７は、指定された提示時刻と、伝送に要する時間と、送出装置側および受信機側のそれぞれの側での処理のオーバーヘッドとして必要な時間とに基づいて、間に合うようにデータの送出を行う。例えば、ＭＭＴ方式により伝送する場合は、ＭＦＵ（メディアフラグメントユニット）をＭＰＵ（メディアプロセッシングユニット）にパッケージ化して送出する。なお、パッケージ化した字幕データの構造の例については、後で、図１７を参照しながら説明する。 In step S45, the output unit 37 packages the generated TTML document (one of the fragmented text document data) and the data for the resource file into a data format for transmission as a broadcast, and a timecode element ( The broadcast is multiplexed and sent out in time for presentation on the receiver side at the presentation time designated by (time code). Specifically, the output unit 37 is in time for the designated presentation time, the time required for transmission, and the time required as processing overhead on each of the sending device side and the receiver side. Send data to. For example, in the case of transmission by the MMT method, an MFU (Media Fragment Unit) is packaged in an MPU (Media Processing Unit) and sent out. An example of the structure of packaged caption data will be described later with reference to FIG.

すべての伝送ユニット（ｕｎｉｔ要素）に関して、以上のＳ４１からＳ４５までの処理が終了すると、分割装置２は、このフローチャート全体の処理を終了する。 When the above processing from S41 to S45 is completed for all transmission units (unit elements), the dividing device 2 ends the processing of the entire flowchart.

次に、実際のデータ例について説明する。図１０、図１１、図１２、図１３、図１４、図１５は、図５において示した情報付加済テキスト文書データ８３の一例を示す概略図である。
この図１０、図１１、図１２、図１３、図１４、図１５を順に連結したデータが、１件の情報付加済テキスト文書データ８３に当たる。なお、情報付加済テキスト文書データ８３は、一種のＸＭＬデータである。これらの図の中において、便宜上、ファイル内の行番号を示している。これらの行番号自体は、ファイル内に含まれているものではない。以下では、これらの図および行番号を参照しながら、情報付加済テキスト文書データ８３の例について説明する。 Next, actual data examples will be described. 10, FIG. 11, FIG. 12, FIG. 13, FIG. 14 and FIG. 15 are schematic views showing an example of the text document data 83 with information added shown in FIG.
Data obtained by concatenating FIG. 10, FIG. 11, FIG. 12, FIG. 13, FIG. 14 and FIG. 15 corresponds to one piece of information-added text document data 83. The information-added text document data 83 is a kind of XML data. In these figures, line numbers in the file are shown for convenience. These line numbers themselves are not included in the file. Hereinafter, an example of the information-added text document data 83 will be described with reference to these drawings and line numbers.

なお、第１実施形態では分割装置１の分割部１５が、情報付加済テキスト文書データ８３と同等のデータを受け取る。ここで、情報付加済テキスト文書データ８３と同等のデータとは、テキスト文書データをどう分割するかを表す情報と、分割後のテキスト文書データからテキスト文書データのヘッダ部に記述された情報を参照するための参照関係を示す情報と、分割後のテキスト文書がリソースファイルを参照する場合にリソースファイルのロケーション情報を放送の名前空間に対応するようにテキスト文章データ中のロケーション情報をどう書き換えるかを表す情報であり、分割装置１の内部的な情報の形式は任意である。
また、第２実施形態では分割装置２の分割部３５が、解析装置５から渡される情報付加済テキスト文書データ８３を読み込む。 In the first embodiment, the dividing unit 15 of the dividing device 1 receives data equivalent to the information-added text document data 83. Here, the data equivalent to the information-added text document data 83 refers to information indicating how the text document data is divided and information described in the header portion of the text document data from the divided text document data. Information that indicates the reference relationship to be used, and how the location information in the text document data is rewritten so that the location information of the resource file corresponds to the broadcast namespace when the divided text document references the resource file The format of the internal information of the dividing device 1 is arbitrary.
In the second embodiment, the dividing unit 35 of the dividing device 2 reads the information-added text document data 83 passed from the analyzing device 5.

第２行目から第１０５行目までは、ｔｔ要素である。
第１０行目から第８５行目までは、ヘッダ部（ｈｅａｄ要素）である。
第８６行目から第１０４行目までは、ボディ部（ｂｏｄｙ要素）である。 The second to 105th lines are tt elements.
The 10th to 85th lines are header parts (head elements).
The 86th to 104th lines are body parts (body elements).

ヘッダ部内において、第１２行目から第４８行目までは、メタデータ（ｍｅｔａｄａｔａ要素）である。このメタデータは、字幕交換情報（ｃａｐｔｉｏｎＥｘｃｈａｎｇｅＩｎｆｏｒｍａｔｉｏｎ要素）と、埋め込みイメージ（ｓｍｐｔｅ：ｉｍａｇｅ要素）とを含む。
第１３行目から第４２行目までがキャプション交換情報である。
キャプション交換情報は、伝送情報（ｔｒａｎｓｍｉｓｓｉｏｎＩｎｆｏｒｍａｔｉｏｎ要素）を含む。
第１４行目から第４１行目までが伝送情報である。
また、第４５行目から第４７行目までが、埋め込みイメージである。
伝送情報は、複数の伝送単位のまとまり（ｔｒａｎｓｍｉｓｓｉｏｎＵｎｉｔｓ要素）を含んでいる。第１５行目から第４０行目までがｔｒａｎｓｍｉｓｓｉｏｎＵｎｉｔｓ要素である。
このｔｒａｎｓｍｉｓｓｉｏｎＵｎｉｔｓ要素は、個々に番号付けされた複数の伝送単位（ｕｎｉｔ要素）を有している。 In the header part, the 12th to 48th lines are metadata (metadata elements). This metadata includes caption exchange information (captionExchangeInformation element) and an embedded image (smpte: image element).
The 13th to 42nd lines are caption exchange information.
The caption exchange information includes transmission information (transmissionInformation element).
The transmission information is from the 14th line to the 41st line.
The 45th to 47th lines are embedded images.
The transmission information includes a group of transmission units (transmissionUnits element). The transmissionUnits element is from the 15th line to the 40th line.
The transmissionUnits element has a plurality of transmission units (unit elements) numbered individually.

個々の伝送単位（ｕｎｉｔ要素）は、時刻解析部（１２または３２）によって解析された結果として断片化された、断片に対応している。個々の伝送単位は、提示時刻（ｔｉｍｅｃｏｄｅ属性）を有している。提示時刻は、番組開始時点をゼロとする相対時刻であり、「ｈｈ：ｍｍ：ｓｓ．ｎｎｎ」（時−分−秒−ミリ秒）の形式の文字列として表現される。なお、＠ｔｉｍｅｃｏｄｅ属性の提示時刻は当該伝送単位に含まれる字幕テキストの提示開始時刻のうち、一番早い開始時間の値である。
本例においては、６個の伝送単位（ｕｎｉｔ要素）が存在し、それらのそれぞれがｘｍｌ：ｉｄ属性として「１」から「６」までの値をもっている。ｘｍｌ：ｉｄ属性が「１」である伝送単位は、第１６行目から第１９行目までである。ｘｍｌ：ｉｄ属性が「２」である伝送単位は、第２０行目から第２４行目までである。ｘｍｌ：ｉｄ属性が「３」である伝送単位は、第２５行目から第２８行目までである。ｘｍｌ：ｉｄ属性が「４」である伝送単位は、第２８行目から第３２行目までである。ｘｍｌ：ｉｄ属性が「５」である伝送単位は、第３４行目から第３６行目までである。ｘｍｌ：ｉｄ属性が「６」である伝送単位は、第３７行目から第３９行目までである。 Each transmission unit (unit element) corresponds to a fragment that is fragmented as a result of analysis by the time analysis unit (12 or 32). Each transmission unit has a presentation time (timecode attribute). The presentation time is a relative time with the program start time being zero, and is represented as a character string in the format of “hh: mm: ss.nnn” (hour-minute-second-millisecond). The presentation time of the @timecode attribute is the earliest start time value among the presentation start times of the caption text included in the transmission unit.
In this example, there are six transmission units (unit elements), each of which has a value from “1” to “6” as an xml: id attribute. The transmission unit having the xml: id attribute “1” is from the 16th line to the 19th line. The transmission unit having the xml: id attribute “2” is from the 20th line to the 24th line. The transmission unit having the xml: id attribute “3” is from the 25th line to the 28th line. The transmission unit having the xml: id attribute “4” is from the 28th line to the 32nd line. The transmission unit having the xml: id attribute “5” is from the 34th line to the 36th line. The transmission unit having the xml: id attribute “6” is from the 37th line to the 39th line.

各伝送単位の情報は、その伝送単位に含まれる字幕テキストの断片と、参照されるリソースとの関係を含んでいる。なお、字幕テキストの断片そのものもリソースの一種である。参照関係解析部（１４または３４）によって解析された結果、各伝送単位において必要とされるリソースの参照関係情報のみが、ｕｎｉｔ要素の中に含まれる。これにより、後で実際に断片化ファイルを生成する際に、余分な情報を参照したり解析したりする必要がなく、直接必要な情報のみを取り出しやすい。 The information of each transmission unit includes the relationship between the subtitle text fragment included in the transmission unit and the resource to be referred to. Note that subtitle text fragments themselves are also a type of resource. As a result of the analysis by the reference relationship analysis unit (14 or 34), only the reference relationship information of the resources required in each transmission unit is included in the unit element. As a result, when the fragmented file is actually generated later, it is not necessary to refer to or analyze extra information, and it is easy to extract only necessary information.

ｘｍｌ：ｉｄ属性が「１」である伝送単位は、２つのリソース（ｒｅｓｏｕｒｃｅ要素）を含んでいる。
その第１のリソースのデータタイプ（ｄａｔａｔｙｐｅ属性）は「００００」であり、これは字幕テキストそのものに対応している。このリソースは、非組込フォント（ｆｏｎｔ−ｆａｃｅ属性）、スタイル（ｓｔｙｌｅ属性）、字幕提示の領域（ｒｅｇｉｏｎ属性）、字幕テキスト文（ｓｕｂｔｉｔｌｅ属性）を有している。各属性の値は、参照のためのＩＤである。なお、ｓｕｂｔｉｔｌｅ属性の値は「Ｃ００１」である。なお、このリソース（ｒｅｓｏｕｒｃｅ要素）において、ｆｏｎｔ−ｆａｃｅ属性や、ｓｔｙｌｅ属性や、ｒｅｇｉｏｎ属性が、参照関係情報の例である。また、このリソースにおいて、ｓｕｂｔｉｔｌｅ属性が、断片化情報の例であり、「Ｃ００１」という値によってテキスト文の断片（グループ）を参照している。これらの参照関係情報や断片化情報は、以下のリソース（データタイプが「００００」）においても同様である。
また、第２のリソースのデータタイプは「０１１０」であり、これはリソースがフォントであることを表す。このリソースは、ｉｄｒｅｆ属性を有し、その値は「ｆ０５」である。これは、参照のために用いられるＩＤである。また、ｓｒｃｐａｔｈ属性は、リソースファイルのロケーション記述へのパス（ｉｄｒｅｆ属性を有する要素を起点としたリソースファイルのロケーション情報の記述へのＸＰＡＴＨ情報）を示す。また、ｓｒｃｖａｌｕｅ属性は、リソースファイルのロケーション情報を指定するものである。また、ｒｅｐｌａｅｔｏ属性は、放送として送出される際の放送の名前空間によるロケーション情報の値を示す。なお、このリソース（ｒｅｓｏｕｒｃｅ要素）において、ｓｒｃｐａｔｈ属性や、ｓｒｃｖａｌｕｅ属性や、ｒｅｐｌａｃｅｔｏ属性が、放送ロケーション変換情報の例である。放送ロケーション変換情報については、以下のリソース（データタイプが「００００」ではない）においても同様である。 A transmission unit having an xml: id attribute of “1” includes two resources (resource elements).
The data type (datatype attribute) of the first resource is “0000”, which corresponds to the caption text itself. This resource has a non-embedded font (font-face attribute), a style (style attribute), a subtitle presentation area (region attribute), and a subtitle text sentence (subtitle attribute). The value of each attribute is an ID for reference. The value of the subtitle attribute is “C001”. In this resource (resource element), the front-face attribute, the style attribute, and the region attribute are examples of the reference relationship information. Also, in this resource, the subtitle attribute is an example of fragmentation information, and a text sentence fragment (group) is referred to by the value “C001”. These reference relationship information and fragmentation information are the same in the following resources (data type is “0000”).
The data type of the second resource is “0110”, which represents that the resource is a font. This resource has an idref attribute, and its value is “f05”. This is an ID used for reference. The srcpath attribute indicates the path to the location description of the resource file (XPATH information to the description of the location information of the resource file starting from the element having the idref attribute). The srcvalue attribute specifies location information of the resource file. The replay attribute indicates the value of location information based on the name space of the broadcast when it is transmitted as a broadcast. In this resource (resource element), the srcpath attribute, the srcvalue attribute, and the replaceto attribute are examples of broadcast location conversion information. The same applies to the broadcast location conversion information in the following resources (data type is not “0000”).

ｘｍｌ：ｉｄ属性が「２」である伝送単位は、３つのリソース（ｒｅｓｏｕｒｃｅ要素）を含んでいる。
第１のリソースのデータタイプ（ｄａｔａｔｙｐｅ属性）は「００００」であり、字幕テキストそのものを示す。このリソースにおけるｆｏｎｔｏ−ｆａｃｅ属性は、「ｆ０３」および「ｆ０４」という２つのＩＤを示すものであり、これら両者を空白で連結したものを属性値としている。なお、ｓｕｂｔｉｔｌｅ属性の値は「Ｃ００２」である。
第２および第３のリソースのデータタイプは「０１１０」であり、これはフォントに対応する。フォントであるリソースの属性については、既に述べたとおりである。 A transmission unit having an xml: id attribute of “2” includes three resources (resource elements).
The data type (datatype attribute) of the first resource is “0000”, which indicates the caption text itself. The front-face attribute in this resource indicates two IDs “f03” and “f04”, and the attribute value is obtained by concatenating both of them with a blank. The value of the subtitle attribute is “C002”.
The data type of the second and third resources is “0110”, which corresponds to the font. The attribute of a resource that is a font has already been described.

ｘｍｌ：ｉｄ属性が「３」である伝送単位は、２つのリソース（ｒｅｓｏｕｒｃｅ要素）を含んでいる。
第１のリソースのデータタイプ（ｄａｔａｔｙｐｅ属性）は「００００」であり、字幕テキストそのものを示す。なお、ｓｕｂｔｉｔｌｅ属性の値は「Ｃ００３」である。
第２のリソースのデータタイプは「０００１」であり、これは画像に対応する。このリソースは、外部の画像に対応する。 A transmission unit having an xml: id attribute of “3” includes two resources (resource elements).
The data type (datatype attribute) of the first resource is “0000”, which indicates the caption text itself. The value of the subtitle attribute is “C003”.
The data type of the second resource is “0001”, which corresponds to an image. This resource corresponds to an external image.

ｘｍｌ：ｉｄ属性が「４」である伝送単位は、３つのリソース（ｒｅｓｏｕｒｃｅ要素）を含んでいる。
第１のリソースのデータタイプ（ｄａｔａｔｙｐｅ属性）は「００００」であり、字幕テキストそのものを示す。なお、ｓｕｂｔｉｔｌｅ属性の値は「Ｃ００４」および「Ｃ００５」（両者を空白で連結）である。
第２および第３のリソースおデータタイプは「０００１」であり、これは画像に対応する。これらのリソースは、それぞれ、外部の画像に対応する。 A transmission unit having an xml: id attribute of “4” includes three resources (resource elements).
The data type (datatype attribute) of the first resource is “0000”, which indicates the caption text itself. The value of the subtitle attribute is “C004” and “C005” (both are connected with a blank).
The second and third resource data types are “0001”, which corresponds to an image. Each of these resources corresponds to an external image.

ｘｍｌ：ｉｄ属性が「５」である伝送単位は、１つのリソース（ｒｅｓｏｕｒｃｅ要素）を含んでいる。そのリソースのデータタイプの値は「００００」である。また、このリソースは、埋め込み画像に関する情報を含むものであり、ｉｍａｇｅ属性として「ＳＭＰＴＥ＿ｌｏｇｏ１６」という値を有する。この「ＳＭＰＴＥ＿ｌｏｇｏ１６」は、埋め込み画像を参照するためのＩＤである。なお、このリソースのｓｕｂｔｉｔｌｅ属性の値は、「Ｃ００６」である。 A transmission unit having an xml: id attribute of “5” includes one resource (resource element). The value of the data type of the resource is “0000”. Further, this resource includes information related to an embedded image and has a value of “SMPTE_logo16” as an image attribute. This “SMPTE_logo16” is an ID for referring to an embedded image. Note that the value of the subtitle attribute of this resource is “C006”.

ｘｍｌ：ｉｄ属性が「６」である伝送単位は、1つのリソース（ｒｅｓｏｕｒｃｅ要素）を含んでいる。
第１のリソースのデータタイプの値は「００００」である。このリソースのｓｕｂｔｉｔｌｅ属性の値は、「Ｃ００7」である。 A transmission unit having an xml: id attribute of “6” includes one resource (resource element).
The value of the data type of the first resource is “0000”. The value of the subtitle attribute of this resource is “C007”.

第４５行目から第４７行目までは、埋め込み画像（ｓｍｐｔｅ：ｉｍａｇｅ要素）である。ｘｍｌ：ｉｄ属性はこの埋め込み画像のＩＤを示すものであり、その値は「ＳＭＰＴＥ＿ｌｏｇｏ１６」である。ｉｍａｇｅＴｙｐｅ属性は、画像ファイルの形式を表しており、その値は「ＰＮＧ」である。また、ｅｎｃｏｄｉｎｇ属性は、バイナリーデータを文字データとして表現する際の符号化の方式を表しており、その値は「ＢＡＳＥ６４」である。また、「ｉＶＢＯＲｗ・・・」という文字列が、画像そのものを表すものである。 The 45th to 47th lines are embedded images (smpte: image element). The xml: id attribute indicates the ID of this embedded image, and its value is “SMPTE_logo16”. The imageType attribute represents the format of the image file, and its value is “PNG”. The encoding attribute represents the encoding method used when expressing binary data as character data, and the value thereof is “BASE64”. The character string “iVBORw...” Represents the image itself.

ヘッダ部内における、メタデータ（ｍｅｔａｄａｔａ要素）の次の、第４９行目から第７４行目までは、スタイリング（ｓｔｙｌｉｎｇ要素）である。
本例におけるこのｓｔｙｌｉｎｇ要素は、５個のフォント（ａｒｉｂ−ｔｔ：ｆｏｎｔ−ｆａｃｅ要素）と、１個のスタイル（ｓｔｙｌｅ要素）とを持つ。
第５１行目から第６５行目までが、５個の非組込フォントの情報である。第１から第５までのフォントのｉｄ要素の値は、それぞれ、「ｆ０１」、「ｆ０２」、「ｆ０３」、「ｆ０４」、「ｆ０５」であり、これらは参照のためのＩＤである。
また、第６６行目から第７３行目までが１個のスタイルである。このスタイルのｘｍｌ：ｉｄ属性の値は「ｓ１」である。また、このスタイルは、色（ｔｔｓ：ｃｏｌｏｒ属性）と、フォント（ｔｔｓ：ｆｏｎｔＦａｍｉｌｙ属性）と、フォントサイズ（ｆｏｎｔＳｉｚｅ属性）と、テキスト位置揃えの調整（ｔｔｓ：ｔｅｘｔＡｌｉｇｎ属性）と、領域の背景色がいつ提示されるかの指定（ｔｔｓ：ｓｈｏｗＢａｃｋｇｒｏｕｎｄ）とを有する。 From the 49th line to the 74th line after the metadata (metadata element) in the header part is styling (styling element).
The styling element in this example has five fonts (arib-tt: font-face element) and one style (style element).
The information from the 51st line to the 65th line is information of five non-embedded fonts. The values of the id elements of the first to fifth fonts are “f01”, “f02”, “f03”, “f04”, and “f05”, respectively, and these are IDs for reference.
Further, the 66th to 73rd lines are one style. The value of the xml: id attribute of this style is “s1”. In addition, this style includes color (tts: color attribute), font (tts: fontFamily attribute), font size (fontSize attribute), text alignment adjustment (tts: textAlign attribute), and area background color. It has a designation (tts: showBackground) when it is presented.

ヘッダ部内における、スタイリング（ｓｔｙｌｉｎｇ要素）の次の、第７５行目から第８４行目までは、レイアウト（ｌａｙｏｕｔ要素）である。このレイアウトは、領域（ｒｅｇｉｏｎ要素）を含む。本例において、第７６行目から第８３行目までが、ひとつの領域を表すものである。この領域のｘｍｌ：ｉｄ属性の値は「ａｌｌ」である。つまり、この領域は、ＩＤ「ａｌｌ」を用いて参照される。また、この領域は、スタイル（ｓｔｙｌｅ属性）と、その領域の原点（ｔｔｓ：ｏｒｉｇｉｎ属性）と、その領域の最大座標点（ｔｔｓ：ｅｘｔｅｎｔ属性）と、縦方向および横方向のパディングサイズ（ｔｔｓ：ｐａｄｄｉｎｇ属性）と、提示する位置揃えの指定（ｔｔｓ：ｄｉｓｐｌａｙＡｌｉｇｎ属性）と、領域の背景色がいつ提示されるかの指定（ｔｔｓ：ｓｈｏｗＢａｃｋｇｒｏｕｎｄ）とを有する。
なお、この領域における第７７行目で指定しているスタイルのＩＤは「ｓ１」である。これは、即ち、第６６行目から始まるスタイルを参照している。 The 75th to 84th lines after the styling (styling element) in the header part are layouts (layout elements). This layout includes a region (region element). In this example, the 76th to 83rd lines represent one area. The value of the xml: id attribute of this area is “all”. That is, this area is referred to using the ID “all”. In addition, this area includes a style (style attribute), an origin of the area (tts: origin attribute), a maximum coordinate point of the area (tts: extent attribute), and padding sizes in the vertical and horizontal directions (tts: a padding attribute), a designation of alignment to be presented (tts: displayAlign attribute), and a designation of when the background color of the region is presented (tts: showBackground).
The ID of the style specified in the 77th line in this area is “s1”. This refers to the style starting on line 66.

ヘッダ部の説明は以上である。次にボディ部について説明する。 This is the end of the description of the header part. Next, the body part will be described.

ボディ部は、領域（ｒｅｇｉｏｎ属性）を指定した１個のｄｉｖ要素を有する。このｄｉｖ要素は、第８７行目から第１０３行目に記述されている。 The body part has one div element designating a region (region attribute). This div element is described from the 87th line to the 103rd line.

上記のｄｉｖ要素（ｂｏｄｙ要素の直下のｄｉｖ要素）は、その下のレベルに、７個の要素を有する。
第１の要素は、第８８行目に記述されているパラグラフ（段落、ｐ要素）であり、そのｘｍｌ：ｉｄ属性の値は「Ｃ００１」である。
第２の要素は、第８９行目に記述されているパラグラフ（ｐ要素）であり、そのｘｍｌ：ｉｄ属性の値は「Ｃ００２」である。
第３の要素は、第９０行目から第９２行目に記述されているｄｉｖ要素であり、そのｘｍｌ：ｉｄ属性の値は「Ｃ００３」である。
第４の要素は、第９３行目から第９５行目に記述されているｄｉｖ要素であり、そのｘｍｌ：ｉｄ属性の値は「Ｃ００４」である。
第５の要素は、第９６行目から第９８行目に記述されているｄｉｖ要素であり、そのｘｍｌ：ｉｄ属性の値は「Ｃ００５」である。
第６の要素は、第９９行目から第１０１行目に記述されているｄｉｖ要素であり、そのｘｍｌ：ｉｄ属性の値は「Ｃ００６」である。
第７の要素は、第１０２行目に記述されているパラグラフ（ｐ要素）であり、そのｘｍｌ：ｉｄ属性の値は「Ｃ００７」である。 The div element (div element immediately below the body element) has seven elements at the lower level.
The first element is a paragraph (paragraph, p element) described in the 88th line, and the value of its xml: id attribute is “C001”.
The second element is a paragraph (p element) described in the 89th line, and the value of its xml: id attribute is “C002”.
The third element is a div element described from the 90th line to the 92nd line, and the value of the xml: id attribute is “C003”.
The fourth element is a div element described from the 93rd line to the 95th line, and the value of the xml: id attribute is “C004”.
The fifth element is a div element described from the 96th line to the 98th line, and the value of its xml: id attribute is “C005”.
The sixth element is a div element described from the 99th line to the 101st line, and the value of its xml: id attribute is “C006”.
The seventh element is a paragraph (p element) described in the 102nd line, and the value of its xml: id attribute is “C007”.

以上説明したように、分割部１５（第１実施形態）または分割部３５（第２実施形態）が受け取るデータは、予め解析された結果として、内部で論理的に伝送単位の断片に分けられているデータである。また、同データは、各断片から参照されるデータとの関係を、情報としてすぐに取り出せる形で含んでいる。よって、断片化テキスト文書データ、および断片化テキスト文書データとリソースファイルのデータを含む断片化字幕データ８５を素早く生成し、リアルタイムな放送信号の伝送に間に合うように出力することができるようになる。 As described above, the data received by the dividing unit 15 (first embodiment) or the dividing unit 35 (second embodiment) is logically divided into transmission unit fragments internally as a result of analysis in advance. Data. The data also includes the relationship with the data referenced from each fragment in a form that can be readily extracted as information. Therefore, the fragmented text document data and the fragmented caption data 85 including the fragmented text document data and the resource file data can be quickly generated and output in time for transmission of the real-time broadcast signal.

図１６は、断片化字幕データ８５に含まれる断片化テキスト文書データの例を示す概略図である。第１実施形態においては、断片化テキスト文書データを含む断片化字幕データ８５は、分割装置１の出力部１７から出力される。第２実施形態においては、断片化テキスト文書データを含む断片化字幕データ８５は、分割装置２の出力部３７から出力される。なお、断片化テキスト文書データもまた、ＴＴＭＬ文書データであり、一種のＸＭＬデータである。同図において、便宜上、ファイル内の行番号を示している。これらの行番号自体は、ファイル内に含まれているものではない。以下では、これらの図および行番号を参照しながら、断片化テキスト文書データの例について説明する。 FIG. 16 is a schematic diagram illustrating an example of fragmented text document data included in the fragmented subtitle data 85. In the first embodiment, fragmented subtitle data 85 including fragmented text document data is output from the output unit 17 of the dividing device 1. In the second embodiment, fragmented subtitle data 85 including fragmented text document data is output from the output unit 37 of the dividing device 2. The fragmented text document data is also TTML document data, which is a kind of XML data. In the figure, for convenience, line numbers in the file are shown. These line numbers themselves are not included in the file. Hereinafter, an example of fragmented text document data will be described with reference to these drawings and line numbers.

同図に示す断片化テキスト文書データは、図１０の第１６行目から第１９行目において記述されているｕｎｉｔ要素（ｘｍｌ：ｉｄ属性は「１」）の内容と、それに対応する図１４の第８８行目に記述されているｐ要素とに基づく。これらの両者は、ＩＤ「Ｃ００１」によって相互に関連付いている。断片化テキスト文書データは、このように、分割装置１（第１実施形態の場合）あるいは解析装置５（第２実施形態の場合）による解析結果の情報に基づいて生成されるものである。 The fragmented text document data shown in the figure includes the contents of the unit element (xml: id attribute is “1”) described in the 16th to 19th lines of FIG. 10 and the corresponding FIG. Based on the p element described in the 88th line. Both of these are related to each other by the ID “C001”. As described above, the fragmented text document data is generated based on the analysis result information by the dividing device 1 (in the case of the first embodiment) or the analyzing device 5 (in the case of the second embodiment).

図１６において、第２行目から第２２行目までが、ｔｔ要素である。
そして、第３行目から第１４行目までは、ヘッダ部（ｈｅａｄ要素）である。
また、第１５行目から第２１行目までは、ボディ部（ｂｏｄｙ要素）である。 In FIG. 16, the 2nd to 22nd lines are tt elements.
The third to the 14th lines are header parts (head elements).
The 15th to 21st lines are body parts (body elements).

ヘッダ部内において、第５行目から第１０行目までは、スタイリング（ｓｔｙｌｉｎｇ要素）である。また、第１１行目から第１３行目までは、レイアウト（ｌａｙｏｕｔ要素）である。
上記のスタイリングには、フォント（ａｒｉｂ−ｔｔ：ｆｏｎｔ−ｆａｃｅ要素）と、スタイル（ｓｔｙｌｅ要素）とが含まれる。 In the header part, the 5th to 10th lines are styling (styling elements). The 11th to 13th lines are layouts (layout elements).
The styling includes a font (arib-tt: font-face element) and a style (style element).

図１６内のこのフォント（ａｒｉｂ−ｔｔ：ｆｏｎｔ−ｆａｃｅ要素）におけるｉｄ属性の値は「ｆ０５」である。これは、図１０における第１７行目のリソース（データタイプは「００００」）が、「ｆ０５」というＩＤを用いて、図１０における第１８行目のリソース（フォントのリソース）を参照していることに対応する。また、図１６内のフォントにおけるｆｏｎｔ−ｆａｍｉｌｙ属性の値は「ＦＡ丸ゴシックＭ」である。これは、図１２における第６３行目におけるｆｏｎｔ−ｆａｍｉｌｙの定義に対応している。また、図１６内のフォントにおけるｕｎｉｃｏｄｅ−ｒａｎｇｅ属性の値は「Ｕ＋Ｆ００２−Ｆ００３」である。これは、図１２における第６３行目におけるｕｎｉｃｏｄｅ−ｒａｎｇｅの定義に対応している。
また、図１６の第７行目のａｒｉｂ−ｔｔ：ｓｒｃ要素は、ｕｒｌ属性を有している。このｕｒｌ属性の値は、「ｓｕｂｔ：／／１」であり、放送名前空間におけるフォントの所在を示している。これは、図１０の第１８行目の定義におけるｒｅｐｌａｃｅｔｏ属性にしたがって置き換えられた後の名前である。 The value of the id attribute in this font (arib-tt: font-face element) in FIG. 16 is “f05”. This is because the resource (data type is “0000”) on the 17th line in FIG. 10 refers to the resource (font resource) on the 18th line in FIG. 10 using the ID “f05”. Corresponding to that. Also, the value of the font-family attribute in the font in FIG. 16 is “FA Maru Gothic M”. This corresponds to the definition of font-family on the 63rd line in FIG. Also, the value of the Unicode-range attribute in the font in FIG. 16 is “U + F002-F003”. This corresponds to the definition of Unicode-range in the 63rd line in FIG.
Also, the arib-tt: src element on the seventh line in FIG. 16 has a url attribute. The value of this url attribute is “sub: /// 1”, which indicates the location of the font in the broadcast name space. This is the name after being replaced according to the replaceto attribute in the definition on the 18th line in FIG.

図１６内の上記のスタイル（第９行目、ｓｔｙｌｅ要素）では、ｘｍｌ：ｉｄ属性の値は「ｓ１」である。これは、図１０の第１７行目における、ｓｔｙｌｅ属性の値「ｓ１」に対応するものである。また、このスタイルは、ＩＤ「ｓ１」によって図１３の第６６行目から第７３行目に記述されているスタイルに関連付けられている。したがって、図１６の第９行目におけるスタイル（ｓｔｙｌｅ要素）の属性は、図１３の第６６行目から記述されているスタイルを引き継いでいる。即ち、具体的には、ｔｔｓ：ｃｏｌｏｒ属性の値が「ｗｈｉｔｅ」であり、ｔｔｓ：ｆｏｎｔ−ｆａｍｉｌｙ属性の値が「ＦＡ丸ゴシックＭ」であり、ｔｔｓ：ｆｏｎｔ−ｓｉｚｅの値が「８０ｐｘ」（８０ピクセル）であり、ｔｔｓ：ｔｅｘｔＡｌｉｇｎ属性の値が「ｌｅｆｔ」（左揃え）であり、ｔｔｓ：ｓｈｏｗＢａｃｋｇｒｏｕｎｄ属性の値が「ｗｈｅｎＡｃｔｉｖｅ」（アクティブなとき）である。 In the above-described style in FIG. 16 (9th line, style element), the value of the xml: id attribute is “s1”. This corresponds to the value “s1” of the style attribute in the 17th line in FIG. This style is associated with the style described in the 66th to 73rd lines in FIG. 13 by the ID “s1”. Accordingly, the style (style element) attribute in the ninth line in FIG. 16 inherits the style described from the 66th line in FIG. Specifically, the value of the tts: color attribute is “white”, the value of the tts: font-family attribute is “FA Maru Gothic M”, and the value of tss: font-size is “80 px” ( 80 pixels), the value of the tts: textAlign attribute is “left” (left-aligned), and the value of the tts: showBackground attribute is “whenActive” (when active).

図１６の第１１行目から第１３行目までのレイアウト（ｌａｙｏｕｔ要素）は、直下のレベルに領域（ｒｅｇｉｏｎ要素）を含む。この領域は、図１６の第１２行目に記述されている。この領域のｘｍｌ：ｉｄ属性の値は「ａｌｌ」である。これは、図１０の第１７目においてｒｅｇｉｏｎ属性の値としてＩＤ「ａｌｌ」が指定されていることに対応する。図１６の第１２行目に記述されている領域（ｒｅｇｉｏｎ要素）の属性は、図１３の第７６行目から第８３行目に記述されている属性を引き継いでいるものである。即ち、具体的には、ｓｔｙｌｅ属性の値は「ａｌｌ」である。また、ｔｔｓ：ｏｒｉｇｉｎ属性（領域の開始点のｘ−ｙ座標値（百分率））は「０％０％」である。また、ｔｔｓ：ｅｘｔｅｎｔ属性（領域の終点のｘ−ｙ座標値（百分率））は「１００％１００％」である。また、ｔｔｓ：ｐａｄｄｉｎｇ属性（領域内における表示位置のための外周隙間のｘ方向およびｙ方向）は「０ｐｘ０ｐｘ」（縦横共に０ピクセル）である。また、ｔｔｓ：ｔｅｘｔＡｌｉｇｎ属性の値は「ｌｅｆｔ」（左揃え）である。また、ｔｔｓ：ｓｈｏｗＢａｃｋｇｒｏｕｎｄ属性の値は「ｗｈｅｎＡｃｔｉｖｅ」（アクティブなとき）である。 The layout (layout element) from the 11th line to the 13th line in FIG. 16 includes a region (region element) at a level immediately below. This area is described in the twelfth line of FIG. The value of the xml: id attribute of this area is “all”. This corresponds to the fact that the ID “all” is designated as the value of the region attribute in the 17th item of FIG. 10. The attributes of the region (region element) described in the 12th line of FIG. 16 are inherited from the attributes described in the 76th to 83rd lines of FIG. Specifically, the value of the style attribute is “all”. Also, the tts: origin attribute (the xy coordinate value (percentage) of the start point of the area) is “0% 0%”. Also, the tts: extent attribute (the xy coordinate value (percentage) of the end point of the area) is “100% 100%”. In addition, the tts: padding attribute (the x direction and the y direction of the outer peripheral gap for the display position in the region) is “0 px 0 px” (both vertically and horizontally 0 pixels). The value of the tts: textAlign attribute is “left” (left alignment). Also, the value of the tts: showBackground attribute is “whenActive” (when active).

図１６に示すボディ部（ｂｏｄｙ要素）には、ｄｉｖ要素が含まれており、その直下のレベルにｐ要素が含まれている。
図１６の第１７行目に記述されている、ｐ要素のｘｍｌ：ｉｄ属性の値は「Ｃ００１」である。これは、図１０の第１７行目のｓｕｂｔｉｔｌｅ属性の値「Ｃ００１」に対応している。
また、図１６の第１7行目に記述されている通り、ｐ要素のｒｅｇｉｏｎ属性の値は「ａｌｌ」である。これは、図１０の第１７行目のｒｅｓｏｕｒｃｅ要素におけるｒｅｇｉｏｎ属性の値を引き継いでいる。
また、図１６の第１７行目に記述されているｐ要素の全体を、図１４の第８８行目に記述されているｐ要素から引き継いでいる。 The body part (body element) shown in FIG. 16 includes a div element, and a p element is included at a level immediately below the div element.
The value of the xml: id attribute of the p element described in the 17th line in FIG. 16 is “C001”. This corresponds to the value “C001” of the subtitle attribute on the 17th line in FIG. 10.
Also, as described in the 17th line in FIG. 16, the value of the region attribute of the p element is “all”. This takes over the value of the region attribute in the resource element on the 17th line in FIG.
Further, the entire p element described in the 17th line in FIG. 16 is inherited from the p element described in the 88th line in FIG.

以上のように分割部１５（第１実施形態）または分割部３５（第２実施形態）は、簡単な処理で素早く断片化テキスト、および断片化テキストを含む断片化字幕データを生成し、送出することができる。 As described above, the dividing unit 15 (first embodiment) or the dividing unit 35 (second embodiment) quickly generates and transmits fragmented text and fragmented subtitle data including the fragmented text by simple processing. be able to.

図１７は、パッケージ化した字幕データの構造の例を示す図である。第１実施形態においては、分割装置１の出力部１７がこの字幕データを出力する。また、第２実施形態においては、分割装置２の出力部３７がこの字幕データを出力する。同図に示す例は、ＭＭＴによる伝送を行う場合のものである。図示する１つのＭＰＵ（メディアプロセッシングユニット，Media Processing Unit）が、１つの断片に相当する。ＭＰＵは、複数のＭＦＵ（メディアフラグメントユニット，Media Fragment Unit）を含む。ＭＰＵ中の１つのＭＦＵは、ＴＴＭＬ文書を格納している。そのＭＦＵは、ヘッダとＴＴＭＬ文書そのものを含んで構成される。他のＭＦＵは、ＴＴＭＬ文書から参照されるリソースを格納している。同図に示すＭＰＵは、参照リソース１，２，・・・，ｎを含む。参照リソースは、画像や非組込フォントなどである。これらの各ＭＦＵもまた、ヘッダと参照リソースそのものを含んで構成される。このように分割装置１（第１実施形態）や分割装置２（第２実施形態）は、字幕の断片と、関連する参照リソースとを、パッケージとして送出する。 FIG. 17 is a diagram illustrating an example of the structure of packaged caption data. In the first embodiment, the output unit 17 of the dividing device 1 outputs the caption data. In the second embodiment, the output unit 37 of the dividing device 2 outputs the caption data. The example shown in the figure is for transmission using MMT. One MPU (Media Processing Unit) shown in the figure corresponds to one fragment. The MPU includes a plurality of MFUs (Media Fragment Units). One MFU in the MPU stores a TTML document. The MFU includes a header and the TTML document itself. Other MFUs store resources referenced from TTML documents. The MPU shown in the figure includes reference resources 1, 2,. Reference resources include images and non-embedded fonts. Each of these MFUs is also configured to include a header and the reference resource itself. As described above, the dividing device 1 (first embodiment) and the dividing device 2 (second embodiment) transmit the subtitle fragments and the related reference resources as a package.

なお、上述した実施形態における解析装置や分析装置の機能をコンピューターで実現するようにしても良い。その場合、これらの機能を実現するためのプログラムをコンピューター読み取り可能な記録媒体に記録して、この記録媒体に記録されたプログラムをコンピューターシステムに読み込ませ、実行することによって実現しても良い。なお、ここでいう「コンピューターシステム」とは、ＯＳや周辺機器等のハードウェアを含むものとする。また、「コンピューター読み取り可能な記録媒体」とは、フレキシブルディスク、光磁気ディスク、ＲＯＭ、ＣＤ−ＲＯＭ等の可搬媒体、コンピューターシステムに内蔵されるハードディスク等の記憶装置のことをいう。さらに「コンピューター読み取り可能な記録媒体」とは、インターネット等のネットワークや電話回線等の通信回線を介してプログラムを送信する場合の通信線のように、短時間の間、動的にプログラムを保持するもの、その場合のサーバーやクライアントとなるコンピューターシステム内部の揮発性メモリのように、一定時間プログラムを保持しているものも含んでも良い。また上記プログラムは、前述した機能の一部を実現するためのものであっても良く、さらに前述した機能をコンピューターシステムにすでに記録されているプログラムとの組み合わせで実現できるものであっても良い。 In addition, you may make it implement | achieve the function of the analyzer and analyzer in embodiment mentioned above with a computer. In that case, the program for realizing these functions may be recorded on a computer-readable recording medium, and the program recorded on the recording medium may be read into a computer system and executed. Here, the “computer system” includes an OS and hardware such as peripheral devices. The “computer-readable recording medium” refers to a storage device such as a flexible disk, a magneto-optical disk, a portable medium such as a ROM and a CD-ROM, and a hard disk incorporated in a computer system. Furthermore, a “computer-readable recording medium” dynamically holds a program for a short time, like a communication line when transmitting a program via a network such as the Internet or a communication line such as a telephone line. In this case, a volatile memory inside a computer system serving as a server or a client in that case may be included, and a program that holds a program for a certain time. The program may be a program for realizing a part of the above-described functions, or may be a program that can realize the above-described functions in combination with a program already recorded in a computer system.

以上、複数の実施形態を説明したが、本発明はさらに次のような変形例でも実施することが可能である。
例えば、放送だけでなく、ビデオオンデマンドのサービスにおいて利用者からの要求に応じて特定のコンテンツを通信等で配信する場合に、本発明を適用しても良い。これにより、一時に大量の字幕テキストを送信するためにまとまった時間を必要とすることを、解消することができる。 Although a plurality of embodiments have been described above, the present invention can also be implemented in the following modifications.
For example, the present invention may be applied not only to broadcasting but also to distributing specific content by communication or the like in response to a request from a user in a video on demand service. As a result, it is possible to eliminate the need for a large amount of time to transmit a large amount of subtitle text at a time.

以上、この発明の実施形態について図面を参照して詳述してきたが、具体的な構成はこの実施形態に限られるものではなく、この発明の要旨を逸脱しない範囲の設計等も含まれる。 The embodiment of the present invention has been described in detail with reference to the drawings. However, the specific configuration is not limited to this embodiment, and includes designs and the like that do not depart from the gist of the present invention.

本発明は、例えば放送事業やビデオオンデマンドサービス事業など、映像コンテンツを提供するしくみの一部などとして利用可能である。 The present invention can be used as a part of a mechanism for providing video content such as a broadcasting business and a video-on-demand service business.

１，２分割装置（送出装置）
５解析装置
１１取得部
１２時刻解析部
１３変換情報解析部
１４参照関係解析部
１５分割部
１７出力部
１８リソースファイルデータ取得部
３１取得部
３２時刻解析部
３３変換情報解析部
３４参照関係解析部
３５分割部
３６付加部（送出情報生成部）
３７出力部
３８リソースファイルデータ取得部 1, 2 Split device (sending device)
5 Analysis Device 11 Acquisition Unit 12 Time Analysis Unit 13 Conversion Information Analysis Unit 14 Reference Relationship Analysis Unit 15 Division Unit 17 Output Unit 18 Resource File Data Acquisition Unit 31 Acquisition Unit 32 Time Analysis Unit 33 Conversion Information Analysis Unit 34 Reference Relationship Analysis Unit 35 Dividing unit 36 adding unit (transmission information generating unit)
37 Output unit 38 Resource file data acquisition unit

Claims

時刻情報が付加された複数のテキスト文を含むテキスト文書データを取得する取得部と、
前記時刻情報に基づいて前記テキスト文書データを、前記テキスト文を含む複数のグループに断片化するための断片化情報を生成する時刻解析部と、
前記断片化された前記テキスト文のグループである断片ごとに、前記断片から参照される前記テキスト文書のヘッダ記述の情報を解析し、前記断片と前記断片から参照される前記ヘッダ記述との関係を表す参照関係情報を生成する参照関係解析部と、
前記断片化情報と前記参照関係情報とを含んだ断片化テキスト文書送出情報を生成する送出情報生成部と、
を具備することを特徴とする解析装置。 An acquisition unit for acquiring text document data including a plurality of text sentences to which time information is added;
A time analysis unit that generates fragmentation information for fragmenting the text document data into a plurality of groups including the text sentence based on the time information;
For each fragment that is a group of the fragmented text sentence, the header description information of the text document referenced from the fragment is analyzed, and the relationship between the fragment and the header description referenced from the fragment is determined. A reference relationship analysis unit that generates reference relationship information to represent,
A transmission information generation unit for generating fragmented text document transmission information including the fragmentation information and the reference relation information;
An analysis apparatus comprising:

前記断片を放送により伝送する際の、前記断片に含まれる前記テキスト文から参照される画像ファイルや音声ファイルや非組込フォントファイルのロケーション情報と、前記画像ファイルや前記音声ファイルや前記非組込フォントファイルの前記ロケーション情報が前記テキスト文書データのどの部分に記述されているかを示すロケーション情報記述位置指定情報と、前記画像ファイルや前記音声ファイルや前記非組込フォントファイルを前記断片と共に放送により伝送する際の放送信号中のリソースの取得位置を特定するための放送の名前空間による放送ロケーション情報と、を含んだ放送ロケーション変換情報を生成する変換情報解析部、
をさらに具備し、
前記送出情報生成部は、前記放送ロケーション変換情報をも含んだ断片化テキスト文書送出情報を生成する、
ことを特徴とする請求項１に記載の解析装置。 Location information of an image file, an audio file, or a non-embedded font file referenced from the text sentence included in the fragment, and the image file, the audio file, or the non-embedded file when the fragment is transmitted by broadcasting The location information description position designation information indicating in which part of the text document data the location information of the font file is described, and the image file, the audio file, and the non-embedded font file are transmitted together with the fragment by broadcasting. A conversion information analysis unit for generating broadcast location conversion information including broadcast location information based on a broadcast name space for specifying a resource acquisition position in a broadcast signal when
Further comprising
The transmission information generation unit generates fragmented text document transmission information including the broadcast location conversion information;
The analysis apparatus according to claim 1, wherein:

前記送出情報生成部は、前記取得部によって取得された前記テキスト文書データに前記断片化情報と前記参照関係情報とを含んだ前記断片化テキスト文書送出情報を付加して、情報付加済テキスト文書データとして出力する、
ことを特徴とする請求項１に記載の解析装置。 The transmission information generation unit adds the fragmented text document transmission information including the fragmentation information and the reference relation information to the text document data acquired by the acquisition unit, and adds information-added text document data Output as
The analysis apparatus according to claim 1, wherein:

前記送出情報生成部は、前記取得部によって取得された前記テキスト文書データに前記断片化情報と前記参照関係情報と前記放送ロケーション変換情報とを含んだ前記断片化テキスト文書送出情報を付加して、情報付加済テキスト文書データとして出力する、
ことを特徴とする請求項２に記載の解析装置。 The transmission information generation unit adds the fragmented text document transmission information including the fragmentation information, the reference relation information, and the broadcast location conversion information to the text document data acquired by the acquisition unit, Output as text document data with information added,
The analysis apparatus according to claim 2, wherein:

前記断片化情報に含まれる個々の断片に関する情報は、当該断片に含まれる前記テキスト文のグループを特定するための、
（１）前記断片に含まれる、前記テキスト文に付加されていた前記テキスト文を識別するＩＤのリスト、
（２）前記断片に含まれる前記テキスト文のうち一番時間順が早い前記テキスト文に付加されていた開始時刻の情報、
（３）前記断片に含まれる前記テキスト文のうち一番時間順が早い前記テキスト文に付加されていた開始時刻および一番時間順が遅い前記テキスト文に付加されていた終了時刻の情報、
の少なくともいずれかを含むものであり、
前記参照関係情報は、前記断片の提示に必要な前記テキスト文書のヘッダ記述として、非組込フォントの情報と、埋め込み画像の情報、テキストのスタイルの情報と、テキスト提示の領域の情報との、少なくともいずれかを含むものである、
ことを特徴とする請求項１から４までのいずれか一項に記載の解析装置。 Information on each fragment included in the fragmentation information is used to specify the group of text sentences included in the fragment.
(1) A list of IDs for identifying the text sentence included in the fragment and attached to the text sentence;
(2) Information of the start time added to the text sentence having the earliest time order among the text sentences included in the fragment;
(3) Information on a start time added to the text sentence with the earliest time order among the text sentences included in the fragment and an end time added to the text sentence with the latest time order;
Including at least one of
The reference relationship information includes, as a header description of the text document necessary for presenting the fragment, information on a non-embedded font, information on an embedded image, information on a text style, and information on a text presentation area. Including at least one of the following:
The analysis apparatus according to any one of claims 1 to 4, wherein

時刻情報が付加された複数のテキスト文を含むテキスト文書データに加え、前記時刻情報に基づいて前記テキスト文書データを前記テキスト文の複数のグループに断片化するための断片化情報と、前記断片化された前記テキスト文のグループである断片ごとに、前記断片から参照される前記テキスト文書のヘッダ記述との関係を表す参照関係情報とを含んだ断片化テキスト文書送出情報を読み込み、前記断片化情報に基づいて前記テキスト文書データを前記テキスト文の複数のグループに分割するとともに、前記参照関係情報に基づいて、分割された断片である前記テキスト文のグループに、前記断片から参照される前記テキスト文書のヘッダ記述の情報を付加する分割部と、
前記分割部によって分割された前記テキスト文の断片から参照されるリソースファイルを取得するリソースファイルデータ取得部と、
前記分割部によって分割された前記テキスト文と、前記リソースファイルデータ取得部によって取得された前記リソースファイルとを含むデータを出力する出力部と、
を具備することを特徴とする分割装置。 In addition to text document data including a plurality of text sentences to which time information is added, fragmentation information for fragmenting the text document data into a plurality of groups of the text sentences based on the time information, and the fragmentation For each fragment that is a group of the text sentence, the fragmented text document transmission information including reference relationship information that represents the relationship with the header description of the text document referenced from the fragment is read, and the fragmentation information is read The text document data is divided into a plurality of groups of the text sentence based on the text document, and the text document referred to from the fragment into the group of the text sentence which is a divided fragment based on the reference relation information A division part for adding information of the header description of
A resource file data acquisition unit for acquiring a resource file referenced from a fragment of the text sentence divided by the division unit;
An output unit that outputs data including the text sentence divided by the dividing unit and the resource file acquired by the resource file data acquisition unit;
A dividing apparatus comprising:

前記分割部は、前記断片を放送により伝送する際の、前記断片に含まれる前記テキスト文から参照される画像ファイルや音声ファイルや非組込フォントファイルのロケーション情報と、前記画像ファイルや前記音声ファイルや前記非組込フォントファイルの前記ロケーション情報が前記テキスト文書データのどの部分に記述されているかを示すロケーション情報記述位置指定情報と、前記画像ファイルや前記音声ファイルや前記非組込フォントファイルを前記断片と共に放送により伝送する際の放送信号中のリソースの取得位置を特定するための放送の名前空間による放送ロケーション情報と、を含んだ放送ロケーション変換情報を更に含む、前記断片化テキスト文書送出情報を読み込み、前記放送ロケーション変換情報に基づいて、前記断片に含まれる前記画像ファイルや前記音声ファイルや前記非組込フォントファイルのロケーション情報を、放送の名前空間によるロケーション情報に書き換えて前記断片に分割する、
ことを特徴とする請求項６に記載の分割装置。 The division unit includes location information of an image file, an audio file, or a non-embedded font file referred to from the text sentence included in the fragment, and the image file or the audio file when the fragment is transmitted by broadcasting. And location information description position designation information indicating in which part of the text document data the location information of the non-embedded font file, the image file, the audio file, and the non-embedded font file are The fragmented text document transmission information further including broadcast location conversion information including broadcast location information according to a broadcast name space for specifying a resource acquisition position in a broadcast signal when transmitted by broadcast together with the fragment. Read, based on the broadcast location conversion information, Wherein the image file and the location information of the audio files and the non-embedded font file is divided into the fragment rewrite the location information Namespaced broadcasting contained in,
The dividing apparatus according to claim 6.

前記分割部は、時刻情報が付加されたテキストを含むテキスト文書データに、前記断片化テキスト文書送出情報が付加されている情報付加済テキスト文書データを読み込み、前記断片化テキスト文書送出情報に含まれる前記断片化情報に基づいて前記テキスト文書データを、テキスト文の複数のグループに分割するとともに、前記参照関係情報に基づいて分割された断片である前記テキスト文のグループに、前記断片から参照される前記テキスト文書のヘッダ記述の情報を付加し、
また、前記分割部は、前記断片化テキスト文書情報に前記放送ロケーション変換情報が含まれる場合は、前記放送ロケーション変換情報に基づいて、前記断片に含まれる前記リソースファイルのロケーション情報を、放送の名前空間によるロケーション情報に書き換える、
ことを特徴とする請求項６または７のいずれか一項に記載の分割装置。 The dividing unit reads information-added text document data to which the fragmented text document transmission information is added to text document data including text to which time information is added, and is included in the fragmented text document transmission information. The text document data is divided into a plurality of groups of text sentences based on the fragmentation information, and is referenced from the fragments to the group of text sentences, which is a fragment divided based on the reference relation information. Add header description information of the text document,
Further, when the fragmented text document information includes the broadcast location conversion information, the dividing unit converts the location information of the resource file included in the fragment based on the broadcast location conversion information to a broadcast name. Rewrite location information by space,
The dividing apparatus according to claim 6, wherein the dividing apparatus is characterized in that

時刻情報が付加された複数のテキスト文を含むテキスト文書データを取得する取得部と、
前記時刻情報に基づいて前記テキスト文書データを、前記テキスト文を含む複数のグループに断片化するための断片化情報を生成する時刻解析部と、
前記断片化された前記テキスト文のグループである断片ごとに、前記断片から参照される前記テキスト文書のヘッダ記述の情報を解析し、前記断片と前記断片から参照される前記ヘッダ記述との関係を表す参照関係情報を生成する参照関係解析部と、
前記テキスト文書データに加え、前記断片化情報と前記参照関係情報とを含んだ断片化テキスト文書送出情報を読み込み、前記断片化情報に基づいて前記テキスト文書データを前記テキスト文の複数のグループに分割するとともに、前記参照関係情報に基づいて分割された断片である前記テキスト文のグループに前記断片から参照される前記テキスト文書のヘッダ記述の情報を付加する分割部と、
前記分割部によって分割された前記テキスト文の断片から参照されるリソースファイルを取得するリソースファイルデータ取得部と、
前記分割部によって分割された前記テキスト文と、前記リソースファイルデータ取得部によって取得されたリソースファイルとを含むデータを出力する出力部と、
を具備することを特徴とする分割装置。 An acquisition unit for acquiring text document data including a plurality of text sentences to which time information is added;
A time analysis unit that generates fragmentation information for fragmenting the text document data into a plurality of groups including the text sentence based on the time information;
For each fragment that is a group of the fragmented text sentence, the header description information of the text document referenced from the fragment is analyzed, and the relationship between the fragment and the header description referenced from the fragment is determined. A reference relationship analysis unit that generates reference relationship information to represent,
Read fragmented text document transmission information including the fragmentation information and the reference relation information in addition to the text document data, and divide the text document data into a plurality of groups of the text sentences based on the fragmentation information And a dividing unit for adding header description information of the text document referenced from the fragment to the group of text sentences that is a fragment divided based on the reference relationship information;
A resource file data acquisition unit for acquiring a resource file referenced from a fragment of the text sentence divided by the division unit;
An output unit that outputs data including the text sentence divided by the dividing unit and the resource file acquired by the resource file data acquisition unit;
A dividing apparatus comprising:

前記出力部は、前記断片に含まれる前記テキスト文に付加された前記提示時刻情報のうち、一番早い提示開始時刻にしたがって、分割された前記テキスト文と、前記リソースファイルとを含むデータを順次出力する、
ことを特徴とする請求項６から９までのいずれか一項に記載の分割装置。 The output unit sequentially includes data including the divided text sentence and the resource file according to the earliest presentation start time among the presentation time information added to the text sentence included in the fragment. Output,
The dividing device according to claim 6, wherein the dividing device is characterized in that:

前記断片化情報に含まれる個々の断片に関する情報は、当該断片に含まれる前記テキスト文のグループを特定するための、
（１）前記断片に含まれる、前記テキスト文に付加されていた前記テキスト文を識別するＩＤのリスト、
（２）前記断片に含まれる前記テキスト文のうち一番時間順が早い前記テキスト文に付加されていた開始時刻の情報、
（３）前記断片に含まれる前記テキスト文のうち一番時間順が早い前記テキスト文に付加されていた開始時刻および一番時間順が遅い前記テキスト文に付加されていた終了時刻の情報、
の少なくともいずれかを含むものであり、
前記参照関係情報は、前記断片の提示に必要な前記テキスト文書のヘッダ記述として、非組込フォントの情報と、埋め込み画像の情報、テキストのスタイルの情報と、テキスト提示の領域の情報との、少なくともいずれかを含むものである、
ことを特徴とする請求項６から１０までのいずれか一項に記載の分割装置。 Information on each fragment included in the fragmentation information is used to specify the group of text sentences included in the fragment.
(1) A list of IDs for identifying the text sentence included in the fragment and attached to the text sentence;
(2) Information of the start time added to the text sentence having the earliest time order among the text sentences included in the fragment;
(3) Information on a start time added to the text sentence with the earliest time order among the text sentences included in the fragment and an end time added to the text sentence with the latest time order;
Including at least one of
The reference relationship information includes, as a header description of the text document necessary for presenting the fragment, information on a non-embedded font, information on an embedded image, information on a text style, and information on a text presentation area. Including at least one of the following:
The dividing device according to claim 6, wherein the dividing device is characterized in that:

前記参照関係情報は、前記断片の提示に必要な前記テキスト文書のヘッダ記述として、非組込フォントの情報と、埋め込み画像の情報、テキストのスタイルの情報と、テキスト提示の領域の情報との、少なくともいずれかを含むものである、
ことを特徴とする請求項１１に記載の分割装置。 The reference relationship information includes, as a header description of the text document necessary for presenting the fragment, information on a non-embedded font, information on an embedded image, information on a text style, and information on a text presentation area. Including at least one of the following:
The dividing apparatus according to claim 11.

請求項１から５までのいずれか一項に記載の解析装置としてコンピューターを機能させるためのプログラム。 The program for functioning a computer as an analysis apparatus as described in any one of Claim 1-5.

請求項６から１２までのいずれか一項に記載の分割装置としてコンピューターを機能させるためのプログラム。 The program for functioning a computer as a division | segmentation apparatus as described in any one of Claim 6-12.