JP2008283667A

JP2008283667A - Voip communication device

Info

Publication number: JP2008283667A
Application number: JP2008034717A
Authority: JP
Inventors: Terutaka Mita; 輝貴三田; Yoshihiro Ariyama; 義博有山
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 2007-04-11
Filing date: 2008-02-15
Publication date: 2008-11-20
Anticipated expiration: 2028-02-15
Also published as: JP5211736B2

Abstract

<P>PROBLEM TO BE SOLVED: To provide a voice over Internet protocol (VoIP) communication device capable of communicating non-audio data in audio data real-time transport protocol (RTP) stream packet communication. <P>SOLUTION: A VoIP communication device according to the present invention prepares audio data RTP packets to be transmitted to a communication network by receiving or generating a stream of the audio data RTP packets, generates non-audio data RTP packets in a different data size from the prepared audio data RTP packets, replaces at least one of the prepared audio data RTP packets with the relevant non-audio data RTP packets, and transmits a stream of audio data RTP packets including the replacing non-audio data RTP packets to the relevant communication network. Furthermore, the VoIP communication device according to the present invention extracts the non-audio data RTP packets contained in the stream of audio data RTP packets received from the communication network based on the data size of the non-audio data RTP packets and reads information contained in the extracted non-audio data RTP packets. <P>COPYRIGHT: (C)2009,JPO&INPIT

Description

本発明は音声データＲＴＰパケットのストリームを送受信して通信を行うＶｏＩＰ通信装置に関する。 The present invention relates to a VoIP communication apparatus that performs communication by transmitting and receiving a stream of voice data RTP packets.

ＶｏＩＰ(Voice Over Internet Protocol)通信装置にて音声をストリーミング再生するための伝送プロトコルとして音声ＲＴＰ(Real-time Transport Protocol)が知られている。音声データＲＴＰパケットのストリーミングによる音声データの通信と併行した非音声データの通信が要望される場合がある。例えばインターネットなどのネットワークを通じて音声などのデジタルデータをやり取りする際に、通信途中で第三者に盗み見られたり改ざんされたりしないように暗号化することが多いが、この場合、暗号鍵という非音声データを通信することになる。ＲＴＰパケットの暗号化についてはＩＥＴＦ(Internet Engineering Task Force)により発行されたＲＦＣ(Request For Comments)３７１１にてＳＲＴＰ(The Secure Real-time Transport Protocol)として標準化されている。ここでは暗号化のために必要な暗号鍵は予め登録するかあるいは別のプロトコルを使用して交換する方法が採用されている。ＲＴＰパケットの暗号通信では、暗号化のために必要な暗号鍵情報を、通信を行う装置に予め設定しない場合には、適宜、通信相手の端末に暗号鍵情報を送信する必要がある。非音声データの通知方法として例えば、特許文献１にはＲＴＰとは別のプロトコルであるＲＴＣＰ(RTP Control Protocol)を使用する通信装置及び暗号通信方法が開示されている。ここでの非音声データ（暗号鍵）通信方法は、通信装置自身のセッション鍵および当該鍵の識別情報を通話相手の公開鍵で暗号化し、これをＲＴＣＰパケットに格納して通話相手へ送信する。また、通話相手から受信したＲＴＣＰパケットに格納されている暗号化された通話相手のセッション鍵および当該鍵の識別情報を通信装置自身の秘密鍵で復号化する方法が開示されている。
特開２００５−１５９９５９号公報 Voice RTP (Real-time Transport Protocol) is known as a transmission protocol for streaming reproduction of voice in a VoIP (Voice Over Internet Protocol) communication apparatus. In some cases, non-voice data communication in parallel with voice data communication by streaming voice data RTP packets is desired. For example, when exchanging digital data such as voice over a network such as the Internet, it is often encrypted so that it is not stolen or altered by a third party during communication. In this case, non-voice data called an encryption key is used. Will communicate. RTP packet encryption is standardized as SRTP (The Secure Real-time Transport Protocol) in RFC (Request For Comments) 3711 issued by IETF (Internet Engineering Task Force). Here, a method of registering an encryption key necessary for encryption in advance or exchanging it using another protocol is adopted. In encryption communication of RTP packets, if encryption key information necessary for encryption is not set in advance in a device that performs communication, it is necessary to appropriately transmit the encryption key information to a communication partner terminal. As a non-voice data notification method, for example, Patent Document 1 discloses a communication device and an encryption communication method using RTCP (RTP Control Protocol), which is a protocol different from RTP. In this non-voice data (encryption key) communication method, the session key of the communication device itself and the identification information of the key are encrypted with the public key of the other party, stored in an RTCP packet, and transmitted to the other party. Also disclosed is a method of decrypting the encrypted session partner's session key and identification information of the key stored in the RTCP packet received from the other party with the private key of the communication device itself.
JP 2005-159959 A

ところで、ＲＴＣＰはデータの送受信チェックや送信者／受信者間などの付随情報を伝達するために用いられ、ＲＴＰのサブプロトコルとして位置付けられる。セキュリティの観点から、ＲＴＣＰの如き付随情報を通信するために利用されるプロトコルによる情報通信を制限しているネットワークがしばしば見られる。ＲＴＣＰパケットの通信路内に、ＲＴＣＰプロトコルを利用した情報通信を制限しているネットワークが１つでも存在した場合、特許文献１に開示される通信方法を適用できず、暗号鍵などの非音声データを通信できないという問題点があった。 By the way, RTCP is used for transmitting / receiving data transmission / reception and accompanying information such as between a sender and a receiver, and is positioned as a sub-protocol of RTP. From the viewpoint of security, there are often seen networks that restrict information communication by a protocol used for communicating accompanying information such as RTCP. If there is at least one network that restricts information communication using the RTCP protocol in the communication path of the RTCP packet, the communication method disclosed in Patent Document 1 cannot be applied, and non-voice data such as an encryption key. There was a problem that could not communicate.

本発明は上記した如き問題点に鑑みてなされたものであって、音声データＲＴＰストリームパケット通信において非音声データの通信を可能とするＶｏＩＰ通信装置を提供することを目的とする。 The present invention has been made in view of the above-described problems, and an object thereof is to provide a VoIP communication apparatus that enables non-voice data communication in voice data RTP stream packet communication.

本発明によるＶｏＩＰ通信装置は、通信ネットワークを介して音声データＲＴＰパケットのストリームを送受信するＶｏＩＰ通信装置であって、前記通信ネットワークへ送信すべき音声データＲＴＰパケットのストリームを受信若しくは生成して準備する送信音声データＲＴＰストリームパケット準備手段と、当該準備した音声データＲＴＰパケットのデータサイズと異なるデータサイズの非音声データＲＴＰパケットを生成する非音声データＲＴＰパケット生成部と、当該準備した音声データＲＴＰパケットの内の少なくとも１つを前記非音声データＲＴＰパケットに置き換える非音声データＲＴＰパケット挿入部と、当該置き換えた非音声データＲＴＰパケットを含む音声データＲＴＰパケットのストリームを前記通信ネットワークに送信するＲＴＰストリームパケット送信手段と、を含むことを特徴とする。 A VoIP communication apparatus according to the present invention is a VoIP communication apparatus that transmits and receives a stream of voice data RTP packets via a communication network, and receives or generates a stream of voice data RTP packets to be transmitted to the communication network. A transmission voice data RTP stream packet preparation means, a non-voice data RTP packet generator for generating a non-voice data RTP packet having a data size different from the data size of the prepared voice data RTP packet, and the prepared voice data RTP packet A non-voice data RTP packet insertion unit that replaces at least one of them with the non-voice data RTP packet, and a stream of voice data RTP packets including the replaced non-voice data RTP packet to the communication network. Characterized in that it comprises a RTP stream packet transmitting means for, the.

また、本発明によるＶｏＩＰ通信装置は、通信ネットワークを介して音声データＲＴＰパケットのストリームを送受信するＶｏＩＰ通信装置であって、前記通信ネットワークから受信した音声データＲＴＰパケットのストリームに含まれる非音声データＲＴＰパケットを当該非音声データＲＴＰパケットのデータサイズに基づいて抽出する非音声データＲＴＰパケット抽出部と、当該抽出した非音声データＲＴＰパケットに含まれる情報を読み取る非音声データＲＴＰパケット読取部と、を含むことを特徴とする。 The VoIP communication apparatus according to the present invention is a VoIP communication apparatus that transmits and receives a stream of voice data RTP packets via a communication network, and includes non-voice data RTP included in the stream of voice data RTP packets received from the communication network. A non-voice data RTP packet extraction unit that extracts packets based on the data size of the non-voice data RTP packet; and a non-voice data RTP packet reading unit that reads information contained in the extracted non-voice data RTP packet. It is characterized by that.

以下、本発明に係る実施例について添付の図面を参照しつつ詳細に説明する。 Hereinafter, embodiments according to the present invention will be described in detail with reference to the accompanying drawings.

図１は本発明によるＶｏＩＰ通信装置を通信ネットワークと共に表すブロック図である。ＶｏＩＰ通信装置１００は、ＲＴＰ制御部１１０と、非音声データ読取生成部１２０と、ＤＳＰ部１３０と、ＷＡＮポート１４０と、ＳＬＩＣ部１５０と、を含む。 FIG. 1 is a block diagram showing a VoIP communication apparatus according to the present invention together with a communication network. The VoIP communication apparatus 100 includes an RTP control unit 110, a non-voice data reading / generating unit 120, a DSP unit 130, a WAN port 140, and an SLIC unit 150.

ＲＴＰ制御部１１０は、ＲＴＰパケット送受信部１１１と、非音声データＲＴＰパケット挿入部１１２と、非音声データＲＴＰパケット抽出部１１３と、を含む。 The RTP control unit 110 includes an RTP packet transmission / reception unit 111, a non-voice data RTP packet insertion unit 112, and a non-voice data RTP packet extraction unit 113.

ＲＴＰパケット送受信部１１１は、ＷＡＮポート１４０経由でインターネットなどの通信ネットワーク２００を介して、音声データＲＴＰパケットのストリームを送受信する。ＲＴＰパケット送受信部１１１は、ＷＡＮポート１４０経由で通信ネットワーク２００から音声データＲＴＰパケットを受信したら、これをＲＴＰパケット送受信部１１１に供給する。ＲＴＰパケット送受信部１１１は、音声データＲＴＰパケットのストリーム内に非音声データＲＴＰパケットが含まれていれば、これもＲＴＰパケット送受信部１１１に供給する。また、ＲＴＰパケット送受信部１１１は、非音声データＲＴＰパケット挿入部１１２から供給され且つ非音声データＲＴＰパケットを含む音声データＲＴＰパケットのストリームをＷＡＮポート１４０経由で通信ネットワーク２００に送信する。なお、本実施例におけるＲＴＰパケットはＩＥＴＦ（Internet Engineering Task Force）によって発行されたＲＦＣ（Request For Comments）１８８９の規定に準拠したものであれば良い。 The RTP packet transmission / reception unit 111 transmits / receives a stream of voice data RTP packets via the WAN port 140 and the communication network 200 such as the Internet. When the RTP packet transmission / reception unit 111 receives the voice data RTP packet from the communication network 200 via the WAN port 140, the RTP packet transmission / reception unit 111 supplies this to the RTP packet transmission / reception unit 111. The RTP packet transmission / reception unit 111 supplies the non-voice data RTP packet to the RTP packet transmission / reception unit 111 if the stream of the voice data RTP packet includes the non-voice data RTP packet. Further, the RTP packet transmission / reception unit 111 transmits a stream of voice data RTP packets supplied from the non-voice data RTP packet insertion unit 112 and including the non-voice data RTP packets to the communication network 200 via the WAN port 140. The RTP packet in this embodiment may be any packet that conforms to the provisions of RFC (Request For Comments) 1889 issued by the Internet Engineering Task Force (IETF).

非音声データＲＴＰパケット挿入部１１２は、音声データＲＴＰパケット生成部１３１から受け取った音声データＲＴＰパケットの内の少なくとも１つを、非音声データＲＴＰパケット生成部１２１から受け取った非音声データＲＴＰパケットに置き換える。非音声データＲＴＰパケット挿入部１１２は、好ましいタイミングにて音声データＲＴＰパケットを非音声データＲＴＰパケットに置き換えれば良い。非音声データＲＴＰパケット挿入部１１２は、当該置き換えた非音声データＲＴＰパケットを含む音声データＲＴＰパケットのストリームをＲＴＰパケット送受信部１１１に供給する。 The non-voice data RTP packet insertion unit 112 replaces at least one of the voice data RTP packets received from the voice data RTP packet generation unit 131 with the non-voice data RTP packet received from the non-voice data RTP packet generation unit 121. . The non-voice data RTP packet insertion unit 112 may replace the voice data RTP packet with a non-voice data RTP packet at a preferred timing. The non-voice data RTP packet insertion unit 112 supplies a stream of voice data RTP packets including the replaced non-voice data RTP packet to the RTP packet transmission / reception unit 111.

非音声データＲＴＰパケット抽出部１１３は、ＲＴＰパケット送受信部１１１から受け取った音声データＲＴＰパケットのストリームに含まれる非音声データＲＴＰパケットを当該非音声データＲＴＰパケットのデータサイズに基づいて抽出する。非音声データＲＴＰパケット抽出部１１３は、当該抽出した非音声データＲＴＰパケットを非音声データＲＴＰパケット読取部１２２に供給する。また、非音声データＲＴＰパケット抽出部１１３は、当該抽出した非音声データＲＴＰパケット以外の音声データＲＴＰパケットのストリームを音声信号変換部１３２に供給する。 The non-voice data RTP packet extraction unit 113 extracts the non-voice data RTP packet included in the stream of voice data RTP packets received from the RTP packet transmission / reception unit 111 based on the data size of the non-voice data RTP packet. The non-voice data RTP packet extraction unit 113 supplies the extracted non-voice data RTP packet to the non-voice data RTP packet reading unit 122. Further, the non-voice data RTP packet extraction unit 113 supplies a stream of voice data RTP packets other than the extracted non-voice data RTP packet to the voice signal conversion unit 132.

パケットのデータサイズはペイロード部分に含まれるデータ量によって異なってくる。通常、パケットのヘッダ内にパケット全体のサイズ情報が格納されている。非音声データＲＴＰパケット抽出部１１３は、当該サイズ情報を参照するなどしてパケットのデータサイズを判別することができる。非音声データＲＴＰパケット抽出部１１３は、音声データＲＴＰパケットの所定データサイズを予め記憶しておく。非音声データＲＴＰパケット抽出部１１３は、音声データＲＴＰパケットまたは非音声データＲＴＰパケットを受け取る度に、当該受け取ったＲＴＰパケットのヘッダ内に格納されているサイズ情報を参照するなどして、パケットのデータサイズを判別する。非音声データＲＴＰパケット抽出部１１３は、当該判別したサイズが予め記憶してある所定データサイズと異なる場合に、受け取ったＲＴＰパケットが非音声データＲＴＰパケットであると判別して、これを抽出する。 The data size of the packet varies depending on the amount of data included in the payload portion. Usually, the size information of the entire packet is stored in the header of the packet. The non-voice data RTP packet extraction unit 113 can determine the data size of the packet by referring to the size information. The non-voice data RTP packet extraction unit 113 stores a predetermined data size of the voice data RTP packet in advance. Each time the non-voice data RTP packet extraction unit 113 receives a voice data RTP packet or a non-voice data RTP packet, the non-voice data RTP packet extraction unit 113 refers to the size information stored in the header of the received RTP packet. Determine the size. When the determined size is different from the predetermined data size stored in advance, the non-voice data RTP packet extraction unit 113 determines that the received RTP packet is a non-voice data RTP packet and extracts it.

例えば、非音声データＲＴＰパケット抽出部１１３は、音声データＲＴＰパケットの所定データサイズを１０００バイトとして予め記憶しておく。非音声データＲＴＰパケット抽出部１１３は、音声データＲＴＰパケットまたは非音声データＲＴＰパケットを受け取る度に、当該受け取ったＲＴＰパケットのヘッダ内に格納されているサイズ情報を参照し、当該サイズ情報が予め記憶してある所定データサイズの１０００バイトと一致するか否かを判別する。例えば当該判別したサイズが１４００バイトであった場合、受け取ったＲＴＰパケットが非音声データＲＴＰパケットであると判別して、これを抽出する。 For example, the non-voice data RTP packet extraction unit 113 stores the predetermined data size of the voice data RTP packet as 1000 bytes in advance. Each time the non-voice data RTP packet extraction unit 113 receives the voice data RTP packet or the non-voice data RTP packet, the non-voice data RTP packet extraction unit 113 refers to the size information stored in the header of the received RTP packet and stores the size information in advance. It is determined whether or not the predetermined data size matches 1000 bytes. For example, if the determined size is 1400 bytes, it is determined that the received RTP packet is a non-voice data RTP packet, and this is extracted.

非音声データ読取生成部１２０は、非音声データＲＴＰパケット生成部１２１と、非音声データＲＴＰパケット読取部１２２と、を含む。 The non-voice data reading / generating unit 120 includes a non-voice data RTP packet generating unit 121 and a non-voice data RTP packet reading unit 122.

非音声データＲＴＰパケット生成部１２１は、音声データＲＴＰパケット生成部１３１が生成している音声データＲＴＰパケットと異なるデータサイズの非音声データＲＴＰパケットを生成する。非音声データＲＴＰパケット生成部１２１は、生成すべき非音声データＲＴＰパケットの所定データサイズを予め記憶しておき、当該所定データサイズの非音声データＲＴＰパケットを生成する。非音声データＲＴＰパケットのサイズは、音声データＲＴＰパケット生成部１３１が生成する音声データＲＴＰパケットのサイズと異なるサイズであれば良く、音声データＲＴＰパケットのサイズと比較した場合の大小は問わない。パケットのデータサイズはペイロード部分に含まれるデータ量によって異なるため、非音声データＲＴＰパケットにおいて当該データ量を音声データＲＴＰパケットのそれと異なるデータ量にすれば良い。 The non-voice data RTP packet generator 121 generates a non-voice data RTP packet having a data size different from that of the voice data RTP packet generated by the voice data RTP packet generator 131. The non-voice data RTP packet generation unit 121 stores a predetermined data size of the non-voice data RTP packet to be generated in advance, and generates a non-voice data RTP packet having the predetermined data size. The size of the non-voice data RTP packet may be any size different from the size of the voice data RTP packet generated by the voice data RTP packet generation unit 131, and the size of the non-voice data RTP packet is not limited when compared with the size of the voice data RTP packet. Since the data size of the packet differs depending on the amount of data included in the payload portion, the data amount in the non-voice data RTP packet may be set to a data quantity different from that of the voice data RTP packet.

例えば、非音声データＲＴＰパケット生成部１２１は、生成すべき非音声データＲＴＰパケットの所定データサイズを１４００バイトとして予め記憶しておき、データサイズが１４００バイトの非音声データＲＴＰパケットを生成する。非音声データＲＴＰパケット生成部１２１は好ましいタイミングにて非音声データＲＴＰパケットを生成すれば良い。非音声データＲＴＰパケット生成部１２１がペイロード部分に含めるデータは、例えば、端末識別情報、端末使用状況情報及びサービス対応状況情報などである。 For example, the non-voice data RTP packet generation unit 121 stores a predetermined data size of the non-voice data RTP packet to be generated as 1400 bytes in advance, and generates a non-voice data RTP packet with a data size of 1400 bytes. The non-voice data RTP packet generation unit 121 may generate the non-voice data RTP packet at a preferable timing. The data included in the payload portion by the non-voice data RTP packet generation unit 121 is, for example, terminal identification information, terminal usage status information, and service response status information.

端末識別情報とは例えば、当該ＶｏＩＰ通信装置の識別情報などであり、ＩＰアドレスやＭＡＣアドレスなども含む。また、通信相手となるＶｏＩＰ通信装置の同情報を確認情報として含むこともありうる。端末使用状況情報とは例えばアナログ電話端末３００の使用状況や応答状況などである。サービス対応状況情報とはアナログ電話端末３００がファクシミリやテレビ電話などのサービスに対応しているか否かを表す情報である。ＶｏＩＰ装置１００は予めアナログ電話端末３００からこれらの情報を入手しておくなどして、好ましいタイミングにて当該端末識別情報を含む非音声データＲＴＰパケットを生成する。非音声データＲＴＰパケットを受信する側のＶｏＩＰ通信装置は、当該パケットに含まれる端末識別情報を送信する側のＶｏＩＰ通信装置の認証処理などに利用できる。 The terminal identification information is, for example, identification information of the VoIP communication device, and includes an IP address and a MAC address. In addition, the same information of the VoIP communication device as a communication partner may be included as confirmation information. The terminal usage status information is, for example, the usage status or response status of the analog telephone terminal 300. The service support status information is information indicating whether or not the analog telephone terminal 300 supports services such as facsimile and videophone. The VoIP device 100 obtains such information from the analog telephone terminal 300 in advance, and generates a non-voice data RTP packet including the terminal identification information at a preferred timing. The VoIP communication device on the side that receives the non-voice data RTP packet can be used for authentication processing of the VoIP communication device on the side that transmits the terminal identification information included in the packet.

非音声データＲＴＰパケット読取部１２２は、非音声データＲＴＰパケット抽出部１１３から供給された非音声データＲＴＰパケットに含まれる情報を読み取る。当該情報は例えば、上述した端末識別情報、端末使用状況情報及びサービス対応状況情報などである。ＶｏＩＰ通信装置１００は、非音声データＲＴＰパケット読取部１２２が読み取った端末識別情報に基づいて、送信側のＶｏＩＰ通信装置の認証処理などを行うことができる。 The non-voice data RTP packet reading unit 122 reads information included in the non-voice data RTP packet supplied from the non-voice data RTP packet extraction unit 113. The information includes, for example, the above-described terminal identification information, terminal usage status information, service response status information, and the like. The VoIP communication apparatus 100 can perform authentication processing of the transmission-side VoIP communication apparatus based on the terminal identification information read by the non-voice data RTP packet reading unit 122.

ＤＳＰ(Digital Signal Processor)部１３０は、音声データＲＴＰパケット生成部１３１と、音声信号変換部１３２と、を含む。 The DSP (Digital Signal Processor) unit 130 includes an audio data RTP packet generation unit 131 and an audio signal conversion unit 132.

音声データＲＴＰパケット生成部１３１は、ＳＬＩＣ部１５０から供給された音声信号に基づいて音声データＲＴＰパケットのストリームを生成する。音声データＲＴＰパケット生成部１３１は、当該生成によって準備した音声データＲＴＰパケットを非音声データＲＴＰパケット挿入部１１２に供給する。音声データＲＴＰパケット生成部１３１は、生成すべき音声データＲＴＰパケットの所定データサイズを予め記憶しておき、当該所定データサイズの音声データＲＴＰパケットを生成する。例えば、音声データＲＴＰパケット生成部１３１は、生成すべき音声データＲＴＰパケットの所定データサイズを１０００バイトとして予め記憶しておき、１０００バイトの音声データＲＴＰパケットを生成する。 The audio data RTP packet generation unit 131 generates a stream of audio data RTP packets based on the audio signal supplied from the SLIC unit 150. The voice data RTP packet generation unit 131 supplies the voice data RTP packet prepared by the generation to the non-voice data RTP packet insertion unit 112. The voice data RTP packet generation unit 131 stores a predetermined data size of the voice data RTP packet to be generated in advance, and generates a voice data RTP packet having the predetermined data size. For example, the voice data RTP packet generation unit 131 stores a predetermined data size of the voice data RTP packet to be generated as 1000 bytes in advance, and generates a 1000 byte voice data RTP packet.

音声信号変換部１３２は、非音声データＲＴＰパケット抽出部１１３から供給された音声データＲＴＰパケットのストリームを音声信号に変換してＳＬＩＣ部１５０に供給する。また、音声信号変換部１３２は、ＰＬＣ(packet Loss Concealment)などの補間機能を備えており、非音声データＲＴＰパケット抽出部１１３によって抽出された非音声データＲＴＰパケットを穴埋めするための音声データを生成できる。 The audio signal converter 132 converts the audio data RTP packet stream supplied from the non-audio data RTP packet extractor 113 into an audio signal and supplies the audio signal to the SLIC unit 150. The audio signal conversion unit 132 has an interpolation function such as PLC (packet loss concealment), and generates audio data for filling in the non-audio data RTP packet extracted by the non-audio data RTP packet extraction unit 113. it can.

ＷＡＮ(World Area Network)ポート１４０は、ＲＴＰパケット送受信部１１１と通信ネットワーク２００との間にあって音声データＲＴＰパケットを中継する。 A WAN (World Area Network) port 140 is located between the RTP packet transmitting / receiving unit 111 and the communication network 200 and relays voice data RTP packets.

ＳＬＩＣ(Subscriber Line Interface Circuit)部１５０は、アナログ電話端末３００から受信した音声信号を音声データＲＴＰパケット生成部１３１に供給する。また、ＳＬＩＣ部１５０は、音声信号変換部１３２から供給された音声信号をアナログ電話端末３００に送信する。 A SLIC (Subscriber Line Interface Circuit) unit 150 supplies a voice signal received from the analog telephone terminal 300 to the voice data RTP packet generation unit 131. In addition, the SLIC unit 150 transmits the audio signal supplied from the audio signal conversion unit 132 to the analog telephone terminal 300.

図２Ａ〜２Ｃは送信時における非音声データＲＴＰパケット挿入部でのＲＴＰパケットのストリームの一例を表す図である。 2A to 2C are diagrams illustrating an example of a stream of RTP packets in the non-voice data RTP packet insertion unit at the time of transmission.

図２Ａは非音声データＲＴＰパケット挿入部１１２が音声データＲＴＰパケット生成部１３１から受け取った音声データＲＴＰパケットのストリームである。当該音声データＲＴＰパケットのストリームは、アナログ電話端末３００から送信された音声信号をＳＬＩＣ部１５０が受信し、当該音声信号に基づいて音声データＲＴＰパケット生成部１３１が生成したものである。非音声データＲＴＰパケット挿入部１１２は音声データＲＴＰパケットＰ１〜Ｐ５を音声データＲＴＰパケット生成部１３１から順次、受け取る。 FIG. 2A shows a stream of voice data RTP packets received by the non-voice data RTP packet insertion unit 112 from the voice data RTP packet generation unit 131. The audio data RTP packet stream is generated by the audio data RTP packet generation unit 131 based on the audio signal received by the SLIC unit 150 from the audio signal transmitted from the analog telephone terminal 300. The non-voice data RTP packet insertion unit 112 sequentially receives the voice data RTP packets P1 to P5 from the voice data RTP packet generation unit 131.

図２Ｂは非音声データＲＴＰパケット挿入部１１２が音声データＲＴＰパケット生成部１３１から受け取った音声データＲＴＰパケットＰ３を、非音声データＲＴＰパケット生成部１２１から受け取った非音声データＲＴＰパケットＤ３に置き換えるときの音声データＲＴＰパケットのストリームである。このとき、非音声データＲＴＰパケット挿入部１１２は音声データＲＴＰパケットＰ３を破棄して、当該破棄した箇所に非音声データＲＴＰパケットＤ３を挿入することによって非音声データＲＴＰパケットＤ３に置き換える。非音声データＲＴＰパケット挿入部１１２は、好ましいタイミングにて音声データＲＴＰパケットを非音声データＲＴＰパケットに置き換えれば良い。ここでは、音声データＲＴＰパケットを１つだけ置き換えているが、本発明には音声データＲＴＰパケットを置き換える個数にかかる制限は無く、非音声データＲＴＰパケット生成部１２１が複数の非音声データＲＴＰパケットを生成し、非音声データＲＴＰパケット挿入部１１２がこれらと複数の音声データＲＴＰパケットとを置き換えても良い。非音声データＲＴＰパケットＤ３には非音声データＲＴＰパケット生成部１２１により端末識別情報、端末使用状況情報及びサービス対応状況情報などの非音声データが含められている。非音声データＲＴＰパケットＤ３のデータサイズは音声データＲＴＰパケットＰ１、Ｐ２、Ｐ４、Ｐ５の各々のデータサイズと異なる。 FIG. 2B illustrates a case where the non-voice data RTP packet insertion unit 112 replaces the voice data RTP packet P3 received from the voice data RTP packet generation unit 131 with a non-voice data RTP packet D3 received from the non-voice data RTP packet generation unit 121. It is a stream of audio data RTP packets. At this time, the non-voice data RTP packet insertion unit 112 discards the voice data RTP packet P3 and replaces it with the non-voice data RTP packet D3 by inserting the non-voice data RTP packet D3 at the discarded location. The non-voice data RTP packet insertion unit 112 may replace the voice data RTP packet with a non-voice data RTP packet at a preferred timing. Here, only one voice data RTP packet is replaced, but the present invention has no limitation on the number of voice data RTP packets to be replaced, and the non-voice data RTP packet generation unit 121 replaces a plurality of voice data RTP packets. The non-voice data RTP packet insertion unit 112 may generate and replace these with a plurality of voice data RTP packets. The non-voice data RTP packet D3 includes non-voice data such as terminal identification information, terminal usage status information, and service response status information by the non-voice data RTP packet generator 121. The data size of the non-voice data RTP packet D3 is different from the data size of each of the voice data RTP packets P1, P2, P4, and P5.

図２Ｃは非音声データＲＴＰパケット挿入部１１２が音声データＲＴＰパケットＰ３を非音声データＲＴＰパケットＤ３に置き換えた後の音声データＲＴＰパケットのストリームである。非音声データＲＴＰパケット挿入部１１２は音声データＲＴＰパケットＰ１、Ｐ２、非音声データＲＴＰパケットＤ３、音声データＲＴＰパケットＰ４、Ｐ５をＲＴＰパケット送受信部１１１に順次、供給する。ＲＴＰパケット送受信部１１１はこれらのＲＴＰパケットを順次、ＷＡＮポート１４０経由で通信ネットワーク２００に送信する。 FIG. 2C is a stream of voice data RTP packets after the non-voice data RTP packet insertion unit 112 replaces the voice data RTP packet P3 with the non-voice data RTP packet D3. The non-voice data RTP packet insertion unit 112 sequentially supplies the voice data RTP packets P1 and P2, the non-voice data RTP packet D3, and the voice data RTP packets P4 and P5 to the RTP packet transmission / reception unit 111. The RTP packet transmitting / receiving unit 111 sequentially transmits these RTP packets to the communication network 200 via the WAN port 140.

上記したように音声データＲＴＰパケットのストリーム送信において、ＲＴＰパケットでの非音声データの送信が可能となる。受信側の装置では非音声データＲＴＰパケットＤ３を含む音声データＲＴＰパケットのストリームを受信することにより、端末識別情報、端末使用状況情報及びサービス対応状況情報などの情報を得ることができる。 As described above, in the audio data RTP packet stream transmission, non-audio data can be transmitted in the RTP packet. By receiving the stream of the voice data RTP packet including the non-voice data RTP packet D3, the receiving apparatus can obtain information such as terminal identification information, terminal usage status information, and service response status information.

図３Ａ及び３Ｂは受信時における非音声データＲＴＰパケット抽出部でのＲＴＰパケットのストリームの一例を表す図である。図３Ｃは受信時における音声信号変換部での音声データのストリームの一例を表す図である。 3A and 3B are diagrams illustrating an example of a stream of RTP packets in the non-voice data RTP packet extraction unit at the time of reception. FIG. 3C is a diagram illustrating an example of a stream of audio data in the audio signal conversion unit at the time of reception.

図３Ａは非音声データＲＴＰパケット抽出部１１３がＲＴＰパケット送受信部１１１から受け取った音声データＲＴＰパケットのストリームである。当該音声データＲＴＰパケットのストリームは、ＲＴＰパケット送受信部１１１がＷＡＮポート１４０経由で通信ネットワーク２００から受信したものであり、非音声データＲＴＰパケットＤ３が含まれている。非音声データＲＴＰパケットＤ３のデータサイズは音声データＲＴＰパケットＰ１、Ｐ２、Ｐ４、Ｐ５の各々のデータサイズと異なる。非音声データＲＴＰパケット抽出部１１３は音声データＲＴＰパケットＰ１、Ｐ２、非音声データＲＴＰパケットＤ３、音声データＲＴＰパケットＰ４、Ｐ５をＲＴＰパケット送受信部１１１から順次、受け取る。 FIG. 3A shows a stream of voice data RTP packets received by the non-voice data RTP packet extraction unit 113 from the RTP packet transmission / reception unit 111. The stream of the voice data RTP packet is received from the communication network 200 by the RTP packet transmission / reception unit 111 via the WAN port 140, and includes a non-voice data RTP packet D3. The data size of the non-voice data RTP packet D3 is different from the data size of each of the voice data RTP packets P1, P2, P4, and P5. The non-voice data RTP packet extraction unit 113 sequentially receives the voice data RTP packets P1 and P2, the non-voice data RTP packet D3, and the voice data RTP packets P4 and P5 from the RTP packet transmission / reception unit 111.

図３Ｂは非音声データＲＴＰパケット抽出部１１３がＲＴＰパケット送受信部１１１から受け取った非音声データＲＴＰパケットＤ３を抽出するときの音声データＲＴＰパケットのストリームである。このとき、非音声データＲＴＰパケット抽出部１１３はパケットデータサイズに基づいて非音声データＲＴＰパケットＤ３を抽出して、これを非音声データＲＴＰパケット読取部１２２に供給する。非音声データＲＴＰパケット抽出部１１３はＲＴＰパケット送受信部１１１から音声データＲＴＰパケットまたは非音声データＲＴＰパケットを受け取る度に、当該受け取ったＲＴＰパケットのヘッダ内に格納されているサイズ情報を参照するなどして、パケットのデータサイズを判別する。非音声データＲＴＰパケット抽出部１１３は、当該判別したデータサイズ（例えば１４００バイト）が予め記憶してある音声データＲＴＰパケットの所定データサイズ（例えば１０００バイト）と異なる場合に、受け取ったＲＴＰパケットが非音声データＲＴＰパケットであると判別して、これを抽出する。 FIG. 3B shows a stream of voice data RTP packets when the non-voice data RTP packet extraction unit 113 extracts the non-voice data RTP packet D3 received from the RTP packet transmission / reception unit 111. At this time, the non-voice data RTP packet extraction unit 113 extracts the non-voice data RTP packet D3 based on the packet data size and supplies it to the non-voice data RTP packet reading unit 122. Each time the non-voice data RTP packet extracting unit 113 receives a voice data RTP packet or a non-voice data RTP packet from the RTP packet transmitting / receiving unit 111, the non-voice data RTP packet extracting unit 113 refers to size information stored in the header of the received RTP packet. To determine the data size of the packet. The non-speech data RTP packet extraction unit 113 determines that the received RTP packet is non-successful when the determined data size (for example, 1400 bytes) is different from the predetermined data size (for example, 1000 bytes) of the stored sound data RTP packet. A voice data RTP packet is identified and extracted.

非音声データＲＴＰパケットＤ３には例えば、端末識別情報、端末使用状況情報及びサービス対応状況情報などの非音声データが含められている。非音声データＲＴＰパケット読取部１２２は供給された非音声データＲＴＰパケットＤ３に含まれるこれらの情報を読み取ることができる。 The non-voice data RTP packet D3 includes, for example, non-voice data such as terminal identification information, terminal usage status information, and service response status information. The non-voice data RTP packet reading unit 122 can read the information included in the supplied non-voice data RTP packet D3.

非音声データＲＴＰパケット抽出部１１３は、音声データＲＴＰパケットＰ１、Ｐ２、Ｐ４及びＰ５を音声信号変換部１３２に順次、供給する。音声信号変換部１３２は、非音声データＲＴＰパケット抽出部１１３から受け取った音声データＲＴＰパケットＰ１、Ｐ２、Ｐ４及びＰ５を順次、音声データＳ１、Ｓ２、Ｓ４及びＳ５に変換する。 The non-voice data RTP packet extraction unit 113 sequentially supplies the voice data RTP packets P1, P2, P4, and P5 to the voice signal conversion unit 132. The audio signal converter 132 sequentially converts the audio data RTP packets P1, P2, P4, and P5 received from the non-audio data RTP packet extractor 113 into audio data S1, S2, S4, and S5.

図３Ｃは音声信号変換部１３２における音声データＲＴＰパケットＳ１、Ｓ２、Ｃ３、Ｓ４及びＳ５のストリームを表す図である。音声信号変換部１３２はＰＬＣ(Packet Loss Concealment)などの補間機能を備えており、抽出された非音声データＲＴＰパケットＤ３を穴埋めするための音声データＣ３を生成する。音声信号変換部１３２は、生成した音声データＣ３を含む音声データＳ１、Ｓ２、Ｃ３、Ｓ４、Ｓ５をＳＬＩＣ部１５０に順次、供給する。また、ＳＬＩＣ部１５０は、音声信号変換部１３２から供給された音声データをアナログ電話端末３００に送信する。 FIG. 3C is a diagram illustrating a stream of audio data RTP packets S1, S2, C3, S4, and S5 in the audio signal conversion unit 132. The audio signal conversion unit 132 has an interpolation function such as PLC (Packet Loss Concealment), and generates audio data C3 for filling the extracted non-audio data RTP packet D3. The audio signal conversion unit 132 sequentially supplies audio data S1, S2, C3, S4, and S5 including the generated audio data C3 to the SLIC unit 150. In addition, the SLIC unit 150 transmits the audio data supplied from the audio signal conversion unit 132 to the analog telephone terminal 300.

上記したように音声データＲＴＰパケットのストリーム受信において、ＲＴＰパケットでの非音声データの受信が可能となる。非音声データＲＴＰパケットＤ３には端末識別情報、端末使用状況情報及びサービス対応状況情報などの情報が含まれており、受信側の装置では非音声データＲＴＰパケットＤ３を含む音声データＲＴＰパケットのストリームを受信することにより、これらの情報を得ることができる。 As described above, in the stream reception of the voice data RTP packet, it is possible to receive the non-voice data by the RTP packet. The non-voice data RTP packet D3 includes information such as terminal identification information, terminal usage status information, and service response status information. The receiving-side apparatus receives a stream of voice data RTP packets including the non-voice data RTP packet D3. Such information can be obtained by receiving.

上記した如く本実施例によれば、音声データＲＴＰパケットのストリーム送受信において、ＲＴＰパケットでの非音声データの送受信が可能となる。例えばＶｏＩＰ通信装置１００が非音声データＲＴＰパケットに自身の識別情報を含めて送信すれば、受信側の装置において当該識別情報を受信することができる。なお、ここでの受信側の装置とは本発明によるＶｏＩＰ通信装置でも良いし、他の装置でも良い。これにより、通信ネットワーク内にＲＴＰとは異なるプロトコル（例えばＲＴＣＰなど）による通信を制限しているネットワークが存在した場合でも、識別情報の送受信が可能となる。受信側の装置は当該識別情報を送信側のＶｏＩＰ通信装置の認証処理に利用することができる。また、送信側の端末（例えばアナログ電話端末３００）の使用状況やファクシミリ及びＴＶ電話などのサービスの対応の有無を表すデータを非音声データＲＴＰパケットに含めて送信することにより、受信側の装置がこれらの情報を受信することができる。仮に受信側の装置が本発明によるＶｏＩＰ通信装置ではない場合、非音声データＲＴＰパケットは不完全なパケットと判別され、当該ＲＴＰパケットは破棄される。受信側の装置がＰＬＣなどの補間機能を備えていれば、当該破棄されたＲＴＰパケットに代えて音声データが補間されるため、聴感上の影響は最小限に止められる。 As described above, according to the present embodiment, in the stream transmission / reception of the voice data RTP packet, the transmission / reception of the non-voice data by the RTP packet becomes possible. For example, if the VoIP communication device 100 transmits the non-voice data RTP packet including its own identification information, the reception side device can receive the identification information. Note that the receiving-side device here may be a VoIP communication device according to the present invention or another device. As a result, even when there is a network that restricts communication using a protocol (for example, RTCP) different from RTP in the communication network, the identification information can be transmitted and received. The receiving apparatus can use the identification information for authentication processing of the transmitting VoIP communication apparatus. In addition, by transmitting the data indicating the usage status of the terminal on the transmitting side (for example, analog telephone terminal 300) and the availability of services such as facsimile and videophone in a non-voice data RTP packet, the receiving apparatus can transmit the data. These pieces of information can be received. If the receiving apparatus is not a VoIP communication apparatus according to the present invention, the non-voice data RTP packet is determined as an incomplete packet, and the RTP packet is discarded. If the receiving apparatus has an interpolating function such as PLC, the audio data is interpolated in place of the discarded RTP packet, so the influence on hearing is minimized.

図４はＬＡＮポート及び無線ＬＡＮポートを含むＶｏＩＰ通信装置を表すブロック図である。他のブロックは図１に示されるのと同様である。実施例１と同様にＳＬＩＣ部１５０は、ＤＳＰ部１３０を介して非音声データＲＴＰパケット挿入部１１２及び非音声データＲＴＰパケット抽出部１１３と音声データをやり取りする。それに対して、ＬＡＮポート１６０及び無線ＬＡＮポート１７０はＤＳＰ部１３０を介さずに非音声データＲＴＰパケット挿入部１１２及び非音声データＲＴＰパケット抽出部１１３とそれぞれ接続される。 FIG. 4 is a block diagram showing a VoIP communication apparatus including a LAN port and a wireless LAN port. The other blocks are the same as those shown in FIG. Similar to the first embodiment, the SLIC unit 150 exchanges voice data with the non-voice data RTP packet insertion unit 112 and the non-voice data RTP packet extraction unit 113 via the DSP unit 130. On the other hand, the LAN port 160 and the wireless LAN port 170 are connected to the non-voice data RTP packet insertion unit 112 and the non-voice data RTP packet extraction unit 113 without going through the DSP unit 130, respectively.

非音声データＲＴＰパケット挿入部１１２は、ＬＡＮポート１６０または無線ＬＡＮポート１７０から受け取った音声データＲＴＰパケットの内の少なくとも１つを、非音声データＲＴＰパケット生成部１２１から受け取った非音声データＲＴＰパケットに置き換える。非音声データＲＴＰパケット挿入部１１２は、当該置き換えた非音声データＲＴＰパケットを含む音声データＲＴＰパケットのストリームをＲＴＰパケット送受信部１１１に供給する。 The non-voice data RTP packet insertion unit 112 converts at least one of the voice data RTP packets received from the LAN port 160 or the wireless LAN port 170 into the non-voice data RTP packet received from the non-voice data RTP packet generation unit 121. replace. The non-voice data RTP packet insertion unit 112 supplies a stream of voice data RTP packets including the replaced non-voice data RTP packet to the RTP packet transmission / reception unit 111.

非音声データＲＴＰパケット抽出部１１３は、ＲＴＰパケット送受信部１１１から受け取った音声データＲＴＰパケットのストリームに含まれる非音声データＲＴＰパケットを当該非音声データＲＴＰパケットのデータサイズに基づいて抽出する。非音声データＲＴＰパケット抽出部１１３は、当該抽出した非音声データＲＴＰパケットを非音声データＲＴＰパケット読取部１２２に供給する。また、非音声データＲＴＰパケット抽出部１１３は、当該抽出した非音声データＲＴＰパケット以外の音声データＲＴＰパケットのストリームをＬＡＮポート１６０または無線ＬＡＮポート１７０に供給する。 The non-voice data RTP packet extraction unit 113 extracts the non-voice data RTP packet included in the stream of voice data RTP packets received from the RTP packet transmission / reception unit 111 based on the data size of the non-voice data RTP packet. The non-voice data RTP packet extraction unit 113 supplies the extracted non-voice data RTP packet to the non-voice data RTP packet reading unit 122. Further, the non-voice data RTP packet extraction unit 113 supplies a stream of voice data RTP packets other than the extracted non-voice data RTP packet to the LAN port 160 or the wireless LAN port 170.

ＬＡＮポート１６０は、ＩＰ電話端末４００から音声データＲＴＰパケットのストリームを受信する。ＬＡＮポート１６０は、当該受信によって準備した音声データＲＴＰパケットのストリームを非音声データＲＴＰパケット挿入部１１２に供給する。また、ＬＡＮポート１６０は、非音声データＲＴＰパケット抽出部１１３から供給された音声データＲＴＰパケットのストリームをＩＰ電話端末４００に送信する。 LAN port 160 receives a stream of voice data RTP packets from IP telephone terminal 400. The LAN port 160 supplies the stream of voice data RTP packets prepared by the reception to the non-voice data RTP packet insertion unit 112. Further, the LAN port 160 transmits a stream of voice data RTP packets supplied from the non-voice data RTP packet extraction unit 113 to the IP telephone terminal 400.

無線ＬＡＮポート１７０は、無線ＩＰ電話端末５００から音声データＲＴＰパケットのストリームを受信する。無線ＬＡＮポート１７０は、当該受信によって準備した音声データＲＴＰパケットのストリームを非音声データＲＴＰパケット挿入部１１２に供給する。また、無線ＬＡＮポート１７０は、非音声データＲＴＰパケット抽出部１１３から供給された音声データＲＴＰパケットのストリームを無線ＩＰ電話端末５００に送信する。 The wireless LAN port 170 receives a stream of voice data RTP packets from the wireless IP telephone terminal 500. The wireless LAN port 170 supplies the stream of voice data RTP packets prepared by the reception to the non-voice data RTP packet insertion unit 112. Further, the wireless LAN port 170 transmits a stream of voice data RTP packets supplied from the non-voice data RTP packet extraction unit 113 to the wireless IP telephone terminal 500.

再び図２Ａ〜２Ｃを参照しつつ、ＩＰ電話端末４００または無線ＩＰ電話端末５００から受信した音声データＲＴＰパケットのストリームを通信ネットワーク２００へ送信する場合における非音声データＲＴＰパケット挿入部１１２でのＲＴＰパケットのストリームについて説明する。 2A to 2C again, the RTP packet in the non-voice data RTP packet insertion unit 112 when the voice data RTP packet stream received from the IP telephone terminal 400 or the wireless IP telephone terminal 500 is transmitted to the communication network 200. Will be described.

図２Ａは非音声データＲＴＰパケット挿入部１１２がＬＡＮポート１６０または無線ＬＡＮポート１７０から受け取った音声データＲＴＰパケットのストリームである。非音声データＲＴＰパケット挿入部１１２は音声データＲＴＰパケットＰ１〜Ｐ５をＬＡＮポート１６０または無線ＬＡＮポート１７０から順次、受け取る。 FIG. 2A shows a stream of voice data RTP packets received from the LAN port 160 or the wireless LAN port 170 by the non-voice data RTP packet insertion unit 112. The non-voice data RTP packet insertion unit 112 sequentially receives voice data RTP packets P1 to P5 from the LAN port 160 or the wireless LAN port 170.

図２Ｂは非音声データＲＴＰパケット挿入部１１２がＬＡＮポート１６０または無線ＬＡＮポート１７０から受け取った音声データＲＴＰパケットＰ３を、非音声データＲＴＰパケット生成部１２１から受け取った非音声データＲＴＰパケットＤ３に置き換えるときの音声データＲＴＰパケットのストリームである。このとき、非音声データＲＴＰパケット挿入部１１２は実施例１と同様に音声データＲＴＰパケットＰ３を破棄して、当該破棄した箇所に非音声データＲＴＰパケットＤ３を挿入することによって非音声データＲＴＰパケットＤ３に置き換える。 FIG. 2B illustrates a case where the non-voice data RTP packet insertion unit 112 replaces the voice data RTP packet P3 received from the LAN port 160 or the wireless LAN port 170 with the non-voice data RTP packet D3 received from the non-voice data RTP packet generation unit 121. Is a stream of voice data RTP packets. At this time, the non-voice data RTP packet insertion unit 112 discards the voice data RTP packet P3 in the same manner as in the first embodiment, and inserts the non-voice data RTP packet D3 into the discarded portion, thereby causing the non-voice data RTP packet D3 to be discarded. Replace with

図２Ｃは非音声データＲＴＰパケット挿入部１１２が音声データＲＴＰパケットＰ３を非音声データＲＴＰパケットＤ３に置き換えた後の音声データＲＴＰパケットのストリームである。実施例１と同様に非音声データＲＴＰパケット挿入部１１２は音声データＲＴＰパケットＰ１、Ｐ２、非音声データＲＴＰパケットＤ３、音声データＲＴＰパケットＰ４、Ｐ５をＲＴＰパケット送受信部１１１に順次、供給する。ＲＴＰパケット送受信部１１１はこれらのＲＴＰパケットを受け取った順に順次、ＷＡＮポート１４０経由で通信ネットワーク２００に送信する。 FIG. 2C is a stream of voice data RTP packets after the non-voice data RTP packet insertion unit 112 replaces the voice data RTP packet P3 with the non-voice data RTP packet D3. As in the first embodiment, the non-voice data RTP packet insertion unit 112 sequentially supplies the voice data RTP packets P1 and P2, the non-voice data RTP packet D3, and the voice data RTP packets P4 and P5 to the RTP packet transmission / reception unit 111. The RTP packet transmission / reception unit 111 sequentially transmits these RTP packets to the communication network 200 via the WAN port 140 in the order received.

上記したようにＩＰ電話端末４００や無線ＩＰ電話端末５００から受信した音声データＲＴＰパケットの内の少なくとも１つを非音声データＲＴＰパケットに置き換えて通信ネットワーク２００へ送信することにより、ＲＴＰパケットでの非音声データの送信が可能となる。 As described above, at least one of the voice data RTP packets received from the IP telephone terminal 400 or the wireless IP telephone terminal 500 is replaced with a non-voice data RTP packet and transmitted to the communication network 200. Audio data can be transmitted.

再び図３Ａ〜３Ｃを参照しつつ、通信ネットワーク２００から受信した音声データＲＴＰパケットのストリームをＩＰ電話端末４００または無線ＩＰ電話端末５００へ向けて送信する場合における非音声データＲＴＰパケット抽出部１１３でのＲＴＰパケットのストリームについて説明する。 3A to 3C again, the non-voice data RTP packet extraction unit 113 in the case of transmitting the voice data RTP packet stream received from the communication network 200 to the IP telephone terminal 400 or the wireless IP telephone terminal 500. The RTP packet stream will be described.

実施例１と同様に非音声データＲＴＰパケット抽出部１１３は、図３Ａに示される如き音声データＲＴＰパケットＰ１、Ｐ２、非音声データＲＴＰパケットＤ３、音声データＲＴＰパケットＰ４、Ｐ５をＲＴＰパケット送受信部１１１から順次、受け取る。 As in the first embodiment, the non-voice data RTP packet extraction unit 113 converts the voice data RTP packets P1 and P2, the non-voice data RTP packet D3, and the voice data RTP packets P4 and P5 as shown in FIG. Receive sequentially.

実施例１と同様に非音声データＲＴＰパケット抽出部１１３は、図３Ｂに示される如くＲＴＰパケット送受信部１１１から受け取った非音声データＲＴＰパケットＤ３を抽出し、これを非音声データＲＴＰパケット読取部１２２に供給する。 As in the first embodiment, the non-voice data RTP packet extraction unit 113 extracts the non-voice data RTP packet D3 received from the RTP packet transmission / reception unit 111 as shown in FIG. To supply.

非音声データＲＴＰパケット抽出部１１３は音声データＲＴＰパケットＰ１、Ｐ２、Ｐ４及びＰ５をＬＡＮポート１６０または無線ＬＡＮポート１７０に順次、供給する。ＬＡＮポート１６０は、非音声データＲＴＰパケット抽出部１１３から受け取った音声データＲＴＰパケットを順次、ＩＰ電話端末４００に送信する。また、無線ＬＡＮポート１７０は非音声データＲＴＰパケット抽出部１１３から受け取った音声データＲＴＰパケットを順次、無線ＩＰ電話端末５００に送信する。ＩＰ電話端末４００または無線ＩＰ電話端末５００は受信した音声データＲＴＰパケットＰ１、Ｐ２、Ｐ４及びＰ５を順次、音声データＳ１、Ｓ２、Ｓ４及びＳ５に変換する。 The non-voice data RTP packet extraction unit 113 sequentially supplies the voice data RTP packets P1, P2, P4, and P5 to the LAN port 160 or the wireless LAN port 170. The LAN port 160 sequentially transmits the voice data RTP packets received from the non-voice data RTP packet extraction unit 113 to the IP telephone terminal 400. Further, the wireless LAN port 170 sequentially transmits the voice data RTP packets received from the non-voice data RTP packet extraction unit 113 to the wireless IP telephone terminal 500. IP telephone terminal 400 or wireless IP telephone terminal 500 sequentially converts received voice data RTP packets P1, P2, P4 and P5 into voice data S1, S2, S4 and S5.

図３ＣはＩＰ電話端末４００または無線ＩＰ電話端末５００における音声データＳ１、Ｓ２、Ｃ３、Ｓ４及びＳ５に変換した後の音声データのストリームを表す図である。ＩＰ電話端末４００及び無線ＩＰ電話端末５００はＰＬＣなどの補間機能によって、抽出された非音声データＲＴＰパケットＤ３を穴埋めするための音声データＣ３を生成できる。ＩＰ電話端末４００及び無線ＩＰ電話端末５００は生成した音声データＣ３を含む音声データＳ１、Ｓ２、Ｃ３、Ｓ４及びＳ５を順次、音声出力する。 FIG. 3C is a diagram showing a stream of audio data after being converted into audio data S1, S2, C3, S4, and S5 in IP telephone terminal 400 or wireless IP telephone terminal 500. The IP telephone terminal 400 and the wireless IP telephone terminal 500 can generate voice data C3 for filling the extracted non-voice data RTP packet D3 by an interpolation function such as PLC. The IP telephone terminal 400 and the wireless IP telephone terminal 500 sequentially output voice data S1, S2, C3, S4 and S5 including the generated voice data C3.

上記したように通信ネットワーク２００から受信した音声データＲＴＰパケットの内の少なくとも１つを非音声データＲＴＰパケットに置き換えてＩＰ電話端末４００や無線ＩＰ電話端末５００へ送信することにより、ＲＴＰパケットでの非音声データの送信が可能となる。 As described above, at least one of the voice data RTP packets received from the communication network 200 is replaced with a non-voice data RTP packet and transmitted to the IP telephone terminal 400 or the wireless IP telephone terminal 500, so Audio data can be transmitted.

上記した如く本実施例によれば、音声データＲＴＰパケットのストリームを受信し、当該音声データＲＴＰパケットの内の少なくとも１つを非音声データＲＴＰパケットに置き換えて送信することにより、ＲＴＰパケットでの非音声データの送信が可能となる。非音声データＲＴＰパケットに端末識別情報、端末使用状況情報及びサービス対応状況情報などの非音声データを含めて送受信すれば、実施例１で述べたのと同様の効果を得ることができる。 As described above, according to the present embodiment, a stream of voice data RTP packets is received, and at least one of the voice data RTP packets is replaced with a non-voice data RTP packet and transmitted. Audio data can be transmitted. If non-voice data including non-voice data such as terminal identification information, terminal usage status information, and service response status information is transmitted and received in the non-voice data RTP packet, the same effect as described in the first embodiment can be obtained.

実施例１及び２ではアナログ電話端末３００、ＩＰ電話端末４００及び無線ＩＰ電話端末５００の各々を各１台としたが、本発明にはかかる電話端末数の制限は無い。また、通常、アナログ電話端末３００とＳＬＩＣ部とはアナログ通信回線で、ＩＰ電話端末とＬＡＮポート１６０とはＬＡＮ網で、無線ＩＰ電話端末５００と無線ＬＡＮポート１７０とは無線ＬＡＮ網で接続されるが、本発明にはかかる接続形態の制限は無い。 In the first and second embodiments, each of the analog telephone terminal 300, the IP telephone terminal 400, and the wireless IP telephone terminal 500 is one, but the present invention has no limitation on the number of telephone terminals. In general, the analog telephone terminal 300 and the SLIC unit are connected by an analog communication line, the IP telephone terminal and the LAN port 160 are connected by a LAN network, and the wireless IP telephone terminal 500 and the wireless LAN port 170 are connected by a wireless LAN network. However, there is no limitation on the connection form in the present invention.

実施例１及び２では、ＲＴＰ制御部１００、非音声データ読取生成部１２０及びＤＳＰ部１３０がそれぞれ独立した構成となっているが、例えば、音声データ及び非音声データＲＴＰパケットの判別をＤＳＰ部１３０にて実施することも可能である。 In the first and second embodiments, the RTP control unit 100, the non-voice data reading / generating unit 120, and the DSP unit 130 are configured independently of each other. For example, the DSP unit 130 determines whether the voice data and the non-voice data RTP packet are discriminated. It is also possible to implement in.

非音声データＲＴＰパケットのサイズは音声データＲＴＰパケットのサイズより大きくても小さくても良く、また、サイズ自体にも特に制限は無い。 The size of the non-voice data RTP packet may be larger or smaller than the size of the voice data RTP packet, and the size itself is not particularly limited.

図５は識別無音データ管理部１２３を含むＶｏＩＰ通信装置１００を表すブロック図である。ＶｏＩＰ通信装置１００が、非音声データＲＴＰパケット生成部１２１、非音声データＲＴＰパケット挿入部１１２及び非音声データＲＴＰパケット抽出部１１３を含まず、識別無音データ管理部１２３を含む点が実施例１と異なる。以下、実施例１と異なる部分を主として説明する。 FIG. 5 is a block diagram showing the VoIP communication apparatus 100 including the identification silence data management unit 123. The point that the VoIP communication apparatus 100 does not include the non-voice data RTP packet generation unit 121, the non-voice data RTP packet insertion unit 112, and the non-voice data RTP packet extraction unit 113 but includes the identification silence data management unit 123 is the same as in the first embodiment. Different. Hereinafter, parts different from the first embodiment will be mainly described.

識別無音データ管理部１２３は、ＲＴＰパケットのペイロードに含めるべき識別無音データを管理している。ＩＴＵ−Ｔの勧告により規定されているＧ．７１１符号化規格においては、データ０ｘ７Ｆ及び０ｘＦＦは無音を示すデータとして定められている。識別無音データ管理部１２３は、例えばデータ０ｘ７Ｆを論理値０、データ０ｘＦＦを論理値１に対応付けて管理する。この場合、識別無音データ管理部１２３は、例えば０ｘ７Ｆ、０ｘＦＦ、０ｘ７Ｆ、０ｘＦＦ、・・・に対応する識別無音データ０１０１・・・を記憶している。識別無音データ０１０１・・・は、例えば２０ｍｓフレームであれば１６０ｂｉｔ分からなる。当該識別無音データは、受信側のＶｏＩＰ通信装置１００において非音声データＲＴＰパケットを識別するために利用される。識別無音データ管理部１２３は、音声データＲＴＰパケット生成部１３１へ識別無音データを適宜、与える。また、識別無音データ管理部１２３は、端末識別情報、端末使用状況情報及びサービス対応状況情報などの端末サービス情報を記憶しており、端末サービス情報を識別無音データと共に音声データＲＴＰパケット生成部１３１へ与える。 The identification silence data management unit 123 manages identification silence data to be included in the payload of the RTP packet. The G.C. In the 711 coding standard, data 0x7F and 0xFF are defined as data indicating silence. The identification silence data management unit 123 manages data 0x7F in association with a logical value 0 and data 0xFF in association with a logical value 1, for example. In this case, the identification silence data management unit 123 stores identification silence data 0101... Corresponding to, for example, 0x7F, 0xFF, 0x7F, 0xFF,. The identification silence data 0101... Consists of 160 bits for a 20 ms frame, for example. The identification silence data is used to identify the non-voice data RTP packet in the VoIP communication apparatus 100 on the receiving side. The identification silence data management unit 123 appropriately provides the identification silence data to the voice data RTP packet generation unit 131. Further, the identification silence data management unit 123 stores terminal service information such as terminal identification information, terminal usage status information, and service response status information, and the terminal service information is transmitted to the voice data RTP packet generation unit 131 together with the identification silence data. give.

音声データＲＴＰパケット生成部１３１は、ＳＬＩＣ部１５０からの音声信号をＧ．７１１符号化規格に従って変換して音声データＲＴＰパケットを生成しつつ、識別無音データ管理部１２３から識別無音データを受け取った場合には、当該識別無音データを変換して得られた識別非音声データを含む非音声データＲＴＰパケットを生成する。図６は識別非音声データを含むＲＴＰパケットの例を表す図である。ＲＴＰヘッダ部分には図６に示されるように通常、ヘッダに含まれるべきデータが含まれていれば良い。ＲＴＰペイロードには、音声データＲＴＰパケット生成部１３１が識別無音データ管理部１２３からの識別無音データをＧ．７１１符号化規格において無音を表すデータ０ｘ７Ｆ及び０ｘＦＦに変換して得られた識別非音声データが含まれている。識別無音データ管理部１２３からの識別無音データが０１０・・・０１０であった場合、音声データＲＴＰパケット生成部１３１は、論理値０をデータ０ｘ７Ｆに、論理値１をデータ０ｘＦＦに、それぞれ変換して識別非音声データ０ｘ７Ｆ、０ｘＦＦ、０ｘ７Ｆ、・・・、０ｘ７Ｆ、０ｘＦＦ、０ｘ７Ｆを得て、これを図６に示される如くＲＴＰペイロードに含む非音声データＲＴＰパケットを生成する。このとき、音声データＲＴＰパケット生成部１３１は、識別無音データと共に識別無音データ管理部１２３から受け取った端末サービス情報を当該非音声データＲＴＰパケットに含める。音声データＲＴＰパケット生成部１３１は、当該生成により得られた非音声データＲＴＰパケットを含む音声データＲＴＰパケットストリームをＲＴＰパケット送受信部１１１へ与える。 The audio data RTP packet generation unit 131 receives the audio signal from the SLIC unit 150 as a G.D. When the identification silence data is received from the identification silence data management unit 123 while generating the voice data RTP packet by converting according to the 711 coding standard, the identification non-voice data obtained by converting the identification silence data is A non-voice data RTP packet is generated. FIG. 6 is a diagram illustrating an example of an RTP packet including identification non-voice data. As shown in FIG. 6, the RTP header portion usually only needs to include data to be included in the header. In the RTP payload, the voice data RTP packet generation unit 131 stores the identification silence data from the identification silence data management unit 123 in the G.G. The discriminating non-speech data obtained by converting data 0x7F and 0xFF representing silence in the 711 coding standard is included. When the identification silence data from the identification silence data management unit 123 is 010... 010, the voice data RTP packet generation unit 131 converts the logical value 0 into data 0x7F and the logical value 1 into data 0xFF. Thus, identification non-voice data 0x7F, 0xFF, 0x7F,..., 0x7F, 0xFF, 0x7F are obtained, and non-voice data RTP packets including this in the RTP payload are generated as shown in FIG. At this time, the voice data RTP packet generation unit 131 includes the terminal service information received from the identification silence data management unit 123 together with the identification silence data in the non-voice data RTP packet. The voice data RTP packet generation unit 131 provides the RTP packet transmission / reception unit 111 with the voice data RTP packet stream including the non-voice data RTP packet obtained by the generation.

ＲＴＰパケット送受信部１１１は、音声データＲＴＰパケット生成部１３１からの非音声データＲＴＰパケットを含む音声データＲＴＰパケットストリームをＷＡＮポート１４０経由で通信ネットワーク２００へ送信する。 The RTP packet transmission / reception unit 111 transmits the voice data RTP packet stream including the non-voice data RTP packet from the voice data RTP packet generation unit 131 to the communication network 200 via the WAN port 140.

音声信号変換部１３２は、ＲＴＰパケット送受信部１１１から非音声データＲＴＰパケットを含む音声データＲＴＰパケットのストリームを受け取り、当該音声データＲＴＰパケットの各々についてＲＴＰペイロードに識別非音声データが含まれているか否かを判別する。音声信号変換部１３２は、ＲＴＰペイロードに識別非音声データを含む音声データＲＴＰパケットを非音声データＲＴＰパケットと識別する。音声信号変換部１３２は、ＲＴＰペイロード部分に含まれているデータ０ｘ７Ｆ及び０ｘＦＦをそれぞれ識別する機能を有しており、データ０ｘ７Ｆを論理値０、データ０ｘＦＦを論理値１へそれぞれデコードする。 The audio signal conversion unit 132 receives a stream of audio data RTP packets including non-audio data RTP packets from the RTP packet transmission / reception unit 111, and whether or not identification non-audio data is included in the RTP payload for each of the audio data RTP packets. Is determined. The audio signal conversion unit 132 identifies the audio data RTP packet including the identified non-audio data in the RTP payload as the non-audio data RTP packet. The audio signal conversion unit 132 has a function of identifying the data 0x7F and 0xFF included in the RTP payload portion, and decodes the data 0x7F into a logical value 0 and the data 0xFF into a logical value 1, respectively.

音声信号変換部１３２は、送信側のＶｏＩＰ通信装置１００の識別無音データ管理部１２３において管理されている識別無音データと同一の識別無音データを予め記憶している。例えば、送信側のＶｏＩＰ通信装置１００の識別無音データ管理部１２３において管理されている識別無音データが０１０１・・・０１０１である場合、音声信号変換部１３２は、同じく識別無音データが０１０１・・・０１０１を予め記憶している。音声信号変換部１３２は、ＲＴＰパケット送受信部１１１からの音声データＲＴＰパケットの各々についてＲＴＰペイロードに含まれているデータをデコードし、当該デコードによって得られたデータ０１０１・・・０１０１が、自身が予め記憶している識別無音データ０１０１・・・０１０１と一致した場合に、当該音声データＲＴＰパケットを非音声データＲＴＰパケットと識別する。音声信号変換部１３２は、非音声データＲＴＰパケットを非音声データＲＴＰパケット読取部１２２へ与える。 The audio signal conversion unit 132 stores in advance the identification silence data that is the same as the identification silence data managed by the identification silence data management unit 123 of the transmission-side VoIP communication apparatus 100. For example, when the identification silence data managed by the identification silence data management unit 123 of the transmission-side VoIP communication apparatus 100 is 0101... 0101, the audio signal conversion unit 132 also has the identification silence data 0101. 0101 is stored in advance. The audio signal conversion unit 132 decodes the data included in the RTP payload for each of the audio data RTP packets from the RTP packet transmission / reception unit 111, and the data 0101... If it matches the stored identification silence data 0101... 0101, the voice data RTP packet is identified as a non-voice data RTP packet. The audio signal conversion unit 132 provides the non-audio data RTP packet to the non-audio data RTP packet reading unit 122.

非音声データＲＴＰパケット読取部１２２は非音声データＲＴＰパケットに含まれている端末識別情報、端末使用状況情報及びサービス対応状況情報などの端末サービス情報を読み取る。 The non-voice data RTP packet reading unit 122 reads terminal service information such as terminal identification information, terminal usage status information, and service response status information included in the non-voice data RTP packet.

図７は、音声データＲＴＰパケットストリームの送信側及び受信側のＶｏＩＰ通信装置１００の動作を表すシーケンス図である。以下、図７を参照しつつ、音声データＲＴＰパケットストリームの送信側及び受信側のＶｏＩＰ通信装置１００の動作について説明する。 FIG. 7 is a sequence diagram showing the operation of the VoIP communication apparatus 100 on the transmission side and reception side of the voice data RTP packet stream. Hereinafter, the operation of the VoIP communication apparatus 100 on the transmission side and reception side of the voice data RTP packet stream will be described with reference to FIG.

送信側のＶｏＩＰ通信装置１００は以下のように動作する。識別無音データ管理部１２３は、自身が管理している識別無音データ及び端末識別情報、端末使用状況情報及びサービス対応状況情報などの端末サービス情報を音声データＲＴＰパケット生成部１３１へ適宜、与える（ステップＳ１０１）。 The transmitting-side VoIP communication apparatus 100 operates as follows. The identification silence data management unit 123 appropriately provides terminal service information such as identification silence data and terminal identification information, terminal usage status information, and service response status information managed by the identification silence data management unit 123 to the voice data RTP packet generation unit 131 (step). S101).

音声データＲＴＰパケット生成部１３１は、ＳＬＩＣ部１５０からの音声信号をＧ．７１１符号化規格に従って変換して音声データＲＴＰパケットを生成しつつ、識別無音データ管理部１２３から識別無音データを受け取った場合には、当該識別無音データを変換して得られた識別非音声データを含む非音声データＲＴＰパケットを生成する（ステップＳ１０２）。当該非音声データＲＴＰパケットのＲＴＰペイロードには、例えば図6に示されるようにＧ．７１１規格において無音を表すデータ０ｘ７Ｆ及び０ｘＦＦのからなる識別非音声データが含まれる。このとき、音声データＲＴＰパケット生成部１３１は、識別無音データ管理部１２３からの端末サービス情報も併せて当該非音声データＲＴＰパケットに含める。 The audio data RTP packet generation unit 131 receives the audio signal from the SLIC unit 150 as a G.D. When the identification silence data is received from the identification silence data management unit 123 while generating the voice data RTP packet by converting according to the 711 coding standard, the identification non-voice data obtained by converting the identification silence data is A non-voice data RTP packet including the same is generated (step S102). In the RTP payload of the non-voice data RTP packet, for example, as shown in FIG. In the 711 standard, identification non-speech data composed of data 0x7F and 0xFF representing silence is included. At this time, the voice data RTP packet generation unit 131 also includes the terminal service information from the identification silence data management unit 123 in the non-voice data RTP packet.

音声データＲＴＰパケット生成部１３１は、当該生成により得られた非音声データＲＴＰパケットを含む音声データＲＴＰパケットストリームをＲＴＰパケット送受信部１１１へ与える。ＲＴＰパケット送受信部１１１は、音声データＲＴＰパケット生成部１３１からの非音声データＲＴＰパケットを含む音声データＲＴＰパケットストリームをＷＡＮポート１４０経由で通信ネットワーク２００へ送信する（ステップＳ１０３）。 The voice data RTP packet generation unit 131 provides the RTP packet transmission / reception unit 111 with the voice data RTP packet stream including the non-voice data RTP packet obtained by the generation. The RTP packet transmission / reception unit 111 transmits the voice data RTP packet stream including the non-voice data RTP packet from the voice data RTP packet generation unit 131 to the communication network 200 via the WAN port 140 (step S103).

受信側のＶｏＩＰ通信装置１００は以下のように動作する。音声信号変換部１３２は、ＲＴＰパケット送受信部１１１から非音声データＲＴＰパケットを含む音声データＲＴＰパケットのストリームを受け取り（ステップＳ１０４）、ＲＴＰペイロードに識別非音声データが含まれている音声データＲＴＰパケットを非音声データＲＴＰパケットと識別する。このとき、音声信号変換部１３２は、ＲＴＰペイロード部分に含まれているデータ０ｘ７Ｆを論理値０、データ０ｘＦＦを論理値１へそれぞれデコードして得られたデータ列と、自身が予め記憶している識別無音データと、が一致した場合に当該音声データＲＴＰパケットを非音声データＲＴＰパケットと識別する（ステップＳ１０５）。 The receiving-side VoIP communication apparatus 100 operates as follows. The audio signal conversion unit 132 receives a stream of audio data RTP packets including the non-audio data RTP packet from the RTP packet transmission / reception unit 111 (step S104), and receives the audio data RTP packet whose identification non-audio data is included in the RTP payload. It is identified as a non-voice data RTP packet. At this time, the audio signal conversion unit 132 stores in advance a data string obtained by decoding the data 0x7F included in the RTP payload portion into the logical value 0 and the data 0xFF into the logical value 1, respectively. If the identified silence data matches, the voice data RTP packet is identified as a non-voice data RTP packet (step S105).

音声信号変換部１３２は、非音声データＲＴＰパケットを非音声データＲＴＰパケット読取部１２２へ与える。非音声データＲＴＰパケット読取部１２２は非音声データＲＴＰパケットに含まれている端末識別情報、端末使用状況情報及びサービス対応状況情報などの端末サービス情報を読み取る（ステップＳ１０６）。 The audio signal conversion unit 132 provides the non-audio data RTP packet to the non-audio data RTP packet reading unit 122. The non-voice data RTP packet reading unit 122 reads terminal service information such as terminal identification information, terminal usage status information, and service response status information included in the non-voice data RTP packet (step S106).

上記したように本実施例による送信側のＶｏＩＰ通信装置１００は、Ｇ．７１１符号化規格において無音を表すデータ０ｘ７Ｆ及び０ｘＦＦのデータ列からなる識別非音声データをＲＴＰペイロードに含めて非音声データＲＴＰパケットを生成し、これを音声データＲＴＰパケットのストリームと共に送信する。当該非音声データＲＴＰパケットには端末識別情報、端末使用状況情報及びサービス対応状況情報などの端末サービス情報も含められて送信される。受信側のＶｏＩＰ通信装置１００は、ＲＴＰペイロードに識別非音声データを含む音声データＲＴＰパケットを非音声データＲＴＰパケットであると識別し、当該非音声データＲＴＰパケットに含まれている端末サービス情報を取得する。このように送信側のＶｏＩＰ通信装置１００は、Ｇ．７１１符号化規格に規定される無音データからなる識別非音声データをＲＴＰパケットに含めて非音声データＲＴＰパケットを生成する。受信側のＶｏＩＰ通信装置１００は、識別非音声データに基づいて非音声データＲＴＰパケットを識別し、当該非音声データＲＴＰパケットに含まれている端末識別情報、端末使用状況情報及びサービス対応状況情報などの情報を得ることができる。 As described above, the VoIP communication apparatus 100 on the transmission side according to this embodiment is a G. The non-voice data RTP packet is generated by including the identified non-voice data composed of the data strings of data 0x7F and 0xFF representing silence in the 711 coding standard in the RTP payload, and this is transmitted together with the stream of the voice data RTP packet. The non-voice data RTP packet is transmitted including terminal service information such as terminal identification information, terminal usage status information, and service response status information. The receiving-side VoIP communication apparatus 100 identifies the voice data RTP packet including the identified non-voice data in the RTP payload as the non-voice data RTP packet, and acquires the terminal service information included in the non-voice data RTP packet To do. Thus, the VoIP communication device 100 on the transmission side The non-voice data RTP packet is generated by including identification non-voice data consisting of silence data defined in the 711 coding standard in the RTP packet. The receiving-side VoIP communication apparatus 100 identifies a non-voice data RTP packet based on the identified non-voice data, and includes terminal identification information, terminal usage status information, service response status information, and the like included in the non-voice data RTP packet. Information can be obtained.

本実施例は０ｘ７Ｆを論理値０に、０ｘＦＦを論理値１に、それぞれ対応させた例であるが、０ｘ７Ｆ及び０ｘＦＦに対応させるべき値に制限は無い。また、本実施例においては図６に示されるような識別非音声データ０ｘ７Ｆ、０ｘＦＦ、０ｘ７Ｆ、・・・、０ｘ７Ｆ、０ｘＦＦ、０ｘ７Ｆとしたが、識別非音声データを構成するデータ０ｘ７Ｆ及び０ｘＦＦの並び順には特に制限は無く、送信側及び受信側の各々のＶｏＩＰ通信装置で共通の識別非音声データが設定されていれば良い。また、本実施例においては０ｘ７Ｆ及び０ｘＦＦのデータ列全体で１つの識別用の識別非音声データを表したが、識別非音声データを構成する個々のデータに意味を持たせても良い。例えば、識別非音声データの、先頭のデータは機種を表すデータ、2番目のデータは機能１の有無を表すデータ、3番目のデータは機能２を表すデータ、・・・、などの意味を持たせる。このとき受信側のＶｏＩＰ通信装置１００は、識別非音声データを構成する個々のデータを読み取ることにより、機種及び機能の有無についての情報を得ることができる。 In this embodiment, 0x7F is associated with a logical value 0, and 0xFF is associated with a logical value 1, but there is no limitation on values that should be associated with 0x7F and 0xFF. In this embodiment, the identification non-speech data 0x7F, 0xFF, 0x7F,..., 0x7F, 0xFF, 0x7F as shown in FIG. There is no particular limitation on the order, and it is only necessary that identification non-voice data common to the VoIP communication apparatuses on the transmission side and the reception side is set. In this embodiment, one identification non-voice data for identification is represented by the entire data string of 0x7F and 0xFF. However, each piece of data constituting the identification non-voice data may have a meaning. For example, the identification non-speech data has the meaning that the top data is data representing the model, the second data is data representing the presence or absence of function 1, the third data is data representing function 2, and so on. Make it. At this time, the VoIP communication apparatus 100 on the receiving side can obtain information on the model and presence / absence of the function by reading individual data constituting the identification non-voice data.

本実施例によるＶｏＩＰ通信装置１００は図１に示される構成である。以下、実施例１と異なる部分を主として説明する。 The VoIP communication apparatus 100 according to the present embodiment has the configuration shown in FIG. Hereinafter, parts different from the first embodiment will be mainly described.

非音声データＲＴＰパケット生成部１２１は、予め識別非音声データを記憶しており、当該識別非音声データを含む非音声データＲＴＰパケットを生成する。当該識別非音声データは、例えば図６のＲＴＰペイロードに示されるように０ｘ７Ｆ及び０ｘＦＦのデータ列からなる０ｘ７Ｆ、０ｘＦＦ、０ｘ７Ｆ、・・・、０ｘ７Ｆ、０ｘＦＦ、０ｘ７Ｆなどのデータ列であり、受信側のＶｏＩＰ通信装置１００の非音声データＲＴＰパケット抽出部１１３に設定されている識別非音声データと同一である。また、このとき、非音声データＲＴＰパケット生成部１２１は、当該非音声データＲＴＰパケットに端末識別情報、端末使用状況情報及びサービス対応状況情報などの端末サービス情報も含める。 The non-voice data RTP packet generation unit 121 stores identification non-voice data in advance and generates a non-voice data RTP packet including the identification non-voice data. The identification non-speech data is, for example, a data sequence such as 0x7F, 0xFF, 0x7F, ..., 0x7F, 0xFF, 0x7F composed of data sequences of 0x7F and 0xFF as shown in the RTP payload of FIG. This is the same as the identified non-voice data set in the non-voice data RTP packet extraction unit 113 of the VoIP communication apparatus 100 of the VoIP communication device 100. At this time, the non-voice data RTP packet generator 121 also includes terminal service information such as terminal identification information, terminal usage status information, and service response status information in the non-voice data RTP packet.

非音声データＲＴＰパケット挿入部１１２は、ＳＬＩＣ部１５０からの音声信号をＧ．７１１符号化規格に従って変換して音声データＲＴＰパケットを生成しつつ、識別無音データ管理部１２３から識別非音声データを含む非音声データＲＴＰパケットを受け取った場合には、当該音声データＲＴＰパケットの内の少なくとも１つを、当該非音声データＲＴＰパケットに置き換え、当該非音声データＲＴＰパケットを含む音声データＲＴＰパケットのストリームをＲＴＰパケット送受信部１１１に供給する。 The non-voice data RTP packet insertion unit 112 receives the voice signal from the SLIC unit 150 as a G.P. When the non-voice data RTP packet including the identified non-voice data is received from the identified silence data management unit 123 while generating the voice data RTP packet by converting according to the H.711 coding standard, At least one is replaced with the non-voice data RTP packet, and a stream of voice data RTP packets including the non-voice data RTP packet is supplied to the RTP packet transceiver 111.

非音声データＲＴＰパケット抽出部１１３は、ＲＴＰパケット送受信部１１１から受け取った音声データＲＴＰパケットのストリームに含まれる非音声データＲＴＰパケットをＲＴＰペイロードに含まれているデータに基づいて抽出する。非音声データＲＴＰパケット抽出部１１３は、送信側のＶｏＩＰ通信装置１００の非音声データＲＴＰパケット生成部１２１に設定されている識別非音声データと同一の識別非音声データを予め記憶しており、ＲＴＰペイロードに当該識別非音声データが含まれている音声データＲＴＰパケットを非音声データＲＴＰパケットであると識別してこれを抽出する。当該識別非音声データは例えば図６のＲＴＰペイロードに示されるように０ｘ７Ｆ及び０ｘＦＦのデータ列からなる０ｘ７Ｆ、０ｘＦＦ、０ｘ７Ｆ、・・・、０ｘ７Ｆ、０ｘＦＦ、０ｘ７Ｆなどのデータ列である。非音声データＲＴＰパケット抽出部１１３は、当該非音声データＲＴＰパケットを非音声データＲＴＰパケット読取部１２２に与える。また、非音声データＲＴＰパケット抽出部１１３は、当該抽出した非音声データＲＴＰパケット以外の音声データＲＴＰパケットのストリームを音声信号変換部１３２に供給するようにしても良い。 The non-voice data RTP packet extraction unit 113 extracts the non-voice data RTP packet included in the stream of voice data RTP packets received from the RTP packet transmission / reception unit 111 based on the data included in the RTP payload. The non-voice data RTP packet extraction unit 113 stores in advance the same identification non-voice data as the identification non-voice data set in the non-voice data RTP packet generation unit 121 of the transmission-side VoIP communication apparatus 100, and RTP The voice data RTP packet including the identified non-voice data in the payload is identified as a non-voice data RTP packet and extracted. The identification non-speech data is, for example, a data sequence such as 0x7F, 0xFF, 0x7F,..., 0x7F, 0xFF, 0x7F composed of data sequences of 0x7F and 0xFF as shown in the RTP payload of FIG. The non-voice data RTP packet extraction unit 113 provides the non-voice data RTP packet reading unit 122 with the non-voice data RTP packet. Further, the non-audio data RTP packet extraction unit 113 may supply the audio signal conversion unit 132 with a stream of audio data RTP packets other than the extracted non-audio data RTP packet.

図８は音声データＲＴＰパケットストリームの送信側及び受信側のＶｏＩＰ通信装置１００の動作を表すシーケンス図である。以下、図８を参照しつつ、音声データＲＴＰパケットストリームの送信側及び受信側のＶｏＩＰ通信装置１００の動作について説明する。 FIG. 8 is a sequence diagram showing the operation of the VoIP communication apparatus 100 on the transmission side and reception side of the voice data RTP packet stream. Hereinafter, the operation of the VoIP communication apparatus 100 on the transmission side and reception side of the voice data RTP packet stream will be described with reference to FIG.

送信側のＶｏＩＰ通信装置１００は以下のように動作する。非音声データＲＴＰパケット生成部１２１は、予め識別非音声データを記憶しており、当該識別非音声データを含む非音声データＲＴＰパケットを生成する（ステップＳ２０１）。なお、当該識別非音声データは、受信側のＶｏＩＰ通信装置１００の非音声データＲＴＰパケット抽出部１１３に設定されている識別非音声データと同一である。また、このとき、非音声データＲＴＰパケット生成部１２１は、当該非音声データＲＴＰパケットに端末識別情報、端末使用状況情報及びサービス対応状況情報などの端末サービス情報も含める。 The transmitting-side VoIP communication apparatus 100 operates as follows. The non-voice data RTP packet generation unit 121 stores identification non-voice data in advance, and generates a non-voice data RTP packet including the identification non-voice data (step S201). The identified non-voice data is the same as the identified non-voice data set in the non-voice data RTP packet extraction unit 113 of the receiving-side VoIP communication apparatus 100. At this time, the non-voice data RTP packet generator 121 also includes terminal service information such as terminal identification information, terminal usage status information, and service response status information in the non-voice data RTP packet.

非音声データＲＴＰパケット挿入部１１２は、ＳＬＩＣ部１５０からの音声信号をＧ．７１１符号化規格に従って変換して音声データＲＴＰパケットを生成しつつ、識別無音データ管理部１２３から識別非音声データを含む非音声データＲＴＰパケットを受け取った場合には、当該音声データＲＴＰパケットの内の少なくとも１つを破棄し、当該非音声データＲＴＰパケットを当該破棄の箇所に挿入する（ステップＳ２０２）。非音声データＲＴＰパケット挿入部１１２は、当該非音声データＲＴＰパケットを含む音声データＲＴＰパケットのストリームをＲＴＰパケット送受信部１１１に供給する。 The non-voice data RTP packet insertion unit 112 receives the voice signal from the SLIC unit 150 as a G.P. When the non-voice data RTP packet including the identified non-voice data is received from the identified silence data management unit 123 while generating the voice data RTP packet by converting according to the H.711 coding standard, At least one is discarded, and the non-voice data RTP packet is inserted at the discard location (step S202). The non-voice data RTP packet insertion unit 112 supplies a stream of voice data RTP packets including the non-voice data RTP packet to the RTP packet transmission / reception unit 111.

ＲＴＰパケット送受信部１１１は当該非音声データＲＴＰパケットを含む音声データＲＴＰパケットのストリームをＷＡＮポート１４０経由で通信ネットワーク２００へ送信する（ステップＳ２０３）。 The RTP packet transmitting / receiving unit 111 transmits a stream of the voice data RTP packet including the non-voice data RTP packet to the communication network 200 via the WAN port 140 (step S203).

送信側のＶｏＩＰ通信装置１００は以下のように動作する。非音声データＲＴＰパケット抽出部１１３は、ＲＴＰパケット送受信部１１１から音声データＲＴＰパケットのストリームを受け取り（ステップＳ２０４）、当該ストリームに含まれる非音声データＲＴＰパケットを、ＲＴＰペイロードに含まれているデータに基づいて抽出する（ステップＳ２０５）。非音声データＲＴＰパケット抽出部１１３は、自身に予め設定されている識別非音声データと同一の識別非音声データＲＴＰがペイロードに含まれている音声データＲＴＰパケットを非音声データＲＴＰパケットであると識別してこれを抽出する。非音声データＲＴＰパケット抽出部１１３は、当該非音声データＲＴＰパケットを非音声データＲＴＰパケット読取部１２２に与える。このとき、非音声データＲＴＰパケット抽出部１１３は、当該非音声データＲＴＰパケットを音声信号変換部１３２にも供給するようにしても良い。 The transmitting-side VoIP communication apparatus 100 operates as follows. The non-voice data RTP packet extraction unit 113 receives a stream of voice data RTP packets from the RTP packet transmission / reception unit 111 (step S204), and converts the non-voice data RTP packets included in the stream into data included in the RTP payload. Based on the extraction (step S205). The non-voice data RTP packet extraction unit 113 identifies a voice data RTP packet that includes the same identified non-voice data RTP as the identified non-voice data set in the payload as a non-voice data RTP packet. And extract this. The non-voice data RTP packet extraction unit 113 provides the non-voice data RTP packet reading unit 122 with the non-voice data RTP packet. At this time, the non-voice data RTP packet extraction unit 113 may supply the non-voice data RTP packet to the voice signal conversion unit 132 as well.

非音声データＲＴＰパケット読取部１２２は、非音声データＲＴＰパケット抽出部１１３から供給された非音声データＲＴＰパケットに含まれる端末識別情報、端末使用状況情報及びサービス対応状況情報などの端末サービス情報を読み取る（ステップＳ２０６）。ＶｏＩＰ通信装置１００は、非音声データＲＴＰパケット読取部１２２が読み取った端末識別情報に基づいて、送信側のＶｏＩＰ通信装置の認証処理などを行うことができる。 The non-voice data RTP packet reading unit 122 reads terminal service information such as terminal identification information, terminal usage status information, and service response status information included in the non-voice data RTP packet supplied from the non-voice data RTP packet extraction unit 113. (Step S206). The VoIP communication apparatus 100 can perform authentication processing of the transmission-side VoIP communication apparatus based on the terminal identification information read by the non-voice data RTP packet reading unit 122.

音声信号変換部１３２は、非音声データＲＴＰパケット抽出部１１３から供給された音声データＲＴＰパケットのストリームを音声信号に変換してＳＬＩＣ部１５０に供給する。音声信号変換部１３２は、ＰＬＣなどの補間機能を備えており、非音声データＲＴＰパケット抽出部１１３によって抽出された非音声データＲＴＰパケットを穴埋めするための音声データを生成し、当該抽出された箇所に挿入する。また、音声信号変換部１３２は、非音声データＲＴＰパケット抽出部１１３から非音声データＲＴＰパケットを受け取った場合には、当該非音声データＲＴＰパケットを変換して得られた無音声信号をＳＬＩＣ部１５０に供給する。このとき、アナログ電話端末３００は、ＳＬＩＣ部１５０からの無音声信号に基づいて無音の音声を再生する。なお、無音となる時間は一瞬であるため、通話への影響は無い。 The audio signal converter 132 converts the audio data RTP packet stream supplied from the non-audio data RTP packet extractor 113 into an audio signal and supplies the audio signal to the SLIC unit 150. The audio signal conversion unit 132 has an interpolation function such as a PLC, generates audio data for filling the non-audio data RTP packet extracted by the non-audio data RTP packet extraction unit 113, and the extracted location Insert into. In addition, when receiving the non-voice data RTP packet from the non-voice data RTP packet extraction unit 113, the voice signal conversion unit 132 converts the non-voice signal obtained by converting the non-voice data RTP packet into the SLIC unit 150. To supply. At this time, the analog telephone terminal 300 reproduces silent sound based on the silent signal from the SLIC unit 150. Note that there is no effect on the call because the silent period is momentary.

上記したように本実施例による送信側のＶｏＩＰ通信装置１００は、Ｇ．７１１符号化規格において無音を表すデータ０ｘ７Ｆ及び０ｘＦＦのデータ列からなる識別非音声データをＲＴＰペイロードに含めて非音声データＲＴＰパケットを生成する。ＶｏＩＰ通信装置１００は、アナログ電話端末３００からの音声信号をＧ．７１１符号化規格に従って変換して音声データＲＴＰパケットを生成しつつ、当該音声データＲＴＰパケットの内の少なくとも１つを破棄して当該非音声データＲＴＰパケットを当該破棄の箇所に挿入し、これを音声データＲＴＰパケットのストリームとして送信する。ＶｏＩＰ通信装置１００は、当該非音声データＲＴＰパケットに端末識別情報、端末使用状況情報及びサービス対応状況情報などの端末サービス情報も含めて送信する。受信側のＶｏＩＰ通信装置１００は、ＲＴＰペイロードに識別非音声データを含む音声データＲＴＰパケットを非音声データＲＴＰパケットであると識別し、当該非音声データＲＴＰパケットに含まれている端末サービス情報を取得する。このように送信側のＶｏＩＰ通信装置１００は、Ｇ．７１１符号化規格に規定される無音データからなる識別非音声データをＲＴＰパケットに含めて非音声データＲＴＰパケットを生成する。受信側のＶｏＩＰ通信装置１００は、識別非音声データに基づいて非音声データＲＴＰパケットを識別し、当該非音声データＲＴＰパケットに含まれている端末識別情報、端末使用状況情報及びサービス対応状況情報などの情報を得ることができる。 As described above, the VoIP communication apparatus 100 on the transmission side according to this embodiment is a G. In the 711 encoding standard, non-voice data RTP packets are generated by including identification non-voice data consisting of data strings of data 0x7F and 0xFF representing silence in the RTP payload. The VoIP communication apparatus 100 receives a voice signal from the analog telephone terminal 300 as a G.D. While converting to generate a voice data RTP packet according to the 711 coding standard, at least one of the voice data RTP packets is discarded, and the non-voice data RTP packet is inserted into the discard location. Transmit as a stream of data RTP packets. The VoIP communication apparatus 100 transmits the non-voice data RTP packet including terminal service information such as terminal identification information, terminal usage status information, and service response status information. The receiving-side VoIP communication apparatus 100 identifies the voice data RTP packet including the identified non-voice data in the RTP payload as the non-voice data RTP packet, and acquires the terminal service information included in the non-voice data RTP packet To do. Thus, the VoIP communication device 100 on the transmission side The non-voice data RTP packet is generated by including identification non-voice data consisting of silence data defined in the 711 coding standard in the RTP packet. The receiving-side VoIP communication apparatus 100 identifies a non-voice data RTP packet based on the identified non-voice data, and includes terminal identification information, terminal usage status information, service response status information, and the like included in the non-voice data RTP packet. Information can be obtained.

ＶｏＩＰ通信装置を通信ネットワークと共に表すブロック図である。It is a block diagram showing a VoIP communication apparatus with a communication network. 送信時における非音声データＲＴＰパケット挿入部でのＲＴＰパケットのストリームの一例を表す図である。It is a figure showing an example of the stream of the RTP packet in the non audio | voice data RTP packet insertion part at the time of transmission. 送信時における非音声データＲＴＰパケット挿入部でのＲＴＰパケットのストリームの一例を表す図である。It is a figure showing an example of the stream of the RTP packet in the non audio | voice data RTP packet insertion part at the time of transmission. 送信時における非音声データＲＴＰパケット挿入部でのＲＴＰパケットのストリームの一例を表す図である。It is a figure showing an example of the stream of the RTP packet in the non audio | voice data RTP packet insertion part at the time of transmission. 受信時における非音声データＲＴＰパケット抽出部でのＲＴＰパケットのストリームの一例を表す図である。It is a figure showing an example of the stream of the RTP packet in the non audio | voice data RTP packet extraction part at the time of reception. 受信時における非音声データＲＴＰパケット抽出部でのＲＴＰパケットのストリームの一例を表す図である。It is a figure showing an example of the stream of the RTP packet in the non audio | voice data RTP packet extraction part at the time of reception. 受信時における音声信号変換部、ＩＰ電話端末及び無線ＩＰ電話端末のいずれかでの音声データのストリームの一例を表す図である。It is a figure showing an example of the stream of the audio | voice data in either the audio | voice signal conversion part at the time of reception, an IP telephone terminal, and a radio | wireless IP telephone terminal. ＬＡＮポート及び無線ＬＡＮポートを含むＶｏＩＰ通信装置を表すブロック図である。It is a block diagram showing the VoIP communication apparatus containing a LAN port and a wireless LAN port. 実施例３におけるＶｏＩＰ通信装置を通信ネットワークと共に表すブロック図である。It is a block diagram showing the VoIP communication apparatus in Example 3 with a communication network. 識別非音声データを含むＲＴＰパケットの例を表す図である。It is a figure showing the example of the RTP packet containing identification non-voice data. 実施例３における音声データＲＴＰパケットストリームの送信側及び受信側のＶｏＩＰ通信装置の動作を表すシーケンス図である。FIG. 10 is a sequence diagram illustrating an operation of a VoIP communication device on a transmission side and a reception side of a voice data RTP packet stream in the third embodiment. 実施例４における音声データＲＴＰパケットストリームの送信側及び受信側のＶｏＩＰ通信装置の動作を表すシーケンス図である。FIG. 10 is a sequence diagram illustrating an operation of a VoIP communication device on a transmission side and a reception side of a voice data RTP packet stream in the fourth embodiment.

符号の説明Explanation of symbols

１００ＶｏＩＰ通信装置
１１０ＲＴＰ制御部
１１１ＲＴＰパケット送受信部
１１２非音声データＲＴＰパケット挿入部
１１３非音声データＲＴＰパケット抽出部
１２０非音声データ読取生成部
１２１非音声データＲＴＰパケット生成部
１２２非音声データＲＴＰパケット読取部
１２３識別無音データ管理部
１３０ＤＳＰ部
１３１音声データＲＴＰパケット生成部
１３２音声信号変換部
１４０ＷＡＮポート
１５０ＳＬＩＣ部
１６０ＬＡＮポート
１７０無線ＬＡＮポート
２００通信ネットワーク
３００アナログ電話端末
４００ＩＰ電話端末
５００無線ＩＰ電話端末 100 VoIP communication apparatus 110 RTP control unit 111 RTP packet transmitting / receiving unit 112 non-voice data RTP packet inserting unit 113 non-voice data RTP packet extracting unit 120 non-voice data reading / generating unit 121 non-voice data RTP packet generating unit 122 non-voice data RTP packet Reading unit 123 Identification silent data management unit 130 DSP unit 131 Audio data RTP packet generation unit 132 Audio signal conversion unit 140 WAN port 150 SLIC unit 160 LAN port 170 Wireless LAN port 200 Communication network 300 Analog telephone terminal 400 IP telephone terminal 500 Wireless IP Phone terminal

Claims

通信ネットワークを介して音声データＲＴＰパケットのストリームを送受信するＶｏＩＰ通信装置であって、
前記通信ネットワークへ送信すべき音声データＲＴＰパケットのストリームを受信若しくは生成して準備する送信音声データＲＴＰストリームパケット準備手段と、
当該準備した音声データＲＴＰパケットのデータサイズと異なるデータサイズの非音声データＲＴＰパケットを生成する非音声データＲＴＰパケット生成部と、
当該準備した音声データＲＴＰパケットの内の少なくとも１つを前記非音声データＲＴＰパケットに置き換える非音声データＲＴＰパケット挿入部と、
当該置き換えた非音声データＲＴＰパケットを含む音声データＲＴＰパケットのストリームを前記通信ネットワークに送信するＲＴＰストリームパケット送信手段と、を含むことを特徴とするＶｏＩＰ通信装置。 A VoIP communication device for transmitting and receiving a stream of voice data RTP packets via a communication network,
Transmission voice data RTP stream packet preparation means for receiving or generating and preparing a stream of voice data RTP packets to be transmitted to the communication network;
A non-voice data RTP packet generator that generates a non-voice data RTP packet having a data size different from the data size of the prepared voice data RTP packet;
A non-voice data RTP packet insertion unit that replaces at least one of the prepared voice data RTP packets with the non-voice data RTP packet;
VoIP communication apparatus comprising: RTP stream packet transmission means for transmitting a stream of voice data RTP packets including the replaced non-voice data RTP packet to the communication network.

通信ネットワークを介して音声データＲＴＰパケットのストリームを送受信するＶｏＩＰ通信装置であって、
前記通信ネットワークから受信した音声データＲＴＰパケットのストリームに含まれる非音声データＲＴＰパケットを当該非音声データＲＴＰパケットのデータサイズに基づいて抽出する非音声データＲＴＰパケット抽出部と、
当該抽出した非音声データＲＴＰパケットに含まれる情報を読み取る非音声データＲＴＰパケット読取部と、を含むことを特徴とするＶｏＩＰ通信装置。 A VoIP communication device for transmitting and receiving a stream of voice data RTP packets via a communication network,
A non-voice data RTP packet extraction unit that extracts a non-voice data RTP packet included in a stream of voice data RTP packets received from the communication network based on a data size of the non-voice data RTP packet;
A VoIP communication apparatus comprising: a non-voice data RTP packet reading unit that reads information included in the extracted non-voice data RTP packet.

前記非音声データＲＴＰパケットは、端末識別情報、端末使用状況情報及びサービス対応状況情報の内の少なくとも１つを含むことを特徴とする請求項１又は２に記載のＶｏＩＰ通信装置。 The VoIP communication apparatus according to claim 1, wherein the non-voice data RTP packet includes at least one of terminal identification information, terminal usage status information, and service support status information.

前記非音声データＲＴＰパケット生成部は、前記非音声データＲＴＰパケットの生成に代えて識別非音声データを含めて生成したＲＴＰパケットを前記非音声データＲＴＰパケットとすることを特徴とする請求項１に記載のＶｏＩＰ通信装置。 2. The non-voice data RTP packet generation unit uses the RTP packet generated including identification non-voice data instead of the generation of the non-voice data RTP packet as the non-voice data RTP packet. VoIP communication apparatus of description.

前記非音声データＲＴＰパケット抽出部は、前記非音声データＲＴＰパケットの抽出に代えて前記非音声データＲＴＰパケットを当該非音声データＲＴＰパケットに含まれている識別非音声データに基づいて抽出することを特徴とする請求項２に記載のＶｏＩＰ通信装置。 The non-voice data RTP packet extraction unit extracts the non-voice data RTP packet based on the identified non-voice data included in the non-voice data RTP packet instead of extracting the non-voice data RTP packet. The VoIP communication apparatus according to claim 2, wherein the VoIP communication apparatus is characterized.

前記識別非音声データは、Ｇ．７１１符号化規格における無音データからなることを特徴とする請求項４又は５に記載のＶｏＩＰ通信装置。 The identification non-speech data is G. The VoIP communication apparatus according to claim 4 or 5, wherein the VoIP communication apparatus comprises silence data in accordance with the H.711 coding standard.