CN109040644B - Video point calling and recording storage method and system - Google Patents

Video point calling and recording storage method and system Download PDF

Info

Publication number
CN109040644B
CN109040644B CN201810830615.5A CN201810830615A CN109040644B CN 109040644 B CN109040644 B CN 109040644B CN 201810830615 A CN201810830615 A CN 201810830615A CN 109040644 B CN109040644 B CN 109040644B
Authority
CN
China
Prior art keywords
video
terminal
server
audio
video data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810830615.5A
Other languages
Chinese (zh)
Other versions
CN109040644A (en
Inventor
李敬来
范向东
沈楷博
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chengdu TD Tech Ltd
Original Assignee
Chengdu TD Tech Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chengdu TD Tech Ltd filed Critical Chengdu TD Tech Ltd
Priority to CN201810830615.5A priority Critical patent/CN109040644B/en
Publication of CN109040644A publication Critical patent/CN109040644A/en
Application granted granted Critical
Publication of CN109040644B publication Critical patent/CN109040644B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/141Systems for two-way working between two video terminals, e.g. videophone
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/27Server based end-user applications
    • H04N21/274Storing end-user multimedia data in response to end-user request, e.g. network recorder
    • H04N21/2743Video hosting of uploaded data from client
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/63Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
    • H04N21/643Communication protocols
    • H04N21/6437Real-time Transport Protocol [RTP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/76Television signal recording

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Telephonic Communication Services (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

The application discloses a method and a system for storing video point call videos, wherein the method comprises the following steps: in the process of video call, two terminals in conversation respectively send video data and audio data generated by the two terminals to a video server; after the video call process is finished, the video recording server performs audio mixing processing on all audio data generated in the video call process, synthesizes the obtained audio mixing audio with video data generated by each terminal in the video call process, and stores two video files obtained after synthesis. The invention can ensure the synchronization of the stored audio and video contents and improve the image quality.

Description

Video point calling and recording storage method and system
Technical Field
The invention relates to a mobile communication technology, in particular to a method and a system for storing video point call videos.
Background
In the process of terminal video call, in order to keep some evidence, the voice and video data of both parties' call need to be stored in the video recording server. Currently, for storing video call data, the commonly adopted method is as follows: synthesizing the video data of the real-time calls of the two parties into a video image, mixing the audio data of the two parties, and then storing the synthesized video file and the mixed audio synthesized MP4 file.
The method for storing the video call data has the following problems:
1. the high cost of CPU resources during video image synthesis can result in the overlarge utilization rate of a CPU of a video server, and the low consumption of CPU resources used for audio mixing, thus causing the problem that the generated audio and video files may have asynchronous audio and video contents.
2. When images are combined, both images need to be compressed, which leads to a reduction in the quality of the combined image.
Therefore, the storage method of the video call data has the problems of image quality reduction and unsynchronized audio and video contents.
Disclosure of Invention
In view of the above, the main objective of the present invention is to provide a method and a system for storing a video call record, which can ensure synchronization of stored audio and video contents and improve image quality.
In order to achieve the purpose, the technical scheme provided by the invention is as follows:
a method for storing video call video includes:
in the process of video call, two terminals in conversation respectively send video data and audio data generated by the two terminals to a video server;
after the video call process is finished, the video recording server performs audio mixing processing on all audio data generated in the video call process, synthesizes the obtained audio mixing audio with video data generated by each terminal in the video call process, and stores two video files obtained after synthesis.
Preferably, the sending, by the two terminals, the video data and the audio data generated by the two terminals to the video recording server respectively includes:
in the video call process, each terminal sends video data and audio data generated by the terminal to the other terminal through a transfer server;
the transit server sends all received video data to the video server through a video channel, and sends the received video data and audio data to the video server through an audio channel.
Preferably, the method further comprises:
when the video call process is initiated, the two terminals send respective terminal identification numbers to the transfer server and the video server;
the transit server and the video recording server respectively use preset integer values to carry out complementation on the terminal identification numbers of the two terminals to obtain a remainder corresponding to each terminal;
the sending all the received video data to the video recording server includes:
when the transfer server sends each video data real-time transport protocol RTP packet, the remainder corresponding to the terminal is carried in the RTP packet;
and the video recording server acquires the terminal to which the video data in the RTP packet belongs according to the remainder in the RTP packet and the corresponding relation between the remainder and the terminal.
Preferably, when the transit server sends each RTP packet of the video data, the synchronization source SSRC of the RTP packet is used to carry the remainder corresponding to the terminal to which the video data belongs.
A system for storing video call recordings, comprising:
the terminal is used for sending the video data and the audio data generated by the terminal to the video server in the video call process;
and the video server is used for carrying out audio mixing processing on all audio data generated in the video call process after the video call process is finished, synthesizing the obtained audio mixing audio with the video data generated by each terminal in the video call process respectively, and storing the two video files obtained after the synthesis.
Preferably, the system further comprises a transit server;
the terminal is further used for sending the video data and the audio data generated by the terminal to the opposite terminal through the transfer server in the video call process;
the transit server is used for sending all the received video data to the video recording server through a video channel and sending the received video data and audio data to the video recording server through an audio channel.
Preferably, the system further comprises:
the terminal is used for sending a terminal identification number of the terminal to the transfer server and the video server when the video call process is initiated;
the transit server is used for utilizing a preset integer value to carry out complementation on the terminal identification number of the terminal to obtain a remainder corresponding to each terminal; when each video data real-time transport protocol RTP packet is sent, the remainder corresponding to the terminal is carried in the RTP packet;
the video recording server is used for utilizing the integer value to carry out complementation on the terminal identification number of the terminal to obtain a remainder corresponding to each terminal; and acquiring the terminal to which the video data in the RTP packet belongs according to the remainder in the received RTP packet and the corresponding relation between the remainder and the terminal.
Preferably, the relay server is configured to utilize a synchronization source SSRC of each RTP packet to carry the remainder corresponding to the terminal to which the video data belongs when sending each RTP packet of the video data.
In summary, the method and system for storing video call records provided by the present invention generate respective video files for each terminal to store, thereby effectively avoiding the occurrence of the problems of asynchronous audio and video contents, reduced image quality, and the like caused by image synthesis of video data of both parties of a video call, and thus ensuring the synchronization of the stored audio and video contents and improving the image quality.
Drawings
FIG. 1 is a schematic flow chart of a method according to an embodiment of the present invention;
FIG. 2 is a system configuration diagram of an embodiment of the present invention
Fig. 3 is a schematic system structure according to another embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention will be described in further detail with reference to the accompanying drawings and specific embodiments.
The core idea of the invention is as follows: the video data of the two parties of the video call are respectively synthesized with the audio data of the audio mixing of the video call to obtain two video files, so that the two parties of the video call respectively generate and store the respective video files, and the problems that the audio content and the video content which are generated by synthesizing the video data are possibly asynchronous, the image quality is reduced and the like can be avoided.
Fig. 1 is a schematic flow chart of a method according to an embodiment of the present invention, and as shown in fig. 1, the method for storing a video call record implemented by the embodiment mainly includes:
step 101, in the process of video call, two terminals in conversation respectively send video data and audio data generated by the two terminals to a video server.
The step is used for sending the video and audio data generated by the two parties of the conversation terminal to the video recording server so that the video recording server can generate a corresponding video file based on the data for storage.
Furthermore, in order to reduce the data processing overhead of the video recording server in the video call process, a transit server may be introduced, and the transit server is responsible for the real-time transmission of the audio and video data of the two parties of the session and sends the audio and video data of the two parties to the video recording server. Therefore, the video server only needs to generate corresponding video files according to the received audio and video data, and does not need to be responsible for forwarding real-time data of two conversation parties in the video call process, so that the CPU processing overhead of the video server can be effectively reduced, and the quality of the generated video files is ensured.
Preferably, the sending, by the two terminals, the video data and the audio data generated by the two terminals to the video recording server respectively includes:
in the video call process, each terminal sends video data and audio data generated by the terminal to the other terminal through a transfer server;
the transit server sends all received video data to the video server through a video channel, and sends the received video data and audio data to the video server through an audio channel.
Further, in order to enable the video recording server to accurately identify the home terminal of the received video data (i.e. the terminal from which the data is generated), the video recording server can distinguish the video data of different terminals to generate corresponding video files. When the video call process is initiated, the two terminals performing the call can send respective terminal identification numbers to the transit server and the video server. Therefore, the transit server can carry the terminal identification numbers of the two terminals or the information corresponding to the terminal identification numbers in the video data packet sent to the video server, so that the video server can acquire the attributive terminal of the data according to the information of the video data packet. For example, in order to reduce signaling overhead, a remainder of the terminal identification number and a preset integer value may be carried in a video data packet as an identification of a terminal to which video data belongs. Based on the above purpose, the transit server may send the received video data to the video server by using the following method:
firstly, a transit server and a video server respectively use preset integer values to carry out complementation on terminal identification numbers of two terminals for conversation to obtain a remainder corresponding to each terminal;
then, when the transit server sends each video data real-time transport protocol (RTP) packet, the remainder corresponding to the terminal is carried in the RTP packet;
for the received video data RTP packet, the video server obtains the terminal to which the video data in the RTP packet belongs according to the remainder carried in the RTP packet and the corresponding relation between the remainder and the terminal.
Preferably, when the relay server sends each RTP packet of the video data, the Synchronization Source (SSRC) of the RTP packet may be used to carry the remainder corresponding to the terminal to which the video data belongs.
And 102, after the video call process is finished, the video recording server performs audio mixing processing on all audio data generated in the video call process, synthesizes the obtained audio mixing audio with video data generated by each terminal in the video call process, and stores two video files obtained after synthesis.
The difference between the present step and the existing scheme is that the video data of both parties of the session are not combined into a video image and then combined with the audio of the audio mixing, but the audio mixing audio generated by the audio data generated by the session is directly combined with the video data of both parties of the session to obtain the video files corresponding to the respective video data, and the video files of both parties are saved. Therefore, the problems of asynchronous audio and video contents and reduced image quality caused by image merging can be avoided.
For example, suppose that terminal a and terminal B perform a video call, two video files need to be generated, which are: FILE1 and FILE 2.
FILE1 consisted of VIDEO A + MIXAudio (A + B).
FILE2 consisted of VIDEO B + MIXAudio (A + B).
Wherein, FILE1 is an MP4 FILE composed of video data video of terminal a and mixed audio MIXAudio (a + B) of terminal a and terminal B;
FILE2 is an MP4 FILE consisting of VIDEO data VIDEO B of terminal B and mixed audio MIXAudio (a + B) of terminal a and terminal B.
Fig. 2 is a schematic structural diagram of a video call record saving system corresponding to the above method, and as shown in fig. 2, the system includes:
the terminal is used for sending the video data and the audio data generated by the terminal to the video server in the video call process;
and the video server is used for carrying out audio mixing processing on all audio data generated in the video call process after the video call process is finished, synthesizing the obtained audio mixing audio with the video data generated by each terminal in the video call process respectively, and storing the two video files obtained after the synthesis.
Preferably, in order to further improve the image quality, the relay server may be used to forward real-time data of both the session terminals, and send data generated by both the session terminals to the video server. In particular, a system architecture as shown in fig. 3 may be employed for this purpose. As shown in fig. 3, the system further includes a transit server.
Correspondingly, the terminal is further used for sending the video data and the audio data generated by the terminal to the opposite terminal through the transfer server in the video call process;
the transit server is used for sending all the received video data to the video recording server through a video channel and sending the received video data and audio data to the video recording server through an audio channel.
Preferably, the system further comprises:
the terminal is used for sending a terminal identification number of the terminal to the transfer server and the video server when the video call process is initiated;
the transit server is used for utilizing a preset integer value to carry out complementation on the terminal identification number of the terminal to obtain a remainder corresponding to each terminal; when each video data real-time transport protocol RTP packet is sent, the remainder corresponding to the terminal is carried in the RTP packet;
the video recording server is used for utilizing the integer value to carry out complementation on the terminal identification number of the terminal to obtain a remainder corresponding to each terminal; and acquiring the terminal to which the video data in the RTP packet belongs according to the remainder in the received RTP packet and the corresponding relation between the remainder and the terminal.
Preferably, the relay server is configured to utilize a synchronization source SSRC of each RTP packet to carry the remainder corresponding to the terminal to which the video data belongs when sending each RTP packet of the video data.
In summary, the above description is only a preferred embodiment of the present invention, and is not intended to limit the scope of the present invention. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims (8)

1. A method for storing video point call videos is characterized by comprising the following steps:
in the process of video call, two terminals in conversation respectively send video data and audio data generated by the two terminals to a video server;
after the video call process is finished, the video recording server performs audio mixing processing on all audio data generated in the video call process, synthesizes the obtained audio mixing audio with video data generated by each terminal in the video call process, and stores two video files obtained after synthesis.
2. The method of claim 1, wherein the two terminals respectively transmitting the video data and the audio data generated by the two terminals to the video recording server comprises:
in the video call process, each terminal sends video data and audio data generated by the terminal to the other terminal through a transfer server;
and the transit server sends all the received video data to the video server through a video channel and sends all the received audio data to the video server through an audio channel.
3. The method of claim 2, further comprising:
when the video call process is initiated, the two terminals send respective terminal identification numbers to the transfer server and the video server;
the transit server and the video recording server respectively use preset integer values to carry out complementation on the terminal identification numbers of the two terminals to obtain a remainder corresponding to each terminal;
the sending all the received video data to the video recording server includes:
when the transfer server sends each video data real-time transport protocol RTP packet, the remainder corresponding to the terminal is carried in the RTP packet;
and the video recording server acquires the terminal to which the video data in the RTP packet belongs according to the remainder in the RTP packet and the corresponding relation between the remainder and the terminal.
4. The method of claim 3, wherein the relay server uses a synchronization source SSRC of each RTP packet to carry the remainder corresponding to the terminal to which the video data belongs when sending each RTP packet of the video data.
5. A system for storing video call records, comprising:
the terminal is used for sending the video data and the audio data generated by the terminal to the video server in the video call process;
and the video server is used for carrying out audio mixing processing on all audio data generated in the video call process after the video call process is finished, synthesizing the obtained audio mixing audio with the video data generated by each terminal in the video call process respectively, and storing the two video files obtained after the synthesis.
6. The system of claim 5, further comprising a transit server;
the terminal is further used for sending the video data and the audio data generated by the terminal to the opposite terminal through the transfer server in the video call process;
the transit server is used for sending all the received video data to the video recording server through a video channel and sending all the received audio data to the video recording server through an audio channel.
7. The system of claim 6, further comprising:
the terminal is used for sending a terminal identification number of the terminal to the transfer server and the video server when the video call process is initiated;
the transit server is used for utilizing a preset integer value to carry out complementation on the terminal identification number of the terminal to obtain a remainder corresponding to each terminal; when each video data real-time transport protocol RTP packet is sent, the remainder corresponding to the terminal is carried in the RTP packet;
the video recording server is used for utilizing the integer value to carry out complementation on the terminal identification number of the terminal to obtain a remainder corresponding to each terminal; and acquiring the terminal to which the video data in the RTP packet belongs according to the remainder in the received RTP packet and the corresponding relation between the remainder and the terminal.
8. The system of claim 7,
and the transit server is used for carrying the remainder corresponding to the terminal to which the video data belongs by using the synchronous source SSRC of each RTP packet when each RTP packet of the video data is sent.
CN201810830615.5A 2018-07-25 2018-07-25 Video point calling and recording storage method and system Active CN109040644B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810830615.5A CN109040644B (en) 2018-07-25 2018-07-25 Video point calling and recording storage method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810830615.5A CN109040644B (en) 2018-07-25 2018-07-25 Video point calling and recording storage method and system

Publications (2)

Publication Number Publication Date
CN109040644A CN109040644A (en) 2018-12-18
CN109040644B true CN109040644B (en) 2020-12-04

Family

ID=64645282

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810830615.5A Active CN109040644B (en) 2018-07-25 2018-07-25 Video point calling and recording storage method and system

Country Status (1)

Country Link
CN (1) CN109040644B (en)

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4847698A (en) * 1987-07-16 1989-07-11 Actv, Inc. Interactive television system for providing full motion synched compatible audio/visual displays
KR20040093297A (en) * 2003-04-29 2004-11-05 엘지전자 주식회사 Picture call saving apparatus and method for mobile communication terminal
CN101951492A (en) * 2010-09-15 2011-01-19 中兴通讯股份有限公司 Method and device for recording videos in video call
CN102027743A (en) * 2008-05-16 2011-04-20 Nxp股份有限公司 Video telephony
CN102055949A (en) * 2009-11-02 2011-05-11 华为终端有限公司 Recording method, device and system of multimedia conference and rewinding method and device
CN102833524A (en) * 2011-12-13 2012-12-19 苏州科达科技有限公司 Control method for simultaneous picture recording on local audio and calling party audio
US9160551B2 (en) * 2012-03-23 2015-10-13 Cisco Technology, Inc. Analytic recording of conference sessions
CN105635659A (en) * 2014-11-04 2016-06-01 广东中星电子有限公司 Audio processing method and apparatus for video surveillance system
CN108024085A (en) * 2016-10-31 2018-05-11 联芯科技有限公司 The method for recording and equipment of audio and video

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9060094B2 (en) * 2007-09-30 2015-06-16 Optical Fusion, Inc. Individual adjustment of audio and video properties in network conferencing

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4847698A (en) * 1987-07-16 1989-07-11 Actv, Inc. Interactive television system for providing full motion synched compatible audio/visual displays
KR20040093297A (en) * 2003-04-29 2004-11-05 엘지전자 주식회사 Picture call saving apparatus and method for mobile communication terminal
CN102027743A (en) * 2008-05-16 2011-04-20 Nxp股份有限公司 Video telephony
CN102055949A (en) * 2009-11-02 2011-05-11 华为终端有限公司 Recording method, device and system of multimedia conference and rewinding method and device
CN101951492A (en) * 2010-09-15 2011-01-19 中兴通讯股份有限公司 Method and device for recording videos in video call
CN102833524A (en) * 2011-12-13 2012-12-19 苏州科达科技有限公司 Control method for simultaneous picture recording on local audio and calling party audio
US9160551B2 (en) * 2012-03-23 2015-10-13 Cisco Technology, Inc. Analytic recording of conference sessions
CN105635659A (en) * 2014-11-04 2016-06-01 广东中星电子有限公司 Audio processing method and apparatus for video surveillance system
CN108024085A (en) * 2016-10-31 2018-05-11 联芯科技有限公司 The method for recording and equipment of audio and video

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
基于324协议栈的Android视频电话方案的研究与实现;朱家栋;《中国优秀硕士学位论文全文数据库》;20150715;第 I136-243页 *
硬件视频会议的理想选择 辰联电子;温情;《http://www.audio160.com/news/2013/8/2013_1_22836.htm》;20130826;第1页 *

Also Published As

Publication number Publication date
CN109040644A (en) 2018-12-18

Similar Documents

Publication Publication Date Title
US10356239B1 (en) Transcribing audio communication sessions
US11482240B2 (en) Presentation of communications
US11710488B2 (en) Transcription of communications using multiple speech recognition systems
US20060085823A1 (en) Media communications method and apparatus
US8478895B2 (en) Content adaptation realizing method and content adaptation server
US10362173B2 (en) Web real-time communication from an audiovisual file
US9311920B2 (en) Voice processing method, apparatus, and system
CN101079934B (en) Method and system for utilizing session initialization protocol soft terminal to record the voice
CN110943977B (en) Multimedia service data transmission method, server, equipment and storage medium
US10313502B2 (en) Automatically delaying playback of a message
US6693510B1 (en) Image communication terminal, image communication system, and mobile station
CN104579710A (en) Method for conference member to issue voice information in fragmentation asynchronous conference system
US10979562B2 (en) Call recording system, call recording method, and call recording program
CN102754448B (en) Data processing equipment and data coding device
CN109040644B (en) Video point calling and recording storage method and system
CN101959143B (en) Method for selectively recording during group calling of dispatching desk in digital trunking system
CN109714295B (en) Voice encryption and decryption synchronous processing method and device
CN104580778A (en) Method for conference member to issue picture information in fragmentation asynchronous conference system
CN110798688A (en) High-definition video compression coding system based on real-time transmission
CN101478621B (en) Encoding and decoding negotiation method for ciphered voice call, communication system and device
CN106230915A (en) A kind of method and system realizing function machine intelligent communication
CN113612759B (en) High-performance high-concurrency intelligent broadcasting system based on SIP protocol and implementation method
EP2469851A1 (en) System and method for generating interactive voice and video response menu
US20190372824A1 (en) Changing a communication session
CN107786836B (en) Method and device for realizing audio and video session service

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant