CN104167210A - Lightweight class multi-side conference sound mixing method and device - Google Patents

Lightweight class multi-side conference sound mixing method and device Download PDF

Info

Publication number
CN104167210A
CN104167210A CN201410414450.5A CN201410414450A CN104167210A CN 104167210 A CN104167210 A CN 104167210A CN 201410414450 A CN201410414450 A CN 201410414450A CN 104167210 A CN104167210 A CN 104167210A
Authority
CN
China
Prior art keywords
voice
frame
speech
value
energy
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201410414450.5A
Other languages
Chinese (zh)
Inventor
王田
蔡奕侨
钟必能
陈永红
田晖
张国亮
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huaqiao University
Original Assignee
Huaqiao University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huaqiao University filed Critical Huaqiao University
Priority to CN201410414450.5A priority Critical patent/CN104167210A/en
Publication of CN104167210A publication Critical patent/CN104167210A/en
Pending legal-status Critical Current

Links

Landscapes

  • Telephonic Communication Services (AREA)

Abstract

Provided is a lightweight class multi-side conference sound mixing method and device. The method comprises the steps that (1) after a client side uses an AMR encoder for encoding voice, voice PCM data and data length are obtained, the encoded PCM data are subjected to framing processing, each frame voice energy value is computed, the fact that a frame is a voice frame or a non-voice frame is determined according to the frame voice energy value and the data length, and accordingly the probability values of the voice frames in the voice PCM data are obtained in a statistics mode; and (2) a server side selects current voice streams of two speakers with the highest voice probability values according to the received voice probability values, whether the superposition principle is used for carrying out sound mixing on the at most two selected voice streams is determined according to the two voice probability values, and finally a voice packet obtained after sound mixing is transferred. According to the method, the shortcoming that portable equipment such as a mobile phone is weak in computing capacity is ingeniously overcome, meanwhile, the computing amount of a server for sound mixing operation is greatly lowered, and the lightweight class multi-side conference sound mixing method and device can be widely used in a multimedia multi-side conference system.

Description

A kind of Multi-Party Conference sound mixing method and device of lightweight
Technical field
The present invention relates to Multi-Party Conference sound mixing method and the device of Multi-Party Conference technical field of communication, particularly a kind of lightweight.
Background technology
In multipart video-meeting system, audio mixing is an important technology.Audio mixing is that the audio frequency of multiple audio-source is mixed into a road audio frequency output according to audio frequency superposition principle, makes the recipient of audio frequency feel the effect that multi-person conference exchanges.
It is server end that audio mixing can be realized in media controller, and also can realize in terminal is client.
Directly realize at server end, be that client is passed through encoder encodes voice data PCM voice signal separately, then send to server end, server is first by the audio decoder of multiple audio-source, then be mixed into the road audio frequency output of encoding again according to audio frequency superposition principle, make the recipient of audio frequency feel the effect that multi-person conference exchanges.But because server end needs multipath decoding, finally encode again, therefore calculated amount and time complexity are all larger, cause time delay also larger simultaneously.This has also just limited the range of application of this scheme.
Directly realize audio mixing in terminal, be that client is passed through encoder encodes voice data PCM voice signal, send to server end, server end is the audio frequency of client by each terminal, send to all terminals except source, each terminal is synthesized all audio streams that receive.The calculating pressure of audio mixing is in each terminal, and this scheme can cause larger pressure to network.One calculated amount of carrying out terminal increases, and this,, for the weak mobile terminal of some computing powers, cannot bear the pressure that audio mixing calculates.The voice packet of two each terminals will be transmitted to the terminal except source, takies network bandwidth resources.
Also have some schemes, do not need Code And Decode, terminal is directly issued server end voice packet, and then server end carries out audio mixing.Because terminal is not encoded and just directly given out a contract for a project voice packet, seriously take the network bandwidth.
Summary of the invention
Fundamental purpose of the present invention is the practical application request for Multi-Party Conference, takes into account the personal characteristics of the portable skinny devices such as mobile phone simultaneously, proposes a kind of novelty and simple Multi-Party Conference sound mixing method and the device of real-time lightweight fast.
The present invention adopts following technical scheme:
A kind of Multi-Party Conference sound mixing method of lightweight, it is characterized in that: 1) customer end adopted AMR scrambler obtains voice PCM data and data length after voice are encoded, to a point frame processing for the voice PCM data acquisition after coding, calculate every frame speech energy value, and determine that in conjunction with this frame speech energy value and data length thereof this frame is speech frame or non-speech frame, thereby count the probable value of speech frame in voice PCM data; 2) server end is selected two spokesmans' that current speech probability value is the highest voice flow by the speech probability value receiving, and determine whether use superposition principle that maximum two-way voice flows of selecting are carried out to audio mixing, finally to forward the voice packet after audio mixing according to these two speech probability value sizes.
Preferably, preset: client grabs a frame voice signal at set intervals, every frame voice signal comprises m sampled value, and the energy of each sampled value is r i; Set statistical window and comprise continuous n frame voice signal, the energy relative reference value of present frame is E refer; Step 1) specifically comprise as follows:
1.1) the output length after client input voice PCM data and AMR coding, the energy value of calculating present frame voice PCM data
1.2) judge whether the present frame output length after AMR coding equals 31, if so, records the energy value of this frame, as speech energy reference value, judges this frame as speech frame and adds in statistical window, enters step 1.4); If not, record the energy value of this frame, as non-voice energy reference value, enter step 1.3);
1.3) judge whether present frame energy value is greater than its energy relative reference value E refer, if so, judge that this frame is as speech frame, if not, judge that this frame is as non-speech frame; Add in new statistical window, enter step 1.4)
1.4) judge that whether statistical window is full, the if so, accounting of speech frame in counting statistics window, is expressed as 0 to 100 speech probability value; If not, enter next frame, skip to step 1.1);
Preferably, the maximal value of the non-voice energy reference value of front n successive frame of setting present frame is E noise, and the maximal value of speech energy reference value is expressed as E voise, the energy relative reference value E of present frame refercalculate with following formula:
E refer=E noise+(E voice-E noise)/10。
Preferably, step 2) specific as follows:
2.1) server receives the speech probability value that client sends over, and selects two voice flow F1, F2 that speech probability value is the highest, and its speech probability value is respectively P1, P2, P1>P2;
2.2) judge whether P1>2P2 sets up, if so, only by P 1corresponding voice flow output; If not, these two voice flows are carried out exporting after audio mixing.
A Multi-Party Conference device sound mixing for lightweight, comprises client and server, it is characterized in that:
Client comprises: obtain the AMR scrambler of voice PCM data and data length for voice are encoded, for the speech energy calculation element of every frame speech energy value of the voice PCM data after calculation code, determine in conjunction with speech energy value and data length thereof the decision maker that this frame is speech frame or non-speech frame, and count the statistic device of the probable value of speech frame in the statistical window of voice PCM data;
Server comprises: for receiving speech probability value and selecting the reception selecting arrangement of two spokesmans' that current speech probability value is the highest voice flow, determine whether use superposition principle maximum two-way voice flows of selecting to be carried out to the device sound mixing of audio mixing according to these two speech probability value sizes, and forward the dispensing device of voice packet.
From the above-mentioned description of this invention, compared with prior art, the present invention has following beneficial effect:
1, adopt the method for probability analysis, client is analyzed voice flow, and the speech probability value that server end utilization receives is carried out decision-making, makes full use of the resource of server end and client, allow it jointly share calculating pressure, algorithm is simple, easily realize, extensibility is good;
2, the calculating pressure of server end and client is little, and the reaction time is fast.Aspect client, only need to carry out AMR coding, and calculate the energy value of each Frame, and judge that every frame data are voice, quiet or noise, aspect server end, speech probability value that only need to more each client, does not need to carry out audio mixing encoding operation most of time, at most only need to carry out audio mixing to 2 road voice.
3, applied range, can adapt to the application of the lightweight such as PDA, mobile phone equipment.
Brief description of the drawings
Fig. 1 is client workflow diagram of the present invention;
Fig. 2 is server workflow diagram of the present invention.
Embodiment
Below by embodiment, the invention will be further described.
For the practical application request of Multi-Party Conference, take into account the personal characteristics of the portable skinny devices such as mobile phone simultaneously, a kind of novelty is proposed and the simple Multi-Party Conference sound mixing method of real-time lightweight fast.The basic thought of this scheme is, according to the feature of conference speech, in most cases, one-man is in speech, and maximum two people make a speech simultaneously, and other are all audiences.Therefore, the present invention carries out audio mixing at the server end the highest two-way voice of probability of selecting at most to make a speech, and the voice after audio mixing are sent to client, thereby client does not need to do audio mixing, and the audio mixing calculated amount of server end is also little simultaneously.
A Multi-Party Conference sound mixing method for lightweight, presets: client grabs a frame voice signal every 20ms, and every frame voice signal comprises 160 sampled values, and the energy of each sampled value is r i; Set statistical window and comprise 20 continuous frame voice signals, the energy relative reference value of present frame is E refer.The maximal value of setting the non-voice energy reference value of front 20 successive frames of present frame is E noise, and the maximal value of speech energy reference value is expressed as E voise, the energy relative reference value E of present frame refercalculate with following formula:
E refer=E noise+(E voice-E noise)/10。
Wherein, if being session, present frame starts certain 1 frame in rear first 20 frames, for example the 2nd frame, and 1 frame energy value before using is as energy relative reference value, if the 3rd frame is brought in formula and calculated with regard to the respective value of 1,2 frames with above, by that analogy.
Comprise the steps:
1) customer end adopted AMR scrambler obtains voice PCM data and data length after voice are encoded, to a point frame processing for the voice PCM data acquisition after coding, calculate every frame speech energy value, and determine that in conjunction with this frame speech energy value and data length thereof this frame is speech frame or non-speech frame, thereby count the probable value of speech frame in voice PCM data.AMR scrambler to voice PCM data encoding after, obtain coding after data and data length, data length represents with nsize.According to the rule of AMR coding output, nsize only has three values, in the time that nsize is 1, is mute state; In the time that nsize is 6, it is noise state; In the time that nsize is 31, it is voice status.But this division methods is inaccurate, in the time that nsize is 31, is essentially voice status, but in the time that nsize is 6, is also likely but voice status.Therefore,, in the time that nsize is 6, need the energy value of the comprehensive PCM of analysis data.With reference to Fig. 1, flow process is as follows
1.1) the output length after client input voice PCM data and AMR coding, the energy value of calculating present frame voice PCM data
1.2) judge whether the present frame output length after AMR coding equals 31, if so, records the energy value of this frame, as speech energy reference value, judges this frame as speech frame and adds in statistical window, enters step 1.4); If not, record the energy value of this frame, as non-voice energy reference value, enter step 1.3);
1.3) judge whether this frame energy value is greater than energy relative reference value E refer, if so, judge that this frame is as speech frame, if not, judge that this frame is as non-speech frame; Add in statistical window, enter step 1.4)
1.4) judge that whether statistical window is full, the if so, accounting of speech frame in counting statistics window, is expressed as 0 to 100 speech probability value; If not, enter next frame, skip to step 1.1);
2) server end is selected two spokesmans' that current speech probability value is the highest voice flow by the speech probability value receiving, and determine whether use superposition principle that maximum two-way voice flows of selecting are carried out to audio mixing, finally to forward the voice packet after audio mixing according to these two speech probability value sizes.Concrete, with reference to Fig. 2, flow process is as follows:
2.1) server receives the speech probability value that client sends over, and selects two voice flow F1, F2 that speech probability value is the highest, and its speech probability value is respectively P1, P2, P1>P2;
2.2) judge whether P1>2P2 sets up, if so, only by P 1corresponding voice flow output; If not, these two voice flows are carried out exporting after audio mixing.
The present invention also proposes a kind of Multi-Party Conference device sound mixing of lightweight, comprises client and server.
Client comprises: obtain the AMR scrambler of voice PCM data and data length for voice are encoded, for the speech energy calculation element of every frame speech energy value of the voice PCM data after calculation code, determine in conjunction with speech energy value and data length thereof the decision maker that this frame is speech frame or non-speech frame, and count the statistic device of the probable value of speech frame in the statistical window of voice PCM data.
Server comprises: for receiving speech probability value and selecting the reception selecting arrangement of two spokesmans' that current speech probability value is the highest voice flow, determine whether use superposition principle maximum two-way voice flows of selecting to be carried out to the device sound mixing of audio mixing according to these two speech probability value sizes, and forward the dispensing device of voice packet.
This device is encoded in client, and in conjunction with two of speech energy value and AMR coded data sizes because usually distinguishing speech frame and non-speech frame, thereby count its speech probability value.Server end, is gone out current speaker (maximum two s') voice flow by the decision-making of speech probability value, and uses superposition principle that maximum two-way streams of selecting are carried out to audio mixing, finally forwards the voice packet after audio mixing.The method has made up the weak defect of the portable skinny device computing powers such as mobile phone dexterously, and the calculated amount that the while greatly reduces again server carries out audio mixing operation, can be widely used in multimedia multiparty conference system.
Above are only the specific embodiment of the present invention, but design concept of the present invention is not limited to this, allly utilizes this design to carry out the change of unsubstantiality to the present invention, all should belong to the behavior of invading protection domain of the present invention.

Claims (5)

1. the Multi-Party Conference sound mixing method of a lightweight, it is characterized in that: 1) customer end adopted AMR scrambler obtains voice PCM data and data length after voice are encoded, to a point frame processing for the voice PCM data acquisition after coding, calculate every frame speech energy value, and determine that in conjunction with this frame speech energy value and data length thereof this frame is speech frame or non-speech frame, thereby count the probable value of speech frame in voice PCM data; 2) server end is selected two spokesmans' that current speech probability value is the highest voice flow by the speech probability value receiving, and determine whether use superposition principle that maximum two-way voice flows of selecting are carried out to audio mixing, finally to forward the voice packet after audio mixing according to these two speech probability value sizes.
2. the Multi-Party Conference sound mixing method of a kind of lightweight as claimed in claim 1, is characterized in that: preset: client grabs a frame voice signal at set intervals, and every frame voice signal comprises m sampled value, and the energy of each sampled value is r i; Set statistical window and comprise continuous n frame voice signal, the energy relative reference value of present frame is E refer; Step 1) specifically comprise as follows:
1.1) the output length after client input voice PCM data and AMR coding, the energy value of calculating present frame voice PCM data
1.2) judge whether the present frame output length after AMR coding equals 31, if so, records the energy value of this frame, as speech energy reference value, judges this frame as speech frame and adds in statistical window, enters step 1.4); If not, record the energy value of this frame, as non-voice energy reference value, enter step 1.3);
1.3) judge whether present frame energy value is greater than its energy relative reference value E refer, if so, judge that this frame is as speech frame, if not, judge that this frame is as non-speech frame; Add in new statistical window, enter step 1.4)
1.4) judge that whether statistical window is full, the if so, accounting of speech frame in counting statistics window, is expressed as 0 to 100 speech probability value; If not, enter next frame, skip to step 1.1).
3. the Multi-Party Conference sound mixing method of a kind of lightweight as claimed in claim 2, is characterized in that: the maximal value of setting the non-voice energy reference value of front n successive frame of present frame is E noise, and the maximal value of speech energy reference value is expressed as E voise, the energy relative reference value E of present frame refercalculate with following formula:
E refer=E noise+(E voice-E noise)/10。
4. the Multi-Party Conference sound mixing method of a kind of lightweight as claimed in claim 1, is characterized in that: step 2) specific as follows:
2.1) server receives the speech probability value that client sends over, and selects two voice flow F1, F2 that speech probability value is the highest, and its speech probability value is respectively P1, P2, P1>P2;
2.2) judge whether P1>2P2 sets up, if so, only by P 1corresponding voice flow output; If not, these two voice flows are carried out exporting after audio mixing.
5. a Multi-Party Conference device sound mixing for lightweight, comprises client and server, it is characterized in that:
Client comprises: obtain the AMR scrambler of voice PCM data and data length for voice are encoded, for the speech energy calculation element of every frame speech energy value of the voice PCM data after calculation code, determine in conjunction with speech energy value and data length thereof the decision maker that this frame is speech frame or non-speech frame, and count the statistic device of the probable value of speech frame in the statistical window of voice PCM data;
Server comprises: for receiving speech probability value and selecting the reception selecting arrangement of two spokesmans' that current speech probability value is the highest voice flow, determine whether use superposition principle maximum two-way voice flows of selecting to be carried out to the device sound mixing of audio mixing according to these two speech probability value sizes, and forward the dispensing device of voice packet.
CN201410414450.5A 2014-08-21 2014-08-21 Lightweight class multi-side conference sound mixing method and device Pending CN104167210A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410414450.5A CN104167210A (en) 2014-08-21 2014-08-21 Lightweight class multi-side conference sound mixing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410414450.5A CN104167210A (en) 2014-08-21 2014-08-21 Lightweight class multi-side conference sound mixing method and device

Publications (1)

Publication Number Publication Date
CN104167210A true CN104167210A (en) 2014-11-26

Family

ID=51910991

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410414450.5A Pending CN104167210A (en) 2014-08-21 2014-08-21 Lightweight class multi-side conference sound mixing method and device

Country Status (1)

Country Link
CN (1) CN104167210A (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107277425A (en) * 2016-04-08 2017-10-20 中兴通讯股份有限公司 A kind of server, meeting-place terminal and cloud meeting processing method
CN108922524A (en) * 2018-06-06 2018-11-30 西安Tcl软件开发有限公司 Control method, system, device, Cloud Server and the medium of intelligent sound equipment
WO2020170946A1 (en) * 2019-02-19 2020-08-27 株式会社ソニー・インタラクティブエンタテインメント Voice output control device, voice output control system, voice output control method and program
CN111770413A (en) * 2020-06-30 2020-10-13 浙江大华技术股份有限公司 Multi-sound-source sound mixing method and device and storage medium
CN113257257A (en) * 2021-07-14 2021-08-13 统信软件技术有限公司 Method, device and equipment for processing mixed sound of multiple paths of voice signals and storage medium
CN114285830A (en) * 2021-12-21 2022-04-05 北京百度网讯科技有限公司 Voice signal processing method and device, electronic equipment and readable storage medium
CN116471263A (en) * 2023-05-12 2023-07-21 杭州全能数字科技有限公司 Real-time audio routing method for video system
US11869516B2 (en) 2019-11-27 2024-01-09 Tencent Technology (Shenzhen) Company Limited Voice processing method and apparatus, computer- readable storage medium, and computer device

Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1701353A (en) * 2002-01-08 2005-11-23 迪里辛姆网络控股有限公司 A transcoding scheme between CELP-based speech codes
CN1859511A (en) * 2005-04-30 2006-11-08 华为技术有限公司 Telephone conference voice mixing method
CN101098362A (en) * 2007-07-26 2008-01-02 中兴通讯股份有限公司 System and method for implementing mixed color bell tone
CN101252452A (en) * 2007-03-31 2008-08-27 红杉树(杭州)信息技术有限公司 Distributed type tone mixing system in multimedia conference
CN101414462A (en) * 2007-10-15 2009-04-22 华为技术有限公司 Audio encoding method and multi-point audio signal mixing control method and corresponding equipment
CN101420374A (en) * 2007-10-23 2009-04-29 日本电气株式会社 Multiplex communication system and method
CN101510988A (en) * 2009-02-19 2009-08-19 深圳华为通信技术有限公司 Method and apparatus for processing and playing voice signal
CN102065265A (en) * 2009-11-13 2011-05-18 华为终端有限公司 Method, device and system for realizing sound mixing
CN102664019A (en) * 2012-04-27 2012-09-12 深圳市邦彦信息技术有限公司 DSP sound mixing method and device for full-interactive conference
CN103050124A (en) * 2011-10-13 2013-04-17 华为终端有限公司 Sound mixing method, device and system
CN103220258A (en) * 2012-01-20 2013-07-24 华为技术有限公司 Conference sound mixing method, terminal and media resource server (MRS)
CN103327014A (en) * 2013-06-06 2013-09-25 腾讯科技(深圳)有限公司 Voice processing method, device and system
CN103500580A (en) * 2013-09-23 2014-01-08 广东威创视讯科技股份有限公司 Audio mixing processing method and system

Patent Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1701353A (en) * 2002-01-08 2005-11-23 迪里辛姆网络控股有限公司 A transcoding scheme between CELP-based speech codes
CN1859511A (en) * 2005-04-30 2006-11-08 华为技术有限公司 Telephone conference voice mixing method
CN101252452A (en) * 2007-03-31 2008-08-27 红杉树(杭州)信息技术有限公司 Distributed type tone mixing system in multimedia conference
CN101098362A (en) * 2007-07-26 2008-01-02 中兴通讯股份有限公司 System and method for implementing mixed color bell tone
CN101414462A (en) * 2007-10-15 2009-04-22 华为技术有限公司 Audio encoding method and multi-point audio signal mixing control method and corresponding equipment
CN101420374A (en) * 2007-10-23 2009-04-29 日本电气株式会社 Multiplex communication system and method
CN101510988A (en) * 2009-02-19 2009-08-19 深圳华为通信技术有限公司 Method and apparatus for processing and playing voice signal
CN102065265A (en) * 2009-11-13 2011-05-18 华为终端有限公司 Method, device and system for realizing sound mixing
CN103050124A (en) * 2011-10-13 2013-04-17 华为终端有限公司 Sound mixing method, device and system
CN103220258A (en) * 2012-01-20 2013-07-24 华为技术有限公司 Conference sound mixing method, terminal and media resource server (MRS)
CN102664019A (en) * 2012-04-27 2012-09-12 深圳市邦彦信息技术有限公司 DSP sound mixing method and device for full-interactive conference
CN103327014A (en) * 2013-06-06 2013-09-25 腾讯科技(深圳)有限公司 Voice processing method, device and system
CN103500580A (en) * 2013-09-23 2014-01-08 广东威创视讯科技股份有限公司 Audio mixing processing method and system

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
张历卓等: ""基于概率决策的自适应跨平台多方会议方案"", 《计算机应用》 *
蔡必强: ""视频会议中混音技术研究"", 《现代电子技术》 *

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107277425A (en) * 2016-04-08 2017-10-20 中兴通讯股份有限公司 A kind of server, meeting-place terminal and cloud meeting processing method
CN108922524A (en) * 2018-06-06 2018-11-30 西安Tcl软件开发有限公司 Control method, system, device, Cloud Server and the medium of intelligent sound equipment
JP7116240B2 (en) 2019-02-19 2022-08-09 株式会社ソニー・インタラクティブエンタテインメント Audio output control system, relay device, communication device, audio output control method and program
WO2020170946A1 (en) * 2019-02-19 2020-08-27 株式会社ソニー・インタラクティブエンタテインメント Voice output control device, voice output control system, voice output control method and program
US12033655B2 (en) 2019-02-19 2024-07-09 Sony Interactive Entertainment Inc. Sound output control apparatus, sound output control system, sound output control method, and program
JPWO2020170946A1 (en) * 2019-02-19 2021-11-18 株式会社ソニー・インタラクティブエンタテインメント Audio output control device, audio output control system, audio output control method and program
US11869516B2 (en) 2019-11-27 2024-01-09 Tencent Technology (Shenzhen) Company Limited Voice processing method and apparatus, computer- readable storage medium, and computer device
CN111770413A (en) * 2020-06-30 2020-10-13 浙江大华技术股份有限公司 Multi-sound-source sound mixing method and device and storage medium
CN111770413B (en) * 2020-06-30 2021-08-27 浙江大华技术股份有限公司 Multi-sound-source sound mixing method and device and storage medium
CN113257257A (en) * 2021-07-14 2021-08-13 统信软件技术有限公司 Method, device and equipment for processing mixed sound of multiple paths of voice signals and storage medium
CN114285830A (en) * 2021-12-21 2022-04-05 北京百度网讯科技有限公司 Voice signal processing method and device, electronic equipment and readable storage medium
CN114285830B (en) * 2021-12-21 2024-05-24 北京百度网讯科技有限公司 Voice signal processing method, device, electronic equipment and readable storage medium
CN116471263A (en) * 2023-05-12 2023-07-21 杭州全能数字科技有限公司 Real-time audio routing method for video system
CN116471263B (en) * 2023-05-12 2024-02-13 杭州全能数字科技有限公司 Real-time audio routing method for video system

Similar Documents

Publication Publication Date Title
CN104167210A (en) Lightweight class multi-side conference sound mixing method and device
US11227612B2 (en) Audio frame loss and recovery with redundant frames
KR101353847B1 (en) Method and apparatus for detecting and suppressing echo in packet networks
US9456273B2 (en) Audio mixing method, apparatus and system
CN105610635B (en) Voice coding sending method and device
CN105304079B (en) A kind of multi-mode phoneme synthesizing method of multi-party call and system and server
EP2786552B1 (en) Method to select active channels in audio mixing for multi-party teleconferencing
CN102226944A (en) Audio mixing method and equipment thereof
US9917945B2 (en) In-service monitoring of voice quality in teleconferencing
CN112334980A (en) Adaptive comfort noise parameter determination
CN110024029A (en) Audio Signal Processing
CN102355484B (en) A kind of method of audio data transmission
CA2689230C (en) Method of transmitting data in a communication system
CN101488870A (en) Method, system and equipment for implementing sound mixing
CN101478616A (en) Instant voice communication method
CN1845573A (en) Simultaneous interpretation video conference system and method for supporting high capacity mixed sound
US20120095760A1 (en) Apparatus, a method and a computer program for coding
CN102436818A (en) Routing and overdubbing method for server end based on priority of energy
Chinna Rao et al. Real-time implementation and testing of VoIP vocoders with asterisk PBX using wireshark packet analyzer
US20160019903A1 (en) Optimized mixing of audio streams encoded by sub-band encoding
CN101990082B (en) Method and device for implementing video telephone
CN107113357B (en) Improved method and apparatus relating to speech quality estimation
CN106937074A (en) A kind of video conferencing system
CN101488828A (en) Acknowledgment of media waveforms between telecommunications endpoints
Yang et al. Embedded Stereo Coding Algorithm for Stereo Voice Codec

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20141126

WD01 Invention patent application deemed withdrawn after publication