CN110266995A

CN110266995A - A kind of method, MCU and the storage medium of orderly control conference terminal speech

Info

Publication number: CN110266995A
Application number: CN201910507630.0A
Authority: CN
Inventors: 张海燕; 黄书平; 王福
Original assignee: ZTE Corp
Current assignee: ZTE Corp
Priority date: 2019-06-12
Filing date: 2019-06-12
Publication date: 2019-09-20
Also published as: WO2020248713A1

Abstract

The embodiment of the present invention provides method, MCU and the storage medium of a kind of sequence control conference terminal speech, the audio data of the conference terminal input by receiving access MCU；Determine whether corresponding conference terminal has speech demand according to audio data；When conference terminal has speech demand, the competence of speech of conference terminal is controlled；The method of orderly control conference terminal speech in the embodiment of the present invention, the audio data inputted according to conference terminal, it is controlled by competence of speech of the MCU to conference terminal, it can more rapidly more accurately get the speaking request of conference terminal in real time relative to scheme in the related technology, and cooperates without conference terminal, efficiently avoid the inter-communicating problem between conference terminal and MCU.

Description

A kind of method, MCU and the storage medium of orderly control conference terminal speech

Technical field

The present embodiments relate to but be not limited to video conference field, in particular to but be not limited to a kind of orderly control Method, MCU and the storage medium of conference terminal speech processed.

Background technique

With popularizing for video conferencing system, user's ability of conference terminal is irregular, in holding video conference Often multipoint service unit, that is, MCU controls the competence of speech of terminal, only specified speech terminal is allowed to make a speech, other Terminal then carries out closing sound processing, even if non-designated speech terminal is open microphone, the audio-frequency information that it sends will not Stereo process is carried out by MCU, the competence of speech of the terminal is opened again when needing it to make a speech, MCU just can be the terminal Audio data audio mixing is added, other with can terminal can just hear the sound of the terminal.This also ensures that video conference is held Order.

Video conference is broadly divided into chairman's mode and director's mode；Ordinary terminal needs to make a speech under chairman's mode, generally has Two kinds of approach are realized:

1. chairman's roll call, the competence of speech directly to open a terminal.

2. ordinary terminal active request floor, MCU is received and it is included in sites requesting floor after request, and chairman end is controlling Platform chooses the terminal to make a speech from sites requesting floor, that is, gives competence of speech.

Ordinary terminal needs to make a speech under director's mode, and also there are two types of approach to realize:

1. conference administrator finds the terminal in Conference control platform from conference terminal list, its hair is then directly opened Say permission.

2. ordinary terminal active request floor, MCU is received and it is included in sites requesting floor after request, and conference administrator exists Conference control platform chooses the terminal to make a speech from sites requesting floor, that is, gives competence of speech.

From the above mentioned, realize that the mode of controlling terminal speech can be classified as two classes under existing both of which, first is that point recipe Formula is initiated by chairman or conference administrator；Second is that request floor mode, is initiated by conference terminal.Both modes all have one Fixed deficiency, first first way need administrator or chairman that the terminal that needs to make a speech is found out from all participant terminals, Especially in the case where vast capacity meeting, this just needs to spend many time, may management in the case where simple speech Speech all has finished on when member finds the terminal.The second way then necessarily requires participant to have certain conference terminal Operational capacity, and the conference terminal must have this request floor function of docking with MCU, need terminal and the signaling of MCU Intercommunication, this there is different manufacturers different model terminal and MCU can not intercommunication risk.

Summary of the invention

Method, MCU and the storage medium of a kind of orderly control conference terminal speech provided in an embodiment of the present invention, it is main to solve Certainly the technical issues of is that administrator existing for roll-call terminal manner of speaking or chairman's operation are numerous in video conference in the related technology Miscellaneous, conference terminal existing for terminal request floor mode use cost and different terminals and MCU can not intercommunication the problem of.

In order to solve the above technical problems, the embodiment of the present invention provides a kind of method of orderly control conference terminal speech, packet It includes:

Receive the audio data of the conference terminal input of access MCU；

Determine whether corresponding conference terminal has speech demand according to the audio data；

When conference terminal has speech demand, the competence of speech of the conference terminal is controlled.

The embodiment of the present invention also provides a kind of MCU that conference terminal is orderly made a speech, comprising:

Audio Input Modules, the audio data of the conference terminal input for receiving access MCU；

Audio processing modules, for determining whether corresponding conference terminal has speech demand according to the audio data；

Competence of speech control module, for when conference terminal has speech demand, to the competence of speech of the conference terminal It is controlled.

The embodiment of the present invention also provides a kind of storage medium, and storage medium is stored with one or more program, and described one A or multiple programs can be executed by one or more processor, to realize orderly control conference terminal speech as described above Method in step

The beneficial effects of the present invention are:

The one kind provided according to embodiments of the present invention method, MCU and storage medium that orderly control conference terminal is made a speech, lead to Cross the audio data for receiving the conference terminal input of access MCU；Determine whether corresponding conference terminal has according to the audio data Speech demand；When conference terminal has speech demand, the competence of speech of the conference terminal is controlled；The embodiment of the present invention In orderly control conference terminal speech method, according to conference terminal input audio data, by MCU to the hair of conference terminal Speech permission is controlled, and can more rapidly more accurately get the hair of conference terminal in real time relative to scheme in the related technology Speech request, and cooperate without conference terminal, efficiently avoid the inter-communicating problem between conference terminal and MCU.

Other features of the invention and corresponding beneficial effect are described in the aft section of specification, and should be managed Solution, at least partly beneficial effect is apparent from from the record in description of the invention.

Detailed description of the invention

Fig. 1 is the method flow diagram that the orderly control conference terminal of the embodiment of the present invention one is made a speech；

Fig. 2 is the system schematic of the video conference terminal in the related technology of the embodiment of the present invention two；

Fig. 3 is the MCU schematic diagram that the orderly control conference terminal of the embodiment of the present invention two is made a speech；

Fig. 4 is a kind of system schematic of video conference terminal of the embodiment of the present invention two；

Fig. 5 is the system schematic of another video conference terminal of the embodiment of the present invention two；

Fig. 6 is a kind of flow chart of method that conference terminal speech is orderly controlled based on MCU of the embodiment of the present invention three；

Fig. 7 is the flow chart that the another kind of the embodiment of the present invention three orderly controls the method for conference terminal speech based on MCU.

Specific embodiment

In order to make the objectives, technical solutions, and advantages of the present invention clearer, below by specific embodiment knot Attached drawing is closed to be described in further detail the embodiment of the present invention.It should be appreciated that specific embodiment described herein is only used to It explains the present invention, is not intended to limit the present invention.

Embodiment one:

In the related art, realize that the mode of controlling terminal speech can be classified as two classes under existing both of which, first is that point Name mode, is initiated by chairman or conference administrator；Second is that request floor mode, is initiated by conference terminal.Both modes are all deposited In certain deficiency, first first way, need administrator or chairman that the terminal that needs to make a speech is looked for from all participant terminals Out, especially in the case where vast capacity meeting, this just needs to spend many time, can in the case where simple speech Speech all has finished on when energy administrator finds the terminal.The second way then necessarily requires participant to have certain meeting Terminal operation ability is discussed, and the conference terminal must have this request floor function of docking with MCU, need terminal and MCU Signalling interworking, this there is different manufacturers different model terminal and MCU can not intercommunication risk；In order to solve above-mentioned ask It inscribes, by the way that the audio data of input is detected or identified in the embodiment of the present invention, orderly controls conference terminal in the side MCU Right to speak is not necessarily to conference terminal intervention；As shown in FIG. 1, FIG. 1 is orderly control conference terminals provided in an embodiment of the present invention to make a speech Method flow chart, this orderly control conference terminal speech method include:

The audio data that S101, the conference terminal for receiving access MCU input.

In embodiments of the present invention, conference terminal accesses MCU, and then MCU receives the audio number that conference terminal inputs in real time According to the audio data can be any audio data acquired by the microphone of conference terminal；Wherein conference terminal includes hair It says conference terminal and closes sound conference terminal, speech conference terminal refers to the specific competence of speech of the conference terminal, so that MCU is to hair Say that the audio data of conference terminal carries out audio mixing, other conference terminals can hear the speech content of the conference terminal；Close sound meeting Terminal refers to that the conference terminal does not have competence of speech, and conference terminal closes sound, and it is defeated that MUC will not close sound conference terminal to this The audio data entered carries out stereo process, other conference terminals will not hear that this closes the speech content of sound conference terminal.

S102, determine whether corresponding conference terminal has speech demand according to audio data.

It is worth noting that, determining whether corresponding conference terminal has speech in real time according to audio data in the embodiment of the present invention Demand, wherein the difference of the conference terminal according to corresponding to audio data, whether conference terminal has the method for determination of speech demand Also different.

When receiving all audio datas for closing the input of sound conference terminal, whether audio data includes effective activity Sound, if so, effectively closing sound conference terminal corresponding to activity sound has speech demand.Wherein effectively movable sound refers to closing sound meeting The speech voice of terminal, for example, audio data 1 be conference terminal 1 input " for * * viewpoint, I thinks ... .. ", audio data 2 ambient sounds (such as mobile phone jingle bell sound) inputted for conference terminal 2, then audio data 1 is effectively movable sound；Preferably, this reality Apply audio number of the speech content in the speech voice for closing sound conference terminal that effective movable sound in example refers to about session topic According to when then receiving all audio datas for closing the input of sound conference terminal, whether audio data is effective speech voice, such as It is further to be identified to the audio data, determines the speech content of the audio data, when speech content and conference content phase Guan Shi determines that the audio data is effectively movable sound, such as it is that " for * * viewpoint, I recognizes that identification, which obtains audio data 1, For ... .. ", audio data 3 are " too shy, to borrow " that conference terminal 3 inputs, and compare audio data 1, determine audio Data 1 are effectively movable sound, it is ensured that conference terminal 1 is really to make a speech；It is understood that effectively activity sound may include one It is a, it also may include multiple.

When receiving the audio data of all speech conference terminal inputs, speech recognition is carried out to audio data, judges sound Whether frequency evidence includes the keyword for requiring to close the speech of sound conference terminal；If so, closing sound conference terminal corresponding to keyword has hair Speech demand.Such as receive speech conference terminal 1 input audio data 1, and identify in audio data 1 include " please number 7 Conference terminal speech " keyword, and the conference terminal of the number 7 is currently at and closes sound-like state, it is determined that the meeting of the number 7 View terminal has speech demand.

S103, when conference terminal has speech demand, the competence of speech of conference terminal is controlled.

In the present embodiment, when closing sound conference terminal has speech demand, MCU closes the competence of speech of sound conference terminal to this It is controlled；When determining that closing sound conference terminal has speech demand according to the effective movable sound for closing sound conference terminal, MCU closes this Sites requesting floor is added in sound conference terminal, further, when audio data includes an effectively movable sound, at this time only one It is a have a speech demand close sound conference terminal, then MCU opens the competence of speech for closing sound conference terminal, and closing sound conference terminal has hair Say permission, when audio data includes at least two effectively movable sound, corresponding at least two close sound conference terminal, in order to guarantee to send out Speech permission orderly controls, and will close sound conference terminal described in corresponding at least two effectively movable sounds at this time and request floor column are added Table, this application speech list is for reminding conference administrator or chairman which closes sound conference terminal request floor, and then meeting pipe Whether reason person or chairman give this and close sound conference terminal competence of speech according to this application list arbitration of making a speech, and wherein request floor arranges Table is stored in MCU, and conference administrator or chairman select to close sound conference terminal from the sites requesting floor of MCU, gives right to speak Limit；Sites requesting floor also can store in third party device in some embodiments, such as in console, conference administrator Or chairman chooses this to close sound view terminal speech in console from sites requesting floor, that is, gives competence of speech.I.e. in the present invention In embodiment, request floor is initiated without closing sound conference terminal, efficiently avoids the inter-communicating problem between terminal and MCU, MCU will intelligently close sound conference terminal according to audio data and sites requesting floor is added, and initiate request floor to close sound conference terminal Request.

It should be noted that then having speech demand in sites requesting floor since effective movable sound may include multiple Close sound conference terminal there are multiple, efficiently arbitrated from sites requesting floor for the ease of chairman or conference administrator, MCU The label for closing sound conference terminal can also be added to sites requesting floor, wherein the label for closing sound conference terminal can be The effect activity sound corresponding determining time, such as determine that the audio data for closing sound conference terminal 1 has effectively movable sound at the t1 moment, It determines that the audio data for closing sound conference terminal 2 has effectively movable sound at the t2 moment, then includes " closing sound in sites requesting floor T1 ", conference terminal 1 " closes sound conference terminal 2, t2 ", in some embodiments, the label for closing sound conference terminal can also be this Close the identity that sound closes user corresponding to sound conference terminal.

In embodiments of the present invention, closing sound conference terminal when the keyword determination according to speech conference terminal has speech demand When, MCU closes sound conference terminal according to sound conference terminal identity information determination from participant list of closing in keyword, wherein closing sound Conference terminal identity information can be the identity information for closing sound conference terminal itself, such as conference terminal number etc., can also be with It is the identity information for closing user corresponding to sound conference terminal, such as the title of user etc., MCU is stored with user's name at this time With the corresponding relationship for closing sound conference terminal；It is understood that when the keyword of speech conference terminal is more accurate, then it can be from It determines that at least one closes sound conference terminal in participant terminal, when determining that closing sound conference terminal includes one, closes sound conference terminal Have a say limit；That is MCU can open corresponding to keyword one and close the competence of speech of sound conference terminal, be not necessarily to conference management Member or chairman authorize arbitration.

When keyword is corresponding closes sound conference terminal including at least two, sound conference terminal is closed by least two, Shen is added Please make a speech list, in order to which conference administrator or chairman control the competence of speech for closing sound conference terminal；It is accurate when failing Sound conference terminal is closed to some, and determines multiple when closing sound conference terminal, such as speech recognition goes out the sound of conference terminal 1 of making a speech Frequency includes " asking conference terminal 6 according to the keyword 1 for including " conference terminal 3 is asked to be made a speech ", the audio data of speech conference terminal 2 The keyword 2 of speech ", determines that two are closed sound conference terminal (meeting according to the sound conference terminal identity information that closes in keyword Terminal 3 and conference terminal 6)；In another example the audio data that speech recognition goes out conference terminal 1 of making a speech includes " Zhang San is asked to make a speech ", and Include two " Zhang San " users in participant, determines multiple to close sound according to the sound conference terminal identity information that closes in keyword Conference terminal closes multiple in the increased sites requesting floor of sound conference terminal, transfers to chairman or conference administrator to carry out secondary It cuts out, it is ensured that the accuracy that competence of speech is given.

In some embodiments, when speech recognition goes out the sound of only one speech conference terminal in all speech conference terminals For frequency according to including the keyword for requiring to close the speech of sound conference terminal, including in the keyword successively includes requiring first, second The instructional information of sound conference terminal speech is closed, then MCU determines this two when closing sound conference terminal, and MCU can first open the One closes the competence of speech of sound conference terminal, after first closes the speech of sound conference terminal, then opens second and closes sound conference terminal Competence of speech.

In some embodiments, when effective movable sound in the audio data that basis closes sound conference terminal, first group is closed Sites requesting floor is added in sound conference terminal, while closing sound meeting including requirement according in the audio data of speech conference terminal The keyword of terminal speech closes sound conference terminal for second group and sites requesting floor is added, it is whole to close sound meeting when first group at this time When sound conference terminal difference is closed with second group in end, in sites requesting floor, second group is closed sound conference terminal relative to first group It is forward to close sound conference terminal position, is preferably second group convenient for chairman or conference administrator and closes sound conference terminal arbitrating floor Limit.

In some embodiments, when the audio data for receiving all speech conference terminal inputs, which can be with root It is adjusted flexibly according to actual demand, such as when conference terminal of making a speech exists simultaneously multiple speeches, whether audio data The keyword of sound is closed including conference terminal of requesting to make a speech, if so, MCU closes the competence of speech of speech conference terminal.

The embodiment of the invention provides a kind of methods of orderly control conference terminal speech, and receive access MCU closes sound meeting Discuss the audio data of terminal input；When detecting that audio data includes effective movable sound, this is closed into sound conference terminal, application is added Speech list, is decided whether to open the competence of speech for closing sound conference terminal by conference administrator or chairman；Or receive all hairs The audio data for saying conference terminal determines to include the key for requiring to close the speech of sound conference terminal in audio data by speech recognition Word, and accurately one is determined when closing sound conference terminal, directly open the competence of speech for closing sound conference terminal；When determine to Few two when closing sound conference terminal, this at least two is closed sound conference terminal sites requesting floor is added；Implemented using the present invention The method for the orderly control conference terminal speech that example provides, can be more rapidly more real relative to scheme in the related technology When get the speaking request of conference terminal, and cooperate without conference terminal, efficiently avoid between conference terminal and MCU Inter-communicating problem.

Embodiment two:

As shown in Fig. 2, system schematic of the Fig. 2 for video conference terminal in the related technology, including conference terminal, MCU, Chairman or conference administrator；MCU includes Audio Input Modules, receives the audio data of the conference terminal input of access MCU, speech Permission control module determines which conference terminal is to close sound, i.e., does not participate in stereo process；Determine which conference terminal has speech Audio mixing can be added in permission, pass through message informing mix module；Mix module receives the instruction of competence of speech control module, only allows The audio data for having the conference terminal of competence of speech participates in audio mixing, and exports the audio data after audio mixing.Audio output module Audio data after audio mixing is distributed to all conference terminals of membership.Speech is initiated when conference terminal to apply, is sent out by message Give MCU；Competence of speech control module receives the speech application of conference terminal, and sites requesting floor is added；By chairman or meeting Whether administrator's arbitration gives competence of speech, feeds back to competence of speech control module；Competence of speech control module allows to be handed down to Meeting-place terminal competence of speech, and then controlling mix module by message allows the audio data in the meeting-place to enter audio mixing.

As shown in figure 3, the embodiment of the present invention provides a kind of MCU of orderly control conference terminal speech；As shown in figure 3, should MCU is included at least:

Audio Input Modules 301, the audio data of the conference terminal input for receiving access MCU；

Audio processing modules 302, for determining whether corresponding conference terminal has speech demand according to audio data；

Competence of speech control module 303, for when conference terminal has speech demand, to the competence of speech of conference terminal into Row control.

In embodiments of the present invention, MCU sound intermediate frequency processing module 302 includes audio detection module 3021；Audio input mould Block 301, for all audio datas for closing the input of sound conference terminal；Audio detection module 3021 is for detecting audio data No includes effectively movable sound, when including effectively movable sound, notifies competence of speech control module 303；Competence of speech control module 303, sites requesting floor is added by sound conference terminal is closed corresponding to effective movable sound, in order to conference administrator or chairman couple The competence of speech for closing sound conference terminal is controlled.In some embodiments, when the audio data includes an effectively activity When sound, competence of speech control module 303, which controls the corresponding sound conference terminal that closes described in effective movable sound, to have a say limit；When When the audio data includes at least two effective movable sound, competence of speech control module 303 is effectively living by described at least two Sites requesting floor is added in the sound conference terminal that closes corresponding to dynamic sound, in order to which conference administrator or chairman close sound to described The competence of speech of conference terminal is controlled.As shown in figure 4, Fig. 4 is one kind of the video conference terminal in the embodiment of the present invention System schematic, MCU further include mix module 304, audio output module 305；By chairman or conference administrator's arbitration according to the Whether the sites requesting floor of tripartite's device gives competence of speech, feeds back to competence of speech control module 303；Competence of speech control Module 303, which allows to be handed down to, closes sound conference terminal competence of speech, and then allows the meeting-place by message control mix module 304 Audio data enters audio mixing, and last audio output module 305 exports the audio data after audio mixing.

In embodiments of the present invention, audio processing modules 302 include audio identification module 3022；Audio Input Modules 301, Audio data for all speech conference terminal inputs；Audio identification module 3022, for carrying out voice knowledge to audio data Not, whether audio data includes the keyword for requiring to close sound conference terminal, when including keyword, according in keyword It closes the determination from participant list of sound conference terminal identity information and closes sound conference terminal, notify competence of speech control module 303；Speech Permission control module 303, when the keyword correspondence closes sound conference terminal including one, the right to speak of sound conference terminal is closed in control Limit；When the keyword correspondence closes sound conference terminal including at least two, sound conference terminal will be closed, sites requesting floor is added, with The competence of speech for closing sound conference terminal is controlled convenient for conference administrator or chairman；As shown in figure 5, Fig. 5 is that the present invention is real Apply another system schematic of the video conference terminal in example.

The embodiment of the present invention provides a kind of MCU of orderly control conference terminal speech, relative to MCU in the related technology, Audio detection module is increased, what detection Audio Input Modules received closes sound conference terminal audio data, has detected speech By message informing competence of speech control module after the conference terminal of demand, competence of speech control module updates request floor column Table；Or relative to MCU in the related technology, increase audio identification module, the audio data of current speech conference terminal is carried out Identification, identifies whether active conference requires other to close voice terminal speech, and then recognition result issues competence of speech control module, And without conference terminal cooperate, it is intelligent for conference terminal request floor or be conference terminal open competence of speech, effectively Avoid the inter-communicating problem between conference terminal and MCU.

Embodiment three:

In order to make it easy to understand, the present embodiment is with a more specific example, to the side of orderly control conference terminal speech Method and MCU are illustrated, as shown in fig. 6, Fig. 6 is a kind of flow chart of method that conference terminal speech is orderly controlled based on MCU, Include:

S601, audio detection module receive all audio datas for closing voice terminal acquisition that Audio Input Modules are sent.

S602, audio detection module carry out effectively movable sound detection to the audio data that Audio Input Modules are sent, and determine It closes sound conference terminal and there is really speech.

In embodiments of the present invention, first determine whether audio data is the sound of meeting personnel, if so, continuing judgement should Whether the corresponding content of audio data related to session topic, for example, audio data be " for * * viewpoint, I thinks ... .. ", then The audio data is effectively movable sound, and then determines that closing sound conference terminal corresponding to effective movable sound has really hair Speech.

S603, when detecting effectively movable sound, close sound conference terminal request floor to the output of competence of speech control module Message.

S604, this is closed sound conference terminal by competence of speech control module increases to sites requesting floor.

In embodiments of the present invention, this application speech list is for reminding conference administrator or chairman which closes sound meeting end Request floor is held, and then conference administrator or chairman close sound conference terminal hair according to whether this application speech list arbitration gives this Say permission, wherein sites requesting floor also can store in third party device, such as in console, conference administrator or chairman It chooses the view terminal to make a speech from sites requesting floor in console, that is, gives competence of speech, feed back to competence of speech control mould Block, which competence of speech control module is determined as according to the information of feedback and closes sound conference terminal opening competence of speech, and notifies mixed Sound module receives the instruction of competence of speech control module, the audio data for closing sound conference terminal for only allowing to have competence of speech Audio mixing is participated in, and exports the audio data after audio mixing, all conference terminals is enabled to hear that this closes the speech of sound conference terminal.

As shown in fig. 7, Fig. 7 is the flow chart for the method that another kind orderly controls conference terminal speech based on MCU, including

S701, audio identification module receive the audio data for all speech terminals acquisition that Audio Input Modules are sent.

S702, audio identification module identify whether audio data includes requiring to close sound meeting to audio data The keyword of terminal, if so, turning S703, if not, turning S702.

In embodiments of the present invention, keyword includes the instructional information for requiring to close the speech of sound conference terminal, such as " asks certain Certain conference terminal speech ".

S703, audio identification module, which identify, closes sound conference terminal title, and according to the title in participant terminal list Search, if accurately be matched to 1 close sound conference terminal if enter S704, if title it is inaccurate be matched to multiple meeting-place into Enter S705.

Audio identification module closes sound conference terminal title " so-and-so conference terminal " according to keyword recognition, and in participant terminal It is searched in list, which includes all terminals for participating in meeting.

S704, it has accurately been matched to and has closed sound conference terminal title, then generating opening, this closes sound conference terminal competence of speech It requests to give competence of speech control module.

After competence of speech control module receives request, the competence of speech for closing sound conference terminal is directly opened, determination is closed Sound conference terminal, which has competence of speech, can be added audio mixing.

If S705, fail precisely to be matched to and close sound conference terminal, and have it is multiple in the case where, what is be matched to several closes sound meeting View terminal submits request floor to request to give competence of speech control module, and chairman or conference administrator is transferred to arbitrate, it is ensured that The accuracy that competence of speech is given.

What detection Audio Input Modules received in the embodiment of the present invention closes sound conference terminal audio data, has detected hair By message informing competence of speech control module after the conference terminal of speech demand, competence of speech control module updates request floor column Table transfers to chairman or conference administrator to arbitrate, it is ensured that the accuracy that competence of speech is given；Or audio identification module, it is right The audio data of current speech conference terminal is identified, identifies whether active conference requires other to close voice terminal speech, into And recognition result issues competence of speech control module, and cooperates without conference terminal, it is intelligent for conference terminal request floor Or open speech for conference terminal and cancel, efficiently avoid the inter-communicating problem between conference terminal and MCU.

Example IV

The embodiment of the invention also provides a kind of storage medium, which is included in by storing information (based on such as Calculation machine readable instruction, data structure, computer program module or other data) any method or technique in the volatibility implemented Or non-volatile, removable or non-removable medium.Storage medium includes but is not limited to RAM (Random Access Memory, random access memory), ROM (Read-Only Memory, read-only memory), EEPROM (Electrically Erasable Programmable read only memory, band Electrically Erasable Programmable Read-Only Memory), flash memory or other deposit Reservoir technology, CD-ROM (Compact Disc Read-Only Memory, compact disc read-only memory), digital versatile disc (DVD) or other optical disc storages, magnetic holder, tape, disk storage or other magnetic memory apparatus or can be used for storing desired Information and any other medium that can be accessed by a computer.

Storage medium in the embodiment of the present invention can be used for storing one or more computer program, one stored Or multiple computer programs can be executed by processor, to realize at least the one of the method for above-mentioned orderly control conference terminal speech A step.

As it can be seen that those skilled in the art should be understood that whole or certain steps in method disclosed hereinabove, be Functional module/unit in system, device may be implemented as the software (computer program code that can be can be performed with computing device To realize), firmware, hardware and its combination appropriate.In hardware embodiment, the functional module that refers in the above description/ Division between unit not necessarily corresponds to the division of physical assemblies；For example, a physical assemblies can have multiple functions, or One function of person or step can be executed by several physical assemblies cooperations.Certain physical assemblies or all physical assemblies can be by realities It applies as by processor, such as the software that central processing unit, digital signal processor or microprocessor execute, or is implemented as hard Part, or it is implemented as integrated circuit, such as specific integrated circuit.

In addition, known to a person of ordinary skill in the art be, communication media generally comprises computer-readable instruction, data knot Other data in the modulated data signal of structure, computer program module or such as carrier wave or other transmission mechanisms etc, and It and may include any information delivery media.So the present invention is not limited to any specific hardware and softwares to combine.

The above content is combining specific embodiment to be further described to made by the embodiment of the present invention, cannot recognize Fixed specific implementation of the invention is only limited to these instructions.For those of ordinary skill in the art to which the present invention belongs, Without departing from the inventive concept of the premise, a number of simple deductions or replacements can also be made, all shall be regarded as belonging to the present invention Protection scope.

Claims

1. a kind of method of orderly control conference terminal speech, comprising:

Receive the audio data of the conference terminal input of access MCU；

2. the method for orderly control conference terminal speech as described in claim 1, which is characterized in that the reception accesses MCU Conference terminal input audio data, comprising:

Receive all audio datas for closing the input of sound conference terminal；

It is described to determine whether corresponding conference terminal has speech demand according to the audio data, comprising:

Judge whether the audio data includes effective movable sound；

If so, closing sound conference terminal corresponding to effective movable sound has speech demand.

3. the method for orderly control conference terminal speech as claimed in claim 2, which is characterized in that described whole to the meeting The competence of speech at end is controlled, comprising:

When the audio data includes an effectively movable sound, the corresponding sound conference terminal that closes of the effective movable sound has speech Permission；

When the audio data includes at least two effectively movable sound, by institute corresponding to described at least two effectively movable sounds It states and closes sound conference terminal sites requesting floor is added, in order to which conference administrator or chairman are to the speech for closing sound conference terminal Permission is controlled.

4. the method for orderly control conference terminal speech as described in claim 1, which is characterized in that the reception accesses MCU Conference terminal input audio data, comprising:

Receive the audio data of all speech conference terminal inputs；

Speech recognition is carried out to the audio data, judges whether the audio data includes requiring to close what sound conference terminal was made a speech Keyword；

If so, closing sound conference terminal corresponding to the keyword has speech demand.

5. the method for orderly control conference terminal speech as claimed in claim 4, which is characterized in that described whole to the meeting The competence of speech at end carries out control

Sound conference terminal is closed according to sound conference terminal identity information determination from participant list of closing in the keyword；

When it is described to close sound conference terminal include one when, the sound conference terminal that closes has a say limit.

6. the method for orderly control conference terminal speech as claimed in claim 4, which is characterized in that described whole to the meeting The competence of speech at end is controlled, further includes:

When it is described close sound conference terminal include at least two when, by described at least two close sound conference terminal be added request floor column Table, in order to which conference administrator or chairman control the competence of speech for closing sound conference terminal.

7. a kind of MCU of orderly control conference terminal speech, comprising:

Competence of speech control module, for being carried out to the competence of speech of the conference terminal when conference terminal has speech demand Control.

8. the MCU of orderly control conference terminal speech as claimed in claim 7, which is characterized in that the audio processing modules Including audio detection module；

Audio Input Modules, for all audio datas for closing the input of sound conference terminal；

The audio detection module, for detecting whether the audio data includes effective movable sound, when including effectively movable sound, Notify the competence of speech control module；

Competence of speech control module controls effective movable sound institute when the audio data includes an effectively movable sound The corresponding sound conference terminal that closes is stated to have a say limit；It, will be described when the audio data includes at least two effectively movable sound Sites requesting floor is added in the sound conference terminal that closes corresponding at least two effectively movable sounds, in order to conference administrator or Chairman controls the competence of speech for closing sound conference terminal.

9. the MCU of orderly control conference terminal speech as claimed in claim 7, which is characterized in that the audio processing modules Including audio identification module；

Audio Input Modules, the audio data for all speech conference terminal inputs；

Audio identification module judges whether the audio data includes requiring for carrying out speech recognition to the audio data The keyword for closing sound conference terminal closes sound conference terminal identity information from participant according in keyword when including keyword Sound conference terminal is closed in determination in list, notifies the competence of speech control module；

Competence of speech control module closes sound described in control when closing sound conference terminal described in the keyword is corresponding includes one Conference terminal has a say limit；When closing sound conference terminal described in the keyword is corresponding includes at least two, sound is closed by described Sites requesting floor is added in conference terminal, in order to conference administrator or chairman to the competence of speech for closing sound conference terminal into Row control.

10. a kind of storage medium, the storage medium is stored with one or more program, and one or more of programs can It is executed by one or more processor, to realize such as orderly control conference terminal hair described in any one of claims 1 to 6 Step in the method for speech.