CN114531425B

CN114531425B - Processing method and processing device

Info

Publication number: CN114531425B
Application number: CN202111672799.5A
Authority: CN
Inventors: 刘扬; 刘金
Original assignee: Lenovo Beijing Ltd
Current assignee: Lenovo Beijing Ltd
Priority date: 2021-12-31
Filing date: 2021-12-31
Publication date: 2024-06-18
Anticipated expiration: 2041-12-31
Also published as: CN114531425A

Abstract

The embodiment of the application discloses a processing method and a processing device, wherein the method comprises the following steps: after the first electronic equipment and the second electronic equipment are in communication connection, first target information in the current space is obtained, wherein the first target information is from at least one third electronic equipment connected with the first electronic equipment; performing first processing on the first target information to obtain second target information; outputting the second target information, or giving the second target information to the second electronic equipment for outputting; wherein, the output effect of the second target information is better than the output effect of the first target information.

Description

Processing method and processing device

Technical Field

The present application relates to the field of communications technologies, and in particular, to a processing method and a processing apparatus.

Background

With the rapid development of internet technology, online audio/video conference has become a common conference form in daily work of people nowadays, and users can initiate online conference through an application program with online conference function and invite other users to participate in the online conference.

In some conference scenarios, when multiple employees participate in a conference in the same space, the employees typically carry personal notebooks to participate in the conference. However, when staff use their own notebook computers and the like to participate in a conference, there are many problems such as information interference between them, which results in poor conference effect.

Disclosure of Invention

The technical scheme of the application is realized as follows:

In a first aspect, an embodiment of the present application provides a processing method, including:

After the first electronic equipment and the second electronic equipment are in communication connection, first target information in the current space is obtained, wherein the first target information is from at least one third electronic equipment connected with the first electronic equipment;

Performing first processing on the first target information to obtain second target information;

outputting the second target information, or giving the second target information to the second electronic equipment for outputting;

Wherein, the output effect of the second target information is better than the output effect of the first target information.

In a second aspect, an embodiment of the present application provides a processing apparatus, including:

the system comprises an acquisition unit, a first electronic device and a second electronic device, wherein the acquisition unit is configured to acquire first target information in a current space after the first electronic device and the second electronic device are in communication connection, and the first target information is from at least one third electronic device connected with the first electronic device;

the processing unit is configured to perform first processing on the first target information to obtain second target information;

An output unit configured to output the second target information, or to give the second target information to the second electronic device for output; wherein, the output effect of the second target information is better than the output effect of the first target information.

In a third aspect, an embodiment of the present application provides an electronic device, including:

a memory for storing a computer program capable of running on the processor;

A processor for executing the processing method as described in the first aspect when running a computer program.

In a fourth aspect, an embodiment of the present application provides a computer storage medium storing a computer program which, when executed by at least one processor, implements a processing method according to the first aspect.

According to the scheme provided by the embodiment of the application, after the first electronic equipment and the second electronic equipment are in communication connection, first target information in the current space is obtained, and the first target information is from at least one third electronic equipment connected with the first electronic equipment; performing first processing on the first target information to obtain second target information; outputting the second target information, or giving the second target information to the second electronic equipment for outputting; wherein, the output effect of the second target information is better than the output effect of the first target information. In this way, based on the connection between the first electronic device and the third electronic device, the first target information is received, the first target information is processed into the second target information, the first electronic device or the second electronic device outputs the second target information, and the output effect of the second target information after the first processing is superior to that of the first target information, so that multiple scenes can be adapted, the problems of mutual interference between audio information or video and audio information, interference between users of multiple third electronic devices and the like during multiple devices are avoided, the effects of conferences, teaching, living broadcast and the like are improved, and the quality of conferences, teaching, living broadcast and the like is further improved.

Drawings

FIG. 1 is a schematic flow chart of a processing method according to an embodiment of the present application;

fig. 2 is a schematic spatial distribution diagram of a plurality of electronic devices according to an embodiment of the present application;

Fig. 3 is an application scenario schematic diagram of a processing method according to an embodiment of the present application;

FIG. 4 is a schematic diagram of a process for sending and receiving information according to an embodiment of the present application;

fig. 5 is a schematic diagram of spatial arrangement of an electronic device according to an embodiment of the present application;

fig. 6 is a schematic diagram of a determination manner of network delay according to an embodiment of the present application;

fig. 7 is a schematic flow chart of an echo cancellation process according to an embodiment of the present application;

fig. 8 is a schematic diagram of a composition structure of a processing apparatus according to an embodiment of the present application;

fig. 9 is a schematic diagram of a composition structure of an electronic device according to an embodiment of the present application;

fig. 10 is a schematic diagram of a composition structure of another electronic device according to an embodiment of the present application.

Detailed Description

The technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present application. It is to be understood that the specific embodiments described herein are for purposes of illustration only and are not intended to be limiting. It should be noted that, for convenience of description, only a portion related to the related application is shown in the drawings.

Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this application belongs. The terminology used herein is for the purpose of describing embodiments of the application only and is not intended to be limiting of the application.

In the following description, reference is made to "some embodiments" which describe a subset of all possible embodiments, but it is to be understood that "some embodiments" can be the same subset or different subsets of all possible embodiments and can be combined with one another without conflict.

It should be noted that the term "first\second\third" related to the embodiments of the present application is merely to distinguish similar objects, and does not represent a specific order for the objects, it being understood that the "first\second\third" may interchange a specific order or sequencing, where allowed, so that the embodiments of the present application described herein can be implemented in an order other than that illustrated or described herein.

At present, for staff of large enterprises and small and medium-sized enterprises, the following situations are frequently met when online conferences are carried out: (1) Small and medium enterprises have no special meeting room equipment, and staff can only carry personal equipment to participate in a meeting. (2) When staff of a large enterprise are in a meeting in a large meeting room, staff at a long distance can dial in the meeting by using a personal notebook computer. (3) When participating in the same online conference together with office workers, everyone dials into the conference using a notebook computer or the like.

In these cases, employees typically take personal notebook computers to attend meetings. However, when staff attends a meeting using their own notebook computer or the like, the following pain points are generally encountered: (1) When the surrounding colleagues participate in the same conference in the same space together, if the earphone is used for participating in the conference, the voice of the speaker enables the surrounding colleagues to hear the speaking simultaneously on line and off line, the experience is poor, and meanwhile, the surrounding colleagues cannot interact and discuss with the colleagues well; if people use the external device to participate in the conference, mutual interference and howling influence can occur; if only one notebook computer is used for the participation. However, because the pick-up distance and angle of the microphone are limited, the voice of a speaker at a slightly far position cannot be picked up well; in addition, the notebook has a small volume, and can not meet the listening requirements of all people. (2) In a large conference room, colleagues farther from the main sound pickup apparatus speak, and the remote persons often cannot hear clearly.

Although some solutions exist today, for example, there are mainly two types of products: (1) megaphone station: can only be used for picking up sound, and a speaker is required to turn on the microphone by himself. It has the following disadvantages: the microphone cannot be automatically turned on and off; cannot make stereo mixing; the flexibility is poor due to the wired fixed connection; there is no horn playback. (2) a wireless audio receiver: the wireless audio device can only change the wired audio into the wireless audio, receives audio for playing, and cannot realize voice transmission. However, existing solutions can only solve some of the pain points by manual adjustment by the participant's person, and still suffer from drawbacks, resulting in poor user experience and efficiency.

Based on the above, the embodiment of the application provides a processing method, which comprises the following steps: after the first electronic equipment and the second electronic equipment are in communication connection, first target information in the current space is obtained, wherein the first target information is from at least one third electronic equipment connected with the first electronic equipment; performing first processing on the first target information to obtain second target information; outputting the second target information, or giving the second target information to the second electronic equipment for outputting; wherein, the output effect of the second target information is better than the output effect of the first target information. Based on the connection between the first electronic equipment and the third electronic equipment, the first target information is received, the first target information is processed into the second target information, the first electronic equipment or the second electronic equipment outputs the second target information, and the output effect of the second target information after the first processing is superior to that of the first target information, so that multiple scenes can be adapted, the problems of mutual interference between audio information or video information, interference among users of multiple third electronic equipment and the like during multiple equipment are avoided, the conference effect is improved, and conference quality is further improved.

Of course, the technical scheme provided by the application can also be suitable for other application scenes with pickup and/or image acquisition, such as live broadcast scenes, teaching scenes and the like. The following is mainly described in terms of conference scenarios:

Embodiments of the present application will be described in detail below with reference to the accompanying drawings.

In an embodiment of the present application, referring to fig. 1, a schematic flow chart of a processing method provided by an embodiment of the present application is shown. As shown in fig. 1, the method may include:

S101, after communication connection is established between a first electronic device and a second electronic device, first target information in a current space is obtained, and the first target information is from at least one third electronic device connected with the first electronic device.

The processing method provided by the embodiment of the application relates to a plurality of electronic devices, such as a first electronic device, a second electronic device and a third electronic device. Here, a communication connection is established between the first electronic device and the second electronic device, and a communication connection is also established between the first electronic device and the at least one third electronic device. The communication connection manner between the electronic devices may include wired connection, wireless connection, and the like, which is not limited in particular in the embodiment of the present application.

It should be noted that, taking a large conference scenario in which multiple persons participate as an example, the third electronic device may be an electronic device equipped by a participant, for example, a notebook computer carried by the participant. In addition, the actual scene may be live broadcast, teaching, etc., which is not particularly limited in the embodiment of the present application. The implementation of the processing method will be described in detail below taking a conference scenario as an example.

In the embodiment of the application, the first target information may be audio or video information, that is, the first target information may be only audio information from at least one third electronic device, or may be audio information and video graphics information from at least one third electronic device.

Based on the communication connection between the first electronic device and the third electronic device, the first electronic device is able to obtain first target information from the at least one third electronic device, and the first target information is obtained from a current space in which the first electronic device and the third electronic device are located.

For the second electronic device, in the embodiment of the present application, at least two scenes are included, where the second electronic device is an electronic device that is in the same space as the first electronic device in the local scene; in a remote scenario, the second electronic device is a remote device that is not in the same space as the first electronic device.

For the manner of obtaining the first target information, in some embodiments, obtaining the first target information in the current space may include:

Determining the position relation between a third electronic device and a first electronic device in the current space, and taking audio information or video information acquired by at least one third electronic device with a target position relation with the first electronic device as first target information; or alternatively, the first and second heat exchangers may be,

Determining user information of third electronic equipment in a current space, and taking audio information or video and audio information acquired by the third electronic equipment used by a target user as first target information; or alternatively, the first and second heat exchangers may be,

Acquiring attribute parameters of audio information or video information acquired by third electronic equipment in a current space, and determining the audio information or video information with target attribute as the first target information; or alternatively, the first and second heat exchangers may be,

If the first electronic equipment is in first communication connection with the second electronic equipment, audio information or video information acquired by at least one third electronic equipment connected with the first electronic equipment is acquired through the fourth electronic equipment.

It should be noted that the first electronic device and the third electronic device are located in the same space, such as in the same conference room. Each third electronic device can collect audio information or video and audio information through the microphone, the camera and other devices. And screening based on certain conditions, wherein at least one third electronic device can be screened out from a plurality of third electronic devices, the screened at least one third electronic device is called target electronic device, and the audio information or the video and audio information collected by the screened target electronic device is used as first target information.

Here, the embodiment of the present application may determine a positional relationship between the third electronic device and the first electronic device in the current space, and then use at least one third electronic device, which has a positional relationship with the first electronic device as a target positional relationship, as the target electronic device. Here, the target positional relationship may be a preset positional relationship, for example, a third electronic device having a distance from the first electronic device that is a preset distance may be used as the target device, or the like, and may be specifically set in connection with actual requirements, which is not specifically limited herein.

In addition, the embodiment of the application can also select the third electronic device used by the target user as the target electronic device based on the user information of the third device in the current space. Here, the target user may include a conference host, a presenter, a leader or teacher, and the like. The user information may include identification number (Identity Document, ID) information bound or registered in the third electronic device, and screening the target user according to the ID information, for example, ID information of the third electronic device used by the target user is a specific ID, or the ID of the target user has higher authority, or the ID of the target user has a specific flag, etc.; the user information may also include face information collected by a camera of the third electronic device, and the third electronic device may be used as the target electronic device when the face information is matched with the preset face information, that is, the target user is the successful match. A user having a specific relationship with other equipment devices may also be determined as a target user, for example, a user carrying a remote control, a user wearing Virtual Reality (VR)/augmented Reality (Augmented Reality, AR)/mediated Reality (MEDIATED REALITY, MR) glasses, and the like. The embodiment of the present application may also determine the third electronic device used by the target user in other feasible manners, for example, the third electronic device having a specific positional relationship with the target user, which may specifically be combined with the actual scenario and requirement, and is not limited herein specifically.

The embodiment of the application can also obtain the attribute parameters of the audio information or the video information acquired by the third electronic equipment in the current space, analyze the obtained attribute parameters and determine the audio information or the video information with the target attribute as the first target information. The attribute parameters may include a transmission time period of the audio information or the video information transmitted from the third electronic device to the first electronic device, a signal quality of the audio information or the video information received by the first electronic device, a signal energy value, a sound attribute parameter related to audio (such as a decibel value, a sampling frequency, a sampling bit number, a channel number, a frame number, a bit rate, etc.), and an attribute parameter related to an image (such as a resolution, a definition, a video frame rate, etc.). Determining audio information or video information with attribute parameters conforming to target attributes as first target information, for example, regarding a third electronic device with the strongest signal energy value as target electronic device for the same collected audio information, and taking the audio information or video information collected by the target electronic device as the first target information; or the collected video and audio information comprises video and audio information of a target user as first target information; or the audio information or the video and audio information with the shortest transmission time or the best data quality is used as the first target information. I.e. the target properties may be determined in connection with the actual scenario and requirements, which are not particularly limited.

If the first electronic device and the second electronic device are in first communication connection, the embodiment of the application can also acquire the first target information through the fourth electronic device. The first communication connection indicates that the first electronic device and the second electronic device are connected in a remote mode, that is, the second electronic device is a remote device which is not in the same space with the first electronic device. In this case, the first electronic device is used as a main device in the current space, a first communication connection (i.e. remote connection) is established between the first electronic device and the second device, the fourth electronic device is used as a transit device in the current space, the first electronic device and the fourth electronic device can be connected in a wired manner, the fourth electronic device and the third electronic device can form a local network through a wireless connection manner, the first electronic device can obtain audio information or audio-visual information acquired from at least one third electronic device through the fourth electronic device, that is, the fourth electronic device can first obtain the audio information or the audio-visual information from at least one third electronic device, and determine the first target information.

Further, in some embodiments, taking audio information or video information collected by at least one third electronic device having a target positional relationship with the first electronic device as the first target information may include:

forming a microphone array and/or a camera array corresponding to the microphone and/or the camera on the at least one third electronic device and the microphone and/or the camera on the first electronic device;

determining a position of the third electronic device using the microphone array and/or the camera array;

Taking audio information or video information acquired by at least one third electronic device with a first position relation with the electronic device as the first target information; or alternatively, the first and second heat exchangers may be,

And determining the transmission time delay between the third electronic equipment and the first electronic equipment based on the position, and taking the audio information or the video and audio information acquired by at least one third electronic equipment with a second position relation with the electronic equipment as first target information based on the transmission time delay.

It should be further noted that, each of the first electronic device and the third electronic device may include a microphone and a camera, and the microphone array and/or the camera array may be formed based on the microphone and/or the camera in the first electronic device and the microphones and/or the cameras in the plurality of third electronic devices.

Exemplary, referring to fig. 2, a schematic diagram of spatial distribution of a plurality of electronic devices according to an embodiment of the present application is shown. As shown in fig. 2, a first electronic device and a plurality of third electronic devices form a spatial array, including a microphone array and/or a camera array.

At this time, the position of the third electronic device may be determined using the microphone array and/or the camera array. For example: and taking the spatial position of the first electronic device as the reference coordinate of the microphone array and/or the camera array, and determining the position of each third electronic device in the microphone array and/or the camera array relative to the first electronic device based on the communication connection between the first electronic device and each third electronic device or the positioning information of the first electronic device and the third electronic device. Judging whether the position relation between the third electronic equipment and the first electronic equipment accords with the first position relation, if so, the third electronic equipment is the target electronic equipment, and the audio information or the video and audio information acquired by the third electronic equipment is used as first target information. Here, the first positional relationship may be that a distance between the first electronic device and the third electronic device coincides with a preset distance, or that a position of the third electronic device coincides with preset coordinates, or the like.

In addition, for the audio information, the third electronic device closest to the speaker is generally capable of obtaining the audio information with the best audio quality, so the embodiment of the present application may further determine a transmission delay between the third electronic device and the first electronic device based on the location of the third electronic device, where the transmission delay refers to a difference between a time point when the same audio information is transmitted to the third electronic device and a time point when the same audio information is transmitted to the first electronic device.

It will be appreciated that for a certain third electronic device, the distance of its corresponding speaker from that third electronic device is small, and for all third electronic devices, the distance of the speaker from its corresponding third electronic device is minimal. For example, as shown in fig. 2, for the third electronic device a, the speaker corresponding to the third electronic device a is located in front of the third electronic device a and is far away from the other third electronic devices. Since the speaker corresponding to the third electronic device a is closest to the third electronic device a, the transmission time period from the speaker to the third electronic device a may be ignored for the third electronic device a. At this time, the ratio of the distance between the first electronic device and the third electronic device to the sound velocity is the standard transmission delay between the third electronic device and the first electronic device, i.e. the standard transmission delay represents the time period for transmitting the sound of the speaker in front of a certain third electronic device to the first electronic device.

As shown in fig. 2, taking the third electronic device a and the first electronic device as an example, the transmission delay between the two is the distance between the two divided by the sound velocity. Then, for a speaker, if the speaker speaks at the location of the third electronic device a, the transmission delay between the third electronic device a and the first electronic device should be substantially identical to the standard transmission delay between the third electronic device a and the first electronic device.

And selecting at least one third electronic device with a second position relation with the first electronic device as a target electronic device based on the transmission time delay between the third electronic device and the first electronic device so as to acquire first target information. Here, the second positional relationship may be a positional relationship indicating closest to the speaker.

In order to obtain the clearest and reliable audio information, the audio information or the audio-visual information collected by the third electronic device closest to the speaker is generally used as the first target information, and when the third electronic device except for the third electronic device a is far away from the speaker, the time taken for sound to propagate to the third electronic devices is longer than the time taken for sound to propagate to the third electronic device a, so that the transmission time delay between the third electronic device except for the third electronic device a and the first electronic device is smaller than the transmission time delay between the third electronic device a and the first electronic device, which means that the third electronic device a is the third electronic device corresponding to the current speaker, and then the audio information or the audio-visual information collected by the third electronic device a is used as the first target information.

That is, the embodiment of the application can obtain the distance between the third electronic device and the first electronic device based on the position of the third electronic device, determine the transmission delay between the third electronic device and the first electronic device, and determine the third electronic device with the transmission delay closest to the standard transmission delay as the target electronic device, so as to obtain the first target information. That is, the embodiment of the application can perform sound source positioning based on the microphone array formed by the microphones of the plurality of third electronic devices and the microphones of the first electronic device, and select the third electronic device in a specific position (for example, closest to the speaker) to collect the first target information.

For the scenario that the first electronic device is remotely connected, when determining the position, the first electronic device can be replaced by the fourth electronic device to perform position judgment and target electronic device selection.

In addition, in order to determine the third electronic device closest to the speaker, the embodiment of the present application may further acquire a time point when each third electronic device acquires the same audio information, and determine the third electronic device that acquires the audio information earliest as the target electronic device. Thus, the most clear and best quality audio information can be acquired.

S102, performing first processing on the first target information to obtain second target information.

After the first target information is obtained, the first target information is subjected to first processing to obtain second target information.

In some embodiments, performing a first process on the first target information to obtain second target information may include:

determining a communication connection state between the first electronic equipment and the second electronic equipment, and carrying out noise elimination and/or enhancement processing on the audio information or the video and audio information by the first electronic equipment or the fourth electronic equipment at least based on the communication connection state to obtain second target information; or alternatively, the first and second heat exchangers may be,

If the first electronic equipment is connected with the second electronic equipment through the second communication, the first electronic equipment recognizes the audio information or the video and audio information to obtain second target information, wherein the data size of the second target information is smaller than that of the first target information.

It should be noted that, in the embodiment of the present application, the first process for the first target information may be performed based on the communication connection state between the first electronic device and the second electronic device. When the first electronic device is remotely connected with the second electronic device, the fourth electronic device can perform noise elimination processing and/or enhancement processing on the first target information (including audio information or video and audio information) so as to obtain second target information; or after the first electronic device receives the first target information forwarded by the fourth device, the first electronic device may perform corresponding processing on the first target information to obtain the second target information.

When the first electronic device and the second electronic device are in the same space, the connection mode between the first electronic device and the second electronic device is called local connection, and the first electronic device can perform noise elimination processing and/or enhancement processing on the audio information or the video and audio information, so that second target information is obtained.

In addition, the connection state between the first electronic device and the second electronic device may include a normal connection state, a disconnection state, and the like in addition to the remote connection and the local connection. When the communication between the first electronic equipment and the second electronic equipment is disconnected, the first electronic equipment cannot send information to the second electronic equipment, and the first electronic equipment performs noise elimination processing and/or enhancement processing on the audio information or the video and audio information to obtain second target information.

Further, if there is a second communication connection (i.e., a local connection) between the first electronic device and the second electronic device, the first electronic device and the second electronic device may be in the same space at this time. At this time, the first electronic device may perform recognition processing on the first target information, for example, the first electronic device may recognize and convert audio information in audio information or video information into text information, generate a conference summary, and the like, where the recognition processing may further optimize the audio information and/or convert the optimized audio information into text information. For example, blank segments and repeated sentences are deleted, abstract processing is performed, and the like, and recording information of audio information before and/or after optimization, recording information of video and audio information, and the like can be generated. At this time, since the recognition processing is performed, the blank section, the repeated sentence, or the like is deleted, and the data amount of the generated second target information is smaller than the data amount of the first target information.

Further, when the first electronic device and the second electronic device are remotely connected, the identification process may be performed on the first target information to obtain the second target information.

The first target information may be from a plurality of third electronic devices, and in this case, the embodiment of the present application may mix the first target information to obtain the second target information, so that when the second target information is sent to the remote device, audio information heard by a participant at the remote end has a stereoscopic impression. Thus, in some embodiments, performing a first process on the first target information to obtain second target information may include:

determining a position of the third electronic device and a transmission delay between the third electronic device and the first electronic device or the fourth electronic device;

and mixing the first target information from the plurality of third electronic devices based on the positions and the transmission delays to obtain second target information.

It should be noted that, when the first target information comes from the plurality of third electronic devices, each third electronic device collects audio information or audiovisual information of the speaker (or non-speaker, etc.) closest to the third electronic device, and at this time, the positions of the third electronic devices for collecting the first target information (that is, the distances between the third electronic devices and the first electronic devices) may be determined separately, and the transmission delay between each third electronic device and the first electronic device may be determined. For a scene in which the fourth electronic device is present as a transit device, determining the position of each third electronic device relative to the fourth electronic device (i.e., determining the distance between the third electronic device and the fourth electronic device), and determining the transmission delay between each third electronic device and the fourth electronic device.

And then, carrying out audio mixing processing on the first target information from the plurality of third electronic devices according to the positions and the transmission delays of the third electronic devices, so as to obtain second target information. The mixing process is to integrate sounds from various sources into a stereo or mono track. The specific implementation of the mixing process may be implemented with reference to conventional understanding by those skilled in the art, and will not be described herein.

S103, outputting second target information, or giving the second target information to second electronic equipment for outputting; wherein, the output effect of the second target information is better than the output effect of the first target information.

After the second target information is obtained, the second target information may be output, or the first target information may be given to the second electronic device, and the second electronic device may output the second target information, and since the second target information is obtained after the first target information is processed, the output effect of the second target information is better than the output effect of the first target information.

It should be noted that, when the second target information is output, the third electronic devices may be set to a mute state, so as to avoid interference between signals.

In some embodiments, outputting the second target information, or giving the second target information to the second electronic device for output, may include:

If the first electronic device and the second electronic device are in first communication connection, giving second target information to the second electronic device for output, or obtaining hardware configuration information of the first electronic device and the fourth electronic device, and determining whether the first electronic device or the fourth electronic device outputs the second target information based on the hardware configuration information; or alternatively, the first and second heat exchangers may be,

If the first electronic device and the second electronic device are in second communication connection, hardware configuration information of the first electronic device and the second electronic device is obtained, and based on the hardware configuration information, it is determined that the first electronic device or the second electronic device outputs second target information; or alternatively, the first and second heat exchangers may be,

And if the first electronic equipment and the second electronic equipment are in second communication connection, obtaining output parameters and/or current space environment of second target information, and determining whether the first electronic equipment or the second electronic equipment outputs the second target information based on the output parameters and/or the current space environment.

When the second target information is output, if the first electronic device and the second electronic device have a first communication connection, that is, a remote connection between the first electronic device and the second electronic device, the second target information may be given to the second electronic device, so that the second electronic device may output the second target information at a remote end, so that a participant at the remote end may obtain the second target information.

In the local space, the second target information can be output through the first electronic device or the fourth electronic device. At this time, the hardware configuration information of the first electronic device and the second electronic device may be obtained, where the hardware configuration information may include a playing sound size and a sound quality of the electronic device, and if the audio-visual information is output, the hardware configuration information may further include a screen size, a screen resolution, a picture quality, and the like of the electronic device. And based on the hardware configuration information of the first electronic equipment and the fourth electronic equipment, selecting the electronic equipment with better output effect from the hardware configuration information to output the second target information. For example, an electronic device capable of playing a larger sound or a larger screen or a better sound quality, image quality, or the like is selected to output the second target information.

In addition, the electronic device for outputting the second target information may also be determined based on the specific scene, the number of people in the current space, ambient noise, the location distribution of the participants, etc. in combination with the hardware configuration information of the electronic device. For example, in the case where the current space is large in number of people, the environment is noisy, and the positions of the participants are relatively scattered, an electronic device that plays louder sound is selected as the electronic device that outputs the second target information.

It should be noted that if the first electronic device and the second electronic device have a second communication connection, that is, the first electronic device and the second electronic device are in the same space, the first electronic device and the second electronic device are locally connected. At this time, the output parameters of the second target information and/or the current space environment may be obtained, where the output parameters of the second target information may include whether the second target information is stereo, only audio information or audio-visual information, and the volume of the audio information, etc.; the current spatial environment may include the size of the conference room, the layout among the first electronic device, the second electronic device, and the third electronic device, and so on.

And selecting one from the first electronic device and the second electronic device to output the second target information based on the output parameters of the second target information and/or the current space environment. For example, if the second target information is stereo, the electronic device with the stereo output function is selected to output the second target information, if the second target information is video and audio information, the electronic device with the display screen is selected to output the second target information, and if the current space is larger, the electronic device with larger playing sound is selected to output the second target information.

Further, in some embodiments, the method may further comprise:

Acquiring an echo cancellation signal sent by first electronic equipment or fourth electronic equipment;

Determining a target sound pickup apparatus based on a positional relationship between the third electronic apparatus and the first electronic apparatus or the fourth electronic apparatus, to perform an echo cancellation operation by giving an echo cancellation signal to a microphone of the first electronic apparatus or the fourth electronic apparatus, and to perform an echo cancellation operation by giving an echo cancellation signal to the target sound pickup apparatus, which is the third electronic apparatus in a sound pickup state.

It should be noted that, in the audio output process, there may be a problem of echo interference, so in the local space where the first electronic device is located, the electronic device that outputs the second target signal (i.e., the first electronic device or the fourth electronic device, or may also be the local second electronic device) may also generate the echo cancellation reference signal, and determine the target pickup device based on the positional relationship between the third electronic device and the first electronic device or the fourth electronic device, where the determining method is as described above, and the target pickup device represents the third electronic device in the pickup state.

For the first electronic device or the fourth electronic device, when performing the echo cancellation operation, the echo cancellation signal may be given to the microphone thereof to perform the echo cancellation operation, and the echo cancellation signal may also be given to the target sound pickup device, so that the target sound pickup device performs the echo cancellation operation, thereby avoiding picking up the echo and causing interference.

In some embodiments, the method may further comprise:

transmitting, by the first electronic device or the fourth electronic device, a detection signal to at least one third electronic device at a first time;

Receiving detection response signals returned by at least one third electronic device, and recording second time for receiving the detection response signals of the at least one third electronic device;

determining the network delay of each third electronic device according to the second time and the first time of each third electronic device;

Determining delay compensation time of each third electronic device according to the network delay of each third electronic device;

accordingly, after obtaining the second target information, the method may further include:

And after the delay compensation time of the third electronic equipment corresponding to the second target information is separated, executing the step of outputting the second target information or giving the second target information to the second electronic equipment for outputting.

It should be noted that, for the third electronic device, there may be a different network delay between the third electronic device and the first electronic device (if the first target signal is obtained by the fourth electronic device) due to a network reason or a hardware reason. Therefore, the embodiment of the application also carries out delay compensation of signal transmission on each third electronic device.

First, the first electronic device or the fourth electronic device respectively sends detection signals to the third electronic devices at the first time, receives detection response signals returned by each third electronic device, and records second time when the detection response signals returned by each third electronic device are received.

Then, based on the second time and the first time, a network delay of each third electronic device is determined. Specifically, one half of the time difference between the second time and the first time is taken as the network delay of the third electronic device.

And finally, determining the delay compensation time of each third electronic device based on the network delay of each third electronic device. Illustratively, the network delays of the different third electronic devices are different, some of which are short, and others of which are long, and others of which are slow. At this time, if a third electronic device with a long network delay first sends audio information, and then a third electronic device with a short network delay also sends audio information, then when the audio information of the former is not yet output, the audio information of the latter may already start to be output, thereby causing signal interference. Therefore, based on the network delay of each third electronic device, the delay compensation time of each third electronic device is respectively determined, and the time sequence consistency of the first target signal sent by each third electronic device is ensured. Assuming that there are two third electronic devices a and B, where the network delay of the third electronic device a is 3 milliseconds (ms) and the network delay of the third electronic device B is 4ms, in order to ensure that the two sending first target signals are consistent in time sequence, a delay compensation time of 4-3=1 ms is set for the third electronic device a, and the delay compensation time of the second electronic device B is 0.

Correspondingly, when the second target information is output, after the second target information is obtained, determining the source of the first target information corresponding to the second target information, namely the third electronic equipment corresponding to the second target information, and determining the delay compensation time of the third electronic equipment corresponding to the second target information. And then outputting the second target information or giving the second target information to the second electronic equipment for outputting after the delay compensation time is spaced, thereby avoiding the interference problem caused by inconsistent network delay of each third electronic equipment.

In some embodiments, the method may further comprise:

obtaining third target information from the electronic equipment outside the current space;

determining to output the third target information by the first electronic device or the fourth electronic device based on the attribute information of the third target information and/or the environmental information and/or the device information in the current space.

When there is a remote participant outside the current space, it is sometimes necessary to output third target information from the electronic device outside the current space. Under the condition that the first electronic equipment and the second electronic equipment are in remote connection, the second electronic equipment is the electronic equipment outside the current space, and the first electronic equipment obtains audio information or video information acquired by at least one third electronic equipment through fourth electronic equipment; in the case where the first electronic device establishes a local connection with the second electronic device, an electronic device outside the current space may establish a remote connection with the first electronic device or the second electronic device.

And then determining to output the third target information by the first electronic device or the fourth electronic device based on the attribute information of the third target information and/or the environmental information and/or the device information in the current space. The device information may include hardware configuration information of the first electronic device and the fourth electronic device, and a specific determination manner of selecting the first electronic device or the fourth electronic device as the output device for outputting the third target information may refer to the foregoing manner of outputting the second target information, which is not described herein again.

The embodiment of the application provides a processing method, which comprises the following steps: after the first electronic equipment and the second electronic equipment are in communication connection, first target information in the current space is obtained, wherein the first target information is from at least one third electronic equipment connected with the first electronic equipment; performing first processing on the first target information to obtain second target information; outputting the second target information, or giving the second target information to the second electronic equipment for outputting; wherein, the output effect of the second target information is better than the output effect of the first target information. In this way, based on the connection between the first electronic device and the third electronic device, the first target information is received, the first target information is processed into the second target information, the first electronic device or the second electronic device outputs the second target information, and the output effect of the second target information after the first processing is superior to that of the first target information, so that multiple scenes can be adapted, the problems of mutual interference between audio information or video information, interference among users of multiple third electronic devices and the like during multiple devices are avoided, the conference effect is improved, and conference quality is further improved.

In another embodiment of the present application, referring to fig. 3, an application scenario schematic diagram of a processing method provided by an embodiment of the present application is shown. As shown in fig. 3, the scene is a conference scene, and the local participants participate in the same conference in the same space, and the remote participants also participate in the conference through remote connection. The local space includes audio relay, main device, n local devices (local device 1, local device 2, … …, local device n), etc., and the remote space includes remote device. The audio relay may also be an audio-video relay, that is, if the first target information to be processed is audio information, the audio relay may be used to implement processing only for the audio information, and if the first target information is audio-video information, the audio-video relay may be used to implement processing only for the audio-video information and the audio information, which is specifically determined in combination with actual scene requirements.

It should be further noted that, in fig. 3, the audio transducer may be the first electronic device or the fourth electronic device in the foregoing embodiment; the master device may be the second electronic device in the foregoing embodiment when the second electronic device is in the same space as the first electronic device (i.e., a local scene), and the remote device may be the second electronic device in the foregoing embodiment when the second electronic device is in a different space as the first electronic device (i.e., a remote scene). It will be appreciated that, for a remote participant, the remote space in which the remote participant is located may include one or more remote devices, and the remote space may have the same or similar device layout as the local space in which the local participant is located, so that, for the remote space, the processing of video information or audio-visual information may also be performed according to the method provided by the embodiment of the present application, if for the remote participant, the remote space in which the remote participant is located is taken as the local space, then the local space in fig. 3 is the remote space relative to the remote participant.

In the conference scenario shown in fig. 3, the master device may integrate the function of the audio transducer, i.e. the audio transducer may be removed, and each local device directly establishes a connection with the master device. Here, as shown in fig. 3, the implementation of the processing method in this scenario will be described in detail by taking the example that the audio transducer corresponds to the first electronic device, the main device corresponds to the second electronic device, the remote device corresponds to the electronic device outside the current space, and the local device corresponds to the third electronic device.

As shown in fig. 3, wireless connection is established between the audio repeater and n local devices, and the audio repeater may receive first target information sent by the local devices through the wireless connection; the audio transfer device and the main device are connected in a wired way, so that information can be sent to the main device and received from the main device, and the audio transfer device and the main device both have the function of audio playing (can also comprise the function of playing video images); the main equipment establishes remote connection with the remote equipment, can send information to the remote equipment and receive the information sent by the remote equipment.

As shown in fig. 3, the host device may be specifically an integrated machine, a desktop computer, a notebook computer, or the like. In addition, the master device may be any local device, that is, the master device may be a specific device, for example, a conference device, a computer, etc. pre-installed in a conference room, or an electronic device such as a notebook computer, etc. carried by a local participant, and may be used as the master device when a conference is performed.

Further, referring to fig. 4, a schematic process of sending and receiving information provided by an embodiment of the present application is shown. As shown in fig. 4, each of the local apparatuses includes a sound pickup device (microphone) for picking up audio information, wherein the user of the local apparatus 1 is the user 1, the user of the local apparatus 2 is the user 2, … …, and the user of the local apparatus n is the user n, in practice, there may be a case where several users share one local apparatus or one user uses a plurality of local apparatuses, and each local apparatus is in a mute state. After obtaining the audio information of the user, the local device may first perform audio preprocessing, such as noise reduction, and the like, on the audio information, and then send the audio information after performing audio preprocessing to the audio transducer based on a wireless connection, such as a wireless fidelity (WIRELESS FIDELITY, WIFI) connection, between the local device and the audio transducer. The audio repeater can receive audio information (first target information) sent by the local equipment through a WiFi module in the audio repeater, decodes the audio signal, and further performs noise reduction processing (relative to secondary noise reduction), automatic gain control (Automatic Gain Control, AFC) and the like on the audio signal through a digital signal processor (DIGITAL SIGNAL Process, DSP) or other processors in the audio repeater to obtain processed audio information (second target information); and then, in the sending direction of the audio repeater, sending the second target information to the main device, playing the second target information by the main device through a playing path 1 (the playing path 1 represents playing by a loudspeaker and other devices on the main device), and in the case of a far-end user (such as a far-end participant), sending the second target information to the far-end device through the cloud server by the main device, so that the far-end device plays and outputs the second target information. The audio relay may play the second target information through its own play path 2 (play path 2 indicates that the second target information is played by a device such as a speaker on the audio relay).

In addition, in the receiving direction (the sending and receiving in fig. 4 are the sending and receiving of the audio repeater relative to the main device), in the case that there is a remote user, the main device also receives the third target information sent by the remote device through the cloud server, and plays and outputs the third target information through the playing path 1; or the third target information is sent to the audio relay, and the audio relay plays and outputs the third target information through the playing path 2 after receiving the third target information.

In the embodiment of the present application, at least one of n local devices may be selected for pickup, and referring to fig. 5, a schematic spatial arrangement diagram of an electronic device provided in the embodiment of the present application is shown. As shown in fig. 5, the system includes an audio repeater, a main device and n local devices, where each local device can at least pick up audio information or video information of its corresponding user. The audio relay receives the audio information or the audio-video information sent by the specific local equipment, processes the audio information or the audio-video information to obtain second target information, and sends the second target information to the main equipment to execute subsequent operation, or the audio relay and the main equipment can be integrated into a whole to realize related functions.

As shown in fig. 4, for a specific local device for transmitting the first target information, the determination manner may be: for the same audio information, because the sound transmission requires time, the local device closest to the user sending the audio information must receive the audio information earliest, and in the case that the local device is near-field pickup, the nearby local devices may pick up the audio information, and at this time, the time point when each local device obtains the same audio information can be respectively determined, the local device which receives the audio information earliest is determined as the target electronic device, and the audio information collected by the target electronic device is taken as the first target information. For some scenes with multiple speakers, the method can also obtain the target electronic device corresponding to each piece of audio information to collect the first target information. In addition, the target electronic device may also be determined based on the transmission delay between the local device and the audio repeater, and the specific manner may refer to the above embodiment.

If the local device has far-field pickup capability, the local device at a specific position can be selected to pick up sound according to the magnitude of the signal energy value on the basis of the judgment mode, so as to acquire the first target information.

Furthermore, the embodiment of the application can also locate the distance information according to different time differences of the audio information at different positions reaching the local equipment and the audio repeater, so as to judge that the first target information from a plurality of local equipment is subjected to audio mixing processing according to the distance information.

For example, when the user 1 speaks at the location of the local device 1, the time difference between the time point when the sound reaches the microphone of the local device 1 and the time point when the sound reaches the microphone of the audio transducer is Δt1, and the distance L1 between the local device 1 and the audio transducer can be obtained from the sound propagation velocity to be approximately: l1=340×Δt1, and so on, the distances L2, L3, L4, … …, ln between each local device and the audio repeater can be obtained.

In this way, in some special scenes, the microphones of the plurality of local devices are all picked up simultaneously, according to the time difference, the distance between the local device and the audio transducer of each pickup can be obtained, the audio information sub-channel (channel) picked up by the local device of each pickup is mixed by using the stereo technology to generate second target information with surround stereo effect and then transmitted to the far-end device, so that the physical talking position of each person of the sender can be perceived during the far-end playing, and better conference experience is obtained.

Further, there may be network delay for each local device in wireless transmission with the audio repeater, and at this time, delay compensation matching may be performed for each local device separately. Referring to fig. 6, a schematic diagram of a determination manner of network delay according to an embodiment of the present application is shown. As shown in fig. 6, the audio repeater may actively transmit a detection signal to each local device, receive a detection response signal returned from the local device, and determine a half of the time difference between transmitting the detection signal and receiving the detection response signal as a network delay. I.e. as shown in fig. 6, the network delay T _M1＝t₁/2 of the local device 1; network delay T _M2＝t₂/2 of the local device 2; network delay T _M3＝t₃/2 of the local device 3; network delay T _M4＝t₄/2 of the local device 4; … …; network delay T _Mn＝t_n/2 of local device n; wherein t ₁、t₂、t₃、t₄,……,t_n represents each time difference.

Therefore, the method can be combined with the method to obtain the local equipment currently picking up sound, so that the matching compensation of network delay is further carried out, and the consistency of the sound time sequence sent out by each local equipment is ensured.

Further, in order to avoid the problem of echo interference, the embodiment of the application also carries out echo cancellation on the audio signal. Referring to fig. 7, a schematic flow chart of an echo cancellation process according to an embodiment of the present application is shown. As shown in fig. 7, microphone 0 (mic 0) represents a microphone of the audio transducer, microphone 1 (mic 1) represents a microphone of the local device 1, microphone 2 (mic 2) represents a microphone of the local device 2, microphone 3 (mic 3) represents a microphone of the local device 3, mic4 represents a microphone of the local device 4, … …, and microphone n (mic n) represents a microphone of the local device n. The audio transducer and the microphones of the respective local devices are also connected based on the connection between the audio transducer and the respective local devices. As shown in fig. 7, it indicates that when the local device 3-bit sound pickup device is connected to the audio relay ON, the remaining microphones are disconnected from the audio relay ON (OFF), and at this time, for the audio information from the local device, only the audio information picked up by the microphone 4 of the local device 4 is transmitted to the audio relay as the first target information.

As shown in fig. 7, the echo cancellation may include both Downlink (Downlink) and Uplink (Uplink) directions; for a signal transmitted to the audio transducer through the downlink, which is typically a third target signal transmitted by the far-end device, at this time, after the audio signal is transmitted, the audio signal is first subjected to a series of processes such as Automatic Gain Control (AGC), dynamic range compression (DYNAMIC RANGE compression, DRC), equalization (Equalizer), amplification (Amplifier), and the like, and the resulting audio signal is then fed to the audio transducer, which generates an Echo cancellation signal (AEC REFERENCE) based on the audio signal, and transmits the Echo cancellation signal to its own microphone 0, so that the microphone 0 performs an Echo cancellation operation (AEC), echo Suppression operation (Echo Suppression), an on-process, an automatic gain control process, and the like, based on the Echo cancellation signal. It will be appreciated that in some cases, the sound of the speaker may be picked up directly by the microphone 0, for example, the speaker is closest to the audio transducer, or the device used by the person hosting the conference is the audio transducer or the master device integrated with the audio transducer, so that, based on the echo cancellation process, no echo interference occurs when the microphone 0 picks up the sound, no echo or other output is produced when the audio transducer outputs the sound, and no audio information including echo, noise or other is produced when the audio information is transmitted to the remote device through the uplink.

The audio repeater also transmits an echo cancellation signal to the local device which is picking up sound, so that the microphone of the local device performs echo cancellation based on the echo cancellation signal, and therefore, when the local device picks up audio information, the local device cannot pick up the echo information, and the local device is prevented from transmitting the echo information. After the local device performs sound pickup, automatic echo cancellation processing, echo suppression processing, noise reduction processing, and the like can be performed based on the echo cancellation signal, and then the processed audio information is sent to the audio transducer or the main device performs noise reduction processing again and performs automatic gain control, and the obtained audio information can be sent to the remote device through an uplink.

In short, the embodiment of the application can develop an audio relay (also called an audio relay enhancer) applied to a conference, a teaching and other multi-person scene, or a main device integrated with the function of the audio relay, when a plurality of persons participate in the same conference in the same space, local devices (such as a notebook and a mobile phone) in a local space can be accessed to the audio relay in a wireless connection mode to perform local networking. The audio transmission and receiving transfer is realized, and a plurality of devices in the same space participate in the same conference at the same time.

For the audio repeater, in the transmitting direction, the following three technical points are at least considered in the embodiment of the application:

(1) Fig. 5, which describes how the microphone of the local device cooperates with the microphone of the audio transducer to obtain distance information, matches the microphone of the local device in the optimal position for pick-up. This scheme is near field pickup based on the pickup ability of local equipment, and at present, electronic equipment such as notebook, cell-phone on the market also all are near field pickup. The distance information is located according to the difference of the time difference of the sound of different positions reaching the audio transducer, and the position from which the sound comes is judged so as to be matched with the microphone of the local equipment of the optimal position for pickup. For example, in fig. 5, when the user 1 speaks at the location of the local device 1, the time difference between the time point when the sound reaches the microphone of the local device 1 and the time point when the sound reaches the microphone of the audio transducer is Δt1, l1=340×Δt1 can be obtained from the sound propagation speed, and so on, L2, L3, L4, L5 … … Ln can be obtained. When a speaker speaks, according to the time difference delta T, the distance between each electronic device and the audio transducer can be obtained, the current speaker is judged to be closer to the microphone of the local device, and then the microphone of the local device closest to the current speaker is used for pickup, noise reduction and enhancement processing are carried out on the audio signal of the channel, and the sound quality of the transmitting direction is improved. If a local device with far-field pickup capability exists, the judgment of the signal energy value can be added to select the microphone of the local device with the optimal position for pickup on the judgment standard.

(2) Since there may be network delay for each local device to wirelessly transmit to the audio transducer, delay compensation matching is also performed for each local device. For each accessed local device, the audio transducer can actively transmit the detection signal, acquire network delay data of each local device and record the network delay data. In fig. 6, T _Mi (i=1, 2,3, … …, n) is the network delay between each local device and the audio repeater, and by combining the method in the technical point (1), the local device currently picking up sound can be obtained, so as to perform matching compensation of the network delay, and ensure the time sequence consistency of the audio information sent out by each local device.

(3) In some special scenes, the microphones of a plurality of local devices are all picked up at the same time, the distance information acquired by the technical point (1) can be combined, audio information of the local devices is subjected to sound mixing by a stereo technology in a split channel mode, surround stereo is generated and is sent to the remote device, and the physical speaking position of each person of a sender can be perceived during remote playing, so that better conference experience is obtained.

(4) The embodiment of the application also utilizes the cooperation of the audio transducer and the local equipment to carry out echo cancellation. Besides being transferred to the microphone reference, the echo cancellation signal generated by the audio transducer is transferred to the local equipment which is picking up the sound at present as the reference signal after the local equipment which is picking up the sound at present is determined, so that the purpose of echo cancellation is achieved.

The audio repeater provided by the embodiment of the application and the detailed explanation of the processing method performed on the audio repeater can show that the technical advantages in the transmitting direction at least comprise: (1) Each local device is placed at different positions, the microphone of each local device can be used as a pickup, and the microphones on the audio transducer are combined in space, so that distance information of a speaker is obtained, and according to the position information, the microphone at the optimal position can be automatically used for pickup. (2) The network delay of wireless transmission between the audio transit device and each local device can be automatically obtained, and further, targeted delay compensation is carried out on the local device which is picking up sound. (3) When the received audio information comes from a plurality of local devices, the audio information of each local device can be mixed into stereo by the channels, and the stereo audio information is sent to the remote device, so that the physical position of a near-end speaker can be perceived when the remote device plays. (4) Echo cancellation processing and secondary noise reduction are performed on the local device that is picking up sound, so that the remote client has better listening experience.

Its technical advantages in the receiving direction may include at least: (1) The local equipment of the network can be set to be mute, and after the sound of the far-end participant is transmitted back, the sound can be played at the main equipment or when the volume of the main equipment is not big enough, the sound is selected to be played at the audio transducer, so that the listening requirement of all the participants is met.

Therefore, the embodiment of the application integrates the two directions of sending and receiving to meet the requirements of conference audio, and simultaneously increases the processing of noise reduction, echo cancellation and the like of the conference audio, thereby providing a brand new audio experience for users. The method can enable the positioning of the audio information or the video information to be more accurate in the conference process, and the audio information or the video information has better tone quality after noise reduction, enhancement, echo cancellation and other processing.

In yet another embodiment of the present application, referring to fig. 8, a schematic diagram of the composition and structure of a processing device 80 according to an embodiment of the present application is shown. As shown in fig. 8, the processing device 80 may include:

An obtaining unit 801, configured to obtain, after the first electronic device and the second electronic device establish a communication connection, first target information in a current space, where the first target information is from at least one third electronic device connected to the first electronic device;

A processing unit 802 configured to perform a first process on the first target information to obtain second target information;

an output unit 803 configured to output the second target information, or give the second target information to the second electronic device for output; wherein, the output effect of the second target information is better than the output effect of the first target information.

It should be noted that, in the embodiment of the present application, the processing apparatus 80 may be the first electronic device, or may be integrated on the first electronic device.

In some embodiments, the obtaining unit 801 is specifically configured to determine a positional relationship between a third electronic device and a first electronic device in the current space, and take audio information or video information collected by at least one third electronic device having a target positional relationship with the first electronic device as first target information; or determining user information of the third electronic equipment in the current space, and taking audio information or video and audio information acquired by the third electronic equipment used by a target user as first target information; or, obtaining attribute parameters of audio information or video information acquired by third electronic equipment in the current space, and determining the audio information or video information with target attribute as first target information; or if the first electronic equipment is in first communication connection with the second electronic equipment, acquiring audio information or video information acquired by at least one third electronic equipment connected with the first electronic equipment through the fourth electronic equipment.

It should be noted that, in the embodiment of the present application, in the case of having the fourth electronic device, the processing apparatus 80 may be the fourth electronic device, or may be integrated on the fourth electronic device.

In some embodiments, the obtaining unit 801 is specifically configured to combine the microphone and/or the camera on the at least one third electronic device with the microphone and/or the camera on the first electronic device to form a corresponding microphone array and/or a camera array; and determining a position of the third electronic device using the microphone array and/or the camera array; the audio information or the video and audio information collected by at least one third electronic device with a first position relation with the first electronic device is used as first target information; or determining the transmission time delay between the third electronic equipment and the first electronic equipment based on the position, and taking the audio information or the video and audio information acquired by at least one third electronic equipment with a second position relation with the first electronic equipment as first target information based on the transmission time delay.

In some embodiments, the processing unit 802 is specifically configured to determine a communication connection state between the first electronic device and the second electronic device, and perform noise cancellation and/or enhancement processing on the audio information or the audio-visual information by the first electronic device or the fourth electronic device based at least on the communication connection state, so as to obtain second target information; or if the first electronic equipment is connected with the second electronic equipment through the second communication, the first electronic equipment performs identification processing on the audio information or the video and audio information to obtain second target information, wherein the data size of the second target information is smaller than that of the first target information.

In some embodiments, the processing unit 802 is specifically configured to determine a location of the third electronic device and a transmission delay between the third electronic device and the first electronic device or the fourth electronic device; and mixing the first target information from the plurality of third electronic devices based on the positions and the transmission delays to obtain second target information.

In some embodiments, the obtaining unit 801 is further configured to obtain an echo cancellation signal sent by the first electronic device or the fourth electronic device;

The processing unit 802 is further configured to determine a target sound pickup apparatus based on a positional relationship between the third electronic apparatus and the first electronic apparatus or the fourth electronic apparatus, to perform an echo cancellation operation for giving an echo cancellation signal to a microphone of the first electronic apparatus or the fourth electronic apparatus, and to perform an echo cancellation operation for giving an echo cancellation signal to the target sound pickup apparatus, which is the third electronic apparatus in a sound pickup state.

In some embodiments, the output unit 803 is specifically configured to give the second target information to the second electronic device for outputting if the first electronic device and the second electronic device have a first communication connection, or obtain hardware configuration information of the first electronic device and the fourth electronic device, and determine that the second target information is output by the first electronic device or the fourth electronic device based on the hardware configuration information; or if the first electronic device and the second electronic device are in second communication connection, hardware configuration information of the first electronic device and the second electronic device is obtained, and the first electronic device or the second electronic device is determined to output second target information based on the hardware configuration information; or if the first electronic device and the second electronic device are in second communication connection, obtaining the output parameter and/or the current space environment of the second target information, and determining that the second target information is output by the first electronic device or the second electronic device based on the output parameter and/or the current space environment.

In some embodiments, the obtaining unit 801 is further configured to obtain third target information, the third target information being from an electronic device outside the current space;

the output unit 803 is further configured to determine that the third target information is output by the first electronic device or the fourth electronic device based on the attribute information of the third target information and/or the environment information and/or the device information in the current space.

In some embodiments, as shown in fig. 8, the processing apparatus 80 may further include a determining unit 804 configured to send, by the first electronic device or the fourth electronic device, a detection signal to at least one third electronic device at a first time; receiving detection response signals returned by at least one third electronic device, and recording second time for receiving the detection response signals of the at least one third electronic device; determining the network delay of each third electronic device according to the second time and the first time of each third electronic device; determining delay compensation time of each third electronic device according to the network delay of each third electronic device;

The output unit 803 is further configured to execute the step of outputting the second target information or giving the second target information to the second electronic device for outputting after the delay compensation time of the third electronic device corresponding to the second target information is spaced.

For the embodiment of the present application, technical details not disclosed about the processing device 80 may be understood with reference to the foregoing processing method embodiment.

It will be appreciated that in this embodiment, the "unit" may be a part of a circuit, a part of a processor, a part of a program or software, etc., and may of course be a module, or may be non-modular. Furthermore, the components in the present embodiment may be integrated in one processing unit, or each unit may exist alone physically, or two or more units may be integrated in one unit. The integrated units may be implemented in hardware or in software functional modules.

The integrated units, if implemented in the form of software functional modules, may be stored in a computer-readable storage medium, if not sold or used as separate products, and based on such understanding, the technical solution of the present embodiment may be embodied essentially or partly in the form of a software product, which is stored in a storage medium and includes several instructions to cause a computer device (which may be a personal computer, a server, or a network device, etc.) or processor to perform all or part of the steps of the method described in the present embodiment. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read Only Memory (ROM), a random access Memory (Random Access Memory, RAM), a magnetic disk, or an optical disk, or other various media capable of storing program codes.

Accordingly, the present embodiment provides a computer storage medium storing a computer program which, when executed by at least one processor, implements the steps of the processing method of any of the preceding embodiments.

Based on the above-mentioned composition of the processing device 80 and the computer storage medium, referring to fig. 9, a schematic diagram of the composition structure of an electronic apparatus 90 according to an embodiment of the present application is shown. As shown in fig. 9, may include: a communication interface 901, a memory 902, and a processor 903; the various components are coupled together by a bus system 904. It is appreciated that the bus system 904 is used to facilitate connected communications between these components. The bus system 904 includes a power bus, a control bus, and a status signal bus in addition to a data bus. But for clarity of illustration, the various buses are labeled as bus system 904 in fig. 9. The communication interface 901 is configured to receive and send signals in a process of receiving and sending information with other external network elements;

a memory 902 for storing a computer program capable of running on the processor 903;

the processor 903 is configured to execute, when executing the computer program:

It will be appreciated that the memory 902 in embodiments of the application can be either volatile memory or nonvolatile memory, or can include both volatile and nonvolatile memory. The nonvolatile Memory may be a Read-Only Memory (ROM), a Programmable ROM (PROM), an Erasable PROM (EPROM), an Electrically Erasable EPROM (EEPROM), or a flash Memory. The volatile memory may be random access memory (Random Access Memory, RAM) which acts as external cache memory. By way of example, and not limitation, many forms of RAM are available, such as static random access memory (STATIC RAM, SRAM), dynamic random access memory (DYNAMIC RAM, DRAM), synchronous dynamic random access memory (Synchronous DRAM, SDRAM), double data rate Synchronous dynamic random access memory (Double DATA RATE SDRAM, DDRSDRAM), enhanced Synchronous dynamic random access memory (ENHANCED SDRAM, ESDRAM), synchronous Link DRAM (SLDRAM), and Direct memory bus RAM (DRRAM). The memory 902 of the systems and methods described herein is intended to comprise, without being limited to, these and any other suitable types of memory.

And the processor 903 may be an integrated circuit chip with signal processing capabilities. In implementation, the steps of the above method may be performed by integrated logic circuitry of hardware in the processor 903 or instructions in the form of software. The Processor 903 may be a general purpose Processor, a digital signal Processor (DIGITAL SIGNAL Processor, DSP), an Application SPECIFIC INTEGRATED Circuit (ASIC), a field programmable gate array (Field Programmable GATE ARRAY, FPGA) or other programmable logic device, discrete gate or transistor logic device, discrete hardware components. The disclosed methods, steps, and logic blocks in the embodiments of the present application may be implemented or performed. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like. The steps of the method disclosed in connection with the embodiments of the present application may be embodied directly in the execution of a hardware decoding processor, or in the execution of a combination of hardware and software modules in a decoding processor. The software modules may be located in a random access memory, flash memory, read only memory, programmable read only memory, or electrically erasable programmable memory, registers, etc. as well known in the art. The storage medium is located in the memory 902, and the processor 903 reads information in the memory 902, and in combination with the hardware, performs the steps of the method described above.

It is to be understood that the embodiments described herein may be implemented in hardware, software, firmware, middleware, microcode, or a combination thereof. For a hardware implementation, the processing units may be implemented within one or more Application SPECIFIC INTEGRATED Circuits (ASICs), digital signal processors, digital signal processing devices (DSP DEVICE, DSPD), programmable logic devices (Programmable Logic Device, PLDs), field-Programmable gate arrays (Field-Programmable GATE ARRAY, FPGA), general purpose processors, controllers, micro-controllers, microprocessors, other electronic units configured to perform the functions described herein, or a combination thereof.

For a software implementation, the techniques described herein may be implemented with modules (e.g., procedures, functions, and so on) that perform the functions described herein. The software codes may be stored in a memory and executed by a processor. The memory may be implemented within the processor or external to the processor.

Optionally, as another embodiment, the processor 903 is further configured to perform the steps of the method of any of the preceding embodiments when the computer program is run.

In yet another embodiment of the present application, referring to fig. 10, a schematic diagram of the composition structure of another electronic device 90 according to an embodiment of the present application is shown. As shown in fig. 10, the electronic device 90 may include a processing apparatus 80 as described in any of the previous embodiments.

In some embodiments, the electronic device 90 may be the first electronic device or the fourth electronic device described previously.

As for the electronic device 90, since it includes the processing apparatus 80 in the foregoing embodiment, it is possible to avoid information interference between each other at the time of a multi-person conference, improve the conference effect, and be suitable for various scenes.

It should be noted that, in the present application, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising one … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises the element.

The foregoing embodiment numbers of the present application are merely for the purpose of description, and do not represent the advantages or disadvantages of the embodiments.

The methods disclosed in the method embodiments provided by the application can be arbitrarily combined under the condition of no conflict to obtain a new method embodiment.

The features disclosed in the several product embodiments provided by the application can be combined arbitrarily under the condition of no conflict to obtain new product embodiments.

The features disclosed in the embodiments of the method or the apparatus provided by the application can be arbitrarily combined without conflict to obtain new embodiments of the method or the apparatus.

The foregoing is merely illustrative of the present application, and the present application is not limited thereto, and any person skilled in the art will readily recognize that variations or substitutions are within the scope of the present application. Therefore, the protection scope of the present application shall be subject to the protection scope of the claims.

Claims

1. A method of processing, comprising:

after the first electronic equipment and the second electronic equipment are in communication connection, first target information in a current space is obtained, wherein the first target information is from at least one third electronic equipment connected with the first electronic equipment;

Wherein the output effect of the second target information is better than the output effect of the first target information;

Wherein the outputting the second target information or outputting the second target information to the second electronic device includes at least one of:

If the first electronic device and the second electronic device have a first communication connection, giving the second target information to the second electronic device for output, or obtaining hardware configuration information of the first electronic device and the fourth electronic device, and determining that the second target information is output by the first electronic device or the fourth electronic device based on the hardware configuration information; wherein the fourth electronic device, the first electronic device, and the at least one third electronic device are within the current space;

if the first electronic device and the second electronic device are in second communication connection, hardware configuration information of the first electronic device and the second electronic device is obtained, and the first electronic device or the second electronic device is determined to output the second target information based on the hardware configuration information; or alternatively, the first and second heat exchangers may be,

And if the first electronic equipment and the second electronic equipment are in second communication connection, obtaining output parameters and/or current space environment of the second target information, and determining to output the second target information by the first electronic equipment or the second electronic equipment based on the output parameters and/or the current space environment.

2.A method of processing, comprising:

the method further comprises the steps of:

determining to output the third target information by the first electronic device or a fourth electronic device based on attribute information of the third target information and/or environmental information and/or device information in the current space; wherein the fourth electronic device, the first electronic device, and the at least one third electronic device are within the current space.

3. The method according to claim 1 or 2, wherein the obtaining the first target information in the current space comprises:

Determining the position relation between a third electronic device in the current space and the first electronic device, and taking audio information or video information acquired by at least one third electronic device with a target position relation with the first electronic device as the first target information; or alternatively, the first and second heat exchangers may be,

Determining user information of third electronic equipment in a current space, and taking audio information or video information acquired by the third electronic equipment used by a target user as first target information; or alternatively, the first and second heat exchangers may be,

If the first electronic equipment is in first communication connection with the second electronic equipment, audio information or video information acquired by at least one third electronic equipment connected with the first electronic equipment is acquired through fourth electronic equipment.

4. The method of claim 3, wherein the taking as the first target information the audio information or the audiovisual information collected by at least one third electronic device having a target positional relationship with the first electronic device, comprises:

Forming a microphone array and/or a camera array corresponding to the microphone and/or the camera on the first electronic equipment by using the microphone and/or the camera on the at least one third electronic equipment;

Taking audio information or video information acquired by at least one third electronic device with a first position relation with the first electronic device as the first target information; or alternatively, the first and second heat exchangers may be,

And determining the transmission time delay between the third electronic equipment and the first electronic equipment based on the position, and taking the audio information or the video information acquired by at least one third electronic equipment with a second position relation with the first electronic equipment as the first target information based on the transmission time delay.

5. The method of claim 4, wherein the performing the first processing on the first target information to obtain second target information includes:

Determining a communication connection state between the first electronic device and the second electronic device, and carrying out noise elimination and/or enhancement processing on the audio information or the video information by the first electronic device or the fourth electronic device at least based on the communication connection state to obtain the second target information; or alternatively, the first and second heat exchangers may be,

And if the first electronic equipment is connected with the second electronic equipment through a second communication, the first electronic equipment performs identification processing on the audio information or the video and audio information to obtain the second target information, wherein the data volume of the second target information is smaller than that of the first target information.

6. The method of claim 4, wherein the performing the first processing on the first target information to obtain second target information includes:

determining a position of a third electronic device and a transmission delay between the third electronic device and the first electronic device or a fourth electronic device;

and mixing the first target information from the plurality of third electronic devices based on the positions and the transmission delays to obtain the second target information.

7. The method of claim 4, further comprising:

Acquiring an echo cancellation signal sent by the first electronic device or the fourth electronic device;

Determining a target sound pickup apparatus, which is the third electronic apparatus in a sound pickup state, based on a positional relationship between the third electronic apparatus and the first electronic apparatus or the fourth electronic apparatus, to perform an echo cancellation operation for giving the echo cancellation signal to a microphone of the first electronic apparatus or the fourth electronic apparatus, and to perform an echo cancellation operation for giving the echo cancellation signal to the target sound pickup apparatus.

8. The method according to claim 1 or 2, further comprising:

transmitting, by the first electronic device or the fourth electronic device, a detection signal to the at least one third electronic device at a first time;

receiving detection response signals returned by the at least one third electronic device, and recording second time for receiving the detection response signals of the at least one third electronic device;

Determining a network delay of each third electronic device according to the second time and the first time of each third electronic device;

accordingly, after the second target information is obtained, the method further includes:

And after the delay compensation time of the third electronic device corresponding to the second target information is spaced, executing the step of outputting the second target information or giving the second target information to the second electronic device for outputting.

9. A processing apparatus, comprising:

The electronic device comprises an acquisition unit, a first control unit and a second control unit, wherein the acquisition unit is configured to acquire first target information in a current space after communication connection is established between a first electronic device and a second electronic device, and the first target information is from at least one third electronic device connected with the first electronic device;

an output unit configured to output the second target information or give the second target information to the second electronic device for output; wherein the output effect of the second target information is better than the output effect of the first target information;

The output unit is further configured to give the second target information to the second electronic device for output if the first electronic device and the second electronic device have a first communication connection, or obtain hardware configuration information of the first electronic device and the fourth electronic device, and determine that the second target information is output by the first electronic device or the fourth electronic device based on the hardware configuration information; wherein the fourth electronic device, the first electronic device, and the at least one third electronic device are within the current space;

10. A processing apparatus, comprising:

the acquisition unit is further configured to acquire third target information, wherein the third target information is from electronic equipment outside the current space;

The output unit is further configured to determine that the third target information is output by the first electronic device or a fourth electronic device based on attribute information of the third target information and/or environmental information and/or device information in the current space; wherein the fourth electronic device, the first electronic device, and the at least one third electronic device are within the current space.