CN108630201B - Method and device for establishing equipment association - Google Patents

Method and device for establishing equipment association Download PDF

Info

Publication number
CN108630201B
CN108630201B CN201810187221.2A CN201810187221A CN108630201B CN 108630201 B CN108630201 B CN 108630201B CN 201810187221 A CN201810187221 A CN 201810187221A CN 108630201 B CN108630201 B CN 108630201B
Authority
CN
China
Prior art keywords
association
natural
speech signal
instruction
devices
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810187221.2A
Other languages
Chinese (zh)
Other versions
CN108630201A (en
Inventor
孙鹏飞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Moqiu Technology Co ltd
Original Assignee
Beijing Moqiu Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Moqiu Technology Co ltd filed Critical Beijing Moqiu Technology Co ltd
Priority to CN201810187221.2A priority Critical patent/CN108630201B/en
Publication of CN108630201A publication Critical patent/CN108630201A/en
Application granted granted Critical
Publication of CN108630201B publication Critical patent/CN108630201B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Telephonic Communication Services (AREA)

Abstract

The invention provides a method and a device for establishing equipment association, wherein the method comprises the following steps: receiving a first natural voice signal; when the first natural speech signal includes a device association instruction, transmitting a second natural speech signal; the device association instruction is used for instructing a plurality of devices to establish device association through natural voice; the second natural speech signal is used to establish a device association. The invention is based on the association establishment mode of natural voice, and the complexity and frequency of user operation are greatly reduced.

Description

Method and device for establishing equipment association
Technical Field
The present invention relates to the technical field of artificial intelligence, and more particularly, to a method and an apparatus for establishing device association.
Background
With the development of artificial intelligence technology, the related performance of natural speech processing technology is greatly improved. More and more intelligent devices have natural voice processing capability, such as smart speakers, smart phones, smart tablet computers, internet of things devices, and the like.
However, each smart device can only operate independently due to the lack of association between smart devices. This situation is very unfriendly to human-computer interaction, because each smart device requires the user to control it individually by activating voice commands, and it is not possible to control the functions of other devices through the voice interaction portal of one device. One possible approach is to manually associate devices with each other using conventional human-computer interaction, such as a web page or a computer application, to enable association between multiple devices. This method has disadvantages in that it is inconvenient to operate and requires frequent operations of association and disassociation of devices.
Therefore, there is a need for a technique to provide more convenience and interaction to more naturally enable establishment of an association between a plurality of devices.
Disclosure of Invention
The embodiment of the invention provides a method and a device for establishing equipment association, which are more convenient and more natural to realize association establishment among a plurality of pieces of equipment through interaction.
In order to solve the above problem, the present invention provides a method and an apparatus for establishing a device association, where the method may include:
receiving a first natural voice signal;
when the first natural speech signal includes a device association instruction, transmitting a second natural speech signal; the device association instruction is used for instructing a plurality of devices to establish device association through natural voice; the second natural speech signal is used to establish a device association.
The method may further comprise:
receiving a first natural voice signal;
when the first natural voice signal comprises a device association instruction, monitoring a second natural voice signal; the device association instruction is used for instructing a plurality of devices to establish device association through natural voice; the second natural speech signal is used to establish a device association.
Optionally, the first natural speech signal further comprises one or more combinations of an activation instruction, a device identification, and an operation instruction.
Optionally, the operation instruction includes an operation instruction for a second natural speech signal.
Optionally, the second natural speech signal includes association information, and the association information is used for establishing device association.
Optionally, part or all of the associated information is loaded on the second natural speech signal by a modulation method.
Optionally, the modulation mode comprises one or more combinations of amplitude modulation, frequency modulation and phase modulation.
Optionally, the association information includes one or more combinations of a device association instruction, a device identifier, a communication address, a communication mode, a communication protocol, a device function, an activation instruction, and an authorization instruction.
Optionally, the method further comprises: establishing an electrical signal-based communication connection of the plurality of devices based on the association information.
Optionally, the format of the second natural speech signal is a default setting or a custom setting.
In order to solve the above problem, the present invention provides an apparatus for establishing device association, which may include:
the first natural voice signal receiving module is used for receiving a first natural voice signal;
the second natural voice signal sending module is used for sending a second natural voice signal when the first natural voice signal comprises a device association instruction; the device association instruction is used for instructing a plurality of devices to establish device association through natural voice; the second natural speech signal is used to establish a device association.
The apparatus may further include:
the first natural voice signal receiving module is used for receiving a first natural voice signal;
the second natural voice signal monitoring module is used for monitoring a second natural voice signal when the first natural voice signal comprises a device association instruction; the device association instruction is used for instructing a plurality of devices to establish device association through natural voice; the second natural speech signal is used to establish a device association.
Optionally, the first natural speech signal further comprises one or more combinations of an activation instruction, a device identification, and an operation instruction.
Optionally, the operation instruction includes an operation instruction for a second natural speech signal.
Optionally, the second natural speech signal includes association information, and the association information is used for establishing device association.
Optionally, part or all of the associated information is loaded on the second natural speech signal by a modulation method.
Optionally, the modulation mode comprises one or more combinations of amplitude modulation, frequency modulation and phase modulation.
Optionally, the association information includes one or more combinations of a device association instruction, a device identifier, a communication address, a communication mode, a communication protocol, a device function, an activation instruction, and an authorization instruction.
Optionally, the apparatus further comprises:
and the communication connection establishing module is used for establishing communication connection based on electric signals of the plurality of devices based on the associated information.
Optionally, the format of the second natural speech signal is a default setting or a custom setting.
In the embodiment of the invention, the natural voice processing capability of the equipment is utilized to process the received first natural voice signal and judge whether the received first natural voice signal comprises an equipment association instruction; when the first natural speech signal includes a device association instruction, a second natural speech signal is transmitted or received, the second natural speech signal being used to establish an association between the plurality of devices. When at least one of the plurality of devices obtains all the association information, the association of the plurality of devices is completed. The complexity and frequency of user operation are greatly reduced based on the association establishment mode of natural voice.
Drawings
In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the drawings needed to be used in the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present application
FIG. 1 is a flow diagram of a method for establishing device associations according to an embodiment of the invention;
FIG. 2 is a flow diagram of a method for establishing device associations according to an embodiment of the invention;
FIG. 3 is a block diagram of an apparatus for establishing device associations according to an embodiment of the present invention;
FIG. 4 is a block diagram of an apparatus for establishing device associations according to an embodiment of the present invention;
fig. 5 is a schematic diagram of an electronic device according to an embodiment of the invention.
Detailed Description
In order to make the objects, features and advantages of the present invention more apparent and understandable, the technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is apparent that the described embodiments are only a part of the embodiments of the present application, and not all the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.
It will be understood by those within the art that the terms "first", "second", etc. in this application are used only to distinguish one device, module, parameter, etc., from another, and do not denote any particular technical meaning or necessary order therebetween.
The speech signal processing is a cross discipline combining the phonetics and the digital signal processing, and is also connected with disciplines of psychoacoustics, linguistics, mode recognition, artificial intelligence and the like. Not only rely on the development of the disciplines, but also promote the progress of the disciplines. The main application branches of speech signal processing at present: speech encoding and decoding for storage and transmission, speech synthesis for "speaking" by a machine, speech recognition for recognizing and understanding the human language content by a machine, pattern recognition for recognizing a speaker by a robot, and speech enhancement for improving the speech intelligibility and intelligibility. The intelligent device in the invention has speech recognition and speech synthesis capabilities.
The voice communication with the machine is carried out, so that the machine can understand what you say, which is a thing that people dreams for a long time. The speech recognition technology is a technology for converting natural speech signals into corresponding texts or commands through a recognition and understanding process by a machine, and mainly comprises three aspects of a feature extraction technology, a pattern matching criterion and a model training technology. Speech recognition can be roughly classified into three categories, i.e., isolated word recognition (isolated word recognition), keyword recognition (or keyword detection), and continuous speech recognition, according to the object to be recognized. The task of isolated word identification is to identify isolated words known in advance or predetermined in advance, such as 'power on', 'power off', and the like; the task of continuous speech recognition is to recognize any continuous speech, which can be a sentence or a section of speech; keyword detection in a continuous speech stream is for continuous speech, but it does not recognize all words, but only detects where known or predetermined keywords appear, e.g. the words "computer", "world" in a segment of speech. Common application areas for speech recognition include: compared with a keyboard input method, the voice input method is more in line with the daily habits of people, and is more natural and more efficient; the voice control, namely the operation of the equipment is controlled by voice, is more rapid and convenient compared with manual control, and can be used in a plurality of fields such as industrial control, voice dialing systems, intelligent household appliances, voice-operated intelligent toys and the like; the intelligent dialogue inquiry is operated according to the voice of a client, and provides natural and friendly database retrieval services for users, such as family service, hotel service, travel agency service system, ticket booking system, medical service, bank service, stock inquiry service and the like. At present, Hidden Markov Models (HMMs) and artificial neural networks are mainstream technical methods in speech recognition.
Speech synthesis is a technique for generating artificial speech by mechanical, electronic methods. The text-to-speech technology (TTS technology) belongs to speech synthesis, and is a technology for converting text information generated by a computer or input from the outside into intelligible and fluent chinese spoken language and outputting the same, and particularly relates to language processing, prosodic processing and acoustic processing. The language processing plays an important role in a text-to-speech conversion system, and mainly simulates the understanding process of a human to natural language, namely text normalization, word segmentation, syntactic analysis and semantic analysis, so that a computer can completely understand input texts and give various pronunciation prompts required by the latter two parts. Prosodic processing programs segment features, such as pitch, duration, and intensity, for the synthesized speech, so that the synthesized speech can correctly express its semantic meaning and sounds more natural. The acoustic processing outputs speech, i.e. synthesized speech, according to the requirements of the first two parts of processing results.
As shown in fig. 1, a flowchart of a method for establishing a device association according to an embodiment of the present invention is applied to a device with natural speech processing capability, such as a smart phone, a smart tablet, a smart speaker, a notebook, and an internet-of-things device. The natural speech processing capability here includes: speech recognition and speech synthesis capabilities. The multiple devices may be divided into a current device and a target device, and the current device and the target device may be one or multiple devices.
Step S11, receiving a first natural voice signal;
current devices with natural speech processing capability collect natural speech signals through a sound collection device, such as a microphone. The first natural speech signal may be a natural speech signal uttered by a user, or may be a natural speech signal uttered by another device, and the embodiments of the present invention are not limited. On the other hand, the first natural speech signal here may be a natural speech signal actively uttered by the user, such as "establishing device association"; it may also be that the current device first issues a query to the user, and the user answers to the query, for example, the current device query: "whether device management needs to be established", the user answers: "required". The answer made by the user may be in a standard format, such as the device may only recognize "yes," or may be an affirmative answer that any current device may recognize, such as "yes," "need," "good," "establish," "yes," etc., and embodiments of the present invention are not limited. For the received first natural speech signal, the current device may perform preprocessing on the received first natural speech signal, such as noise reduction, enhancement, and the like, and the preprocessing may also be completed on other devices connected to the current device, such as a cloud server.
Step S12, when the first natural voice signal includes the device association instruction, sending a second natural voice signal; the device association instruction is used for instructing a plurality of devices to establish device association through natural voice; the second natural speech signal is used to establish a device association.
The current equipment carries out voice recognition processing on the received first natural voice signal to obtain an actual operation instruction. When the operation instruction includes a device association instruction, that is, when the first natural speech signal includes a device association instruction, the current device transmits a second natural speech signal. The device association instruction is used for instructing a plurality of devices to establish device association through natural voice, and can be said to be a trigger instruction for starting a device association process. The device association instructions may be keywords, such as "association"; or the recognition of a continuous speech stream, such as "association establishment"; or keywords in the continuous voice stream, such as "establish" and "associate". The keywords or device association commands may be user-defined and non-generic, and embodiments of the present invention are not limited. The second natural voice signal is used for establishing the association between the current device and the target device, and is information required by the current device and the target device to obtain the association.
In the implementation mode of the invention, the natural voice processing capability of the equipment is utilized to process the received first natural voice signal and judge whether the received first natural voice signal comprises an equipment association instruction; when the first natural speech signal includes a device association instruction, a second natural speech signal is transmitted, the second natural speech signal being used to establish an association between the plurality of devices. The complexity and frequency of user operation are greatly reduced based on the association establishment mode of natural voice.
As shown in fig. 2, a flowchart of a method for establishing a device association according to an embodiment of the present invention is applied to a device with natural speech processing capability, such as a smart phone, a smart tablet, a smart speaker, a notebook, and an internet-of-things device. The natural speech processing capability here includes: speech recognition and speech synthesis capabilities. The multiple devices may be divided into a current device and a target device, and the current device and the target device may be one or multiple devices.
Step S21, receiving a first natural voice signal;
current devices with natural speech processing capability collect natural speech signals through a sound collection device, such as a microphone. The first natural speech signal may be a natural speech signal uttered by a user, or may be a natural speech signal uttered by another device, and the embodiments of the present invention are not limited. On the other hand, the first natural speech signal here may be a natural speech signal actively uttered by the user, such as "establishing device association"; it may also be that the current device first issues a query to the user, and the user answers to the query, for example, the current device query: "whether device management needs to be established", the user answers: "required". The answer made by the user may be in a standard format, such as the device may only recognize "yes," or may be an affirmative answer that any current device may recognize, such as "yes," "need," "good," "establish," "yes," etc., and embodiments of the present invention are not limited. For the received first natural speech signal, the current device may perform preprocessing on the received first natural speech signal, such as noise reduction, enhancement, and the like, and the preprocessing may also be completed on other devices connected to the current device, such as a cloud server.
Step S22, when the first natural voice signal includes the device association instruction, monitoring a second natural voice signal; the device association instruction is used for instructing a plurality of devices to establish device association through natural voice; the second natural speech signal is used to establish a device association.
The current equipment carries out voice recognition processing on the received first natural voice signal to obtain an actual operation instruction. When the operation instruction comprises a device association instruction, namely the first natural voice signal comprises the device association instruction, the current device listens to the second natural voice signal. The second natural speech signal theory is the natural speech signal emitted by other devices to be associated, but it cannot be practically excluded that the user emits the speech signal to help a plurality of devices establish an association. The device association instruction is used for instructing a plurality of devices to establish device association through natural voice, and can be said to be a trigger instruction for starting a device association process. The device association instructions may be keywords, such as "association"; or the recognition of a continuous speech stream, such as "association establishment"; or keywords in the continuous voice stream, such as "establish" and "associate". The keywords or device association commands may be user-defined and non-generic, and embodiments of the present invention are not limited. The second natural voice signal is used for establishing the association between the current device and the target device, and is information required by the current device and the target device to obtain the association.
In the implementation mode of the invention, the natural voice processing capability of the equipment is utilized to process the received first natural voice signal and judge whether the received first natural voice signal comprises an equipment association instruction; when the first natural voice signal comprises a device association instruction, a second natural voice signal is monitored, and the second natural voice signal is used for establishing association among a plurality of devices. The complexity and frequency of user operation are greatly reduced based on the association establishment mode of natural voice.
For the above two embodiments, the process of establishing association by the device may include sending and receiving a plurality of second natural speech signals, each sending and receiving being used for transmitting a certain information. For example, after receiving a first natural speech signal including a device association command, the current device first transmits a first second natural speech signal, then listens for a response of the first second natural speech signal sent by the target device, that is, receives the second natural speech signal, then transmits a third second natural speech signal … …, for example, after receiving the first natural speech signal including the device association command, the current device first listens for the first second natural speech signal, responds after receiving the first second natural speech signal, that is, transmits the second natural speech signal, and then listens for the third second natural speech signal … …, and after transmitting a second natural speech signal, the device enters a state of continuing listening, and after receiving another second natural speech signal, the device enters a state of continuing transmitting, and repeating the sending and receiving until the association information needed for establishing the association is obtained. Of course, the device may be configured to send a plurality of second natural speech signals before entering the listening state. If the device waits for a period of time after sending a second natural speech signal but does not receive feedback or a new second natural speech signal, the device may also perform automatic retransmission, and the embodiments of the present invention are not limited thereto. The process of establishing the association may further include an instruction for indicating the end of the establishment of the association process.
When at least one of the plurality of devices obtains all the association information, it may be considered that the association of the plurality of devices is completed. The device which obtains all the associated information at this time is equivalent to a master device among the plurality of devices, and can control other devices, that is, the master device actively sends an instruction and obtains feedback, and the other devices can only respond to the instruction. When all the pieces of equipment obtain all the associated information, the user can control other any piece of equipment through any piece of equipment in the pieces of equipment, namely the pieces of equipment all have the capability of actively sending instructions and obtaining feedback. The instruction may be an operation instruction, an information obtaining instruction, a call instruction, etc., and embodiments of the present invention are not limited. By successfully establishing the association, the user can control other devices through one device, and the use convenience is greatly improved.
Optionally, the first natural speech signal may further include one or more combinations of an activation instruction, a device identification, and an operation instruction. The activation instruction is used to instruct the device to enter a state based on natural voice interaction. The activation instruction is a voice instruction for starting an interactive process based on natural voice, and is generally predefined and can be customized. The device continuously monitors the instruction when in sleep or standby, and when the collected voice signal is successfully matched with the instruction, the device enters a state based on natural voice interaction. The device identification includes: the identity of the current device, the identity of the target device. The identification of the target device, the identification of the current device may be the name or code number of the device. The plurality of devices may receive the first natural speech signal simultaneously, but the user only wishes some of them to establish management, in which case the first natural speech signal may directly or indirectly include the identity of the current device and/or the target device. The actual operation requirement of the user when the operation instruction reflects may be of the current device or of a certain target device, for example, for the first natural voice signal of the current device a, "establish association, open xxx file in device D," establish association "is a device association instruction," device D "is an identifier of the target device," open xxx file "is an operation instruction.
Further, the operation instruction includes an operation instruction for a second natural speech signal. For example, if the first natural voice signal received by the current device includes an operation instruction of "send device association request", the current device sends a second natural voice signal; for another example, if the operation instruction of "wait for receiving device association request" is included in the first natural speech signal received by the current device, the current device listens to the second natural speech signal. The examples here are that the first natural speech signal directly includes the operation instruction for the second natural speech signal, and in practice, the operation instruction may be indirectly included, and the embodiments of the present invention are not limited by the way of indirect inclusion. When the device receives the first natural voice signal including the device association instruction, whether to transmit or listen to the second natural voice signal may also be predefined, for example, after entering a device association procedure, device a defaults to transmit a device association request, device B waits to receive the device association request, and device C waits to receive the device association request. The predefinition may be due to the device being the same vendor or may be due to the user considering the device prior to use, and embodiments of the present invention are not limited.
Optionally, the second natural speech signal includes association information, and the association information is used for establishing device association. In order to realize the association between the plurality of devices, at least one of the plurality of devices must obtain information of the other devices. In other words, the association information is a service to the establishment of the device association.
Optionally, part or all of the associated information is loaded on the second natural speech signal by a modulation method. The modulation mode comprises one or more combinations of amplitude modulation, frequency modulation and phase modulation. When the association is established in a natural voice mode, the association information to be exchanged is translated into a second natural voice signal between the devices, and the second natural voice signal is acquired through voice interaction. When the association is established in the form of modulated natural speech, the device loads the digital signal of the association information into the second natural speech signal in the form of phase or amplitude modulation, and transmits and receives it via audio.
Further, the association information includes one or more combinations of a device association instruction, a device identifier, a communication address, a communication mode, a communication protocol, a device function, an activation instruction, and an authorization instruction. The activation instruction is used to instruct the device to enter a state based on natural voice interaction. The device association command is used to instruct the target device to enter a device association procedure. The device identification includes: identification of the target device, identification of the current device. The communication method includes a wired method and a wireless method. The communication address is a device address (device address), which may be an identification of any device that transmits or receives data in data communication, or may be a first sub-channel address recognized by the channel connection device. The communication protocol (communication protocol) refers to the rules and conventions that must be followed by two entities to complete a communication or service. The device function refers to a function of a device which establishes the association or is to establish the association, and may be an explicit function, for example, the smart speaker has a function of playing music, or an implicit function, for example, the smart tablet has a plurality of functions of playing audio/video files, text files, browsing web pages, and the like, and other devices do not need to learn about the function of the smart tablet, but send a further request or query to the smart tablet on the basis of receiving an operation instruction. The activation instruction is a voice instruction for starting an interactive process based on natural voice, and is generally predefined and can be customized. The device continuously monitors the instruction when in sleep or standby, and when the collected voice signal is successfully matched with the instruction, the device enters a state based on natural voice interaction. Based on the characteristics of natural speech, pattern recognition is an important area in current natural speech processing. Accordingly, in order to verify the identity of the user, some smart devices only recognize the natural voice of their owner, and want to "communicate" with or control the device by using a device with natural voice processing capability, so that the second natural voice signal emitted by the smart device is processed to simulate the natural voice of the owner. Of course, an authorization command may also be preset, and when the second natural voice signal sent by the predecessor includes the authorization command, the latter can recognize the natural voice signal of the predecessor, or think that the predecessor has the control right. In some embodiments, the authorization instruction may be an activation instruction, i.e. an instruction has the function of activating the device to enter a human-computer interaction state and also has the function of identity verification. The determination or selection of the associated information may be a default setting, or may be customized according to the needs of the user in the actual scene, and the embodiment of the present invention is not limited.
Optionally, the method further includes step S3, establishing communication connection based on electrical signals of the plurality of devices based on the association information. Speech signals, in contrast, are less perfect in many respects, including transmission distance, transmission quality, etc., than electrical signal-based communication connections. Therefore, after certain associated information is obtained among a plurality of devices, the communication and/or control in a natural voice signal mode is not needed. For example, based on one or more of the communication identifier, the communication address, the communication protocol, and the communication method in the association information, conventional communication based on an electrical signal is performed, for example, communication is performed through a wired ethernet, a wireless area network, a wireless wide area network, and the like. The communication identifier may be a device identifier or an IP address of communication based on an electrical signal. The establishment of the communication connection may be a transmission of association information required for the association of the plurality of devices using the communication connection before the establishment of the device association is completed. Of course, the establishment of the communication connection may be after the completion of the establishment of the device association. At this time, one or more information such as device identification, device function, communication address based on electric signal, communication protocol, etc. of the associated device is stored in each device, and the devices can communicate and/or control more quickly and swiftly through the stored information.
Optionally, the format of the second natural speech signal is a default setting or a custom setting. The format of the second natural speech signal refers to the format of information included in the second natural speech signal, and for example, for the second natural speech signal, the "request to establish association, i is device a, and the IP address is yyyy", and the format thereof is a device association instruction, a current device identifier, and a communication address. The format of the second natural speech signal may be a default setting, or may be customized according to the needs of the user in the actual scene, and this customization may be included in the first natural speech signal before the user uses the association function, and the embodiment of the present invention is not limited.
For clarity of illustration, we will here again cite two embodiments in detail.
In the first embodiment, we present a method for two devices to establish association and communication through natural voice. For example, a user has a device a and a device B, where the device a is a smart phone and the device B is a smart tablet. Device a and device B belong to the same vendor Sky, both of which have natural speech processing capabilities. Assuming that the active voice command of the two devices is "Hey Sky", that is, when the user can issue the voice command "Hey Sky" when the two devices are in close proximity, both devices enter an interactive state based on natural voice, and both devices listen to the voice command to be issued next by the user. It should be noted that the activation command of the same device is not the same, and the activation command may be customized or the same activation command may be used by devices of different vendors. Then, the user sends a predefined device association instruction "establish device association", that is, a first natural voice signal, and at this time, the two devices receive and recognize the device association instruction at the same time, and enter a device association establishing process based on natural voice. Since the two devices are products of the same vendor, both devices may store predefined association establishment procedures, where the smart phone first sends the first and second natural voice signals, and the smart tablet waits to receive the first and second natural voice signals. For example, a natural voice signal sent by the smartphone, i.e., the second natural voice signal, includes identification information and address information: "request to establish a connection, i.e. device xxxx, IP address yyyy". After receiving the voice signal, the smart tablet may send natural voice "receive establish connection request, i.e. i device xxx, IP address yyy". At this time, both the two devices know the device identifier and the IP address of the other, and can establish communication. Further, the two devices may exchange other association information using a natural speech signal and/or electrical signal based approach until multiple devices establish an association. In this case, the communication connection means that both parties obtain IP addresses in, for example, a local area network, and at the same time, the two intelligent devices know that both parties are in the same area and can communicate through the obtained IP addresses. After the association between the device a and the device B is completed, in one embodiment, the user may interact with the smart phone in a natural voice manner, thereby completing the control of the smart tablet. For example, after the user says "Hey Sky", the smart tablet does not respond, and only the smart phone enters a natural voice-based human-computer interaction state. At this time, the user may further "play the video on yyy", and thereafter the smartphone recognizes that the command is a control command, and the target device of the command is the smart tablet device of yyy, where the confirmation of the target device may be inferred from the location of the video file "yyy". Further, the smart phone sends the instruction of "playing video yyy" to the smart tablet device through the communication address of the device. At this time, the functions of association, communication and manipulation are completed among the devices.
In the second embodiment, we present a method for establishing association between two devices belonging to different suppliers through natural voice. For example, a user has a first provider's device C and a second provider's device D and their activating voice commands are also different, but both have natural voice processing capabilities. The voice command for activation of device C is "Hey Sky" and the voice command for activation of device D is "Hey Blue". Because the voice commands for activating the two devices are different, the user cannot start the associated process of the device through one same voice command. Thus, in this embodiment, the user first activates device C by voice instruction: "HeySky", further controls the device C to enter the process of establishing association by a voice command, i.e., a first voice signal "wait to establish device association", where the device association command is "establish device association". At this time, since the first voice signal includes the keyword "wait", according to the semantic definition rule, the device C enters a listening state, i.e., waits for receiving the second voice signal. Subsequently, the user activates device D by means of the voice command "Hey Blue", and the user controls "establish device association with device C of the first vendor using the activation command HeySky" by means of the first voice signal, from which device D can analyze to obtain the device association command "establish device association", the target device is device C, and the activation command of device C is "Hey Sky", after which device D enters the process of establishing association. In this state device D first issues a voice signal to device C using its activate voice command: "Hey Sky, i.e., my device xxxx, requests to establish device association and communication," i.e., a second voice signal. Further, the device C receives the voice activation instruction and feeds back "Hey Blue, i.e., my y, i.e., the device y, and receives the request message", i.e., the second voice signal. Further, the two devices exchange association information required to establish device association using other predefined voice commands, respectively. In another embodiment, the user only needs to input the first speech signal to device D: "Hey Blue," establishes association and communication with device a, using the activate voice command Hey Sky, "i.e., including the device association command, the activation command for the current device, the identification of the target device, and the activation command for the target device, after which device D will automatically generate the corresponding voice command: "Hey Sky, i.e., my device xxxx, requests that device association and communication be established," i.e., the second voice signal, this voice command is the first voice signal for device C. Thereafter, the device C is activated by the "Hey Sky" section, and a device association command of "establish device association" is recognized, which will control the device C to enter a flow of establishing device association. In this embodiment, "establish device association" is a special voice control command, after recognizing the command, the device enters a program different from the processing program of interaction with another computer, and the purpose of the program is not to feed back information that the user needs to obtain to the user, but to establish an associated channel with another device.
As shown in FIG. 3, the present invention provides an apparatus for establishing device association, which is used for establishing association between intelligent devices with natural speech processing capability. The natural speech processing capability here includes: speech recognition and speech synthesis capabilities. The device comprises:
a first natural voice signal receiving module 31, configured to receive a first natural voice signal;
a second natural speech signal transmission module 32, configured to transmit a second natural speech signal when the first natural speech signal includes a device association instruction; the device association instruction is used for instructing a plurality of devices to establish device association through natural voice; the second natural speech signal is used to establish a device association.
As shown in fig. 4, the present invention provides an apparatus for establishing device association, which is used to establish association between intelligent devices with natural speech processing capability. The natural speech processing capability here includes: speech recognition and speech synthesis capabilities. The device comprises:
a first natural voice signal receiving module 41, configured to receive a first natural voice signal;
a second natural speech signal monitoring module 42, configured to monitor a second natural speech signal when the first natural speech signal includes a device association instruction; the device association instruction is used for instructing a plurality of devices to establish device association through natural voice; the second natural speech signal is used to establish a device association.
In the embodiment of the present invention, by using the natural speech processing capability of the device, it is determined whether a device association instruction is included for processing the first natural speech signal received by the first natural speech receiving module 31 or 41; when the first natural voice signal includes the device association instruction, the second natural voice signal transmission module 32 transmits a second natural voice signal, or the second natural voice signal reception module 42 receives a second natural voice signal, which is used to establish an association between a plurality of devices. The complexity and frequency of user operation are greatly reduced based on the association establishment mode of natural voice.
Optionally, the first natural speech signal further comprises one or more combinations of an activation instruction, a device identification, and an operation instruction.
Optionally, the operation instruction includes an operation instruction for a second natural speech signal.
Optionally, the second natural speech signal includes association information, and the association information is used for establishing device association.
Optionally, part or all of the associated information is loaded on the second natural speech signal by a modulation method.
Optionally, the modulation mode comprises one or more combinations of amplitude modulation, frequency modulation and phase modulation.
Optionally, the association information includes one or more combinations of a device association instruction, a device identifier, a communication address, a communication mode, a communication protocol, a device function, an activation instruction, and an authorization instruction.
Optionally, the apparatus further comprises:
a communication connection establishing module 5, configured to establish a communication connection based on an electrical signal of the multiple devices based on the association information.
Optionally, the format of the second natural speech signal is a default setting or a custom setting.
Fig. 5 is a schematic diagram of an electronic device according to an embodiment of the invention. As shown in fig. 5, the electronic apparatus includes:
a memory 53 and one or more processors 51;
wherein the memory 53 is communicatively coupled to the one or more processors 51, the memory 53 having stored therein instructions executable by the one or more processors 51 to cause the one or more processors 51 to perform: receiving a first natural voice signal; when the first natural speech signal includes a device association instruction, transmitting a second natural speech signal; the device association instruction is used for instructing a plurality of devices to establish device association through natural voice; the second natural speech signal is used to establish a device association.
Alternatively, the instructions are executed by the one or more processors 51 to cause the one or more processors 51 to perform:
receiving a first natural voice signal;
when the first natural voice signal comprises a device association instruction, monitoring a second natural voice signal; the device association instruction is used for instructing a plurality of devices to establish device association through natural voice; the second natural speech signal is used to establish a device association.
Optionally, the first natural speech signal further comprises one or more combinations of an activation instruction, a device identification, and an operation instruction.
Optionally, the operation instruction includes an operation instruction for a second natural speech signal.
Optionally, the second natural speech signal includes association information, and the association information is used for establishing device association.
Optionally, part or all of the associated information is loaded on the second natural speech signal by a modulation method.
Optionally, the modulation mode comprises one or more combinations of amplitude modulation, frequency modulation and phase modulation.
Optionally, the association information includes one or more combinations of a device association instruction, a device identifier, a communication address, a communication mode, a communication protocol, a device function, an activation instruction, and an authorization instruction.
Optionally, the method further comprises: establishing an electrical signal-based communication connection of the plurality of devices based on the association information.
Optionally, the format of the second natural speech signal is a default setting or a custom setting.
As will be appreciated by one skilled in the art, embodiments of the present invention may be provided as a method, system, or computer program product. Accordingly, the present invention may take the form of a hardware embodiment, a software embodiment, or an embodiment combining software and hardware aspects. Furthermore, the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, optical storage, and the like) having computer-usable program code embodied therein.
The present invention is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
The above description is only a preferred embodiment of the present invention, and is not intended to limit the scope of the present invention.

Claims (22)

1. A method for establishing device associations, the method comprising:
receiving a first natural voice signal;
when the first natural voice signal comprises a device association instruction, sending a second natural voice signal according to the device association instruction; the device association instruction is used for instructing a plurality of devices to establish device association through natural voice; the second natural speech signal is used to establish an association between the plurality of devices.
2. A method for establishing device associations, the method comprising:
receiving a first natural voice signal;
when the first natural voice signal comprises a device association instruction, monitoring a second natural voice signal according to the device association instruction; the device association instruction is used for instructing a plurality of devices to establish device association through natural voice; the second natural speech signal is used to establish an association between the plurality of devices.
3. The method of claim 1 or 2, the first natural speech signal further comprising one or more combinations of activation instructions, device identification, operational instructions.
4. The method of claim 3, the operational instructions comprising operational instructions for a second natural speech signal.
5. The method according to claim 1 or 2, the second natural speech signal comprising association information for establishing a device association.
6. The method of claim 5, wherein part or all of the associated information is loaded onto the second natural speech signal by a modulation method.
7. The method of claim 6, the modulation scheme comprising one or more combinations of amplitude modulation, frequency modulation, and phase modulation.
8. The method of claim 5, wherein the association information comprises one or more of a combination of device association instructions, device identifiers, communication addresses, communication modes, communication protocols, device functions, activation instructions, and authorization instructions.
9. The method of claim 5, further comprising: establishing an electrical signal-based communication connection of the plurality of devices based on the association information.
10. The method of claim 1 or 2, the format of the second natural speech signal being a default setting or a custom setting.
11. An apparatus for establishing device associations, the apparatus comprising:
the first natural voice signal receiving module is used for receiving a first natural voice signal;
the second natural voice signal sending module is used for sending a second natural voice signal according to the equipment association instruction when the first natural voice signal comprises the equipment association instruction; the device association instruction is used for instructing a plurality of devices to establish device association through natural voice; the second natural speech signal is used to establish an association between the plurality of devices.
12. An apparatus for establishing device associations, the apparatus comprising:
the first natural voice signal receiving module is used for receiving a first natural voice signal;
the second natural voice signal monitoring module is used for monitoring a second natural voice signal according to the equipment association instruction when the first natural voice signal comprises the equipment association instruction; the device association instruction is used for instructing a plurality of devices to establish device association through natural voice; the second natural speech signal is used to establish an association between the plurality of devices.
13. The apparatus of claim 11 or 12, the first natural speech signal further comprising one or more combinations of activation instructions, device identifications, operational instructions.
14. The apparatus of claim 13, the operational instructions comprising operational instructions for a second natural speech signal.
15. The apparatus according to claim 11 or 12, the second natural speech signal comprising association information for establishing a device association.
16. The apparatus according to claim 15, wherein part or all of the associated information is loaded on the second natural speech signal by a modulation method.
17. The apparatus of claim 16, the modulation scheme comprising one or more combinations of amplitude modulation, frequency modulation, and phase modulation.
18. The apparatus of claim 16, the association information comprising one or more combinations of device association instructions, device identifications, communication addresses, communication modes, communication protocols, device functions, activation instructions, and authorization instructions.
19. The apparatus of claim 16, the apparatus further comprising:
and the communication connection establishing module is used for establishing communication connection based on electric signals of the plurality of devices based on the associated information.
20. The apparatus of claim 11 or 12, the format of the second natural speech signal being a default setting or a custom setting.
21. An electronic device, comprising:
a memory and one or more processors;
wherein the memory is communicatively coupled to the one or more processors and has stored therein instructions executable by the one or more processors to enable the one or more processors to implement the method of any one of claims 1-10.
22. A computer-readable storage medium having computer-executable instructions stored thereon which, when executed, perform the method of any one of claims 1-10.
CN201810187221.2A 2018-03-07 2018-03-07 Method and device for establishing equipment association Active CN108630201B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810187221.2A CN108630201B (en) 2018-03-07 2018-03-07 Method and device for establishing equipment association

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810187221.2A CN108630201B (en) 2018-03-07 2018-03-07 Method and device for establishing equipment association

Publications (2)

Publication Number Publication Date
CN108630201A CN108630201A (en) 2018-10-09
CN108630201B true CN108630201B (en) 2020-09-29

Family

ID=63706103

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810187221.2A Active CN108630201B (en) 2018-03-07 2018-03-07 Method and device for establishing equipment association

Country Status (1)

Country Link
CN (1) CN108630201B (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103197571A (en) * 2013-03-15 2013-07-10 张春鹏 Control method, device and system
CN103885350A (en) * 2014-03-19 2014-06-25 四川长虹电器股份有限公司 Method and device for voice control over household appliances
EP2801973A1 (en) * 2013-05-06 2014-11-12 Honeywell International Inc. Systems and methods for control of devices by voice recognition
US8996375B1 (en) * 2007-10-04 2015-03-31 Great Northern Research, LLC Speech interface system and method for control and interaction with applications on a computing system
WO2016136207A1 (en) * 2015-02-27 2016-09-01 パナソニックIpマネジメント株式会社 Voice interaction device, voice interaction system, control method of voice interaction device, and program
CN106653008A (en) * 2015-10-28 2017-05-10 中兴通讯股份有限公司 Voice control method, device and system
CN106886166A (en) * 2015-12-11 2017-06-23 美的集团股份有限公司 Method, device and the audio amplifier of household electrical appliance are controlled by audio amplifier
CN107680595A (en) * 2017-11-15 2018-02-09 安徽声讯信息技术有限公司 A kind of intelligent sound mobile phone or intelligent sound flat board

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8380503B2 (en) * 2008-06-23 2013-02-19 John Nicholas and Kristin Gross Trust System and method for generating challenge items for CAPTCHAs

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8996375B1 (en) * 2007-10-04 2015-03-31 Great Northern Research, LLC Speech interface system and method for control and interaction with applications on a computing system
CN103197571A (en) * 2013-03-15 2013-07-10 张春鹏 Control method, device and system
EP2801973A1 (en) * 2013-05-06 2014-11-12 Honeywell International Inc. Systems and methods for control of devices by voice recognition
CN103885350A (en) * 2014-03-19 2014-06-25 四川长虹电器股份有限公司 Method and device for voice control over household appliances
WO2016136207A1 (en) * 2015-02-27 2016-09-01 パナソニックIpマネジメント株式会社 Voice interaction device, voice interaction system, control method of voice interaction device, and program
CN106653008A (en) * 2015-10-28 2017-05-10 中兴通讯股份有限公司 Voice control method, device and system
CN106886166A (en) * 2015-12-11 2017-06-23 美的集团股份有限公司 Method, device and the audio amplifier of household electrical appliance are controlled by audio amplifier
CN107680595A (en) * 2017-11-15 2018-02-09 安徽声讯信息技术有限公司 A kind of intelligent sound mobile phone or intelligent sound flat board

Also Published As

Publication number Publication date
CN108630201A (en) 2018-10-09

Similar Documents

Publication Publication Date Title
JP6613347B2 (en) Method and apparatus for pushing information
CN108831469B (en) Voice command customizing method, device and equipment and computer storage medium
US10708423B2 (en) Method and apparatus for processing voice information to determine emotion based on volume and pacing of the voice
CN106653008B (en) Voice control method, device and system
KR100679043B1 (en) Apparatus and method for spoken dialogue interface with task-structured frames
KR20140105673A (en) Supporting Method And System For communication Service, and Electronic Device supporting the same
WO2015094907A1 (en) Attribute-based audio channel arbitration
CN105206272A (en) Voice transmission control method and system
JP6783339B2 (en) Methods and devices for processing audio
KR20180046780A (en) Method for providing of voice recognition service using double wakeup and apparatus thereof
CN110517686A (en) Intelligent sound box end voice opens the method and system of application
CN111161742A (en) Directional person communication method, system, storage medium and intelligent voice device
CN111933135A (en) Terminal control method and device, intelligent terminal and computer readable storage medium
CN108630201B (en) Method and device for establishing equipment association
CN112309396A (en) AI virtual robot state dynamic setting system
CN108766429B (en) Voice interaction method and device
US11790913B2 (en) Information providing method, apparatus, and storage medium, that transmit related information to a remote terminal based on identification information received from the remote terminal
KR102584324B1 (en) Method for providing of voice recognition service and apparatus thereof
CN115424622A (en) Man-machine voice intelligent interaction method and device
KR20220140304A (en) Video learning systems for recognize learners' voice commands
KR20220140301A (en) Video learning systems for enable learners to be identified through artificial intelligence and method thereof
CN111353771A (en) Method, device, equipment and medium for remotely controlling payment
CN105118507A (en) Sound control system and control method thereof
Dumitrescu Cassandra smart-home system description
KR102150902B1 (en) Apparatus and method for voice response

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant