CN109995475B - Control method and device of intelligent terminal, storage medium and electronic equipment - Google Patents

Control method and device of intelligent terminal, storage medium and electronic equipment Download PDF

Info

Publication number
CN109995475B
CN109995475B CN201910271951.5A CN201910271951A CN109995475B CN 109995475 B CN109995475 B CN 109995475B CN 201910271951 A CN201910271951 A CN 201910271951A CN 109995475 B CN109995475 B CN 109995475B
Authority
CN
China
Prior art keywords
intelligent terminal
target voice
format
encoding
coding
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910271951.5A
Other languages
Chinese (zh)
Other versions
CN109995475A (en
Inventor
姚茂楠
张斌
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Jiangsu Zimi Software Technology Co ltd
Original Assignee
Jiangsu Zimi Software Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jiangsu Zimi Software Technology Co ltd filed Critical Jiangsu Zimi Software Technology Co ltd
Priority to CN201910271951.5A priority Critical patent/CN109995475B/en
Publication of CN109995475A publication Critical patent/CN109995475A/en
Application granted granted Critical
Publication of CN109995475B publication Critical patent/CN109995475B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L1/00Arrangements for detecting or preventing errors in the information received
    • H04L1/0001Systems modifying transmission characteristics according to link quality, e.g. power backoff
    • H04L1/0015Systems modifying transmission characteristics according to link quality, e.g. power backoff characterised by the adaptation strategy
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L1/00Arrangements for detecting or preventing errors in the information received
    • H04L1/0001Systems modifying transmission characteristics according to link quality, e.g. power backoff
    • H04L1/0033Systems modifying transmission characteristics according to link quality, e.g. power backoff arrangements specific to the transmitter
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D30/00Reducing energy consumption in communication networks
    • Y02D30/70Reducing energy consumption in communication networks in wireless communication networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Human Computer Interaction (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Telephone Function (AREA)
  • Telephonic Communication Services (AREA)

Abstract

The embodiment of the application discloses a control method and device of an intelligent terminal, a storage medium and electronic equipment. The method comprises the following steps: acquiring at least one of locally preset voice instructions as a target voice instruction; if the connection channel with the intelligent terminal is established, determining a coding format supported by the intelligent terminal; and encoding the target voice instruction data by utilizing an encoding format supported by the intelligent terminal to obtain an encoding result, and transmitting the encoding result to the intelligent terminal through a connecting channel so that the intelligent terminal decodes the encoding result to obtain a target voice instruction, and executing the target voice instruction through a voice assistant. By executing the technical scheme provided by the embodiment, the voice command can be transmitted in a data form to realize the control of the intelligent terminal, so that the problem that whether a communication protocol between the voice command and the intelligent terminal is compatible is solved without considering, and the voice command has better applicability without additional software development and hardware upgrading.

Description

Control method and device of intelligent terminal, storage medium and electronic equipment
Technical Field
The embodiment of the application relates to the technical field of communication, in particular to a control method and device of an intelligent terminal, a storage medium and electronic equipment.
Background
With the rapid development of social economy and the gradual improvement of the technological level, the intelligent terminal has become an indispensable communication and entertainment tool in people's life.
For the control of the intelligent terminal, the conventional direct operation mode is gradually developed to the control of the intelligent terminal through other equipment interaction modes. The conventional interaction manner between the device and the intelligent terminal is generally to interact through a physical interface or a network protocol interface, such as USB (Universal Serial Bus ), WIFI (Wireless Fidelity, wireless local area network), bluetooth, and the like. However, such interaction requires the device to employ the same communication protocol as the intelligent terminal. And communication protocols of different companies or different products are not compatible, so that direct communication between the two cannot be realized.
Disclosure of Invention
The embodiment of the application provides a control method and device of an intelligent terminal, a storage medium and electronic equipment, so as to solve the problem that the equipment is incompatible with communication of the intelligent terminal.
In a first aspect, an embodiment of the present application provides a control method of an intelligent terminal, where the method includes:
acquiring at least one of locally preset voice instructions as a target voice instruction;
if the connection channel with the intelligent terminal is established, determining the coding format supported by the intelligent terminal;
and coding the target voice instruction data by utilizing a coding format supported by the intelligent terminal to obtain a coding result, and sending the coding result to the intelligent terminal through the connecting channel so that the intelligent terminal decodes the coding result to obtain the target voice instruction, and executing the target voice instruction through a voice assistant.
Further, before the target voice instruction data is encoded by using the encoding format supported by the intelligent terminal to obtain an encoding result, the method further includes:
determining whether the storage format of the target voice instruction is a voice format;
if yes, then execute: and encoding the target voice instruction data by utilizing an encoding format supported by the intelligent terminal to obtain an encoding result.
Further, after determining whether the storage format of the target voice command is a voice format, the method further comprises:
if not, converting the format of the target voice command into a voice format, and executing: and encoding the target voice instruction data by utilizing an encoding format supported by the intelligent terminal to obtain an encoding result.
Further, converting the format of the target voice command into a voice format includes:
and converting the format of the target voice instruction into a voice format through a local TTS interface or through a network TTS interface.
Further, before the target voice instruction data is encoded by using the encoding format supported by the intelligent terminal to obtain an encoding result, and the encoding result is sent to the intelligent terminal through the connection channel, the method further includes:
determining the bandwidth of the connection channel;
and determining the code rate for encoding the target voice instruction data according to the bandwidth of the connecting channel.
Further, after the target voice command obtaining module is configured to obtain at least one of the locally preset voice commands as the target voice command, the method further includes:
and if the connection channel with the intelligent terminal is not established, stopping sending the target voice instruction.
Further, the connection channel includes: one of a BLE channel, an SPP channel, an HFP channel, and an iAP channel of the Bluetooth protocol.
In a second aspect, an embodiment of the present application provides a control device of an intelligent terminal, where the device includes:
the target voice command acquisition module is used for acquiring at least one of locally preset voice commands as a target voice command;
the coding format determining module is used for determining a coding format supported by the intelligent terminal if judging that a connection channel with the intelligent terminal is established;
and the target voice command sending module is used for coding the target voice command data by utilizing a coding format supported by the intelligent terminal to obtain a coding result, sending the coding result to the intelligent terminal through the connecting channel so that the intelligent terminal decodes the coding result to obtain the target voice command, and executing the target voice command through a voice assistant.
Further, the apparatus further includes a storage format determination module, the storage format determination module including:
a storage format judging unit for determining whether the storage format of the target voice instruction is a voice format;
if yes, then execute: and encoding the target voice instruction data by utilizing an encoding format supported by the intelligent terminal to obtain an encoding result.
Further, the storage format determining module further includes:
and a format conversion unit, configured to convert the format of the target voice command into a voice format if the storage format determination unit determines no, and perform: and encoding the target voice instruction data by utilizing an encoding format supported by the intelligent terminal to obtain an encoding result.
Further, the format conversion unit is specifically configured to:
and converting the format of the target voice instruction into a voice format through a local TTS interface or through a network TTS interface.
Further, the apparatus further includes a code rate determining module, where the code rate determining module includes:
a bandwidth determining unit, configured to determine a bandwidth of the connection channel;
and the code rate determining unit is used for determining the code rate for encoding the target voice instruction data according to the bandwidth of the connecting channel.
Further, the device further comprises:
and the target voice command sending control module is used for stopping sending the target voice command if judging that the connection channel with the intelligent terminal is not established.
Further, the connection channel includes: one of a BLE channel, an SPP channel, an HFP channel, and an iAP channel of the Bluetooth protocol.
In a third aspect, embodiments of the present application provide a computer-readable storage medium having stored thereon a computer program that, when executed by a processor, implements a control method of an intelligent terminal according to embodiments of the present application.
In a fourth aspect, an embodiment of the present application provides an electronic device, including a memory, a processor, and a computer program stored on the memory and capable of being executed by the processor, where the processor executes the computer program to implement a control method of an intelligent terminal according to an embodiment of the present application.
According to the technical scheme provided by the embodiment of the application, at least one of locally preset voice instructions is obtained as a target voice instruction; if the connection channel with the intelligent terminal is established, determining the coding format supported by the intelligent terminal; and coding the target voice instruction data by utilizing a coding format supported by the intelligent terminal to obtain a coding result, and sending the coding result to the intelligent terminal through the connecting channel so that the intelligent terminal decodes the coding result to obtain the target voice instruction, and executing the target voice instruction through a voice assistant. By adopting the technical scheme provided by the application, the voice command can be transmitted in the form of data so as to realize the control of the intelligent terminal, so that the problem that whether a communication protocol between the voice command and the intelligent terminal is compatible is solved without considering, and the voice command has better applicability without additional software development and hardware upgrading.
Drawings
Fig. 1 is a flowchart of a control method of an intelligent terminal according to an embodiment of the present application;
fig. 2 is a flowchart of a control method of an intelligent terminal according to a second embodiment of the present application;
fig. 3 is a flowchart of a control method of an intelligent terminal according to a third embodiment of the present application;
fig. 4 is a schematic structural diagram of a control device of an intelligent terminal according to a fourth embodiment of the present application;
fig. 5 is a schematic structural diagram of an electronic device according to a sixth embodiment of the present application.
Detailed Description
The present application is described in further detail below with reference to the drawings and examples. It is to be understood that the specific embodiments described herein are merely illustrative of the application and not limiting thereof. It should be further noted that, for convenience of description, only some, but not all of the structures related to the present application are shown in the drawings.
Before discussing exemplary embodiments in more detail, it should be mentioned that some exemplary embodiments are described as processes or methods depicted as flowcharts. Although a flowchart depicts steps as a sequential process, many of the steps may be implemented in parallel, concurrently, or with other steps. Furthermore, the order of the steps may be rearranged. The process may be terminated when its operations are completed, but may have additional steps not included in the figures. The processes may correspond to methods, functions, procedures, subroutines, and the like.
Example 1
Fig. 1 is a flowchart of a control method of an intelligent terminal according to an embodiment of the present application, where the embodiment may be suitable for controlling the intelligent terminal by voice technology, and the method may be performed by a control device of the intelligent terminal according to the embodiment of the present application, where the device may be implemented by software and/or hardware, and may be integrated in an electronic device.
As shown in fig. 1, the control method of the intelligent terminal includes:
s110, at least one of the locally preset voice commands is acquired as a target voice command.
The electronic device according to the present application may be any device capable of communicating with an intelligent terminal, such as bluetooth communication, wireless network communication, and the like. Specifically, the electronic device may be an intelligent sound box, a vehicle-mounted mobile phone support, a selfie stick, and the like. The electronic device is required to store the voice instructions locally, wherein the voice instructions may be one or more. For example, the voice command stored locally in the smart speaker may include a voice command to turn on music playing software, start, pause, play the next song, disconnect the network, call someone, etc. The locally stored voice command of the vehicle-mounted mobile phone support can comprise switching to a vehicle-mounted mode. The locally stored voice instructions of the selfie stick may include turning on a camera, taking a photograph, etc. Because the using functions of the electronic equipment are different, different voice instructions can be preset for different electronic equipment, so that in the actual using process, one or more functions can be realized by controlling the intelligent terminal through the electronic equipment.
In this embodiment, at least one of the locally preset voice commands may be obtained as the target voice command according to the actual requirement of the user. For example, when the user places the smart phone on the vehicle-mounted mobile phone support, it can be understood that the user needs to switch the mode of the mobile phone to the vehicle-mounted mode. The voice command switched to the in-vehicle mode may be determined as the target voice command.
And S120, if the connection channel with the intelligent terminal is established, determining the coding format supported by the intelligent terminal.
The connection channel between the electronic device and the intelligent terminal can comprise USB, WIFI, bluetooth and the like, and whether connection with the intelligent terminal is established or not can be judged by detecting a corresponding physical interface or a communication protocol interface. If yes, the coding format supported by the intelligent terminal can be determined in an interactive mode. For example, after determining that a connection channel with the intelligent terminal has been established, the coding format supported by the intelligent terminal may be obtained by sending a coding format request.
In this embodiment, optionally, after the target voice command obtaining module is configured to obtain at least one of the locally preset voice commands as the target voice command, the method further includes: and if the connection channel with the intelligent terminal is not established, stopping sending the target voice instruction. If the connection channel is not established or disconnected, the sending of the target voice command is stopped, so that the problem that the target voice command is not executed after being sent can be avoided, and the electronic equipment cannot actively send the target voice command again after being reconnected at one end, so that a substantial control effect is not achieved.
The connection channel includes: one of a BLE channel, an SPP channel, an HFP channel, and an iAP channel of the Bluetooth protocol. Among them, BLE has an advantage in that low power consumption transmission can be achieved. SPP defines how to set up a virtual serial port and how to connect two bluetooth devices. SPP provides serial commodity simulation based on ETSI TS 07.10 specification using RFCOMM protocol. SPP provides a method to replace existing RS-232 serial communication applications and control signals in a wireless manner. SPP provides the basis for DUN, FAX, HSP and LAN profiles. The advantage is that compatibility is guaranteed. HFP (handfree Profile) allows a Bluetooth device to control a phone, such as answering, hanging up, rejecting, voice dialing, etc., and rejecting, voice dialing depends on whether the Bluetooth headset and the phone are supported, which is a settable mode for allowing the Bluetooth headset to enter into a high-fidelity call. Service Class UUID of the iAP protocol, the intelligent terminal can display the name of the bluetooth access found during the bluetooth discovery process if the intelligent terminal owns this service.
S130, coding the target voice instruction data by utilizing a coding format supported by the intelligent terminal to obtain a coding result, and sending the coding result to the intelligent terminal through the connecting channel so that the intelligent terminal decodes the coding result to obtain the target voice instruction, and executing the target voice instruction through a voice assistant.
After the coding format supported by the intelligent terminal is obtained, the determined target voice instruction data can be coded by adopting the coding format, so that a coding result is obtained, and the coding result is sent to the intelligent terminal through the connecting channel. The target voice command data may be text data, voice data, or data in other formats. In this embodiment, the intelligent terminal can only receive and decode the target voice command.
After the coding result is obtained, the intelligent terminal decodes the coding result to obtain the target voice instruction, and the target voice instruction is executed through a voice assistant. The voice assistant can determine the target voice instruction through identifying the target voice instruction data and execute the target voice instruction.
According to the technical scheme provided by the embodiment of the application, at least one of locally preset voice instructions is obtained as a target voice instruction; if the connection channel with the intelligent terminal is established, determining the coding format supported by the intelligent terminal; and coding the target voice instruction data by utilizing a coding format supported by the intelligent terminal to obtain a coding result, and sending the coding result to the intelligent terminal through the connecting channel so that the intelligent terminal decodes the coding result to obtain the target voice instruction, and executing the target voice instruction through a voice assistant. By adopting the technical scheme provided by the application, the voice command can be transmitted in the form of data so as to realize the control of the intelligent terminal, so that the problem that whether a communication protocol between the voice command and the intelligent terminal is compatible is solved without considering, and the voice command has better applicability without additional software development and hardware upgrading.
Example two
Fig. 2 is a flowchart of a control method of an intelligent terminal according to a second embodiment of the present application. The embodiment is specifically optimized based on the above embodiment: before the target voice instruction data is encoded by utilizing the encoding format supported by the intelligent terminal to obtain an encoding result, the method further comprises the following steps: determining whether the storage format of the target voice instruction is a voice format; if yes, then execute: and encoding the target voice instruction data by utilizing an encoding format supported by the intelligent terminal to obtain an encoding result. After determining whether the storage format of the target voice instruction is a voice format, the method further comprises: if not, converting the format of the target voice command into a voice format, and executing: and encoding the target voice instruction data by utilizing an encoding format supported by the intelligent terminal to obtain an encoding result.
As shown in fig. 2, the control method of the intelligent terminal includes:
s210, at least one of the locally preset voice commands is acquired as a target voice command.
S220, if the connection channel with the intelligent terminal is established, determining the coding format supported by the intelligent terminal.
S230, judging whether the storage format of the target voice instruction is a voice format or not. If not, executing S240; if yes, then execution proceeds to S250.
Because some electronic devices consider the relationship of the size of the FLASH, the preset voice command may be stored in a text format. Therefore, before sending the voice command to the intelligent terminal, it may be determined whether the format of the voice command is a voice format. If yes, the determined target voice command can be directly sent. I.e., S250 described below is performed.
S240, converting the format of the target voice instruction into a voice format. And performs S250.
There are many ways to convert the format of the target voice command into a voice format.
In this embodiment, preferably, converting the format of the target voice command into a voice format includes: the format of the target voice command is converted into a voice format through a local TTS (Text To Speech) interface or through a network TTS interface. The text format conversion may be implemented by a TTS interface local to the electronic device or by a TTS interface provided by a web server. The advantage of this arrangement is that it ensures that the transmitted target voice information can be recognized by the voice assistant of the intelligent terminal to determine the actual content of the operating instruction.
S250, coding the target voice instruction data by utilizing a coding format supported by the intelligent terminal to obtain a coding result.
And S260, the coding result is sent to the intelligent terminal through the connecting channel, so that the intelligent terminal decodes the coding result to obtain the target voice instruction, and the target voice instruction is executed through a voice assistant.
The embodiment provides a method for identifying and processing the format of the locally preset voice command on the basis of the embodiment, and the method has the advantages that after the intelligent terminal receives the target voice command, the intelligent terminal can be controlled by identifying the content of the voice command through a system voice assistant or a third-party voice assistant, so that the problem of compatibility can be solved, meanwhile, the stability of the scheme can be ensured, and the influence of the format of the transmitted voice command can be avoided.
On the basis of the above technical solutions, preferably, before encoding the target voice command data by using an encoding format supported by the intelligent terminal to obtain an encoding result, and transmitting the encoding result to the intelligent terminal through the connection channel, the method further includes: determining the bandwidth of the connection channel; and determining the code rate for encoding the target voice instruction data according to the bandwidth of the connecting channel. Wherein, the wider the bandwidth of the established connection channel is, the higher the code rate of the coding can be. The advantage of this arrangement is that the efficiency of the use of the connection channel can be improved and the situation of reduced transmission rate and even transmission errors due to too high code rate and insufficient bandwidth can be avoided.
Example III
Fig. 3 is a flowchart of a control method of an intelligent terminal according to a third embodiment of the present application. Taking a smart phone as an example, the embodiments of the present application are explained and illustrated in more detail. As shown in fig. 3, the control method of the intelligent terminal includes:
the intelligent hardware device is started to operate. The intelligent hardware device may be any of the electronic devices described above. The transmission channel is checked. If not, stopping sending the voice command. If so, further judging whether to preset the text instruction. If yes, converting into a voice command and sending the voice command, and if not, directly sending the voice command. After the data of the voice instruction are transmitted to the intelligent mobile phone, the intelligent mobile phone receives the voice instruction. The intelligent mobile phone processes the voice instruction and achieves the operation purpose of intelligent hardware.
Specifically, the following procedure may be adopted:
(1) The intelligent hardware device is started and operated, and a group of appointed voice operation instructions are preset according to function definition to be realized and the supporting condition of voice instructions by a voice assistant and an application program of the intelligent mobile phone.
(2) And checking whether a transmission channel between the intelligent hardware equipment and the intelligent mobile phone is established, and if not, not performing voice instruction sending operation.
(3) Because some hardware devices are limited by storage space, the preset operation instructions are text information. The method comprises the steps of checking whether a preset voice instruction is a text instruction or not, and if so, converting the preset voice instruction into voice instruction information through a local or network TTS interface.
(4) And sending voice instruction data to the smart phone by using the transmission channel, and according to the bandwidth of the transmission channel and the coding format supported by the smart phone, coding the voice data before sending, and then sending the voice instruction data to the smart phone.
(5) And the intelligent mobile phone receives the voice instruction and performs corresponding decoding processing. At present, the voice recognition of the smart phone is basically processed at the cloud, so that the mobile phone side can perform cloud voice recognition and semantic analysis through a network, then a mobile phone operating system or a voice assistant application program can perform corresponding processing on the semantics, the process can perform multiple data requests according to specific semantic operations and intelligent hardware, and if the requests are required, the intelligent hardware can perform multiple voice data transmission according to preset operation instructions.
(6) Through the above flow, the operation of realizing intelligent hardware expectation through the intelligent mobile phone is completed.
The intelligent hardware function requirement is completed by using the voice instruction supported by the voice assistant of the intelligent mobile phone, and for hardware products meeting the requirement, the product requirement can be directly realized by using the voice instruction supported by the voice assistant of the mobile phone and the application program. For voice transmission channels, the invention can be implemented through BLE, SPP, HFP or iAP channels of Bluetooth protocol, and can also be implemented through other protocols supporting voice instruction transmission.
Compared with the prior art, the technical scheme provided by the invention can complete some hardware products meeting the requirements through the intelligent mobile phone by using the voice instructions supported by the mobile phone voice assistant and the application program without defining the private communication protocol of the mobile phone end. The invention not only saves the cost of software development and installation, but also improves the user interaction experience of the product.
Example IV
Fig. 4 is a schematic structural diagram of a control device of an intelligent terminal according to a fourth embodiment of the present application. As shown in fig. 4, the control device of the intelligent terminal includes:
a target voice command obtaining module 410, configured to obtain at least one of locally preset voice commands as a target voice command;
the coding format determining module 420 is configured to determine a coding format supported by the intelligent terminal if it is determined that a connection channel with the intelligent terminal has been established;
and the target voice command sending module 430 is configured to encode the target voice command data using an encoding format supported by the intelligent terminal, obtain an encoding result, send the encoding result to the intelligent terminal through the connection channel, so that the intelligent terminal decodes the encoding result to obtain the target voice command, and execute the target voice command through a voice assistant.
According to the technical scheme provided by the embodiment of the application, at least one of locally preset voice instructions is obtained as a target voice instruction; if the connection channel with the intelligent terminal is established, determining the coding format supported by the intelligent terminal; and coding the target voice instruction data by utilizing a coding format supported by the intelligent terminal to obtain a coding result, and sending the coding result to the intelligent terminal through the connecting channel so that the intelligent terminal decodes the coding result to obtain the target voice instruction, and executing the target voice instruction through a voice assistant. By adopting the technical scheme provided by the application, the voice command can be transmitted in the form of data so as to realize the control of the intelligent terminal, so that the problem that whether a communication protocol between the voice command and the intelligent terminal is compatible is solved without considering, and the voice command has better applicability without additional software development and hardware upgrading.
On the basis of the above technical solutions, optionally, the apparatus further includes a storage format determining module, where the storage format determining module includes:
a storage format judging unit for determining whether the storage format of the target voice instruction is a voice format;
if yes, then execute: and encoding the target voice instruction data by utilizing an encoding format supported by the intelligent terminal to obtain an encoding result.
On the basis of the above technical solutions, optionally, the storage format determining module further includes:
and a format conversion unit, configured to convert the format of the target voice command into a voice format if the storage format determination unit determines no, and perform: and encoding the target voice instruction data by utilizing an encoding format supported by the intelligent terminal to obtain an encoding result.
Based on the above technical solutions, optionally, the format conversion unit is specifically configured to:
and converting the format of the target voice instruction into a voice format through a local TTS interface or through a network TTS interface.
On the basis of the above technical solutions, optionally, the apparatus further includes a code rate determining module, where the code rate determining module includes:
a bandwidth determining unit, configured to determine a bandwidth of the connection channel;
and the code rate determining unit is used for determining the code rate for encoding the target voice instruction data according to the bandwidth of the connecting channel.
On the basis of the above technical solutions, optionally, the apparatus further includes:
and the target voice command sending control module is used for stopping sending the target voice command if judging that the connection channel with the intelligent terminal is not established.
On the basis of the above technical solutions, optionally, the connection channel includes: one of a BLE channel, an SPP channel, an HFP channel, and an iAP channel of the Bluetooth protocol.
The product can execute the method provided by any embodiment of the application, and has the corresponding functional modules and beneficial effects of the execution method.
Example five
The embodiments also provide a storage medium containing computer executable instructions, which when executed by a computer processor, are used to perform a control method of an intelligent terminal, the method comprising:
acquiring at least one of locally preset voice instructions as a target voice instruction;
if the connection channel with the intelligent terminal is established, determining the coding format supported by the intelligent terminal;
and coding the target voice instruction data by utilizing a coding format supported by the intelligent terminal to obtain a coding result, and sending the coding result to the intelligent terminal through the connecting channel so that the intelligent terminal decodes the coding result to obtain the target voice instruction, and executing the target voice instruction through a voice assistant.
Storage media-any of various types of memory devices or storage devices. The term "storage medium" is intended to include: mounting media such as CD-ROM, floppy disk or tape devices; computer system memory or random access memory such as DRAM, DDR RAM, SRAM, EDO RAM, lanbas (Rambus) RAM, etc.; nonvolatile memory such as flash memory, magnetic media (e.g., hard disk or optical storage); registers or other similar types of memory elements, etc. The storage medium may also include other types of memory or combinations thereof. In addition, the storage medium may be located in a computer system in which the program is executed, or may be located in a different second computer system connected to the computer system through a network (such as the internet). The second computer system may provide program instructions to the computer for execution. The term "storage medium" may include two or more storage media that may reside in different locations (e.g., in different computer systems connected by a network). The storage medium may store program instructions (e.g., embodied as a computer program) executable by one or more processors.
Of course, the storage medium containing the computer executable instructions provided in the embodiments of the present application is not limited to the monitoring operation of the concentration degree as described above, and may also perform the related operations in the control method of the intelligent terminal provided in any embodiment of the present application.
Example six
The embodiment of the application provides electronic equipment, and the electronic equipment can integrate the control device of the intelligent terminal provided by the embodiment of the application. Fig. 5 is a schematic structural diagram of an electronic device according to a sixth embodiment of the present application. As shown in fig. 5, the present embodiment provides an electronic device 500, which includes: one or more processors 520; the storage 510 is configured to store one or more programs, where the one or more programs are executed by the one or more processors 520, so that the one or more processors 520 implement a control method of an intelligent terminal provided by an embodiment of the present application, and the method includes:
acquiring at least one of locally preset voice instructions as a target voice instruction;
if the connection channel with the intelligent terminal is established, determining the coding format supported by the intelligent terminal;
and coding the target voice instruction data by utilizing a coding format supported by the intelligent terminal to obtain a coding result, and sending the coding result to the intelligent terminal through the connecting channel so that the intelligent terminal decodes the coding result to obtain the target voice instruction, and executing the target voice instruction through a voice assistant.
Of course, it will be understood by those skilled in the art that the processor 520 may also implement the technical solution of the control method of the intelligent terminal provided in any embodiment of the present application.
The electronic device 500 shown in fig. 5 is merely an example, and should not be construed as limiting the functionality and scope of use of embodiments of the present application.
As shown in fig. 5, the electronic device 500 includes a processor 520, a storage device 510, an input device 530, and an output device 540; the number of processors 520 in the electronic device may be one or more, one processor 520 being exemplified in fig. 5; the processor 520, the storage 510, the input 530, and the output 540 in the electronic device may be connected by a bus or other means, as exemplified by connection via bus 550 in fig. 5.
The storage device 510 is a computer readable storage medium, and may be used to store a software program, a computer executable program, and a module unit, such as program instructions corresponding to a control method of an intelligent terminal in the embodiment of the present application.
The storage device 510 may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, at least one application program required for functions; the storage data area may store data created according to the use of the terminal, etc. In addition, the storage 510 may include high-speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other non-volatile solid-state storage device. In some examples, storage 510 may further include memory located remotely from processor 520, which may be connected via a network. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.
The input means 530 may be used to receive input numeric, character information or voice information and to generate key signal inputs related to user settings and function control of the electronic device. Output 540 may include a display screen, speakers, etc.
The electronic equipment provided by the embodiment of the application can transmit the voice command in a data form so as to realize the control of the intelligent terminal, so that the problem that whether a communication protocol between the voice command and the intelligent terminal is compatible is solved, additional software development and hardware upgrading are not needed, and the electronic equipment has good applicability.
The control device, the storage medium and the electronic equipment of the intelligent terminal provided in the above embodiments can execute the control method of the intelligent terminal provided in any embodiment of the application, and have the corresponding functional modules and beneficial effects of executing the method. Technical details not described in detail in the above embodiments may be referred to the control method of the intelligent terminal provided in any embodiment of the present application.
Note that the above is only a preferred embodiment of the present application and the technical principle applied. Those skilled in the art will appreciate that the present application is not limited to the particular embodiments described herein, but is capable of numerous obvious changes, rearrangements and substitutions as will now become apparent to those skilled in the art without departing from the scope of the present application. Therefore, while the present application has been described in connection with the above embodiments, the present application is not limited to the above embodiments, but may include many other equivalent embodiments without departing from the spirit of the present application, the scope of which is defined by the scope of the appended claims.

Claims (8)

1. The control method of the intelligent terminal is characterized by comprising the following steps:
acquiring at least one of locally preset voice instructions as a target voice instruction;
if the connection channel with the intelligent terminal is judged to be established by detecting the corresponding physical interface or communication protocol interface, determining the coding format supported by the intelligent terminal; the method comprises the steps of obtaining a coding format supported by an intelligent terminal in a mode of sending a coding format request;
determining the bandwidth of the connection channel; determining the code rate for encoding the target voice instruction data according to the bandwidth of the connecting channel;
encoding the target voice instruction data by utilizing an encoding format supported by the intelligent terminal and the code rate to obtain an encoding result, and sending the encoding result to the intelligent terminal through the connecting channel so that the intelligent terminal decodes the encoding result to obtain the target voice instruction, and executing the target voice instruction through a voice assistant;
before the target voice instruction data is encoded by utilizing the encoding format supported by the intelligent terminal to obtain an encoding result, the method further comprises the following steps:
determining whether the storage format of the target voice instruction is a voice format;
if yes, then execute: and encoding the target voice instruction data by utilizing an encoding format supported by the intelligent terminal to obtain an encoding result.
2. The method of claim 1, wherein after determining whether the storage format of the target voice command is a voice format, the method further comprises:
if not, converting the format of the target voice command into a voice format, and executing: and encoding the target voice instruction data by utilizing an encoding format supported by the intelligent terminal to obtain an encoding result.
3. The method of claim 2, wherein converting the format of the target voice command to a voice format comprises:
and converting the format of the target voice instruction into a voice format through a local TTS interface or through a network TTS interface.
4. The method of claim 1, wherein after the target voice command obtaining module is configured to obtain at least one of the locally preset voice commands as the target voice command, the method further comprises:
and if the connection channel with the intelligent terminal is not established, stopping sending the target voice instruction.
5. The method of claim 1, wherein the connection channel comprises: one of a BLE channel, an SPP channel, an HFP channel, and an iAP channel of the Bluetooth protocol.
6. The utility model provides a controlling means of intelligent terminal which characterized in that includes:
the target voice command acquisition module is used for acquiring at least one of locally preset voice commands as a target voice command;
the coding format determining module is used for determining a coding format supported by the intelligent terminal if the connection channel with the intelligent terminal is judged to be established by detecting a corresponding physical interface or communication protocol interface; the method comprises the steps of obtaining a coding format supported by an intelligent terminal in a mode of sending a coding format request;
the code rate determining module is used for determining the bandwidth of the connecting channel; determining the code rate for encoding the target voice instruction data according to the bandwidth of the connecting channel;
the target voice command sending module is used for coding the target voice command data by utilizing a coding format supported by the intelligent terminal and the code rate to obtain a coding result, sending the coding result to the intelligent terminal through the connecting channel so that the intelligent terminal decodes the coding result to obtain the target voice command, and executing the target voice command through a voice assistant;
the device further comprises a storage format determining module, wherein the storage format determining module comprises:
a storage format judging unit for determining whether the storage format of the target voice instruction is a voice format;
if yes, then execute: and encoding the target voice instruction data by utilizing an encoding format supported by the intelligent terminal to obtain an encoding result.
7. A computer-readable storage medium, on which a computer program is stored, characterized in that the program, when being executed by a processor, implements the control method of the intelligent terminal according to any one of claims 1-5.
8. An electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the processor implements the control method of the intelligent terminal according to any of claims 1-5 when executing the computer program.
CN201910271951.5A 2019-04-04 2019-04-04 Control method and device of intelligent terminal, storage medium and electronic equipment Active CN109995475B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910271951.5A CN109995475B (en) 2019-04-04 2019-04-04 Control method and device of intelligent terminal, storage medium and electronic equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910271951.5A CN109995475B (en) 2019-04-04 2019-04-04 Control method and device of intelligent terminal, storage medium and electronic equipment

Publications (2)

Publication Number Publication Date
CN109995475A CN109995475A (en) 2019-07-09
CN109995475B true CN109995475B (en) 2023-05-30

Family

ID=67132415

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910271951.5A Active CN109995475B (en) 2019-04-04 2019-04-04 Control method and device of intelligent terminal, storage medium and electronic equipment

Country Status (1)

Country Link
CN (1) CN109995475B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113782015A (en) * 2021-01-06 2021-12-10 北京沃东天骏信息技术有限公司 Voice interaction method and device
CN113805961B (en) * 2021-08-27 2023-09-15 中国航空无线电电子研究所 Product software and hardware compatibility management method based on compatibility list file

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101996638A (en) * 2009-08-10 2011-03-30 北京多思科技发展有限公司 Voice codec and voice coding/decoding method
CN102625007A (en) * 2011-01-26 2012-08-01 中山爱科数字家庭产业孵化基地有限公司 Method for controlling voice-recognition home device
CN104980839A (en) * 2015-07-08 2015-10-14 广东翼卡车联网服务有限公司 Vehicle-mounted intelligent Bluetooth handset and implementation method thereof
CN105741840A (en) * 2016-04-06 2016-07-06 北京云知声信息技术有限公司 Voice control method and device
CN108022592A (en) * 2018-01-17 2018-05-11 深圳市沃特沃德股份有限公司 Voice remote control method and device
CN108492545A (en) * 2018-03-15 2018-09-04 北京金山安全软件有限公司 Infrared remote control method, device, remote control equipment and medium

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106612368B (en) * 2015-10-27 2019-11-05 比亚迪股份有限公司 Speech collecting system, voice collector and its information transmission and control method
US9912977B2 (en) * 2016-02-04 2018-03-06 The Directv Group, Inc. Method and system for controlling a user receiving device using voice commands
CN108737229A (en) * 2018-05-08 2018-11-02 芜湖琅格信息技术有限公司 A kind of intelligent domestic system based on voice control
CN108966162A (en) * 2018-06-27 2018-12-07 努比亚技术有限公司 Data communications method, communication processing equipment, terminal and readable storage medium storing program for executing
CN108989163A (en) * 2018-08-22 2018-12-11 歌尔科技有限公司 A kind of smart machine control method, device and smart machine

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101996638A (en) * 2009-08-10 2011-03-30 北京多思科技发展有限公司 Voice codec and voice coding/decoding method
CN102625007A (en) * 2011-01-26 2012-08-01 中山爱科数字家庭产业孵化基地有限公司 Method for controlling voice-recognition home device
CN104980839A (en) * 2015-07-08 2015-10-14 广东翼卡车联网服务有限公司 Vehicle-mounted intelligent Bluetooth handset and implementation method thereof
CN105741840A (en) * 2016-04-06 2016-07-06 北京云知声信息技术有限公司 Voice control method and device
CN108022592A (en) * 2018-01-17 2018-05-11 深圳市沃特沃德股份有限公司 Voice remote control method and device
CN108492545A (en) * 2018-03-15 2018-09-04 北京金山安全软件有限公司 Infrared remote control method, device, remote control equipment and medium

Also Published As

Publication number Publication date
CN109995475A (en) 2019-07-09

Similar Documents

Publication Publication Date Title
US11676601B2 (en) Voice assistant tracking and activation
CN110459221B (en) Method and device for multi-device cooperative voice interaction
US10055190B2 (en) Attribute-based audio channel arbitration
RU2554848C2 (en) System and method of contactless control of "electronic calendar" application in vehicle
JP7353497B2 (en) Server-side processing method and server for actively proposing the start of a dialogue, and voice interaction system capable of actively proposing the start of a dialogue
US20140357248A1 (en) Apparatus and System for Interacting with a Vehicle and a Device in a Vehicle
KR20190075800A (en) Intelligent personal assistant interface system
US10950238B2 (en) Bluetooth speaker base, method and system for controlling thereof
CN110010139B (en) Audio input/output method, system and computer readable storage medium
WO2014090040A1 (en) Method of using a mobile device as a microphone, method of audio playback, and related device and system
US7680514B2 (en) Wireless speech recognition
CN105023575B (en) Audio recognition method, device and system
US7496693B2 (en) Wireless enabled speech recognition (SR) portable device including a programmable user trained SR profile for transmission to external SR enabled PC
CN109995475B (en) Control method and device of intelligent terminal, storage medium and electronic equipment
US20230138804A1 (en) Enhanced video call method and system, and electronic device
JP2022095779A (en) Vehicle-device interconnection method, device, mobile terminal, in-vehicle terminal, computer-readable storage medium, and computer program
WO2016150191A1 (en) Data sharing method and device
CN104202714A (en) Mobile terminal near field communication method and system for heterogeneous operating system
US20200213844A1 (en) Communication method, communication apparatus and electronic device
CN110781014A (en) Recording data multi-process distribution method and system based on Android device
WO2018059595A1 (en) Vehicular radio interaction method, control apparatus, and vehicular apparatus
JP7242248B2 (en) ELECTRONIC DEVICE, CONTROL METHOD AND PROGRAM THEREOF
US20140163971A1 (en) Method of using a mobile device as a microphone, method of audio playback, and related device and system
JP2015002394A (en) Information processing apparatus and computer program
CN112312368A (en) Data processing method, device, equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant