CN108172225A - Voice interactive method and robot, computer readable storage medium, terminal - Google Patents

Voice interactive method and robot, computer readable storage medium, terminal Download PDF

Info

Publication number
CN108172225A
CN108172225A CN201711442348.6A CN201711442348A CN108172225A CN 108172225 A CN108172225 A CN 108172225A CN 201711442348 A CN201711442348 A CN 201711442348A CN 108172225 A CN108172225 A CN 108172225A
Authority
CN
China
Prior art keywords
voice
interactive
answer sentence
user
word
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201711442348.6A
Other languages
Chinese (zh)
Inventor
张家重
白喜阳
王玉奎
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inspur Financial Information Technology Co Ltd
Original Assignee
Inspur Financial Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inspur Financial Information Technology Co Ltd filed Critical Inspur Financial Information Technology Co Ltd
Priority to CN201711442348.6A priority Critical patent/CN108172225A/en
Publication of CN108172225A publication Critical patent/CN108172225A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3329Natural language query formulation or dialogue systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G07CHECKING-DEVICES
    • G07FCOIN-FREED OR LIKE APPARATUS
    • G07F9/00Details other than those peculiar to special kinds or types of apparatus
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Computational Linguistics (AREA)
  • General Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • General Engineering & Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Data Mining & Analysis (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Manipulator (AREA)

Abstract

A kind of voice interactive method robot, computer readable storage medium, terminal, the method includes:Recognition of face is carried out to preset monitoring area;When identifying facial image, user corresponding with the facial image engages in the dialogue, and performs corresponding operation.Above-mentioned scheme can improve the intelligence degree of interactive voice, promote the usage experience of user.

Description

Voice interactive method and robot, computer readable storage medium, terminal
Technical field
The present invention relates to debugging technique field, more particularly to a kind of voice interactive method and robot, computer-readable Storage medium, terminal.
Background technology
Self-help terminal equipment has been widely used in every field, such as finance, transport, medical treatment, mobile communication, food and drink Deng.User can handle a variety of self-service business by self-help terminal equipment, without window queuing is gone to wait for, save manpower money Source improves the efficiency of business handling.
When user handles corresponding self-service business, corresponding media card is placed on corresponding self-aided terminal carry out it is self-service Business is handled.
But existing voice interactive method there is intelligence degree it is low the problem of, seriously affected the use of user Experience.
Invention content
The technical issues of embodiment of the present invention solves is how to improve the intelligence degree of interactive voice, promotes making for user With experience.
To solve the above problems, an embodiment of the present invention provides a kind of voice interactive method, the method includes:
Recognition of face is carried out to preset monitoring area;
When identifying facial image, user corresponding with the facial image engages in the dialogue, and performs corresponding operation.
Optionally, the user corresponding with the facial image engages in the dialogue, and performs corresponding operation, including:
Obtain the voice input by user;
The voice input by user is identified, obtains corresponding word;
Judge to identify obtained word for imperative statement or non-imperative statement;
When the word for determining that identification obtains is imperative statement, the corresponding operation of the imperative statement is performed.
Optionally, when the word for determining that identification obtains is non-imperative statement, the method further includes:
When matching corresponding answer sentence from preset corpus data library, output matches obtained answer sentence voice.
Optionally, it when not matching corresponding answer sentence from preset corpus data library, further includes:
Corresponding answer sentence, and answer sentence voice determined by output are determined by web search.
The embodiment of the present invention additionally provides a kind of interactive voice robot, and the robot includes:
Face identification unit, suitable for carrying out recognition of face to preset monitoring area;
Interactive operation unit, suitable for when identifying facial image, user corresponding with the facial image engages in the dialogue, and hold The corresponding operation of row.
Optionally, the interactive operation unit, suitable for obtaining the voice input by user;To the language input by user Sound is identified, and obtains corresponding word;Judge to identify obtained word for imperative statement or non-imperative statement;When true When to identify obtained word calmly be imperative statement, the corresponding operation of the imperative statement is performed.
Optionally, the interactive operation unit, be further adapted for when determining that the obtained word of identification is non-imperative statement and When matching corresponding answer sentence from preset corpus data library, output matches obtained answer sentence voice.
Optionally, the interactive operation unit is further adapted for matching corresponding answer from preset corpus data library During case sentence, corresponding answer sentence, and answer sentence voice determined by output are determined by web search.
The embodiment of the present invention additionally provides a kind of computer readable storage medium, is stored thereon with computer instruction, described The step of computer instruction performs voice interactive method described in any one of the above embodiments when running.
The embodiment of the present invention additionally provides a kind of terminal, and including memory and processor, energy is stored on the memory Enough computer instructions run on the processor, the processor perform any of the above-described when running the computer instruction The step of described voice interactive method.
Compared with prior art, technical scheme of the present invention has the following advantages that:
Above-mentioned scheme, by carrying out recognition of face to preset monitoring area, when identifying facial image, with the face The corresponding user of image engages in the dialogue, and performs corresponding operation, can improve the intelligent Service degree of robot, be promoted and used The usage experience at family.
Further, when not matching corresponding answer sentence from preset corpus data library, pass through web search It determines corresponding answer sentence, and answer sentence voice determined by output, the accuracy of interactive voice can be further improved, Promote the usage experience of user.
Description of the drawings
Fig. 1 is a kind of flow chart of voice interactive method in the embodiment of the present invention;
Fig. 2 is the flow chart of another voice interactive method in the embodiment of the present invention;
Fig. 3 is a kind of structure diagram of interactive voice robot in the embodiment of the present invention.
Specific embodiment
To solve the above-mentioned problems in the prior art, technical solution used in the embodiment of the present invention passes through to preset Monitoring area carries out recognition of face, and when identifying facial image, user corresponding with the facial image engages in the dialogue, and hold The corresponding operation of row can improve the intelligent Service degree of robot, promote the usage experience of user.
It is understandable for the above objects, features and advantages of the present invention is enable to become apparent, below in conjunction with the accompanying drawings to the present invention Specific embodiment be described in detail.
Fig. 1 shows a kind of flow chart of voice interactive method in the embodiment of the present invention.Interactive voice as shown in Figure 1 Method can specifically include following operation:
Step S101:Recognition of face is carried out to preset monitoring area.
Step S102:When identifying facial image, user corresponding with the facial image engages in the dialogue, and perform phase The operation answered.
Above-mentioned scheme, it is and described when identifying facial image by carrying out recognition of face to preset monitoring area The corresponding user of facial image engages in the dialogue, and performs corresponding operation, can improve the intelligent Service degree of robot, carry Rise the usage experience of user.
Further details of introduction is carried out to the voice interactive method in the embodiment of the present invention below in conjunction with Fig. 2.
Fig. 2 shows a kind of flow charts of voice interactive method in the embodiment of the present invention.Referring to Fig. 2, a kind of voice is handed over Mutual method suitable for carrying out voice interface with user, can specifically include following operation:
Step S201:Recognition of face is carried out to preset monitoring area.
In specific implementation, the preset monitoring area, for taking the photograph for the interactive voice robot in the embodiment of the present invention The region that can be taken as head.It can be to institute by the camera for setting interactive voice robot in embodiments of the present invention It states monitoring area and carries out image taking, and recognition of face is carried out to captured image, to identify corresponding facial image.
Step S202:When identifying facial image, the corresponding voice messaging input by user of the facial image is obtained.
In specific implementation, when identifying corresponding facial image from captured obtained image, the present invention is implemented Interactive voice robot in example can user's progress interactive voice corresponding with the facial image identified.Specifically, Interactive voice robot in the embodiment of the present invention can obtain the voice messaging input by user first.
Step S203:The voice input by user is identified, obtains corresponding word.
In specific implementation, when getting voice messaging input by user, the interactive voice machine in the embodiment of the present invention Device people may be used corresponding audio recognition method and identify voice input by user, and be converted to corresponding word.
Step S204:Judge whether the word that identification obtains is imperative statement;When judging result is when being, can perform Step S205;Conversely, it can then perform step S206.
In specific implementation, when obtaining the corresponding word of voice input by user by speech recognition, the present invention is implemented Interactive voice robot in example can carry out clause judgement to the word that identification obtains, using the word identified as order Formula sentence or non-imperative statement.
Step S205:Perform the corresponding operation of the imperative statement.
In specific implementation, when identification obtains the word as imperative statement, the voice in the embodiment of the present invention is handed over Mutual robot can be based on the correspondence between the corresponding imperative statement of word identified and performed operation, really Fixed corresponding operation, and identified operation is performed, so as to complete the operation of user's instruction.
In an embodiment of the present invention, the voice interactive method can further include:
Step S206:Judge whether match to obtain corresponding answer sentence from preset corpus data library;When judging result is When being, step S207 can be performed;Conversely, it can then perform step 208.
Step S207:The answer sentence voice that output matching obtains.
In specific implementation, when the word for determining that identification obtains is non-imperative statement, the language in the embodiment of the present invention Sound interaction robot can retrieve answer sentence corresponding with the word identified from the preset corpus data library, And the answer sentence retrieved from the corpus data library is converted into corresponding voice and is exported, so as to the user into Row interactive voice.
In an embodiment of the present invention, the method further includes:
Step S208:Corresponding answer sentence, and answer sentence voice determined by output are determined by web search.
In specific implementation, when it is non-imperative statement to determine the obtained word of identification, and from preset corpus data When not matching corresponding answer sentence in library, the interactive voice robot in the embodiment of the present invention can pass through web search Answer sentence corresponding with the word is obtained out, and the answer sentence that search is obtained is converted to corresponding voice and exports, So as to be interacted with the user.
In specific implementation, the corpus data library in the embodiment of the present invention can be established simultaneously constantly by way of self study The abundant question sentence wherein stored and answer.For example, when execution web search determines corresponding answer sentence, it can be in institute Corresponding non-imperative statement can be added in predicate material database and searches for the correspondence between obtained answer sentence.When During the corresponding non-injunctive word sentence that next time is identified, can directly it be corresponded to by retrieving the corpus data library Answer sentence, the efficiency and accuracy rate of speech retrieval of the embodiment of the present invention can be further improved, improve the intelligence of interactive voice Degree can be changed, promote the usage experience of user.
The above-mentioned method in the embodiment of the present invention is described in detail, below will be to the above-mentioned corresponding dress of method It puts and is introduced.
Fig. 3 shows a kind of structure of interactive voice robot in the embodiment of the present invention.Referring to Fig. 3, a kind of voice is handed over Mutual robot 30 can include face identification unit 301 and interactive operation unit 302, wherein:
The face identification unit 301, suitable for carrying out recognition of face to preset monitoring area.
The interactive operation unit 302, suitable for when identifying facial image, user corresponding with the facial image into Row dialogue, and perform corresponding operation.
In specific implementation, the interactive operation unit 302, suitable for obtaining the voice input by user;To the use The voice of family input is identified, and obtains corresponding word;Judge to identify obtained word for imperative statement or non-order Formula sentence;When the word for determining that identification obtains is imperative statement, the corresponding operation of the imperative statement is performed.
In specific implementation, the interactive operation unit 302, it is non-injunctive to be further adapted for when the word that determining identification obtains During sentence and when matching corresponding answer sentence from preset corpus data library, output matches obtained answer sentence language Sound.
In specific implementation, the interactive operation unit 302 is further adapted for matching from preset corpus data library During corresponding answer sentence, corresponding answer sentence, and answer sentence voice determined by output are determined by web search.
The embodiment of the present invention additionally provides a kind of computer readable storage medium, is stored thereon with computer instruction, described The step of voice interactive method is performed when computer instruction is run.Wherein, the step of voice interactive method please With reference to the introduction of preceding sections, details are not described herein.
The embodiment of the present invention additionally provides a kind of terminal, and including memory and processor, energy is stored on the memory Enough computer instructions run on the processor, the processor perform the voice when running the computer instruction The step of exchange method.Wherein, the step of voice interactive method please refers to the introduction of preceding sections, no longer superfluous herein It states.
Using the said program of the embodiment of the present invention, recognition of face is carried out to preset monitoring area, when identifying face During image, user corresponding with the facial image engages in the dialogue, and performs corresponding operation, can improve the intelligence of robot Change service routine, promote the usage experience of user.
Further, when not matching corresponding answer sentence from preset corpus data library, pass through web search It determines corresponding answer sentence, and answer sentence voice determined by output, the accuracy of interactive voice can be further improved, Promote the usage experience of user.
One of ordinary skill in the art will appreciate that all or part of step in the various methods of above-described embodiment is can It is completed with instructing relevant hardware by program, which can be stored in computer readable storage medium, and storage is situated between Matter can include:ROM, RAM, disk or CD etc..
The method and system of the embodiment of the present invention are had been described in detail above, the present invention is not limited thereto.Any Field technology personnel without departing from the spirit and scope of the present invention, can make various changes or modifications, therefore the guarantor of the present invention Shield range should be subject to claim limited range.

Claims (10)

1. a kind of voice interactive method, which is characterized in that including:
Recognition of face is carried out to preset monitoring area;
When identifying facial image, user corresponding with the facial image engages in the dialogue, and performs corresponding operation.
2. voice interactive method according to claim 1, which is characterized in that the user corresponding with the facial image It engages in the dialogue, and performs corresponding operation, including:
Obtain the voice input by user;
The voice input by user is identified, obtains corresponding word;
Judge to identify obtained word for imperative statement or non-imperative statement;
When the word for determining that identification obtains is imperative statement, the corresponding operation of the imperative statement is performed.
3. voice interactive method according to claim 2, which is characterized in that when the word that determining identification obtains is non-order During formula sentence, further include:
When matching corresponding answer sentence from preset corpus data library, output matches obtained answer sentence voice.
4. voice interactive method according to claim 3, which is characterized in that do not matched when from preset corpus data library When going out corresponding answer sentence, further include:
Corresponding answer sentence, and answer sentence voice determined by output are determined by web search.
5. a kind of interactive voice robot, which is characterized in that including:
Face identification unit, suitable for carrying out recognition of face to preset monitoring area;
Interactive operation unit, suitable for when identifying facial image, user corresponding with the facial image engages in the dialogue, and hold The corresponding operation of row.
6. interactive voice robot according to claim 5, which is characterized in that the interactive operation unit, suitable for obtaining The voice input by user;The voice input by user is identified, obtains corresponding word;Judge what identification obtained Word is imperative statement or non-imperative statement;When the word for determining that identification obtains is imperative statement, described in execution The corresponding operation of imperative statement.
7. interactive voice robot according to claim 6, which is characterized in that the interactive operation unit is further adapted for working as It is determining to identify when obtained word is non-imperative statement and corresponding answer sentence is matched from preset corpus data library When, output matches obtained answer sentence voice.
8. interactive voice robot according to claim 7, which is characterized in that the interactive operation unit is further adapted for working as When not matching corresponding answer sentence from preset corpus data library, corresponding answer sentence is determined by web search, And answer sentence voice determined by exporting.
9. a kind of computer readable storage medium, is stored thereon with computer instruction, which is characterized in that the computer instruction fortune Perform claim requires the step of 1 to 4 any one of them voice interactive method during row.
10. a kind of terminal, which is characterized in that including memory and processor, being stored on the memory can be at the place The computer instruction run on reason device, any one of perform claim requirement 1 to 4 institute when the processor runs the computer instruction The step of voice interactive method stated.
CN201711442348.6A 2017-12-27 2017-12-27 Voice interactive method and robot, computer readable storage medium, terminal Pending CN108172225A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711442348.6A CN108172225A (en) 2017-12-27 2017-12-27 Voice interactive method and robot, computer readable storage medium, terminal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711442348.6A CN108172225A (en) 2017-12-27 2017-12-27 Voice interactive method and robot, computer readable storage medium, terminal

Publications (1)

Publication Number Publication Date
CN108172225A true CN108172225A (en) 2018-06-15

Family

ID=62521891

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711442348.6A Pending CN108172225A (en) 2017-12-27 2017-12-27 Voice interactive method and robot, computer readable storage medium, terminal

Country Status (1)

Country Link
CN (1) CN108172225A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108920639A (en) * 2018-07-02 2018-11-30 北京百度网讯科技有限公司 Context acquisition methods and equipment based on interactive voice
CN109272673A (en) * 2018-08-22 2019-01-25 深圳怡化电脑股份有限公司 Financial self-service equipment and its working method
CN110111784A (en) * 2019-04-11 2019-08-09 苏宁云计算有限公司 A kind of processing method and system of customer's remote assistance in night unmanned shop
WO2020125252A1 (en) * 2018-12-20 2020-06-25 达闼科技(北京)有限公司 Robot conversation switching method and apparatus, and computing device
CN111429924A (en) * 2018-12-24 2020-07-17 同方威视技术股份有限公司 Voice interaction method and device, robot and computer readable storage medium
CN115101048A (en) * 2022-08-24 2022-09-23 深圳市人马互动科技有限公司 Science popularization information interaction method, device, system, interaction equipment and storage medium

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108920639A (en) * 2018-07-02 2018-11-30 北京百度网讯科技有限公司 Context acquisition methods and equipment based on interactive voice
CN108920639B (en) * 2018-07-02 2022-01-18 北京百度网讯科技有限公司 Context obtaining method and device based on voice interaction
CN109272673A (en) * 2018-08-22 2019-01-25 深圳怡化电脑股份有限公司 Financial self-service equipment and its working method
WO2020125252A1 (en) * 2018-12-20 2020-06-25 达闼科技(北京)有限公司 Robot conversation switching method and apparatus, and computing device
CN111429924A (en) * 2018-12-24 2020-07-17 同方威视技术股份有限公司 Voice interaction method and device, robot and computer readable storage medium
CN110111784A (en) * 2019-04-11 2019-08-09 苏宁云计算有限公司 A kind of processing method and system of customer's remote assistance in night unmanned shop
CN115101048A (en) * 2022-08-24 2022-09-23 深圳市人马互动科技有限公司 Science popularization information interaction method, device, system, interaction equipment and storage medium
CN115101048B (en) * 2022-08-24 2022-11-11 深圳市人马互动科技有限公司 Science popularization information interaction method, device, system, interaction equipment and storage medium

Similar Documents

Publication Publication Date Title
CN108172225A (en) Voice interactive method and robot, computer readable storage medium, terminal
CN110377911B (en) Method and device for identifying intention under dialog framework
CN106600298B (en) Power information system customer service knowledge base construction method based on work order data analysis
US11138250B2 (en) Method and device for extracting core word of commodity short text
WO2019084810A1 (en) Information processing method and terminal, and computer storage medium
CN109635117A (en) A kind of knowledge based spectrum recognition user intention method and device
CN111428010B (en) Man-machine intelligent question-answering method and device
EP3872652A2 (en) Method and apparatus for processing video, electronic device, medium and product
CN106297777A (en) A kind of method and apparatus waking up voice service up
CN110334241A (en) Quality detecting method, device, equipment and the computer readable storage medium of customer service recording
CN110349564A (en) Across the language voice recognition methods of one kind and device
WO2020047861A1 (en) Method and device for generating ranking model
CN109344395A (en) A kind of data processing method, device, server and storage medium
CN106294505B (en) Answer feedback method and device
CN112966089A (en) Problem processing method, device, equipment, medium and product based on knowledge base
CN114625923B (en) Training method of video retrieval model, video retrieval method, device and equipment
US20220301547A1 (en) Method for processing audio signal, method for training model, device and medium
CN114817478A (en) Text-based question and answer method and device, computer equipment and storage medium
CN114090792A (en) Document relation extraction method based on comparison learning and related equipment thereof
CN112560480B (en) Task community discovery method, device, equipment and storage medium
US20230274161A1 (en) Entity linking method, electronic device, and storage medium
EP4145306A1 (en) Method and apparatus of processing data, electronic device, and medium
CN116226355A (en) Intelligent customer service method, system, electronic equipment and readable storage medium
CN109684357A (en) Information processing method and device, storage medium, terminal
CN110580899A (en) Voice recognition method and device, storage medium and computing equipment

Legal Events

Date Code Title Description
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20180615

RJ01 Rejection of invention patent application after publication