CN108172225A - Voice interactive method and robot, computer readable storage medium, terminal - Google Patents
Voice interactive method and robot, computer readable storage medium, terminal Download PDFInfo
- Publication number
- CN108172225A CN108172225A CN201711442348.6A CN201711442348A CN108172225A CN 108172225 A CN108172225 A CN 108172225A CN 201711442348 A CN201711442348 A CN 201711442348A CN 108172225 A CN108172225 A CN 108172225A
- Authority
- CN
- China
- Prior art keywords
- voice
- interactive
- answer sentence
- user
- word
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 230000002452 interceptive effect Effects 0.000 title claims abstract description 58
- 238000000034 method Methods 0.000 title claims abstract description 36
- 230000001815 facial effect Effects 0.000 claims abstract description 27
- 238000012544 monitoring process Methods 0.000 claims abstract description 14
- 238000005516 engineering process Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/332—Query formulation
- G06F16/3329—Natural language query formulation or dialogue systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
-
- G—PHYSICS
- G07—CHECKING-DEVICES
- G07F—COIN-FREED OR LIKE APPARATUS
- G07F9/00—Details other than those peculiar to special kinds or types of apparatus
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- Computational Linguistics (AREA)
- General Physics & Mathematics (AREA)
- Human Computer Interaction (AREA)
- General Engineering & Computer Science (AREA)
- Mathematical Physics (AREA)
- Data Mining & Analysis (AREA)
- Multimedia (AREA)
- Acoustics & Sound (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Manipulator (AREA)
Abstract
A kind of voice interactive method robot, computer readable storage medium, terminal, the method includes:Recognition of face is carried out to preset monitoring area;When identifying facial image, user corresponding with the facial image engages in the dialogue, and performs corresponding operation.Above-mentioned scheme can improve the intelligence degree of interactive voice, promote the usage experience of user.
Description
Technical field
The present invention relates to debugging technique field, more particularly to a kind of voice interactive method and robot, computer-readable
Storage medium, terminal.
Background technology
Self-help terminal equipment has been widely used in every field, such as finance, transport, medical treatment, mobile communication, food and drink
Deng.User can handle a variety of self-service business by self-help terminal equipment, without window queuing is gone to wait for, save manpower money
Source improves the efficiency of business handling.
When user handles corresponding self-service business, corresponding media card is placed on corresponding self-aided terminal carry out it is self-service
Business is handled.
But existing voice interactive method there is intelligence degree it is low the problem of, seriously affected the use of user
Experience.
Invention content
The technical issues of embodiment of the present invention solves is how to improve the intelligence degree of interactive voice, promotes making for user
With experience.
To solve the above problems, an embodiment of the present invention provides a kind of voice interactive method, the method includes:
Recognition of face is carried out to preset monitoring area;
When identifying facial image, user corresponding with the facial image engages in the dialogue, and performs corresponding operation.
Optionally, the user corresponding with the facial image engages in the dialogue, and performs corresponding operation, including:
Obtain the voice input by user;
The voice input by user is identified, obtains corresponding word;
Judge to identify obtained word for imperative statement or non-imperative statement;
When the word for determining that identification obtains is imperative statement, the corresponding operation of the imperative statement is performed.
Optionally, when the word for determining that identification obtains is non-imperative statement, the method further includes:
When matching corresponding answer sentence from preset corpus data library, output matches obtained answer sentence voice.
Optionally, it when not matching corresponding answer sentence from preset corpus data library, further includes:
Corresponding answer sentence, and answer sentence voice determined by output are determined by web search.
The embodiment of the present invention additionally provides a kind of interactive voice robot, and the robot includes:
Face identification unit, suitable for carrying out recognition of face to preset monitoring area;
Interactive operation unit, suitable for when identifying facial image, user corresponding with the facial image engages in the dialogue, and hold
The corresponding operation of row.
Optionally, the interactive operation unit, suitable for obtaining the voice input by user;To the language input by user
Sound is identified, and obtains corresponding word;Judge to identify obtained word for imperative statement or non-imperative statement;When true
When to identify obtained word calmly be imperative statement, the corresponding operation of the imperative statement is performed.
Optionally, the interactive operation unit, be further adapted for when determining that the obtained word of identification is non-imperative statement and
When matching corresponding answer sentence from preset corpus data library, output matches obtained answer sentence voice.
Optionally, the interactive operation unit is further adapted for matching corresponding answer from preset corpus data library
During case sentence, corresponding answer sentence, and answer sentence voice determined by output are determined by web search.
The embodiment of the present invention additionally provides a kind of computer readable storage medium, is stored thereon with computer instruction, described
The step of computer instruction performs voice interactive method described in any one of the above embodiments when running.
The embodiment of the present invention additionally provides a kind of terminal, and including memory and processor, energy is stored on the memory
Enough computer instructions run on the processor, the processor perform any of the above-described when running the computer instruction
The step of described voice interactive method.
Compared with prior art, technical scheme of the present invention has the following advantages that:
Above-mentioned scheme, by carrying out recognition of face to preset monitoring area, when identifying facial image, with the face
The corresponding user of image engages in the dialogue, and performs corresponding operation, can improve the intelligent Service degree of robot, be promoted and used
The usage experience at family.
Further, when not matching corresponding answer sentence from preset corpus data library, pass through web search
It determines corresponding answer sentence, and answer sentence voice determined by output, the accuracy of interactive voice can be further improved,
Promote the usage experience of user.
Description of the drawings
Fig. 1 is a kind of flow chart of voice interactive method in the embodiment of the present invention;
Fig. 2 is the flow chart of another voice interactive method in the embodiment of the present invention;
Fig. 3 is a kind of structure diagram of interactive voice robot in the embodiment of the present invention.
Specific embodiment
To solve the above-mentioned problems in the prior art, technical solution used in the embodiment of the present invention passes through to preset
Monitoring area carries out recognition of face, and when identifying facial image, user corresponding with the facial image engages in the dialogue, and hold
The corresponding operation of row can improve the intelligent Service degree of robot, promote the usage experience of user.
It is understandable for the above objects, features and advantages of the present invention is enable to become apparent, below in conjunction with the accompanying drawings to the present invention
Specific embodiment be described in detail.
Fig. 1 shows a kind of flow chart of voice interactive method in the embodiment of the present invention.Interactive voice as shown in Figure 1
Method can specifically include following operation:
Step S101:Recognition of face is carried out to preset monitoring area.
Step S102:When identifying facial image, user corresponding with the facial image engages in the dialogue, and perform phase
The operation answered.
Above-mentioned scheme, it is and described when identifying facial image by carrying out recognition of face to preset monitoring area
The corresponding user of facial image engages in the dialogue, and performs corresponding operation, can improve the intelligent Service degree of robot, carry
Rise the usage experience of user.
Further details of introduction is carried out to the voice interactive method in the embodiment of the present invention below in conjunction with Fig. 2.
Fig. 2 shows a kind of flow charts of voice interactive method in the embodiment of the present invention.Referring to Fig. 2, a kind of voice is handed over
Mutual method suitable for carrying out voice interface with user, can specifically include following operation:
Step S201:Recognition of face is carried out to preset monitoring area.
In specific implementation, the preset monitoring area, for taking the photograph for the interactive voice robot in the embodiment of the present invention
The region that can be taken as head.It can be to institute by the camera for setting interactive voice robot in embodiments of the present invention
It states monitoring area and carries out image taking, and recognition of face is carried out to captured image, to identify corresponding facial image.
Step S202:When identifying facial image, the corresponding voice messaging input by user of the facial image is obtained.
In specific implementation, when identifying corresponding facial image from captured obtained image, the present invention is implemented
Interactive voice robot in example can user's progress interactive voice corresponding with the facial image identified.Specifically,
Interactive voice robot in the embodiment of the present invention can obtain the voice messaging input by user first.
Step S203:The voice input by user is identified, obtains corresponding word.
In specific implementation, when getting voice messaging input by user, the interactive voice machine in the embodiment of the present invention
Device people may be used corresponding audio recognition method and identify voice input by user, and be converted to corresponding word.
Step S204:Judge whether the word that identification obtains is imperative statement;When judging result is when being, can perform
Step S205;Conversely, it can then perform step S206.
In specific implementation, when obtaining the corresponding word of voice input by user by speech recognition, the present invention is implemented
Interactive voice robot in example can carry out clause judgement to the word that identification obtains, using the word identified as order
Formula sentence or non-imperative statement.
Step S205:Perform the corresponding operation of the imperative statement.
In specific implementation, when identification obtains the word as imperative statement, the voice in the embodiment of the present invention is handed over
Mutual robot can be based on the correspondence between the corresponding imperative statement of word identified and performed operation, really
Fixed corresponding operation, and identified operation is performed, so as to complete the operation of user's instruction.
In an embodiment of the present invention, the voice interactive method can further include:
Step S206:Judge whether match to obtain corresponding answer sentence from preset corpus data library;When judging result is
When being, step S207 can be performed;Conversely, it can then perform step 208.
Step S207:The answer sentence voice that output matching obtains.
In specific implementation, when the word for determining that identification obtains is non-imperative statement, the language in the embodiment of the present invention
Sound interaction robot can retrieve answer sentence corresponding with the word identified from the preset corpus data library,
And the answer sentence retrieved from the corpus data library is converted into corresponding voice and is exported, so as to the user into
Row interactive voice.
In an embodiment of the present invention, the method further includes:
Step S208:Corresponding answer sentence, and answer sentence voice determined by output are determined by web search.
In specific implementation, when it is non-imperative statement to determine the obtained word of identification, and from preset corpus data
When not matching corresponding answer sentence in library, the interactive voice robot in the embodiment of the present invention can pass through web search
Answer sentence corresponding with the word is obtained out, and the answer sentence that search is obtained is converted to corresponding voice and exports,
So as to be interacted with the user.
In specific implementation, the corpus data library in the embodiment of the present invention can be established simultaneously constantly by way of self study
The abundant question sentence wherein stored and answer.For example, when execution web search determines corresponding answer sentence, it can be in institute
Corresponding non-imperative statement can be added in predicate material database and searches for the correspondence between obtained answer sentence.When
During the corresponding non-injunctive word sentence that next time is identified, can directly it be corresponded to by retrieving the corpus data library
Answer sentence, the efficiency and accuracy rate of speech retrieval of the embodiment of the present invention can be further improved, improve the intelligence of interactive voice
Degree can be changed, promote the usage experience of user.
The above-mentioned method in the embodiment of the present invention is described in detail, below will be to the above-mentioned corresponding dress of method
It puts and is introduced.
Fig. 3 shows a kind of structure of interactive voice robot in the embodiment of the present invention.Referring to Fig. 3, a kind of voice is handed over
Mutual robot 30 can include face identification unit 301 and interactive operation unit 302, wherein:
The face identification unit 301, suitable for carrying out recognition of face to preset monitoring area.
The interactive operation unit 302, suitable for when identifying facial image, user corresponding with the facial image into
Row dialogue, and perform corresponding operation.
In specific implementation, the interactive operation unit 302, suitable for obtaining the voice input by user;To the use
The voice of family input is identified, and obtains corresponding word;Judge to identify obtained word for imperative statement or non-order
Formula sentence;When the word for determining that identification obtains is imperative statement, the corresponding operation of the imperative statement is performed.
In specific implementation, the interactive operation unit 302, it is non-injunctive to be further adapted for when the word that determining identification obtains
During sentence and when matching corresponding answer sentence from preset corpus data library, output matches obtained answer sentence language
Sound.
In specific implementation, the interactive operation unit 302 is further adapted for matching from preset corpus data library
During corresponding answer sentence, corresponding answer sentence, and answer sentence voice determined by output are determined by web search.
The embodiment of the present invention additionally provides a kind of computer readable storage medium, is stored thereon with computer instruction, described
The step of voice interactive method is performed when computer instruction is run.Wherein, the step of voice interactive method please
With reference to the introduction of preceding sections, details are not described herein.
The embodiment of the present invention additionally provides a kind of terminal, and including memory and processor, energy is stored on the memory
Enough computer instructions run on the processor, the processor perform the voice when running the computer instruction
The step of exchange method.Wherein, the step of voice interactive method please refers to the introduction of preceding sections, no longer superfluous herein
It states.
Using the said program of the embodiment of the present invention, recognition of face is carried out to preset monitoring area, when identifying face
During image, user corresponding with the facial image engages in the dialogue, and performs corresponding operation, can improve the intelligence of robot
Change service routine, promote the usage experience of user.
Further, when not matching corresponding answer sentence from preset corpus data library, pass through web search
It determines corresponding answer sentence, and answer sentence voice determined by output, the accuracy of interactive voice can be further improved,
Promote the usage experience of user.
One of ordinary skill in the art will appreciate that all or part of step in the various methods of above-described embodiment is can
It is completed with instructing relevant hardware by program, which can be stored in computer readable storage medium, and storage is situated between
Matter can include:ROM, RAM, disk or CD etc..
The method and system of the embodiment of the present invention are had been described in detail above, the present invention is not limited thereto.Any
Field technology personnel without departing from the spirit and scope of the present invention, can make various changes or modifications, therefore the guarantor of the present invention
Shield range should be subject to claim limited range.
Claims (10)
1. a kind of voice interactive method, which is characterized in that including:
Recognition of face is carried out to preset monitoring area;
When identifying facial image, user corresponding with the facial image engages in the dialogue, and performs corresponding operation.
2. voice interactive method according to claim 1, which is characterized in that the user corresponding with the facial image
It engages in the dialogue, and performs corresponding operation, including:
Obtain the voice input by user;
The voice input by user is identified, obtains corresponding word;
Judge to identify obtained word for imperative statement or non-imperative statement;
When the word for determining that identification obtains is imperative statement, the corresponding operation of the imperative statement is performed.
3. voice interactive method according to claim 2, which is characterized in that when the word that determining identification obtains is non-order
During formula sentence, further include:
When matching corresponding answer sentence from preset corpus data library, output matches obtained answer sentence voice.
4. voice interactive method according to claim 3, which is characterized in that do not matched when from preset corpus data library
When going out corresponding answer sentence, further include:
Corresponding answer sentence, and answer sentence voice determined by output are determined by web search.
5. a kind of interactive voice robot, which is characterized in that including:
Face identification unit, suitable for carrying out recognition of face to preset monitoring area;
Interactive operation unit, suitable for when identifying facial image, user corresponding with the facial image engages in the dialogue, and hold
The corresponding operation of row.
6. interactive voice robot according to claim 5, which is characterized in that the interactive operation unit, suitable for obtaining
The voice input by user;The voice input by user is identified, obtains corresponding word;Judge what identification obtained
Word is imperative statement or non-imperative statement;When the word for determining that identification obtains is imperative statement, described in execution
The corresponding operation of imperative statement.
7. interactive voice robot according to claim 6, which is characterized in that the interactive operation unit is further adapted for working as
It is determining to identify when obtained word is non-imperative statement and corresponding answer sentence is matched from preset corpus data library
When, output matches obtained answer sentence voice.
8. interactive voice robot according to claim 7, which is characterized in that the interactive operation unit is further adapted for working as
When not matching corresponding answer sentence from preset corpus data library, corresponding answer sentence is determined by web search,
And answer sentence voice determined by exporting.
9. a kind of computer readable storage medium, is stored thereon with computer instruction, which is characterized in that the computer instruction fortune
Perform claim requires the step of 1 to 4 any one of them voice interactive method during row.
10. a kind of terminal, which is characterized in that including memory and processor, being stored on the memory can be at the place
The computer instruction run on reason device, any one of perform claim requirement 1 to 4 institute when the processor runs the computer instruction
The step of voice interactive method stated.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711442348.6A CN108172225A (en) | 2017-12-27 | 2017-12-27 | Voice interactive method and robot, computer readable storage medium, terminal |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711442348.6A CN108172225A (en) | 2017-12-27 | 2017-12-27 | Voice interactive method and robot, computer readable storage medium, terminal |
Publications (1)
Publication Number | Publication Date |
---|---|
CN108172225A true CN108172225A (en) | 2018-06-15 |
Family
ID=62521891
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201711442348.6A Pending CN108172225A (en) | 2017-12-27 | 2017-12-27 | Voice interactive method and robot, computer readable storage medium, terminal |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108172225A (en) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108920639A (en) * | 2018-07-02 | 2018-11-30 | 北京百度网讯科技有限公司 | Context acquisition methods and equipment based on interactive voice |
CN109272673A (en) * | 2018-08-22 | 2019-01-25 | 深圳怡化电脑股份有限公司 | Financial self-service equipment and its working method |
CN110111784A (en) * | 2019-04-11 | 2019-08-09 | 苏宁云计算有限公司 | A kind of processing method and system of customer's remote assistance in night unmanned shop |
WO2020125252A1 (en) * | 2018-12-20 | 2020-06-25 | 达闼科技(北京)有限公司 | Robot conversation switching method and apparatus, and computing device |
CN111429924A (en) * | 2018-12-24 | 2020-07-17 | 同方威视技术股份有限公司 | Voice interaction method and device, robot and computer readable storage medium |
CN115101048A (en) * | 2022-08-24 | 2022-09-23 | 深圳市人马互动科技有限公司 | Science popularization information interaction method, device, system, interaction equipment and storage medium |
-
2017
- 2017-12-27 CN CN201711442348.6A patent/CN108172225A/en active Pending
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108920639A (en) * | 2018-07-02 | 2018-11-30 | 北京百度网讯科技有限公司 | Context acquisition methods and equipment based on interactive voice |
CN108920639B (en) * | 2018-07-02 | 2022-01-18 | 北京百度网讯科技有限公司 | Context obtaining method and device based on voice interaction |
CN109272673A (en) * | 2018-08-22 | 2019-01-25 | 深圳怡化电脑股份有限公司 | Financial self-service equipment and its working method |
WO2020125252A1 (en) * | 2018-12-20 | 2020-06-25 | 达闼科技(北京)有限公司 | Robot conversation switching method and apparatus, and computing device |
CN111429924A (en) * | 2018-12-24 | 2020-07-17 | 同方威视技术股份有限公司 | Voice interaction method and device, robot and computer readable storage medium |
CN110111784A (en) * | 2019-04-11 | 2019-08-09 | 苏宁云计算有限公司 | A kind of processing method and system of customer's remote assistance in night unmanned shop |
CN115101048A (en) * | 2022-08-24 | 2022-09-23 | 深圳市人马互动科技有限公司 | Science popularization information interaction method, device, system, interaction equipment and storage medium |
CN115101048B (en) * | 2022-08-24 | 2022-11-11 | 深圳市人马互动科技有限公司 | Science popularization information interaction method, device, system, interaction equipment and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108172225A (en) | Voice interactive method and robot, computer readable storage medium, terminal | |
CN110377911B (en) | Method and device for identifying intention under dialog framework | |
CN106600298B (en) | Power information system customer service knowledge base construction method based on work order data analysis | |
US11138250B2 (en) | Method and device for extracting core word of commodity short text | |
WO2019084810A1 (en) | Information processing method and terminal, and computer storage medium | |
CN109635117A (en) | A kind of knowledge based spectrum recognition user intention method and device | |
CN111428010B (en) | Man-machine intelligent question-answering method and device | |
EP3872652A2 (en) | Method and apparatus for processing video, electronic device, medium and product | |
CN106297777A (en) | A kind of method and apparatus waking up voice service up | |
CN110334241A (en) | Quality detecting method, device, equipment and the computer readable storage medium of customer service recording | |
CN110349564A (en) | Across the language voice recognition methods of one kind and device | |
WO2020047861A1 (en) | Method and device for generating ranking model | |
CN109344395A (en) | A kind of data processing method, device, server and storage medium | |
CN106294505B (en) | Answer feedback method and device | |
CN112966089A (en) | Problem processing method, device, equipment, medium and product based on knowledge base | |
CN114625923B (en) | Training method of video retrieval model, video retrieval method, device and equipment | |
US20220301547A1 (en) | Method for processing audio signal, method for training model, device and medium | |
CN114817478A (en) | Text-based question and answer method and device, computer equipment and storage medium | |
CN114090792A (en) | Document relation extraction method based on comparison learning and related equipment thereof | |
CN112560480B (en) | Task community discovery method, device, equipment and storage medium | |
US20230274161A1 (en) | Entity linking method, electronic device, and storage medium | |
EP4145306A1 (en) | Method and apparatus of processing data, electronic device, and medium | |
CN116226355A (en) | Intelligent customer service method, system, electronic equipment and readable storage medium | |
CN109684357A (en) | Information processing method and device, storage medium, terminal | |
CN110580899A (en) | Voice recognition method and device, storage medium and computing equipment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20180615 |
|
RJ01 | Rejection of invention patent application after publication |