CN116312472A

CN116312472A - Method and device for designing robot speaking group, computer equipment and storage medium

Info

Publication number: CN116312472A
Application number: CN202211097504.0A
Authority: CN
Inventors: 姚凡
Original assignee: OneConnect Financial Technology Co Ltd Shanghai
Current assignee: OneConnect Financial Technology Co Ltd Shanghai
Priority date: 2022-09-08
Filing date: 2022-09-08
Publication date: 2023-06-23

Abstract

The application relates to the technical field of voice interaction, and particularly discloses a method, a device, equipment and a storage medium for designing a robot speaking group. The method comprises the following steps: acquiring a preset voice operation group, and converting the preset voice operation group into a voice operation coding group; generating an initialization tag of a target user according to basic information of the target user, and generating a dynamic tag of the target user according to voice information of the target user; and determining a target speech group of the target user based on the initialization tag, the dynamic tag and the speech coding group. Through the mode, the method matches the initialization label and the dynamic label of the user with the preset voice group, the voice group of different target users is designed based on the label of the user, the intelligent degree of the voice group design of the robot is improved, and the technical problem that the intelligent degree of the voice group design of the robot is low is solved.

Description

Method and device for designing robot speaking group, computer equipment and storage medium

Technical Field

The present disclosure relates to the field of voice interaction technologies, and in particular, to a method and apparatus for designing a microphone set of a robot, a computer device, and a storage medium.

Background

In the communication process with users, the existing voice conversation type robot performs fixed voice broadcasting and jumping according to the robot flow configured by a robot back-end management platform, the same group of users select and configure the same robot to call outwards, the flow felt by the whole user is consistent, the broadcasting flow varies according to different intentions fed back by the user, the robot contacted by the user is mechanized, the interaction effect between people cannot be really realized, the user can find that the robot exchanges with the robot when receiving the robot telephone, the call is hung, and the call-outwards success rate of enterprises is greatly reduced.

In the whole robot outbound flow, the current user has two user tags, namely an initialization tag and a dynamic tag, wherein the initialization tag refers to a tag which cannot be changed on the user, such as the gender, the name, the birth year and month and the like of the user, and the dynamic tag refers to a tag generated in the process of communicating with the robot, such as a company, a living place, wedding, child and other dynamic information. Most outbound robots do not well utilize the tags, for example, the user tags can be utilized, and the robot can conduct different voice broadcasting, so that the intelligence of the robot can be improved. Therefore, how to solve the problem that the intelligentization degree of the design of the robot speaking group is low becomes a urgent need to be solved.

Disclosure of Invention

The application provides a method, a device, computer equipment and a storage medium for designing a robot voice group, so as to solve the technical problem of low intelligent degree of the design of the robot voice group.

In a first aspect, the present application provides a method for designing a talkgroup of a robot, the method comprising: acquiring a preset voice operation group, and converting the preset voice operation group into a voice operation coding group; generating an initialization tag of a target user according to basic information of the target user, and generating a dynamic tag of the target user according to voice information of the target user; and determining a target speech group of the target user based on the initialization tag, the dynamic tag and the speech coding group.

Further, determining a target talkgroup for the target user based on the initialization tag, the dynamic tag, and the talkcode group, comprising:

converting the initialization tag into an initialization tag code, and converting the dynamic tag into a dynamic tag code;

combining the initialization tag code and the dynamic tag code to generate a target tag code, and determining a speaking code corresponding to the target tag code in the speaking code group to serve as a target speaking code group;

and inversely converting the target speech operation group code into speech operation to obtain the target speech operation group.

Further, after obtaining the voice information of the target user and taking the voice information as the dynamic tag of the target user, the method further comprises the following steps:

acquiring a preset text information base, and generating user emotion keywords based on the preset text information base;

and determining the emotion information of the target user by capturing the emotion keywords of the target user in the text input information, and taking the emotion information of the target user as the dynamic tag.

acquiring a preset voice library, and converting the preset voice library into voice codes;

and matching the dynamic tag with the voice code based on the emotion information of the target user to generate a voice matching result, and determining a target voice library based on the voice matching result.

Further, before determining the target speech group of the target user based on the initialization tag, the dynamic tag and the speech coding group, the method further comprises:

acquiring an interaction record with the target user, and determining the service requirement of the target user based on the interaction record;

and determining the target speech group by matching the business requirement with the speech coding group.

setting the priority of each telephone in the preset telephone group;

if the target speech group cannot be determined according to the matching result of the dynamic tag based on the matching result of the initialization tag, determining the target speech group according to the priority.

Further, the speaking group design method further comprises the following steps:

and acquiring a preset sensitive word stock, matching the preset speech operation group with the preset sensitive word stock, and filtering sensitive words matched with the preset speech operation group from the preset speech operation group.

In a second aspect, the present application further provides a talkgroup design device for a robot, the talkgroup design device comprising:

the voice code group generation module is used for acquiring a preset voice group and converting the preset voice group into a voice code group;

the tag acquisition module is used for generating an initialization tag of the target user according to the basic information of the target user and generating a dynamic tag of the target user according to the voice information of the target user;

and the speech surgery group design module is used for determining a target speech surgery group of the target user based on the initialization tag, the dynamic tag and the speech surgery encoding group.

In a third aspect, the present application also provides a computer device comprising a memory and a processor; the memory is used for storing a computer program; the processor is configured to execute the computer program and implement the method for designing a microphone set of a robot as described above when the computer program is executed.

In a fourth aspect, the present application also provides a computer readable storage medium storing a computer program, which when executed by a processor causes the processor to implement a method for designing a talk group of a robot as described above.

The application discloses a method, a device, computer equipment and a storage medium for designing a conversation group of a robot, wherein the method obtains a preset conversation group and converts the preset conversation group into a conversation coding group; generating an initialization tag of a target user according to basic information of the target user, and generating a dynamic tag of the target user according to voice information of the target user; and determining a target speech group of the target user based on the initialization tag, the dynamic tag and the speech coding group. Through the mode, the method matches the initialization label and the dynamic label of the user with the preset voice group, the voice group of different target users is designed based on the label of the user, the intelligent degree of the voice group design of the robot is improved, and the technical problem that the intelligent degree of the voice group design of the robot is low is solved.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present application, the drawings needed in the description of the embodiments will be briefly introduced below, and it is obvious that the drawings in the following description are some embodiments of the present application, and other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.

FIG. 1 is a schematic flow chart of a first embodiment of a method for designing a talkgroup of robots provided in an embodiment of the present application;

FIG. 2 is a schematic flow chart of a second embodiment of a method for designing a talkgroup of robots provided in an embodiment of the present application;

FIG. 3 is a schematic flow chart of a third embodiment of a method for designing a talkgroup of robots provided in an embodiment of the present application;

FIG. 4 is a schematic flow chart of a fourth embodiment of a method for designing a talkgroup of robots provided in an embodiment of the present application;

FIG. 5 is a schematic block diagram of a robotic talk group design apparatus provided in an embodiment of the present application;

fig. 6 is a schematic block diagram of a computer device according to an embodiment of the present application.

Detailed Description

The following description of the embodiments of the present application will be made clearly and fully with reference to the accompanying drawings, in which it is evident that the embodiments described are some, but not all, of the embodiments of the present application. All other embodiments, which can be made by one of ordinary skill in the art without undue burden from the present disclosure, are within the scope of the present disclosure.

The flow diagrams depicted in the figures are merely illustrative and not necessarily all of the elements and operations/steps are included or performed in the order described. For example, some operations/steps may be further divided, combined, or partially combined, so that the order of actual execution may be changed according to actual situations.

It is to be understood that the terminology used in the description of the present application is for the purpose of describing particular embodiments only and is not intended to be limiting of the application. As used in this specification and the appended claims, the singular forms "a," "an," and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise.

It should also be understood that the term "and/or" as used in this specification and the appended claims refers to any and all possible combinations of one or more of the associated listed items, and includes such combinations.

The embodiment of the application provides a method, a device, computer equipment and a storage medium for designing a conversation group of a robot. The method for designing the microphone group of the robot can be applied to a server, and the intelligentization degree of the microphone group design of the robot is improved by matching the labels of the users with the microphone group. The server may be an independent server or a server cluster.

Some embodiments of the present application are described in detail below with reference to the accompanying drawings. The following embodiments and features of the embodiments may be combined with each other without conflict.

Referring to fig. 1, fig. 1 is a schematic flow chart of a first embodiment of a method for designing a microphone set of a robot according to an embodiment of the present application. The method for designing the robot talk group can be applied to a server and used for matching the label of the user with the talk group so as to improve the intelligent degree of the design of the talk group of the robot.

As shown in fig. 1, the method for designing a microphone set of the robot specifically includes steps S101 to S103.

S101, acquiring a preset voice operation group, and converting the preset voice operation group into a voice operation coding group;

s102, generating an initialization tag of a target user according to basic information of the target user, and generating a dynamic tag of the target user according to voice information of the target user;

s103, determining a target conversation group of the target user based on the initialization tag, the dynamic tag and the conversation code group.

In this embodiment, the table structure of the talker group is designed as follows:

a) Table of talk group:

CREATE TABLE public.aicsp_speech_group(

id serial NOT NULL- -Main Key

spech_group_code varchar (60) NULL, talkgroup code

group_name varchar (60) NULL, — caller group name

description varchar (1000) NULL- - -description

updated_ date timestamp NOT NULL DEFAULT now () - -update time

updated_by varchar (100) NOT NULL DEFAULT' ROOT:. Character varying, — update user

created_ date timestamp NOT NULL DEFAULT now () - -creation time

character varying, created_by varchar (100) NOT NULL DEFAULT' ROOT →-Create user

app id varchar (60) NULL, -tenant

scene varchar (60) NULL, a scene

CONSTRAINT pk_aicsp_speech_group PRIMARY KEY(id)

)；

b) Details table of the talk group:

CREATE TABLE public.aicsp_speech_detail(

id serial NOT NULL- -Main Key

spech_code varchar (60) NULL, speech-by-speech code

specch varchar (1000) NULL, phony

group_idint 4 NOT NULL, associated talkgroup id

updated_ date timestamp NOT NULL DEFAULT now () - -update time

created_ date timestamp NOT NULL DEFAULT now () - -creation time

app id varchar (60) NULL, -tenant

CONSTRAINT pk_aicsp_speech_detail PRIMARY KEY(id)

)；

In a specific embodiment, a large number of automatic call-back techniques are formulated according to the business content frequently consulted by the user by acquiring the business content frequently consulted by the user in the current business range from the internet big data, and the automatic call-back techniques are converted into codes, wherein each code bit of the codes represents a factor, and the design of the call-back techniques group is influenced.

The robot edits the incoming call group, and when editing the robot, the incoming call group configures different calls according to static labels and dynamic labels generated by users. The robot configuration is increased, different call interfaces are called through user labels, user label obtaining nodes are added in the robot flow, call interface nodes, corresponding call functions are broadcasted, corresponding call contents are obtained according to user labels and call group codes, and corresponding user call is configured according to corresponding conditions.

In designing the preset talk group, the following principles should be noted:

white:

for simplicity and understanding, the intent can be expressed explicitly. For example: your (pause 1.5 s.) you are you, i are intelligent robots, delay you 20 seconds to tell you a good message to create a user, add business opportunity-!

The recall term:

to embody the preferential activity, the user has the impulse of consumption. (after opening a white or offer, a good retrieval often may function as an offer), for example: the replacement of people with intelligent robots has now become a trend, and at present, the traditional customer service and telemarketing modes of enterprises are both changed, which is troublesome to give a secret an opportunity to know that the current trend is not bad, and to the bar?

String field language:

between the open language and the offer.

The embodiment discloses a method, a device, a computer device and a storage medium for designing a conversation group of a robot, wherein the method obtains a preset conversation group and converts the preset conversation group into a conversation coding group; generating an initialization tag of a target user according to basic information of the target user, and generating a dynamic tag of the target user according to voice information of the target user; and determining a target speech group of the target user based on the initialization tag, the dynamic tag and the speech coding group. Through the mode, the method matches the initialization label and the dynamic label of the user with the preset voice group, the voice group of different target users is designed based on the label of the user, the intelligent degree of the voice group design of the robot is improved, and the technical problem that the intelligent degree of the voice group design of the robot is low is solved.

Referring to fig. 2, fig. 2 is a schematic flow chart of a second embodiment of a method for designing a microphone set of a robot according to an embodiment of the present application. The method for designing the robot talk group can be applied to a server and used for matching the label of the user with the talk group so as to improve the intelligent degree of the design of the talk group of the robot.

Based on the above embodiment, in this embodiment, step S103 specifically includes:

s1031, converting the initialization tag into an initialization tag code, and converting the dynamic tag into a dynamic tag code;

s1032, generating a target tag code by combining the initialization tag code and the dynamic tag code, and determining a speaking code corresponding to the target tag code in the speaking code group as a target speaking group code;

s1033, inversely converting the target speech surgery group codes into speech surgery to obtain the target speech surgery group.

In this embodiment, in the system for configuring a microphone group, two pages are newly added, namely, a microphone group page and a microphone detail page. The method is mainly used for maintaining different dialogs according to the user tag.

An external spe function getspecbycodeservicepost is added. The method is mainly used for DM call, and DM calls functions according to different user labels in a dialogue flow to obtain different dialogues.

The DM calls the sci function (getspecbyCodeServiceParse) to get the session. The spi is known as Service Provider Interface, which is a service discovery mechanism. The SPI is essentially that the full-defined name of the interface implementation class is configured in a file, and the service loader reads the configuration file to load the implementation class. This allows the implementation class to be dynamically replaced for the interface at runtime.

Based on the embodiment shown in fig. 1, after step S102, the method specifically includes:

In this embodiment, through the interaction information between the robot and the user, when some keywords capable of clearly expressing emotion, such as "happy", "satisfied", "refused", etc., appear in the voice information input by the user, these words can express the current mood state of the user. User emotion keywords in a preset text information base of the robot are matched with the keywords, when matching is successful, the current emotion of the user is used as a dynamic label of the user, and for example, when the user is angry, a conversation group capable of pacifying the user is further determined through the dynamic label of the user.

Based on the embodiment shown in fig. 1, after step S102, the method further includes:

In a specific embodiment, based on the emotion of the user mentioned in the above embodiment, different voices are selected for performing the televised broadcasting by different emotions for different users. For example, when the emotion of the user is more excited, milder voices can be selected for voice broadcasting so as to relieve the excited emotion of the user.

In addition, the preset voice library can also have multi-language voice, when users of different nationalities and different languages are faced, after the initialization labels of the users are identified, the users can be immediately switched to the native language or the international general language of the users, so that the robot can smoothly complete interaction when facing the users of different nationalities, and the intelligent degree of the design of the talk group is improved.

Referring to fig. 3, fig. 3 is a schematic flow chart of a third embodiment of a method for designing a microphone set of a robot according to an embodiment of the present application. The method for designing the robot talk group can be applied to a server and used for matching the label of the user with the talk group so as to improve the intelligent degree of the design of the talk group of the robot.

Based on the embodiment shown in fig. 1, before step S103, the method further includes:

s1021, acquiring an interaction record of the target user, and determining the service requirement of the target user based on the interaction record;

s1022, determining the target speech group by matching the business requirement with the speech coding group.

In a specific embodiment, the normal service requirement of the user is known by acquiring the chat record of the user with the customer service of the robot, so that understanding of the service requirement of the user is deepened, and the user cuts into the theme more quickly. For example, in the process of prompting receipt, if the user does not pay for a plurality of times, more severe robot voice operation and corresponding voice library can be switched; in the insurance outbound sales process, different user quotations can be carried out according to different user conditions.

Referring to fig. 4, fig. 4 is a schematic flow chart of a fourth embodiment of a method for designing a microphone set of a robot according to an embodiment of the present application. The method for designing the robot talk group can be applied to a server and used for matching the label of the user with the talk group so as to improve the intelligent degree of the design of the talk group of the robot.

s1023, setting the priority of each telephone in the preset telephone group;

s1024, if the target conversation group cannot be determined according to the matching result of the dynamic tag based on the matching result of the initialization tag, determining the target conversation group according to the priority.

Based on all the above embodiments, the present embodiment further includes:

In this embodiment, a sensitive word library is set, and all the utterances including the sensitive words are deleted from the preset utterances, so that the robot is prevented from automatically replying to some utterances including the sensitive words in the process of interacting with the user, causing some violations or causing user dissatisfaction, and causing the automatic reply of the robot to deviate from the forward track.

Referring to fig. 5, fig. 5 is a schematic block diagram of a robot talk group design device for performing the aforementioned method of talk group design of a robot according to an embodiment of the present application. The device for designing the speaking group of the robot can be configured on a server.

As shown in fig. 5, the robot talk group design apparatus 400 includes:

a speech code group generating module 10, configured to obtain a preset speech group, and convert the preset speech group into a speech code group;

the tag acquisition module 20 is configured to generate an initialization tag of a target user according to basic information of the target user, and generate a dynamic tag of the target user according to voice information of the target user;

a speech group design module 30, configured to determine a target speech group of the target user based on the initialization tag, the dynamic tag and the speech encoding group.

Further, the speech coding group generation module 10 includes:

and the voice code group generating unit is used for acquiring a preset voice group and converting the preset voice group into a voice code group.

Further, the tag acquisition module 20 includes:

the tag acquisition unit is used for generating an initialization tag of the target user according to the basic information of the target user, and generating a dynamic tag of the target user according to the voice information of the target user.

Further, the talkgroup design module 30 includes:

the dynamic tag code generating unit is used for converting the initialization tag into an initialization tag code and converting the dynamic tag into a dynamic tag code;

a speaking code group determining unit, configured to combine the initialization tag code and the dynamic tag code to generate a target tag code, and determine, in the speaking code group, a speaking code corresponding to the target tag code as a target speaking code group;

and the target speech surgery group generation unit is used for inversely converting the target speech surgery group codes into speech surgery to obtain the target speech surgery group.

Further, the speaking group design apparatus 400 further includes a user emotion matching module, which specifically includes:

the user emotion keyword acquisition unit is used for acquiring a preset text information base and generating user emotion keywords based on the preset text information base;

and the dynamic label generating unit is used for determining the emotion information of the target user by capturing the emotion keywords of the target user in the text input information, and taking the emotion information of the target user as the dynamic label.

Further, the speaking group design apparatus 400 further includes a voice matching module, which specifically includes:

the voice code generation unit is used for acquiring a preset voice library and converting the preset voice library into voice codes;

and the target voice library determining unit is used for matching the dynamic tag with the voice code based on the emotion information of the target user, generating a voice matching result and determining a target voice library based on the voice matching result.

Further, the speaking group design apparatus 400 further includes a service requirement acquisition module, which specifically includes:

the business requirement acquisition unit is used for acquiring the interaction record of the target user and determining the business requirement of the target user based on the interaction record;

and the target speech surgery group determining unit is used for determining the target speech surgery group by matching the business requirement with the speech surgery coding group.

Further, the speaking group design apparatus 400 further includes a priority setting module, which specifically includes:

a priority setting unit, configured to set a priority of each phone operation in the preset phone operation group;

and the priority ranking unit is used for determining the target speech group according to the priority if the target speech group cannot be determined according to the matching result of the dynamic tag based on the matching result of the initialization tag.

Further, the speaking group design apparatus 400 further includes a sensitive word filtering module, which specifically includes:

the sensitive word filtering unit is used for acquiring a preset sensitive word stock, matching the preset speech operation group with the preset sensitive word stock, and filtering sensitive words matched with the preset speech operation group from the preset speech operation group.

It should be noted that, for convenience and brevity of description, the specific working process of the apparatus and each module described above may refer to the corresponding process in the foregoing method embodiment, which is not described herein again.

The apparatus described above may be implemented in the form of a computer program which is executable on a computer device as shown in fig. 6.

Referring to fig. 6, fig. 6 is a schematic block diagram of a computer device according to an embodiment of the present application. The computer device may be a server.

With reference to FIG. 6, the computer device includes a processor, memory, and a network interface connected by a system bus, where the memory may include a non-volatile storage medium and an internal memory.

The non-volatile storage medium may store an operating system and a computer program. The computer program includes program instructions that, when executed, cause the processor to perform any one of a plurality of robot microphone set design methods.

The processor is used to provide computing and control capabilities to support the operation of the entire computer device.

The internal memory provides an environment for the execution of a computer program in a non-volatile storage medium that, when executed by the processor, causes the processor to perform any one of the robotic speaking group design methods.

The network interface is used for network communication such as transmitting assigned tasks and the like. It will be appreciated by those skilled in the art that the structure shown in fig. 6 is merely a block diagram of some of the structures associated with the present application and is not limiting of the computer device to which the present application may be applied, and that a particular computer device may include more or fewer components than shown, or may combine certain components, or have a different arrangement of components.

It should be appreciated that the processor may be a central processing unit (Central Processing Unit, CPU), but may also be other general purpose processors, digital signal processors (Digital Signal Processor, DSP), application specific integrated circuits (Application Specific Integrated Circuit, ASIC), field-programmable gate arrays (Field-Programmable Gate Array, FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, or the like. Wherein the general purpose processor may be a microprocessor or the processor may be any conventional processor or the like.

Wherein in one embodiment the processor is configured to run a computer program stored in the memory to implement the steps of:

acquiring a preset voice operation group, and converting the preset voice operation group into a voice operation coding group;

generating an initialization tag of a target user according to basic information of the target user, and generating a dynamic tag of the target user according to voice information of the target user;

and determining a target speech group of the target user based on the initialization tag, the dynamic tag and the speech coding group.

In one embodiment, a target talkgroup of the target user is determined based on the initialization tag, the dynamic tag, and the talkcode group for implementing:

In one embodiment, the method further comprises the steps of obtaining the voice information of the target user, and after using the voice information as the dynamic tag of the target user, implementing:

In one embodiment, before determining the target talkgroup of the target user based on the initialization tag, the dynamic tag, and the talkcode group, the method is further configured to:

setting the priority of each telephone in the preset telephone group;

In one embodiment, the talkgroup design method is further used to implement:

The embodiment of the application also provides a computer readable storage medium, wherein the computer readable storage medium stores a computer program, the computer program comprises program instructions, and the processor executes the program instructions to realize any one of the design methods of the robot talk group provided by the embodiment of the application.

The computer readable storage medium may be an internal storage unit of the computer device according to the foregoing embodiment, for example, a hard disk or a memory of the computer device. The computer readable storage medium may also be an external storage device of the computer device, such as a plug-in hard disk, a Smart Media Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card), or the like, which are provided on the computer device.

While the invention has been described with reference to certain preferred embodiments, it will be understood by those skilled in the art that various changes and substitutions of equivalents may be made and equivalents will be apparent to those skilled in the art without departing from the scope of the invention. Therefore, the protection scope of the present application shall be subject to the protection scope of the claims.

Claims

1. A method of designing a talk group of a robot, the method comprising:

2. The method of claim 1, wherein the determining the target talkgroup for the target user based on the initialization tag, the dynamic tag, and the talkcode group comprises:

3. The method for designing a microphone set of a robot according to claim 1, wherein the step of obtaining the voice information of the target user and using the voice information as a dynamic tag of the target user further comprises:

4. The method for designing a talk group of a robot according to claim 3, wherein the step of obtaining the voice information of the target user and using the voice information as a dynamic tag of the target user further comprises:

5. The method of claim 1, wherein prior to determining the target talkgroup for the target user based on the initialization tag, the dynamic tag, and the talkcode group, further comprising:

6. The method of claim 1, wherein prior to determining the target talkgroup for the target user based on the initialization tag, the dynamic tag, and the talkcode group, further comprising:

setting the priority of each telephone in the preset telephone group;

7. The method of claim 1 to 6, further comprising:

8. A robotic speech team design device, the speech team design device comprising:

9. A computer device, the computer device comprising a memory and a processor;

the memory is used for storing a computer program;

the processor for executing the computer program and for implementing the method for designing a talk group of robots according to any one of claims 1 to 7 when executing the computer program.

10. A computer-readable storage medium, characterized in that the computer-readable storage medium stores a computer program which, when executed by a processor, causes the processor to implement the method of designing a talk group of robots according to any one of claims 1 to 7.