CN112511690A

CN112511690A - Interaction method, mobile terminal and storage medium

Info

Publication number: CN112511690A
Application number: CN202011267586.XA
Authority: CN
Inventors: 朱宁焱
Original assignee: Shanghai Chuanying Information Technology Co Ltd
Current assignee: Shanghai Chuanying Information Technology Co Ltd
Priority date: 2020-11-13
Filing date: 2020-11-13
Publication date: 2021-03-16

Abstract

The application relates to an interaction method, a mobile terminal and a storage medium. Applied to a mobile terminal or a server, comprising: acquiring portrait information of an information receiver; acquiring preference information of the information receiver according to the portrait information; and executing the operation corresponding to the preference information. The interaction method, the mobile terminal and the storage medium can acquire the preference information according to the portrait information of the information receiver and execute the operation corresponding to the preference information of the information receiver so as to optimize the use experience of the information receiver.

Description

Interaction method, mobile terminal and storage medium

Technical Field

The present application belongs to the field of communications technologies, and in particular, to an interaction method, a mobile terminal, and a storage medium.

Background

With the development and popularization of mobile terminals, mobile terminals bring more and more convenience to the life of people, and especially in the aspect of communication, users can perform various communication activities such as voice call, information receiving and sending (including text information and voice information) and the like through mobile terminals such as smart phones and smart watches.

In some implementations, the voice information can be presented in a voice form or converted into a text form, but the conversion can be completed only by manual operation of a user, and cannot be automatically completed according to the preference of the user; the presentation mode of the text information is single, and the text information can only be presented in a text form, so that the user requirement of receiving the information in a voice form is not met.

The foregoing description is provided for general background information and is not admitted to be prior art.

Disclosure of Invention

In view of the above technical problems, the present application provides an interaction method, a mobile terminal, and a storage medium to optimize the use experience of an information receiver.

The application provides an interaction method, which is applied to a mobile terminal or a server and comprises the following steps:

acquiring portrait information of an information receiver; acquiring preference information of the information receiver according to the portrait information; and executing the operation corresponding to the preference information.

Optionally, the portrait information of the information receiver includes at least one of sex, age, usage habit, relationship with the information sender, and usage scenario of the execution operation.

Optionally, the portrait information of the information receiver is obtained according to information of both information receivers and senders, and optionally, the information of both information receivers and senders includes at least one of label setting of the information receiver on the information sender, nickname remark between the information receiver and the information sender, chat content of the information receiver and the information sender, and personal setting information.

Optionally, the portrait information of the information receiver is updated according to an update frequency, and optionally, the update frequency includes a fixed frequency update or an acquisition process real-time update.

Optionally, the preference information of the information receiver includes at least one of receiving information in a text form and receiving information in a voice form.

Optionally, the performing an operation corresponding to the preference information includes at least one of:

if the preference information is received in a text form, directly sending or displaying the text information when the sending information or the receiving information is the text information;

if the preference information is received information in a text form, converting the voice information into corresponding text information and then transmitting or displaying the text information when the transmitted information or the received information is the voice information;

if the preference information is information received in a voice form, converting the text information into corresponding voice information and then transmitting or displaying the voice information when the transmitting information or the receiving information is the text information;

and if the preference information is information received in a voice form, directly sending or displaying the voice information when the sending information or the receiving information is the voice information.

Optionally, before the step of converting the text information into corresponding voice information and then sending or displaying the converted text information, the method includes:

acquiring tone preference information according to the portrait information of the information receiver;

and performing tone cloning and voice synthesis according to the tone preference information.

Optionally, the tone color preference information includes at least one of voice broadcast with a tone color of an information sender and voice broadcast with a favorite tone color of an individual.

Optionally, the interaction method further includes at least one of:

if the information receiver is in a state of avoiding noise, when the received information is voice information, the voice information is converted into corresponding text information and then is sent or displayed;

if the information receiver is in an eye using state, when the received information is the text information, the text information is converted into corresponding voice information and then is sent or displayed.

Optionally, the converting the voice information into corresponding text information and then sending or displaying the corresponding text information includes synchronously sending or displaying the voice information before conversion and the corresponding text information after conversion.

Optionally, the converting the text information into corresponding voice information and then sending or displaying the corresponding voice information includes synchronously sending or displaying the text information before conversion and the corresponding voice information after conversion.

Optionally, the converting the text information into corresponding voice information and then sending or displaying the corresponding voice information includes: and selecting corresponding volume to perform voice broadcast according to the portrait information of the information receiver.

The application also provides an interaction method applied to a system, wherein the system comprises at least one receiving end, at least one sending end and at least one server, and the method comprises the following steps:

the server acquires the portrait information of the receiving end and acquires the preference information of the receiving end according to the portrait information;

the server receives the interactive information sent by the sending end;

the server converts the interactive information sent by the sending end into corresponding text or voice information according to the preference information and then sends the text or voice information to the receiving end, or sends the preference information to the sending end so that the sending end converts the interactive information to be sent into corresponding text or voice information according to the preference information and then sends the text or voice information to the receiving end, or sends the preference information to the receiving end so that the receiving end converts the interactive information into corresponding text or voice information according to the preference information;

and the receiving end displays the converted text or voice information.

The application also provides a mobile terminal, which comprises a memory, a processor and a computer program stored in the memory and capable of running on the processor, wherein the processor implements the steps of the interaction method when executing the computer program.

The present application also provides a computer-readable storage medium, in which a computer program is stored, which, when being executed by a processor, realizes the steps of the above-mentioned interaction method.

The interaction method, the mobile terminal and the storage medium can acquire preference information according to portrait information of an information receiver and execute operation corresponding to the preference information of the information receiver so as to optimize use experience of the information receiver.

Drawings

The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the present application and together with the description, serve to explain the principles of the application. In order to more clearly illustrate the technical solutions of the embodiments of the present application, the drawings needed to be used in the description of the embodiments will be briefly described below, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without inventive exercise.

Fig. 1 is a schematic hardware structure diagram of a mobile terminal implementing various embodiments of the present application;

fig. 2 is a communication network system architecture diagram according to an embodiment of the present application;

fig. 3 is a schematic flowchart of an interaction method according to an embodiment of the present application;

fig. 4 is a schematic flowchart of an interaction method provided in the second embodiment of the present application;

fig. 5 is a schematic specific flowchart of an interaction method provided in the third embodiment of the present application;

fig. 6 is a schematic flowchart of an interaction method provided in the fourth embodiment of the present application;

fig. 7 is a schematic structural diagram of a mobile terminal according to a fifth embodiment of the present application.

The implementation, functional features and advantages of the objectives of the present application will be further explained with reference to the accompanying drawings. With the above figures, there are shown specific embodiments of the present application, which will be described in more detail below. These drawings and written description are not intended to limit the scope of the inventive concepts in any manner, but rather to illustrate the inventive concepts to those skilled in the art by reference to specific embodiments.

Detailed Description

Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, like numbers in different drawings represent the same or similar elements unless otherwise indicated. The embodiments described in the following exemplary embodiments do not represent all embodiments consistent with the present application. Rather, they are merely examples of apparatus and methods consistent with certain aspects of the present application, as detailed in the appended claims.

It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, the recitation of an element by the phrase "comprising an … …" does not exclude the presence of additional like elements in the process, method, article, or apparatus that comprises the element, and further, where similarly-named elements, features, or elements in different embodiments of the disclosure may have the same meaning, or may have different meanings, that particular meaning should be determined by their interpretation in the embodiment or further by context with the embodiment.

It should be understood that although the terms first, second, third, etc. may be used herein to describe various information, such information should not be limited to these terms. These terms are only used to distinguish one type of information from another. For example, first information may also be referred to as second information, and similarly, second information may also be referred to as first information, without departing from the scope herein. The word "if" as used herein may be interpreted as "at … …" or "when … …" or "in response to a determination", depending on the context. Also, as used herein, the singular forms "a", "an" and "the" are intended to include the plural forms as well, unless the context indicates otherwise. It will be further understood that the terms "comprises," "comprising," "includes" and/or "including," when used in this specification, specify the presence of stated features, steps, operations, elements, components, items, species, and/or groups, but do not preclude the presence, or addition of one or more other features, steps, operations, elements, components, species, and/or groups thereof. The terms "or," "and/or," "including at least one of the following," and the like, as used herein, are to be construed as inclusive or mean any one or any combination. For example, "includes at least one of: A. b, C "means" any of the following: a; b; c; a and B; a and C; b and C; a and B and C ", again for example," A, B or C "or" A, B and/or C "means" any of the following: a; b; c; a and B; a and C; b and C; a and B and C'. An exception to this definition will occur only when a combination of elements, functions, steps or operations are inherently mutually exclusive in some way.

It should be understood that, although the steps in the flowcharts in the embodiments of the present application are shown in order as indicated by the arrows, the steps are not necessarily performed in order as indicated by the arrows. The steps are not performed in the exact order shown and may be performed in other orders unless explicitly stated herein. Moreover, at least some of the steps in the figures may include multiple sub-steps or multiple stages that are not necessarily performed at the same time, but may be performed at different times, in different orders, and may be performed alternately or at least partially with respect to other steps or sub-steps of other steps.

The words "if", as used herein, may be interpreted as "at … …" or "at … …" or "in response to a determination" or "in response to a detection", depending on the context. Similarly, the phrases "if determined" or "if detected (a stated condition or event)" may be interpreted as "when determined" or "in response to a determination" or "when detected (a stated condition or event)" or "in response to a detection (a stated condition or event)", depending on the context.

It should be noted that step numbers such as S301 and S302 are used herein for the purpose of more clearly and briefly describing the corresponding contents, and do not constitute a substantial limitation on the sequence, and those skilled in the art may perform S302 first and then S301 in specific implementation, but these should be within the scope of the present application.

It should be understood that the specific embodiments described herein are merely illustrative of the present application and are not intended to limit the present application.

In the following description, suffixes such as "module", "component", or "unit" used to denote elements are used only for the convenience of description of the present application, and have no specific meaning in themselves. Thus, "module", "component" or "unit" may be used mixedly.

The mobile terminal may be implemented in various forms. For example, the mobile terminal described in the present application may include mobile terminals such as a mobile phone, a tablet computer, a notebook computer, a palmtop computer, a Personal Digital Assistant (PDA), a Portable Media Player (PMP), a navigation device, a wearable device, a smart band, a pedometer, and the like, and fixed terminals such as a Digital TV, a desktop computer, and the like.

The following description will be given taking a mobile terminal as an example, and it will be understood by those skilled in the art that the configuration according to the embodiment of the present application can be applied to a fixed type terminal in addition to elements particularly used for mobile purposes.

Referring to fig. 1, which is a schematic diagram of a hardware structure of a mobile terminal for implementing various embodiments of the present application, the mobile terminal 100 may include: RF (Radio Frequency) unit 101, WiFi module 102, audio output unit 103, a/V (audio/video) input unit 104, sensor 105, display unit 106, user input unit 107, interface unit 108, memory 109, processor 110, and power supply 111. Those skilled in the art will appreciate that the mobile terminal architecture shown in fig. 1 is not intended to be limiting of mobile terminals, which may include more or fewer components than those shown, or some components may be combined, or a different arrangement of components.

The following describes each component of the mobile terminal in detail with reference to fig. 1:

the radio frequency unit 101 may be configured to receive and transmit signals during information transmission and reception or during a call, and specifically, receive downlink information of a base station and then process the downlink information to the processor 110; in addition, the uplink data is transmitted to the base station. Typically, radio frequency unit 101 includes, but is not limited to, an antenna, at least one amplifier, a transceiver, a coupler, a low noise amplifier, a duplexer, and the like. In addition, the radio frequency unit 101 can also communicate with a network and other devices through wireless communication. The wireless communication may use any communication standard or protocol, including but not limited to GSM (Global System for Mobile communications), GPRS (General Packet Radio Service), CDMA2000(Code Division Multiple Access 2000), WCDMA (Wideband Code Division Multiple Access), TD-SCDMA (Time Division-Synchronous Code Division Multiple Access), FDD-LTE (Frequency Division duplex Long Term Evolution), and TDD-LTE (Time Division duplex Long Term Evolution).

WiFi belongs to short-distance wireless transmission technology, and the mobile terminal can help a user to receive and send e-mails, browse webpages, access streaming media and the like through the WiFi module 102, and provides wireless broadband internet access for the user. Although fig. 1 shows the WiFi module 102, it is understood that it does not belong to the essential constitution of the mobile terminal, and may be omitted entirely as needed within the scope not changing the essence of the invention.

The audio output unit 103 may convert audio data received by the radio frequency unit 101 or the WiFi module 102 or stored in the memory 109 into an audio signal and output as sound when the mobile terminal 100 is in a call signal reception mode, a call mode, a recording mode, a voice recognition mode, a broadcast reception mode, or the like. Also, the audio output unit 103 may also provide audio output related to a specific function performed by the mobile terminal 100 (e.g., a call signal reception sound, a message reception sound, etc.). The audio output unit 103 may include a speaker, a buzzer, and the like.

The a/V input unit 104 is used to receive audio or video signals. The a/V input Unit 104 may include a Graphics Processing Unit (GPU) 1041 and a microphone 1042, the Graphics processor 1041 Processing image data of still pictures or video obtained by an image capturing device (e.g., a camera) in a video capturing mode or an image capturing mode. The processed image frames may be displayed on the display unit 106. The image frames processed by the graphic processor 1041 may be stored in the memory 109 (or other storage medium) or transmitted via the radio frequency unit 101 or the WiFi module 102. The microphone 1042 may receive sounds (audio data) via the microphone 1042 in a phone call mode, a recording mode, a voice recognition mode, or the like, and may be capable of processing such sounds into audio data. The processed audio (voice) data may be converted into a format output transmittable to a mobile communication base station via the radio frequency unit 101 in case of a phone call mode. The microphone 1042 may implement various types of noise cancellation (or suppression) algorithms to cancel (or suppress) noise or interference generated in the course of receiving and transmitting audio signals.

The mobile terminal 100 also includes at least one sensor 105, such as a light sensor, a motion sensor, and other sensors. Optionally, the light sensor includes an ambient light sensor that may adjust the brightness of the display panel 1061 according to the brightness of ambient light, and a proximity sensor that may turn off the display panel 1061 and/or the backlight when the mobile terminal 100 is moved to the ear. As one of the motion sensors, the accelerometer sensor can detect the magnitude of acceleration in each direction (generally, three axes), can detect the magnitude and direction of gravity when stationary, and can be used for applications of recognizing the posture of a mobile phone (such as horizontal and vertical screen switching, related games, magnetometer posture calibration), vibration recognition related functions (such as pedometer and tapping), and the like; as for other sensors such as a fingerprint sensor, a pressure sensor, an iris sensor, a molecular sensor, a gyroscope, a barometer, a hygrometer, a thermometer, and an infrared sensor, which can be configured on the mobile phone, further description is omitted here.

The display unit 106 is used to display information input by a user or information provided to the user. The Display unit 106 may include a Display panel 1061, and the Display panel 1061 may be configured in the form of a Liquid Crystal Display (LCD), an Organic Light-Emitting Diode (OLED), or the like.

The user input unit 107 may be used to receive input numeric or character information and generate key signal inputs related to user settings and function control of the mobile terminal. Alternatively, the user input unit 107 may include a touch panel 1071 and other input devices 1072. The touch panel 1071, also referred to as a touch screen, may collect a touch operation performed by a user on or near the touch panel 1071 (e.g., an operation performed by the user on or near the touch panel 1071 using a finger, a stylus, or any other suitable object or accessory), and drive a corresponding connection device according to a predetermined program. The touch panel 1071 may include two parts of a touch detection device and a touch controller. Optionally, the touch detection device detects a touch orientation of a user, detects a signal caused by a touch operation, and transmits the signal to the touch controller; the touch controller receives touch information from the touch sensing device, converts the touch information into touch point coordinates, sends the touch point coordinates to the processor 110, and can receive and execute commands sent by the processor 110. In addition, the touch panel 1071 may be implemented in various types, such as a resistive type, a capacitive type, an infrared ray, and a surface acoustic wave. In addition to the touch panel 1071, the user input unit 107 may include other input devices 1072. Optionally, other input devices 1072 may include, but are not limited to, one or more of a physical keyboard, function keys (e.g., volume control keys, switch keys, etc.), a trackball, a mouse, a joystick, and the like, and are not limited thereto.

Further, the touch panel 1071 may cover the display panel 1061, and when the touch panel 1071 detects a touch operation thereon or nearby, the touch panel 1071 transmits the touch operation to the processor 110 to determine the type of the touch event, and then the processor 110 provides a corresponding visual output on the display panel 1061 according to the type of the touch event. Although the touch panel 1071 and the display panel 1061 are shown in fig. 1 as two separate components to implement the input and output functions of the mobile terminal, in some embodiments, the touch panel 1071 and the display panel 1061 may be integrated to implement the input and output functions of the mobile terminal, and is not limited herein.

The interface unit 108 serves as an interface through which at least one external device is connected to the mobile terminal 100. For example, the external device may include a wired or wireless headset port, an external power supply (or battery charger) port, a wired or wireless data port, a memory card port, a port for connecting a device having an identification module, an audio input/output (I/O) port, a video I/O port, an earphone port, and the like. The interface unit 108 may be used to receive input (e.g., data information, power, etc.) from external devices and transmit the received input to one or more elements within the mobile terminal 100 or may be used to transmit data between the mobile terminal 100 and external devices.

The memory 109 may be used to store software programs as well as various data. The memory 109 may mainly include a program storage area and a data storage area, and optionally, the program storage area may store an operating system, an application program (such as a sound playing function, an image playing function, and the like) required by at least one function, and the like; the storage data area may store data (such as audio data, a phonebook, etc.) created according to the use of the cellular phone, and the like. Further, the memory 109 may include high speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid state storage device.

The processor 110 is a control center of the mobile terminal, connects various parts of the entire mobile terminal using various interfaces and lines, and performs various functions of the mobile terminal and processes data by operating or executing software programs and/or modules stored in the memory 109 and calling data stored in the memory 109, thereby performing overall monitoring of the mobile terminal. Processor 110 may include one or more processing units; preferably, the processor 110 may integrate an application processor and a modem processor, optionally, the application processor mainly handles operating systems, user interfaces, application programs, etc., and the modem processor mainly handles wireless communications. It will be appreciated that the modem processor described above may not be integrated into the processor 110.

The mobile terminal 100 may further include a power supply 111 (e.g., a battery) for supplying power to various components, and preferably, the power supply 111 may be logically connected to the processor 110 via a power management system, so as to manage charging, discharging, and power consumption management functions via the power management system.

Although not shown in fig. 1, the mobile terminal 100 may further include a bluetooth module or the like, which is not described in detail herein.

In order to facilitate understanding of the embodiments of the present application, a communication network system on which the mobile terminal of the present application is based is described below.

Referring to fig. 2, fig. 2 is an architecture diagram of a communication Network system according to an embodiment of the present disclosure, where the communication Network system is an LTE system of a universal mobile telecommunications technology, and the LTE system includes a UE (User Equipment) 201, an E-UTRAN (Evolved UMTS Terrestrial Radio Access Network) 202, an EPC (Evolved Packet Core) 203, and an IP service 204 of an operator, which are in communication connection in sequence.

Optionally, the UE201 may be the terminal 100 described above, and is not described herein again.

The E-UTRAN202 includes eNodeB2021 and other eNodeBs 2022, among others. Alternatively, the eNodeB2021 may be connected with other enodebs 2022 through a backhaul (e.g., X2 interface), the eNodeB2021 is connected to the EPC203, and the eNodeB2021 may provide the UE201 access to the EPC 203.

The EPC203 may include an MME (Mobility Management Entity) 2031, an HSS (Home Subscriber Server) 2032, other MMEs 2033, an SGW (Serving gateway) 2034, a PGW (PDN gateway) 2035, and a PCRF (Policy and Charging Rules Function) 2036, and the like. Optionally, the MME2031 is a control node that handles signaling between the UE201 and the EPC203, providing bearer and connection management. HSS2032 is used to provide registers to manage functions such as home location register (not shown) and holds subscriber specific information about service characteristics, data rates, etc. All user data may be sent through SGW2034, PGW2035 may provide IP address assignment for UE201 and other functions, and PCRF2036 is a policy and charging control policy decision point for traffic data flow and IP bearer resources, which selects and provides available policy and charging control decisions for a policy and charging enforcement function (not shown).

The IP services 204 may include the internet, intranets, IMS (IP Multimedia Subsystem), or other IP services, among others.

Although the LTE system is described as an example, it should be understood by those skilled in the art that the present application is not limited to the LTE system, but may also be applied to other wireless communication systems, such as GSM, CDMA2000, WCDMA, TD-SCDMA, and future new network systems.

Based on the above mobile terminal hardware structure and communication network system, various embodiments of the present application are provided.

Fig. 3 is a schematic flowchart of an interaction method provided in an embodiment of the present application, where the interaction method provided in the present application may be applied to a mobile terminal or a server such as a mobile phone, a tablet, a watch, and the like. As shown in fig. 3, the interaction method of the present application may include the following steps:

step S301: acquiring portrait information of an information receiver;

optionally, the portrait information of the information receiver is obtained according to information of both the information receiver and the information transmitter, and optionally, the information of both the information receiver and the information transmitter includes at least one of a label setting of the information receiver to the information transmitter, a nickname remark between the information receiver and the information transmitter, chat content of the information receiver and the information transmitter, and personal setting information.

Optionally, the personal setting information includes alarm clock setting, mode (vibration, ring, mute) setting, and the like, and optionally, work and rest time and work time of the information receiver can be obtained according to the alarm clock setting information, so as to obtain portrait information of the information receiver corresponding to different time periods; similarly, the portrait information of the corresponding information receiver in different modes can be obtained according to the vibration, the ring and the mute setting information, for example, in the vibration mode, the portrait information of the information receiver is in a working state or avoids a noisy state.

Optionally, the portrait information of the information receiver includes at least one of sex, age, usage habit, relationship with the information sender, usage scenario of performing operation.

Optionally, the usage scenario for performing the operation includes: at least one of work, driving, and rest.

The specific implementation mode can be as follows: 1) implemented using Natural Language Processing (NLP) text classifiers (or rule matching); 2) the method is realized by using a pre-trained Natural Language Processing (NLP) classification model for judging the relation between two chat parties according to the chat content, wherein the model takes the chat content as input, and takes the relation labels of an information sender and an information receiver, the characters of a program user and the like as output to obtain the portrait information of the information receiver.

Optionally, the portrait information of the information receiver may be updated at an update frequency, optionally, the update frequency includes a fixed frequency update or an acquisition process real-time update.

Alternatively, each time the obtained portrait information of the information receiver is saved, a portrait information database of the information receiver is established, and the portrait information in the database is comprehensively analyzed to form the portrait information of the information receiver with multiple layers in a structured manner.

Step S302: acquiring preference information of the information receiver according to the portrait information;

optionally, the preference information includes at least one of receiving the information in text form, receiving the information in speech form.

Optionally, if the portrait information of the information receiver is old people, the acquired preference information is received in a voice form; if the portrait information of the information receiver is a young person, the acquired preference information is received in a text form; if the portrait information of the information receiver is a working scene, the acquired preference information is received in a text form; if the portrait information of the information receiver is the habit voice communication, the acquired preference information is the received information in the voice form.

In other embodiments, if the information receiver is in a state of avoiding noise, including a working state, a learning state, and the like, when the received information is the voice information, the voice information is converted into corresponding text information and then is sent or displayed; and/or if the information receiving party is in an eye using state including a driving state, a motion state and the like, when the received information is text information, converting the text information into corresponding voice information and then sending or displaying the voice information.

Optionally, the working state may be obtained according to a set working day or working time period; the learning state can be obtained according to the formulated learning plan; the driving state can be obtained according to the connection with the vehicle-mounted Bluetooth; the motion state can be acquired according to the information of the acceleration sensor.

Alternatively, if the preference information of a plurality of information receivers can be obtained according to the portrait information of the information receivers, the preference information of the information receivers is determined according to the following priority: the priority of the operation is from high to low from front to back.

Optionally, if the portrait information of the information receiver includes portrait information such as a work scene and a habit voice communication, the acquired preference information is received in a text form determined according to the work scene. If the portrait information of the information receiver includes habit text communication, portrait information of old people and the like, the acquired preference information is text-form received information determined according to the preference text communication.

Step S303: and executing the operation corresponding to the preference information.

Optionally, if the preference information is received in a text form, the text information is directly sent or displayed when the sent information or the received information is the text information; and/or

If the preference information is received in a text form, when the sending information or the receiving information is voice information, the voice information is converted into corresponding text information and then sent or displayed; and/or

If the preference information is received in a voice form, when the sent information or the received information is text information, the text information is converted into corresponding voice information and then is sent or displayed; and/or

If the preference information is the information received in the voice form, the voice information is directly sent or displayed when the sending information or the receiving information is the voice information.

Optionally, the converting the text information into corresponding voice information and then sending or displaying the corresponding voice information includes: and selecting corresponding volume to perform voice broadcast according to the portrait information of the information receiver. For example, if the portrait information of the information receiver is old people, the voice broadcast is performed by selecting a volume suitable for the hearing of the old people.

Optionally, the present application further provides an interactive method applied to a system, where the system includes at least one receiving end, at least one sending end, and at least one server, and the method includes the following steps:

the server receives the interactive information sent by the sending end;

and the receiving end displays the converted text or voice information.

Optionally, the interaction method provided by the application can also actively recommend the text-to-speech interconversion function according to the portrait information of the information receiver, and the information receiver determines whether to use the text-to-speech interconversion function.

Fig. 4 is a schematic specific flowchart of an interaction method provided in the second embodiment of the present application. As shown in fig. 4, the interaction method provided by the present application includes the following steps:

step S401: acquiring preference information according to portrait information of an information receiver;

step S402: judging the preference information as text-form received information;

step S403: judging whether the received information is text information or not;

if the received information is text information, step S404 is performed: directly sending or displaying the text information;

if the received information is voice information, step S405 is performed: and converting the voice information into corresponding text information and then sending or displaying the text information.

Optionally, step S401 and step S402 are performed in step S302, which is not described herein again; step S403 is realized by identifying the information format, and if the identification result of the information format is a text format, the text information is obtained; the conversion in step S405 is realized by a voice-to-text function.

Optionally, the voice information is converted into corresponding text information and then transmitted or displayed, including synchronous transmission or display of the voice information before conversion and the corresponding text information after conversion.

Fig. 5 is a schematic specific flowchart of an interaction method provided in the third embodiment of the present application. As shown in fig. 5, the interaction method provided by the present application includes the following steps:

step S501: acquiring preference information according to portrait information of an information receiver;

step S502: judging the preference information as receiving information in a voice form;

step S503: judging whether the received information is voice information;

if the received information is voice information, step S504 is performed: directly sending or displaying the voice information;

if the received information is text information, step S505 is performed: and converting the text information into corresponding voice information and then sending or displaying the voice information.

Optionally, step S501 and step S502 synchronize step S302, which is not described herein again; step S503 is realized by identifying the information format, and if the identification result of the information format is an audio format, the information is voice information; the conversion in step S505 is realized by a text-to-speech function.

Optionally, the text information is converted into corresponding voice information and then transmitted or displayed, including synchronous transmission or display of the text information before conversion and the corresponding voice information after conversion.

Alternatively, in step S505: before the text information is converted into corresponding voice information and then sent or displayed, the method comprises the following steps: acquiring tone preference information according to portrait information of an information receiver; and performing tone cloning and voice synthesis according to the tone preference information. This is explained with reference to fig. 6.

Fig. 6 is a schematic flowchart of an interaction method provided in the fourth embodiment of the present application. As shown in fig. 6, the interaction method provided by the present application includes the following steps:

s601: acquiring tone preference information according to the portrait information of the information receiver;

optionally, the tone color preference information includes at least one of voice broadcast in a tone color of a sender of the information, and voice broadcast in a tone color preferred by an individual.

Optionally, the personal favorite timbre includes at least one of a star timbre and a cartoon timbre.

Optionally, if the portrait information of the information receiver is that an old person receives text information of children, the obtained tone preference information is that the voice broadcast is carried out by using the tone of the children; and if the image information of the information receiver is that the fan users receive the text information at the star fan group, the acquired sound color preference information is that the voice broadcast is carried out by using the sound color of the star.

S602: and performing tone cloning and voice synthesis according to the tone preference information.

Optionally, taking voiceprint mosaic timbre cloning as an example, the principle of timbre cloning is to input the voice frequency of an arbitrary target speaker into a voiceprint model (e.g., an X-vector model of Kaldi), obtain a voiceprint mosaic vector of the target speaker, and input the voiceprint mosaic vector as speaker timbre information into a model trained on large-scale multi-speaker data in advance, so as to obtain a voice synthesis model of the target speaker.

Fig. 7 is a schematic structural diagram of a mobile terminal according to a fifth embodiment of the present application. As shown in fig. 7, the mobile terminal of this embodiment includes: a processor 310, a memory 311, and a computer program 312 stored in the memory 311 and executable on the processor 310. The processor 310 implements the steps in the various interactive method embodiments described above when executing the computer program 312.

Illustratively, the computer program 312 may be partitioned into one or more modules/units that are stored in the memory 311 and executed by the processor 310 to accomplish the present application. The one or more modules/units may be a series of computer program instruction segments capable of performing specific functions, which are used to describe the execution of the computer program 312 in the mobile terminal.

The mobile terminal can be a desktop computer, a notebook, a palm computer, a server and other computing equipment. The mobile terminal may include, but is not limited to, a processor 310, a memory 311. Those skilled in the art will appreciate that fig. 7 is only an example of the mobile terminal and is not intended to be limiting, and that the mobile terminal may include more or less components than those shown, or some components may be combined, or different components, for example, the mobile terminal may also include input and output devices, network access devices, buses, etc.

The Processor 310 may be a Central Processing Unit (CPU), other general purpose Processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), an off-the-shelf Programmable Gate Array (FPGA) or other Programmable logic device, discrete Gate or transistor logic, discrete hardware components, etc. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like.

The storage 311 may be an internal storage unit of the mobile terminal, such as a hard disk or a memory of the mobile terminal. The memory 311 may also be an external storage device of the mobile terminal, such as a plug-in hard disk, a Smart Media Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card), and the like, which are provided on the mobile terminal. Further, the memory 311 may also include both an internal storage unit and an external storage device of the mobile terminal. The memory 311 is used for storing the computer program and other programs and data required by the mobile terminal. The memory 311 may also be used to temporarily store data that has been output or is to be output.

Embodiments of the present application also provide a computer program product, which includes computer program code, when the computer program code runs on a computer, the computer is caused to execute the method in the above various possible embodiments.

Embodiments of the present application further provide a chip, which includes a memory and a processor, where the memory is used to store a computer program, and the processor is used to call and run the computer program from the memory, so that a device in which the chip is installed executes the method in the above various possible embodiments.

Through the above description of the embodiments, those skilled in the art will clearly understand that the method of the above embodiments can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware, but in many cases, the former is a better implementation manner. Based on such understanding, the technical solutions of the present application may be embodied in the form of a software product, which is stored in a storage medium (e.g., ROM/RAM, magnetic disk, optical disk) and includes instructions for enabling a terminal device (e.g., a mobile phone, a computer, a server, a controlled terminal, or a network device) to execute the method of each embodiment of the present application.

The above description is only a preferred embodiment of the present application, and not intended to limit the scope of the present application, and all modifications of equivalent structures and equivalent processes, which are made by the contents of the specification and the drawings of the present application, or which are directly or indirectly applied to other related technical fields, are included in the scope of the present application.

Claims

1. An interactive method, comprising the steps of:

acquiring portrait information of an information receiver;

acquiring preference information of the information receiver according to the portrait information;

and executing the operation corresponding to the preference information.

2. The interaction method as claimed in claim 1 to, wherein the obtaining of the portrait information of the information receiver comprises at least one of:

the portrait information of the information receiver is obtained according to the information of both the information transceiver and the information receiver;

the portrait information of the information receiver is updated according to the updating frequency;

3. An interaction method according to claim 1 or 2, comprising at least one of:

the portrait information of the information receiver comprises at least one of sex, age, use habit, relationship with the information sender and use scene of executing operation;

the preference information of the information receiver includes at least one of receiving information in a text form and receiving information in a voice form.

4. The interaction method of claim 3, wherein the performing the operation corresponding to the preference information comprises at least one of:

5. The interactive method of claim 4, wherein before the step of transmitting or presenting after converting the text information into corresponding voice information, comprising:

6. The interactive method of claim 5, wherein the tone color preference information includes at least one of voice broadcast in a tone color of a sender of the information, and voice broadcast in a favorite tone color of an individual.

7. The interaction method of any one of claims 4 to 6, comprising at least one of:

the step of converting the voice information into corresponding text information and then sending or displaying the corresponding text information comprises the step of synchronously sending or displaying the voice information before conversion and the corresponding text information after conversion;

and the step of converting the text information into corresponding voice information and then sending or displaying the corresponding voice information comprises the step of synchronously sending or displaying the text information before conversion and the corresponding voice information after conversion.

8. The interactive method of claim 7, wherein said converting said text message into corresponding voice message for transmission or presentation comprises: and selecting corresponding volume to perform voice broadcast according to the portrait information of the information receiver.

9. A mobile terminal comprising a memory, a processor and a computer program stored in the memory and executable on the processor, characterized in that the processor implements the steps of the interaction method according to any of claims 1 to 8 when executing the computer program.

10. A computer-readable storage medium, in which a computer program is stored which, when being executed by a processor, carries out the steps of the interaction method according to any one of claims 1 to 8.