CN109286726B

CN109286726B - Content display method and terminal equipment

Info

Publication number: CN109286726B
Application number: CN201811252706.1A
Authority: CN
Inventors: 刘奔
Original assignee: Vivo Mobile Communication Co Ltd
Current assignee: Vivo Mobile Communication Co Ltd
Priority date: 2018-10-25
Filing date: 2018-10-25
Publication date: 2021-05-14
Anticipated expiration: 2038-10-25
Also published as: CN109286726A

Abstract

The embodiment of the invention provides a content display method and terminal equipment, relates to the technical field of terminals, and aims to solve the problem that the display effect is poor due to the fact that the existing terminal equipment displays content according to a system default mode. The method comprises the following steps: acquiring a target voice signal, wherein the target voice signal is a signal input by a user voice; acquiring voice characteristic information of the target voice signal, wherein the voice characteristic information comprises at least one voice characteristic; determining a target display strategy according to the voice feature information, wherein the target display strategy comprises a display mode corresponding to each voice feature; and identifying the content of the target voice signal, and displaying the content of the target voice signal according to the target display strategy. The method can be applied to a voice input scene of the terminal equipment.

Description

Content display method and terminal equipment

Technical Field

The embodiment of the invention relates to the technical field of terminals, in particular to a content display method and terminal equipment.

Background

With the continuous development of terminal technology and internet technology, the application of terminal equipment is more and more extensive, and more users have become accustomed to communicating with others through communication application programs in the terminal equipment.

Currently, when a user uses a communication application program of a terminal device, the user can input the communication application program in a voice input mode. For example, the terminal device may recognize a voice input of the user through a voice recognition technology and display the recognized contents corresponding to the voice input on a display screen of the terminal device.

However, in the content display process based on the voice input, since the terminal device displays according to the style, format, and the like configured by default, the manner in which the terminal device displays the content is monotonous, and the display effect of the content of the terminal device is poor.

Disclosure of Invention

The embodiment of the invention provides a content display method and terminal equipment, and aims to solve the problem that the display effect is poor due to the fact that the existing terminal equipment displays content according to a system default mode.

In order to solve the technical problem, the invention is realized as follows:

in a first aspect, an embodiment of the present invention provides a content display method, which is applied to a terminal device, and the method includes: acquiring a target voice signal, wherein the target voice signal is a signal input by a user voice; acquiring voice characteristic information of a target voice signal, wherein the voice characteristic information comprises at least one voice characteristic; determining a target display strategy according to the voice feature information, wherein the target display strategy comprises a display mode corresponding to each voice feature; and identifying the content of the target voice signal, and displaying the content of the target voice signal according to the target display strategy.

In a second aspect, an embodiment of the present invention provides a terminal device, where the terminal device includes an obtaining module, a determining module, an identifying module, and a displaying module. The acquisition module is used for acquiring a target voice signal and acquiring voice characteristic information of the target voice signal, wherein the target voice signal is a signal input by a user, and the voice characteristic information comprises at least one voice characteristic; the determining module is used for determining a target display strategy according to the voice feature information acquired by the acquiring module, wherein the target display strategy comprises a display mode corresponding to each voice feature; the identification module is used for identifying the content of the target voice signal acquired by the acquisition module; and the display module is used for displaying the content of the target voice signal identified by the identification module according to the target display strategy determined by the determination module.

In a third aspect, an embodiment of the present invention provides a terminal device, where the terminal device includes a processor, a memory, and a computer program stored on the memory and executable on the processor, and the computer program, when executed by the processor, implements the steps of the content display method in the first aspect.

In a fourth aspect, an embodiment of the present invention provides a computer-readable storage medium, on which a computer program is stored, where the computer program, when executed by a processor, implements the steps of the content display method in the first aspect.

In the embodiment of the present invention, a target speech signal (the target speech signal is a signal input by a user) may be acquired, speech feature information of the target speech signal (the speech feature information includes at least one speech feature) is acquired, a target display policy (the target display policy includes a display manner corresponding to each speech feature) is determined according to the speech feature information, and after the content of the target speech signal is identified, the content of the target speech signal is displayed according to the target display policy. According to the scheme, the target display strategy corresponding to the voice characteristic information can be determined according to the voice characteristic information of the target voice signal so as to be used for displaying the content of the target voice signal, so that the target display strategies determined according to the voice characteristic information of different target voice signals are different. Therefore, the terminal equipment can display the contents of different target voice signals according to different display strategies, so that the content display mode of the terminal equipment is rich, and the content display effect of the terminal equipment is improved.

Drawings

Fig. 1 is a schematic diagram of an architecture of a possible android operating system according to an embodiment of the present invention;

FIG. 2 is a diagram illustrating a content display method according to an embodiment of the present invention;

fig. 3 is one of schematic interfaces of an application of a content display method according to an embodiment of the present invention;

fig. 4 is a second schematic interface diagram of an application of the content display method according to the embodiment of the present invention;

FIG. 5 is a second schematic diagram illustrating a content display method according to an embodiment of the present invention;

fig. 6 is a third schematic diagram illustrating a content display method according to an embodiment of the present invention;

fig. 7 is a schematic structural diagram of a terminal device according to an embodiment of the present invention;

fig. 8 is a hardware schematic diagram of a terminal device according to an embodiment of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

The term "and/or" herein is an association relationship describing an associated object, meaning that three relationships may exist, e.g., a and/or B, may mean: a exists alone, A and B exist simultaneously, and B exists alone. The symbol "/" herein denotes a relationship in which the associated object is or, for example, a/B denotes a or B.

The terms "first" and "second," and the like, in the description and in the claims of the present invention are used for distinguishing between different objects and not for describing a particular order of the objects. For example, the first and second font sizes, etc. are used to distinguish different font sizes, rather than to describe a particular order of font sizes.

In the embodiments of the present invention, words such as "exemplary" or "for example" are used to mean serving as examples, illustrations or descriptions. Any embodiment or design described as "exemplary" or "e.g.," an embodiment of the present invention is not necessarily to be construed as preferred or advantageous over other embodiments or designs. Rather, use of the word "exemplary" or "such as" is intended to present concepts related in a concrete fashion.

In the description of the embodiments of the present invention, unless otherwise specified, "a plurality" means two or more, for example, a plurality of processing units means two or more processing units, and the like.

The embodiment of the invention provides a content display method and terminal equipment, which can acquire a target voice signal (the target voice signal is a signal input by a user), acquire voice feature information (the voice feature information comprises at least one voice feature) of the target voice signal, determine a target display strategy (the target display strategy comprises a display mode corresponding to each voice feature) according to the voice feature information, and display the content of the target voice signal according to the target display strategy after identifying the content of the target voice signal. According to the scheme, the target display strategy corresponding to the voice characteristic information can be determined according to the voice characteristic information of the target voice signal so as to be used for displaying the content of the target voice signal, so that the target display strategies determined according to the voice characteristic information of different target voice signals are different. Therefore, the terminal equipment can display the contents of different target voice signals according to different display strategies, so that the content display mode of the terminal equipment is rich, and the content display effect of the terminal equipment is improved.

The terminal device in the embodiment of the present invention may be a terminal device having an operating system. The operating system may be an Android (Android) operating system, an ios operating system, or other possible operating systems, and embodiments of the present invention are not limited in particular.

The following describes a software environment to which the content display method provided by the embodiment of the present invention is applied, by taking an android operating system as an example.

Fig. 1 is a schematic diagram of an architecture of a possible android operating system according to an embodiment of the present invention. In fig. 1, the architecture of the android operating system includes 4 layers, which are respectively: an application layer, an application framework layer, a system runtime layer, and a kernel layer (specifically, a Linux kernel layer).

The application program layer comprises various application programs (including system application programs and third-party application programs) in an android operating system.

The application framework layer is a framework of the application, and a developer can develop some applications based on the application framework layer under the condition of complying with the development principle of the framework of the application.

The system runtime layer includes libraries (also called system libraries) and android operating system runtime environments. The library mainly provides various resources required by the android operating system. The android operating system running environment is used for providing a software environment for the android operating system.

The kernel layer is an operating system layer of an android operating system and belongs to the bottommost layer of an android operating system software layer. The kernel layer provides kernel system services and hardware-related drivers for the android operating system based on the Linux kernel.

Taking an android operating system as an example, in the embodiment of the present invention, a developer may develop a software program for implementing the content display method provided in the embodiment of the present invention based on the system architecture of the android operating system shown in fig. 1, so that the content display method may operate based on the android operating system shown in fig. 1. Namely, the processor or the terminal device can implement the content display method provided by the embodiment of the invention by running the software program in the android operating system.

The terminal equipment in the embodiment of the invention can be a mobile terminal or a non-mobile terminal. For example, the mobile terminal may be a mobile phone, a tablet computer, a notebook computer, a palm top computer, a vehicle-mounted terminal, a wearable device, an ultra-mobile personal computer (UMPC), a netbook or a Personal Digital Assistant (PDA), and the like, and the non-mobile terminal may be a Personal Computer (PC), a Television (TV), a teller machine or a self-service machine, and the like, and the embodiment of the present invention is not particularly limited.

The execution main body of the content display method provided by the embodiment of the present invention may be the terminal device, or may also be a functional module and/or a functional entity capable of implementing the content display method in the terminal device, which may be determined specifically according to actual use requirements, and the embodiment of the present invention is not limited. The following takes a terminal device as an example to exemplarily explain a content display method provided by the embodiment of the present invention.

As shown in fig. 2, an embodiment of the present invention provides a content display method, which may include S200 to S203 described below.

S200, the terminal equipment acquires the target voice signal.

The target voice signal may be a signal input by a user voice.

In the embodiment of the present invention, if the user interacts with the terminal device through voice input, the user may input voice by using an application program (e.g., a voice assistant, a voice input method, etc.) having a voice recognition function in the terminal device, so as to trigger the terminal device to collect a voice signal input by the user, that is, the target voice signal.

In the embodiment of the present invention, the scene in which the user interacts with the terminal device through the voice input may include that the user performs chat in a voice input manner through a communication application program in the terminal device; a user searches in a voice input mode through a browser application program in terminal equipment; and the user interacts with the terminal equipment in a voice input mode through a voice assistant application program in the terminal equipment, and the like.

In the embodiment of the present invention, taking an example that a user and an opposite user perform chat in a voice input manner through a communication application program in a terminal device, a manner for the terminal device to obtain a target voice signal may include the following two possible implementation manners: the terminal device can acquire the target voice signal by collecting the voice signal input by the user, and the terminal device can acquire the target voice signal by receiving the voice signal input by the opposite user.

Optionally, in the embodiment of the present invention, the above S200 may be specifically implemented by the following S200a and S200 b.

S200a, the terminal equipment receives a first input of a user.

The first input is used for triggering the terminal equipment to acquire a voice signal.

S200b, the terminal device responds to the first input, and obtains the target voice signal input by the user.

In this embodiment of the present invention, the first input may include an input of a "voice input" control on an interface (e.g., an interface of a communication application) of the terminal device by a user, and a voice input of the user, which may be determined according to an actual use requirement, and the embodiment of the present invention is not limited.

Referring to fig. 3, a method for acquiring a target voice signal input by a user by a terminal device in response to a first input of the user is exemplarily described with an interface of a communication application as an example.

As shown in fig. 3 (a), a user may input and input voice (i.e., the first input mentioned above) through a "voice input" control on an interface of the communication application program, so as to trigger the terminal device to collect a voice signal. Accordingly, as shown in fig. 3 (b), the terminal device may collect a voice signal input by the user (i.e., a target voice signal) in response to the input, and recognize content corresponding to the target voice signal, and the terminal device may display the content on the interface (e.g., "shopping, i like, hey"). As a further alternative, as shown in (b) of fig. 3, the user may input a "complete" control on the interface of the terminal device to trigger the terminal device to stop collecting the voice signal.

The terminal device can directly receive the voice signal input by the opposite user and takes the received voice signal as the target voice signal. And then the terminal equipment can acquire the voice characteristic information of the target voice signal, determine a target display strategy according to the voice characteristic information, and display the content corresponding to the voice signal according to the target display strategy.

In the content display method provided by the embodiment of the invention, the terminal equipment can respond to the first input of the user and acquire the target voice signal input by the user in real time, so that the convenience of inputting voice by the user through the terminal equipment is improved.

S201, the terminal device obtains voice characteristic information of the target voice signal.

The voice feature information may include at least one voice feature.

Optionally, in an embodiment of the present invention, the at least one voice feature may include at least one of: voice tone information in the target voice signal, voice volume information in the target voice signal, voice speed information in the target voice signal, voice pitch information in the target voice signal, and voice pitch information in the target voice signal.

It is to be understood that the above listed individual phonetic features are exemplary lists, i.e., embodiments of the present invention include, but are not limited to, the above listed individual phonetic features. In practical implementation, the voice feature may further include any other possible voice feature, which may be determined according to practical use requirements, and the embodiment of the present invention is not limited.

In the embodiment of the present invention, the voice mood information in the target voice signal may be used to indicate the emotion expressed by the target voice signal; for example, the tone information may be tone information indicating an emotion such as happiness, anger, sadness, or fear. The voice volume information in the target voice signal can indicate the volume of the target voice signal; illustratively, the voice volume information may include low volume, medium volume, high volume, and the like. The voice speed information in the target voice signal can be used for indicating the speed of the voice speed information of the target voice signal; illustratively, the voice speed information may include slow speed, medium speed, fast speed, and the like. The voice pitch information in the target voice signal can be used for indicating the high and low of the voice vibration frequency; illustratively, the voice pitch information may include low frequency, medium frequency, high frequency, etc., and typically the voice pitch of males (corresponding to low frequency) is lower than the voice pitch of females (corresponding to high frequency). The voice tone information in the target voice signal can be used for indicating the voice-off control of the target voice signal; illustratively, the voice tone information may include yin-level, yang-level, up and down, up, down, and up.

It should be noted that the respective classifications of the voice tone information, the voice volume information, the voice speed information, the voice tone information, and the voice tone information in the target voice signal are exemplary illustrations, which may be determined according to actual usage requirements, and the embodiments of the present invention are not limited thereto.

S202, the terminal equipment determines a target display strategy according to the voice characteristic information.

The target display policy may include a display mode corresponding to each voice feature.

In the embodiment of the invention, after acquiring the target voice signal input by the voice of the user and acquiring the voice feature information of the target voice signal, the terminal equipment can determine the display mode corresponding to each voice feature according to each voice feature in the voice feature information, so as to obtain the target display strategy corresponding to the voice feature information.

Optionally, in this embodiment of the present invention, the target display policy may include at least one of the following: display in a predetermined text font, display in a predetermined text font size, display in a predetermined text color, display in a predetermined text interval, display in a predetermined text stroke width.

It is to be understood that the above listed respective target display policies are exemplary lists, i.e., embodiments of the present invention include, but are not limited to, the above listed respective target display policies. In practical implementation, the voice feature may further include any other possible target display policy, which may be determined according to practical usage requirements, and the embodiment of the present invention is not limited.

Optionally, in the embodiment of the present invention, one voice feature may correspond to a preset display mode. Specifically, any one of the voice characteristics such as the voice mood information in the target voice signal, the voice volume information in the target voice signal, the voice speed information in the target voice signal, the voice tone information in the target voice signal, and the voice tone information in the target voice signal may correspond to any one of the preset display modes such as displaying in a preset character font, displaying in a preset character font size, displaying in a preset character color, displaying in a preset character interval, and displaying in a preset character stroke width. The corresponding relationship between each voice feature and the preset display mode may be determined according to actual use requirements, and the embodiment of the present invention is not limited.

Specifically, in the embodiment of the present invention, the terminal device may store in advance a corresponding relationship between each voice feature in the voice feature information and the preset display mode. In this way, for each voice feature in the voice feature information acquired by the terminal device, the terminal device may determine a display mode corresponding to each acquired voice feature according to the corresponding relationship.

The terminal equipment can store preset voice characteristics. Therefore, the terminal device can compare the voice features in the voice feature information with the corresponding preset voice features after acquiring the voice feature information in the target voice signal, and then the terminal device can determine which display mode to display the content of the target voice signal according to the comparison result.

The following description will exemplarily describe that the voice mood information corresponds to a display mode displayed in a preset text color.

For example, it is assumed that the phonetic mood information in the target speech signal has a corresponding relationship with the display mode displayed in the preset text color. For example, the speech mood information "happy" may correspond to displaying text in orange, the speech mood information "angry" may correspond to displaying text in red, the speech mood information "sad" may correspond to displaying text in gray, and the speech mood information "fear" may correspond to displaying text in blue. Therefore, the terminal equipment can determine which character color to display the characters according to the voice tone information in the voice characteristic information, namely which character color to display the characters corresponding to the content of the target voice signal. Specifically, the terminal device may compare the voice tone information in the target voice signal with preset voice tone information, and determine a corresponding text color according to a comparison result, thereby determining which text color to display the text corresponding to the content of the target voice signal.

Optionally, the correspondence between the voice tone information and the display mode displayed in the preset text color may be set by default in the system, or may be set by user-defined setting, and may specifically be determined according to actual use requirements, which is not limited in the embodiment of the present invention.

It should be noted that, the correspondence between the voice tone information and the display manner displayed in the preset text color is an exemplary illustration, and may be determined specifically according to the actual use requirement, and the embodiment of the present invention is not limited.

The following description will exemplarily describe the voice volume information corresponding to the display mode displayed by the preset character size.

For example, it is assumed that the voice volume information in the target voice signal has a corresponding relationship with the display mode displayed by the preset text font size. For example, "low volume" may correspond to display in a first font size, "medium volume" may correspond to display in a second font size, and "high volume" may correspond to display in a third font size.

Alternatively, "low volume" may correspond to displaying text in a first font size, "medium volume" may correspond to displaying text in a second font size larger than the first font size, and "high volume" may correspond to displaying text in a third font size larger than the second font size. For example, the first, second, and third font sizes may be font size "8", font size "12", and font size "16", respectively. Namely, the larger the voice volume is, the larger the preset character size is, and correspondingly, the larger the characters displayed by the terminal device are. It is understood that the above is exemplified by the case that the larger the voice volume is, the larger the preset character size is. In the specific implementation, the situation that the larger the voice volume is, the smaller the preset character size is also belongs to the protection scope of the embodiment of the invention.

Therefore, the terminal equipment can determine which character font size to display the characters according to the voice volume information in the voice characteristic information, namely which character font size to display the characters corresponding to the content of the target voice signal. Specifically, the terminal device may compare the voice volume in the target voice signal with a preset voice volume threshold, and determine a corresponding text font size according to the comparison result, thereby determining which text font size displays the text corresponding to the content of the target voice signal.

For example, when the voice volume in the target voice signal is equal to a preset voice volume threshold (e.g., 50dB) (i.e., the voice volume information indicates "medium volume"), the terminal device may determine to display the text in a second font size (e.g., font size "12"). When the voice volume in the target voice signal is less than the voice volume threshold (i.e., the voice volume information indicates "low volume"), the terminal device may determine to display the text in a first font size (e.g., font size "8") that is smaller than a second font size. When the voice volume in the target voice signal is greater than the voice volume threshold (i.e., the voice volume information indicates "high volume"), the terminal device may determine to display the text in a third font size (e.g., font size "16") that is larger than the second font size.

Optionally, the correspondence between the voice volume information and the display mode displayed by the preset text font size may be a default setting of the system, or may be set by a user in a user-defined manner, and may specifically be determined according to actual use requirements, which is not limited in the embodiment of the present invention.

It should be noted that the first font size, the second font size and the third font size are exemplary illustrations, and the embodiments of the present invention are not limited to these three font sizes, nor to the size representative values of the three font sizes, and the number and the size representative values of the preset font sizes may be determined according to actual usage requirements, and the embodiments of the present invention are not limited.

The following is an exemplary description of the voice speed information corresponding to a display mode displayed at a preset text interval.

For example, it is assumed that the speech speed information in the target speech signal has a corresponding relationship with a display manner displayed at a preset text interval. For example, "slow" may correspond to display at a first text interval, "medium" may correspond to display at a second text interval, and "fast" may correspond to display at a third text interval.

Alternatively, "slow" may correspond to displaying text at a first text interval, "medium" may correspond to displaying text at a second text interval that is less than the first text interval, and "fast" may correspond to displaying text at a third text interval that is less than the second text interval. For example, the first letter spacing, the second letter spacing, and the third letter spacing may be 1.5 millimeters, 1 millimeter, and 0.5 millimeters, respectively. That is, the larger (i.e., faster) the speech speed is, the smaller the preset text interval is, and accordingly, the text displayed by the terminal device is compact. It is understood that the above is exemplified by the case that the larger the voice speed is, the smaller the preset character interval is. In the specific implementation, the situation that the speech speed is larger and the preset character interval is larger also belongs to the protection scope of the embodiment of the invention.

In this way, the terminal device may determine, according to the voice speed in the voice feature information, at which character interval the characters are displayed, that is, at which character interval the characters corresponding to the content of the target voice signal are displayed. Specifically, the terminal device may compare the speech speed in the target speech signal with a preset speech speed threshold, and determine a corresponding text interval according to the comparison result, thereby determining which text interval displays the text corresponding to the content of the target speech signal.

For example, when the speech speed in the target speech signal is equal to a preset speech speed threshold (e.g., 120 words/min), the terminal device may determine to display the words at a second word interval (e.g., 1 mm). When the speech speed in the target speech signal is less than the speech speed threshold, the terminal device may determine to display text at a first text interval (e.g., 1.5 millimeters) that is greater than a second text interval. When the speech speed in the target speech signal is greater than the speech speed threshold, the terminal device may determine to display the text at a third text interval (e.g., 0.5 mm) that is smaller than the second text interval.

Optionally, the correspondence between the voice speed information and the display mode displayed at the preset text interval may be a default setting of the system, or may be set by a user in a user-defined manner, and may specifically be determined according to actual use requirements, which is not limited in the embodiment of the present invention.

It should be noted that the first text interval, the second text interval, and the third text interval are exemplary illustrations, and the embodiment of the present invention is not limited to these three text intervals, nor to the numerical values of the three text intervals, and the number and the numerical values of the preset text numbers may be determined according to actual use requirements, and the embodiment of the present invention is not limited.

The following also exemplifies a display mode in which the voice tone information corresponds to the preset character stroke width.

For example, it is assumed that the voice pitch information in the target voice signal has a corresponding relationship with the display mode displayed by the preset character stroke width. For example, "low frequency" may correspond to display at a first text stroke width, "medium frequency" may correspond to display at a second text stroke width, and "high frequency" may correspond to display at a third text stroke width.

Alternatively, "low frequency" may correspond to displaying text at a first text stroke width (which may be referred to as "bold"), and "medium frequency" may correspond to displaying text at a second text stroke width (which may be referred to as "bold") that is less than the first text stroke width, and "high frequency" may correspond to displaying text at a third text stroke width (which may be referred to as "bold") that is less than the second text stroke width. For example, the first, second, and third text stroke widths may be 0.8, 0.6, and 0.4 millimeters, respectively. That is, the higher the voice pitch (i.e., the higher the frequency), the smaller the width of the preset character strokes, and accordingly, the slimness of the character strokes displayed by the terminal device. It is understood that the above is exemplified by the case that the higher the voice pitch, the smaller the preset character stroke width. In the concrete implementation, the condition that the width of the stroke of the preset character is larger as the voice tone is higher also belongs to the protection scope of the embodiment of the invention.

Therefore, the terminal equipment can determine which character stroke width to display the characters according to the voice tone in the voice characteristic information, namely which character stroke width to display the characters corresponding to the content of the target voice signal. Specifically, the terminal device may compare the voice tone in the target voice signal with a preset voice tone threshold, and determine a corresponding character stroke width according to the comparison result, thereby determining which character stroke width displays the character corresponding to the content of the target voice signal.

For example, the terminal device may determine to display text at a second text stroke width (e.g., 0.6 mm) when the voice pitch in the target voice signal is equal to a preset voice pitch threshold (e.g., 300 Hz). When the voice pitch in the target voice signal is below the voice pitch threshold, the terminal device may determine to display text at a first text stroke width (e.g., 0.8 millimeters) that is greater than a second text stroke width. When the voice pitch in the target voice signal is above the voice pitch threshold, the terminal device may determine to display the text with a third text stroke width (e.g., 0.4 millimeters) that is smaller than the second text stroke width.

Optionally, the correspondence between the voice tone information and the display mode displayed by the preset character stroke width may be set by default in the system, or may be set by user-defined, and may specifically be determined according to actual use requirements, which is not limited in the embodiment of the present invention.

It should be noted that the first character stroke width, the second character stroke width, and the third character stroke width are exemplary illustrations, and the embodiment of the present invention is not limited to the three character stroke widths, nor to the numerical values of the three character stroke widths, and the number and the numerical values of the preset character stroke widths may be specifically determined according to actual use requirements, and the embodiment of the present invention is not limited.

In the above embodiments, the example is described by taking an example that the voice tone information corresponds to a display mode displayed in a preset text color, the voice volume information corresponds to a display mode displayed in a preset text font size, the voice speed information corresponds to a display mode displayed in a preset text interval, and the voice tone information corresponds to a display mode displayed in a preset text stroke width. It can be understood that, in the embodiment of the present invention, the corresponding relationship between the voice feature in the voice feature information and the preset display mode is not limited to the above case, for example, the voice tone information may correspond to the display mode displayed in the preset text font, the voice tone information may also correspond to the display mode displayed in the preset text font size, the voice volume information may also correspond to the display mode displayed in the preset text color, the voice speed information may also correspond to the display mode displayed in the preset text stroke width, and the voice tone information may also correspond to the display mode displayed in the preset text interval, which may be determined specifically according to the actual use requirement, and the embodiment of the present invention is not limited thereto.

In the embodiment of the present invention, the terminal device may determine, according to each voice feature in the voice feature information, a display mode corresponding to each voice feature. And then, obtaining a target display strategy consisting of display modes corresponding to the voice features.

S203, the terminal equipment identifies the content of the target voice signal and displays the content of the target voice signal according to the target display strategy.

In the embodiment of the present invention, the terminal device may recognize the content of the target voice signal by using a voice recognition technology to obtain the text (i.e., text), and then the terminal device may display the text obtained by recognizing the content of the target voice signal according to the target display policy.

It should be noted that the present embodiment may not limit the execution sequence of the steps of the terminal device identifying the content of the target speech signal in S201 to S202 and S203 described above. That is, in the embodiment of the present invention, S201 to S202 may be executed first, and then the step of the terminal device identifying the content of the target voice signal in S203 may be executed; the step of the terminal device recognizing the content of the target speech signal in S203 may be executed first, and then S201-S202 are executed; the steps of the terminal device recognizing the content of the target voice signal in S201-S202 and S203 may also be performed simultaneously. It is understood that fig. 2 is illustrated by first performing steps S201 to S202 and then performing the step of the terminal device recognizing the content of the target speech signal in S203.

The content display method provided by the embodiment of the invention is exemplarily described below by taking an interface of a communication application as an example with reference to fig. 4.

Fig. 4 (a) is a schematic diagram showing a content display manner on an interface of a communication application in a terminal device. The terminal device can respond to the voice input of the user, recognize the content corresponding to the voice input and display the characters corresponding to the content on the interface. As shown in fig. 4 (a), in such a content display process based on voice input, since the terminal device displays according to the style, format, and the like configured by default in the system, the manner in which the terminal device displays the content is monotonous, so that the terminal device has a poor effect of displaying the content.

Fig. 4 (b) is a schematic diagram illustrating a content display manner on an interface of a communication application in a terminal device according to an embodiment of the present invention. As shown in (b) of fig. 4, if the terminal device determines that the voice speed of the user widget a is less than the preset voice speed threshold, the terminal device may determine that the contents corresponding to the voice input of the user widget a (e.g., "hi, go shopping together on weekends, how") are displayed at a first text interval (e.g., 1.5 mm). If the terminal device determines that the speech speed of the user small B is greater than the preset speech speed threshold, the terminal device may determine that the content (e.g., "good or good") corresponding to the speech signal of the user small B is displayed at a third text interval (e.g., 0.5 mm) smaller than the first text interval.

As shown in fig. 4 (B), if the terminal device determines that the voice volume of the user small B is smaller than the preset voice volume threshold, the terminal device may determine that the content (e.g., "good or good") corresponding to the voice signal of the user small B is displayed with the first font size (e.g., "font size" 8 "). If the terminal device determines that the voice volume of the user bar B is greater than the preset voice volume threshold, the terminal device may determine that the content (e.g., "don't see go") corresponding to the voice signal of the user bar B is displayed in a third font size (e.g., "16") that is larger than the first font size, i.e., in a relatively larger font size.

As shown in (b) in fig. 4, if the terminal device determines that the voice pitch of the user widget a is higher than the preset voice pitch threshold, the terminal device may determine that the content corresponding to the voice signal of the user widget a (e.g., "haha, i.e., the content corresponding to the voice signal of the user widget a) is displayed in a third character stroke width (e.g., 0.4 mm), i.e., in a solid form. If the terminal device determines that the voice pitch of the user widget B is lower than the preset voice pitch threshold, the terminal device may determine that the content (e.g., "don't see here") corresponding to the voice signal of the user widget B is displayed with a first character stroke width (e.g., 0.8 mm) smaller than the third character stroke width, i.e., in bold.

In addition, if the terminal device determines that the voice mood information of the user widget a conforms to the preset voice mood information (e.g., mood indicating "happy"), the terminal device may determine that the content (e.g., "like", "haha") corresponding to the voice signal of the user widget a is displayed in a preset text color (e.g., orange).

Compared with the monotonous effect of the manner of displaying the content shown in (a) in fig. 4, the manner of displaying the content shown in (b) in fig. 4 is rich, the effect of displaying the content by the terminal device is improved, and the interest of the user in using the terminal device is enhanced.

The content display method provided by the embodiment of the present invention may acquire a target voice signal (the target voice signal is a signal input by a user), acquire voice feature information of the target voice signal (the voice feature information includes at least one voice feature), determine a target display policy (the target display policy includes a display manner corresponding to each voice feature) according to the voice feature information, and display the content of the target voice signal according to the target display policy after identifying the content of the target voice signal. According to the scheme, the target display strategy corresponding to the voice characteristic information can be determined according to the voice characteristic information of the target voice signal so as to be used for displaying the content of the target voice signal, so that the target display strategies determined according to the voice characteristic information of different target voice signals are different. Therefore, the terminal equipment can display the contents of different target voice signals according to different display strategies, so that the content display mode of the terminal equipment is rich, and the content display effect of the terminal equipment is improved.

Optionally, in an embodiment of the present invention, the at least one voice feature includes voice mood information in the target voice signal. Accordingly, referring to fig. 2, as shown in fig. 5, the above S201 can be specifically realized by the following S201a and S201 b.

S201a, the terminal device obtains the target voice characteristics of the target voice signal.

The target voice feature is used for indicating voice tone information in the target voice signal.

In the embodiment of the present invention, the target voice feature may include at least one of the following: voice volume information in the target voice signal, voice speed information in the target voice signal, voice pitch information in the target voice signal, and phonetic character information in the target voice signal.

S201b, the terminal device obtains the voice mood in the target voice signal according to the target voice characteristics.

In the embodiment of the invention, the terminal equipment can acquire the target voice feature of the target voice signal after acquiring the target voice signal input by the voice of the user, and then acquire the voice tone information in the target voice signal according to the target voice feature, and further the terminal equipment can determine the display mode corresponding to the voice tone information according to the voice tone information in the target voice signal.

Alternatively, with reference to fig. 5, as shown in fig. 6, the above S201b may be specifically implemented by the following S201b1 and S201b 2.

S201b1, the terminal device determines a preset voice feature range corresponding to the target voice feature according to the target voice feature.

S201b2, the terminal device determines the preset speech mood information corresponding to the preset speech feature range as the speech mood information in the target speech signal.

In the embodiment of the present invention, as described above, the speech mood information in the target speech signal may be mood information indicating emotions such as "happy", "angry", "sad" or "fear", and these speech mood information may be determined by at least one of the following speech characteristics: voice volume in the target voice signal, voice speed information in the target voice signal, voice pitch information in the target voice signal, and phonetic character information in the target voice signal. Accordingly, the terminal device may determine a preset voice feature range, such as a "happy" voice feature range, an "angry" voice feature range, a "sad" voice feature range, or a "fear" voice feature range, according to at least one of the voice features.

For example, assuming that the target speech feature is speech volume information, speech speed information, speech tone information, and speech character information in the target speech signal, the terminal device may determine the "anger" speech feature range according to the speech volume information, speech speed information, speech tone information, and speech character information in the target speech signal. For example, if the terminal device determines that the voice volume information in the target voice signal is "high volume", the voice speed information is "fast", the voice tone information is "fair (i.e., rising tone), and the voice character information matches the preset character information, the terminal device may determine, according to these target voice features, that the preset voice feature range corresponding to the target voice feature is an" angry "voice feature range, and may further determine that the voice tone information" angry "corresponding to the" angry "voice feature range is the voice tone information in the target voice signal, whereby the terminal device acquires the voice tone information in the target voice signal.

As another example, assuming that the target voice feature is voice volume information, voice speed information, voice tone information, and voice character information in the target voice signal, the terminal device may determine the "sad" voice feature range according to the target voice feature in the target voice signal. For example, if the terminal device determines that the voice volume information in the target voice signal is "low volume", the voice speed information is "slow speed", the voice tone information is voice-off (i.e., tone-down), and the voice character information conforms to the preset character information, the terminal device may determine, according to the target voice features, that the preset voice feature range corresponding to the target voice features is a "sad" voice feature range, and further may determine that the voice tone information "sad" corresponding to the "sad" voice feature range is the voice tone information in the target voice signal, so that the terminal device obtains the voice tone information in the target voice signal.

It is to be understood that the above target speech features are all exemplary lists, i.e., the embodiments of the present invention include, but are not limited to, the above listed target speech features. In practical implementation, the target speech feature may further include any other possible speech feature, which may be determined according to practical use requirements, and the embodiment of the present invention is not limited.

In the content display method provided by the embodiment of the invention, the terminal equipment can determine the voice tone information in the target voice signal based on different voice characteristics in the target voice signal, so that the accuracy of determining the voice tone information by the terminal equipment can be improved.

As shown in fig. 7, an embodiment of the present invention provides a terminal device 700, where the terminal device 700 may include an obtaining module 701, a determining module 702, an identifying module 703, and a displaying module 704.

An obtaining module 701, configured to obtain a target speech signal, and obtain speech feature information of the target speech signal, where the target speech signal is a signal input by a user, and the speech feature information includes at least one speech feature; a determining module 702, configured to determine a target display policy according to the voice feature information acquired by the acquiring module 702, where the target display policy includes a display manner corresponding to each voice feature; an identifying module 703, configured to identify content of the target speech signal acquired by the acquiring module 701; a display module 704, configured to display the content of the target speech signal identified by the identification module 703 according to the target display policy determined by the determination module 702.

Optionally, in an embodiment of the present invention, the at least one voice feature may include at least one of: speech mood information in the target speech signal, speech volume information in the target speech signal, speech speed information in the target speech signal, speech pitch information in the target speech signal.

Optionally, in an embodiment of the present invention, the at least one voice feature may include voice mood information in the target voice signal. Correspondingly, the obtaining module 701 is specifically configured to obtain a target voice feature of a target voice signal, determine a preset voice feature range corresponding to the target voice feature according to the target voice feature, and determine preset voice mood information corresponding to the preset voice feature range as voice mood information in the target voice signal, where the target voice feature is used to indicate the voice mood information in the target voice signal.

Optionally, in this embodiment of the present invention, the obtaining module 701 is specifically configured to receive a first input of a user, and obtain, in response to the first input, a target voice signal input by the user, where the first input is used to trigger the terminal device 700 to obtain the voice signal.

The terminal device provided by the embodiment of the present invention can implement each process implemented by the terminal device in the above method embodiments, and is not described here again to avoid repetition.

The terminal device provided in the embodiment of the present invention may acquire a target speech signal (the target speech signal is a signal input by a user), acquire speech feature information (the speech feature information includes at least one speech feature) of the target speech signal, determine a target display policy (the target display policy includes a display manner corresponding to each speech feature) according to the speech feature information, and display the content of the target speech signal according to the target display policy after recognizing the content of the target speech signal. According to the scheme, the target display strategy corresponding to the voice characteristic information can be determined according to the voice characteristic information of the target voice signal so as to be used for displaying the content of the target voice signal, so that the target display strategies determined according to the voice characteristic information of different target voice signals are different. Therefore, the terminal equipment can display the contents of different target voice signals according to different display strategies, so that the content display mode of the terminal equipment is rich, and the content display effect of the terminal equipment is improved.

Fig. 8 is a schematic diagram of a hardware structure of a terminal device for implementing various embodiments of the present invention. As shown in fig. 8, the terminal device 800 includes but is not limited to: a radio frequency unit 801, a network module 802, an audio output unit 803, an input unit 804, a sensor 805, a display unit 806, a user input unit 807, an interface unit 808, a memory 809, a processor 810, and a power supply 811. Those skilled in the art will appreciate that the terminal device configuration shown in fig. 8 does not constitute a limitation of the terminal device, and that the terminal device may include more or fewer components than shown, or combine certain components, or a different arrangement of components. In the embodiment of the present invention, the terminal device includes, but is not limited to, a mobile phone, a tablet computer, a notebook computer, a palm computer, a vehicle-mounted terminal, a wearable device, a pedometer, and the like.

The input unit 804 is used for acquiring a target voice signal input by a user; a processor 810, configured to obtain voice feature information of a target voice signal acquired by the user input unit 807, determine a target display policy according to the voice feature information, and identify content of the target voice signal acquired by the user input unit 807, where the voice feature information includes at least one voice feature, and the target display policy includes a display manner corresponding to each voice feature; and a display unit 806, configured to display the content of the target speech signal identified by the processor 810 according to the target display policy determined by the processor 810.

The embodiment of the invention provides a terminal device, which can acquire a target voice signal (the target voice signal is a signal input by a user) and voice feature information (the voice feature information comprises at least one voice feature) of the target voice signal, determine a target display strategy (the target display strategy comprises a display mode corresponding to each voice feature) according to the voice feature information, and display the content of the target voice signal according to the target display strategy after identifying the content of the target voice signal. According to the scheme, the target display strategy corresponding to the voice characteristic information can be determined according to the voice characteristic information of the target voice signal so as to be used for displaying the content of the target voice signal, so that the target display strategies determined according to the voice characteristic information of different target voice signals are different. Therefore, the terminal equipment can display the contents of different target voice signals according to different display strategies, so that the content display mode of the terminal equipment is rich, and the content display effect of the terminal equipment is improved.

It should be understood that, in the embodiment of the present invention, the radio frequency unit 801 may be used for receiving and sending signals during a message sending and receiving process or a call process, and specifically, receives downlink data from a base station and then processes the received downlink data to the processor 810; in addition, the uplink data is transmitted to the base station. In general, radio frequency unit 801 includes, but is not limited to, an antenna, at least one amplifier, a transceiver, a coupler, a low noise amplifier, a duplexer, and the like. Further, the radio frequency unit 801 can also communicate with a network and other devices through a wireless communication system.

The terminal device 800 provides the user with wireless broadband internet access through the network module 802, such as helping the user send and receive e-mails, browse webpages, access streaming media, and the like.

The audio output unit 803 may convert audio data received by the radio frequency unit 801 or the network module 802 or stored in the memory 809 into an audio signal and output as sound. Also, the audio output unit 803 may also provide audio output related to a specific function performed by the terminal apparatus 800 (e.g., a call signal reception sound, a message reception sound, etc.). The audio output unit 803 includes a speaker, a buzzer, a receiver, and the like.

The input unit 804 is used for receiving an audio or video signal. The input unit 804 may include a Graphics Processing Unit (GPU) 8041 and a microphone 8042, and the graphics processor 8041 processes image data of a still picture or video obtained by an image capturing apparatus (e.g., a camera) in a video capturing mode or an image capturing mode. The processed image frames may be displayed on the display unit 806. The image frames processed by the graphics processor 8041 may be stored in the memory 809 (or other storage medium) or transmitted via the radio frequency unit 801 or the network module 802. The microphone 8042 can receive sound, and can process such sound into audio data. The processed audio data may be converted into a format output transmittable to a mobile communication base station via the radio frequency unit 801 in case of a phone call mode.

The terminal device 800 also includes at least one sensor 805, such as light sensors, motion sensors, and other sensors. Specifically, the light sensor includes an ambient light sensor that can adjust the brightness of the display panel 8061 according to the brightness of ambient light, and a proximity sensor that can turn off the display panel 8061 and/or the backlight when the terminal device 800 moves to the ear. As one of the motion sensors, the accelerometer sensor can detect the magnitude of acceleration in each direction (generally three axes), detect the magnitude and direction of gravity when stationary, and can be used to identify the terminal device posture (such as horizontal and vertical screen switching, related games, magnetometer posture calibration), vibration identification related functions (such as pedometer, tapping), and the like; the sensors 805 may also include fingerprint sensors, pressure sensors, iris sensors, molecular sensors, gyroscopes, barometers, hygrometers, thermometers, infrared sensors, etc., which are not described in detail herein.

The display unit 806 is used to display information input by the user or information provided to the user. The display unit 806 may include a display panel 8061, and the display panel 8061 may be configured in the form of a Liquid Crystal Display (LCD), an organic light-emitting diode (OLED), or the like.

The user input unit 807 is operable to receive input numeric or character information and generate key signal inputs related to user settings and function control of the terminal device. Specifically, the user input unit 807 includes a touch panel 8071 and other input devices 8072. The touch panel 8071, also referred to as a touch screen, may collect touch operations by a user on or near the touch panel 8071 (e.g., operations by a user on or near the touch panel 8071 using a finger, a stylus, or any other suitable object or accessory). The touch panel 8071 may include two portions of a touch detection device and a touch controller. The touch detection device detects the touch direction of a user, detects a signal brought by touch operation and transmits the signal to the touch controller; the touch controller receives touch information from the touch sensing device, converts the touch information into touch point coordinates, sends the touch point coordinates to the processor 810, receives a command from the processor 810, and executes the command. In addition, the touch panel 8071 can be implemented by various types such as a resistive type, a capacitive type, an infrared ray, and a surface acoustic wave. In addition to the touch panel 8071, the user input unit 807 can include other input devices 8072. In particular, other input devices 8072 may include, but are not limited to, a physical keyboard, function keys (e.g., volume control keys, switch keys, etc.), a trackball, a mouse, and a joystick, which are not described in detail herein.

Further, the touch panel 8071 can be overlaid on the display panel 8061, and when the touch panel 8071 detects a touch operation on or near the touch panel 8071, the touch operation is transmitted to the processor 810 to determine the type of the touch event, and then the processor 810 provides a corresponding visual output on the display panel 8061 according to the type of the touch event. Although in fig. 8, the touch panel 8071 and the display panel 8061 are two independent components to implement the input and output functions of the terminal device, in some embodiments, the touch panel 8071 and the display panel 8061 may be integrated to implement the input and output functions of the terminal device, and this is not limited herein.

The interface unit 808 is an interface for connecting an external device to the terminal apparatus 800. For example, the external device may include a wired or wireless headset port, an external power supply (or battery charger) port, a wired or wireless data port, a memory card port, a port for connecting a device having an identification module, an audio input/output (I/O) port, a video I/O port, an earphone port, and the like. The interface unit 808 may be used to receive input (e.g., data information, power, etc.) from an external device and transmit the received input to one or more elements within the terminal apparatus 800 or may be used to transmit data between the terminal apparatus 800 and an external device.

The memory 809 may be used to store software programs as well as various data. The memory 809 may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program required by at least one function (such as a sound playing function, an image playing function, etc.), and the like; the storage data area may store data (such as audio data, a phonebook, etc.) created according to the use of the cellular phone, and the like. Further, the memory 809 can include high speed random access memory, and can also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid state storage device.

The processor 810 is a control center of the terminal device, connects various parts of the whole terminal device by using various interfaces and lines, and performs various functions of the terminal device and processes data by running or executing software programs and/or modules stored in the memory 809 and calling data stored in the memory 809, thereby performing overall monitoring of the terminal device. Processor 810 may include one or more processing units; optionally, the processor 810 may integrate an application processor and a modem processor, wherein the application processor mainly handles operating systems, user interfaces, application programs, and the like, and the modem processor mainly handles wireless communications. It will be appreciated that the modem processor described above may not be integrated into processor 810.

Terminal device 800 may also include a power supply 811 (e.g., a battery) for powering the various components, and optionally, power supply 811 may be logically coupled to processor 810 via a power management system to manage charging, discharging, and power consumption management functions via the power management system.

In addition, the terminal device 800 includes some functional modules that are not shown, and are not described in detail here.

Optionally, an embodiment of the present invention further provides a terminal device, which includes the processor 810 shown in fig. 8, a memory 809, and a computer program stored in the memory 809 and capable of running on the processor 810, where the computer program, when executed by the processor 810, implements each process of the foregoing content display method embodiment, and can achieve the same technical effect, and details are not described here to avoid repetition.

The embodiment of the present invention further provides a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, and when the computer program is executed by a processor, the computer program implements each process of the content display method embodiment, and can achieve the same technical effect, and in order to avoid repetition, details are not repeated here. The computer-readable storage medium may include a read-only memory (ROM), a Random Access Memory (RAM), a magnetic or optical disk, and the like.

It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises the element.

Through the above description of the embodiments, those skilled in the art will clearly understand that the method of the above embodiments can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware, but in many cases, the former is a better implementation manner. Based on such understanding, the technical solution of the present invention or portions thereof contributing to the prior art may be embodied in the form of a software product, which is stored in a storage medium (such as ROM/RAM, magnetic disk, optical disk) and includes instructions for enabling a terminal device (such as a mobile phone, a computer, a server, an air conditioner, or a network device) to execute the method disclosed in the embodiments of the present invention.

While the present invention has been described with reference to the embodiments shown in the drawings, the present invention is not limited to the embodiments, which are illustrative and not restrictive, and it will be apparent to those skilled in the art that various changes and modifications can be made therein without departing from the spirit and scope of the invention as defined in the appended claims.

Claims

1. A content display method is applied to terminal equipment, and is characterized by comprising the following steps:

acquiring a target voice signal, wherein the target voice signal is a signal input by a user voice;

acquiring voice feature information of a target voice signal, wherein the voice feature information comprises at least one voice feature, the at least one voice feature comprises voice mood information in the target voice signal, and each voice feature corresponds to a display mode respectively;

determining a target display strategy according to the voice feature information, wherein the target display strategy comprises all display modes corresponding to the at least one voice feature;

identifying the content of the target voice signal to obtain a text, and displaying the text obtained by the content of the target voice signal according to the target display strategy;

the acquiring of the voice feature information of the target voice signal includes:

determining a preset voice feature range corresponding to the target voice feature according to the target voice feature of the target voice signal;

and determining preset voice tone information corresponding to the preset voice characteristic range as the voice tone information in the target voice signal.

2. The method of claim 1, wherein the at least one speech feature further comprises at least one of:

voice volume information in the target voice signal, voice speed information in the target voice signal, voice tone information in the target voice signal, and voice tone information in the target voice signal.

3. The method of claim 1 or 2, wherein the target display policy comprises at least one of:

display in a predetermined text font, display in a predetermined text font size, display in a predetermined text color, display in a predetermined text interval, display in a predetermined text stroke width.

4. The method of claim 1,

the acquiring of the voice feature information of the target voice signal further includes:

and acquiring a target voice characteristic of the target voice signal, wherein the target voice characteristic is used for indicating voice tone information in the target voice signal.

5. The method of claim 1, wherein the obtaining the target speech signal comprises:

receiving a first input of a user, wherein the first input is used for triggering the terminal equipment to acquire a voice signal;

and responding to the first input, and acquiring the target voice signal input by a user.

6. The terminal equipment is characterized by comprising an acquisition module, a determination module, an identification module and a display module;

the acquisition module is used for acquiring a target voice signal and acquiring voice feature information of the target voice signal, wherein the target voice signal is a signal input by a user, the voice feature information comprises at least one voice feature, the at least one voice feature comprises voice tone information in the target voice signal, and each voice feature corresponds to a display mode;

the determining module is configured to determine a target display policy according to the voice feature information acquired by the acquiring module, where the target display policy includes all display modes corresponding to the at least one voice feature;

the identification module is used for identifying the content of the target voice signal acquired by the acquisition module to obtain a text;

the display module is configured to display a text obtained from the content of the target speech signal identified by the identification module according to the target display policy determined by the determination module;

the acquisition module is specifically configured to determine a preset voice feature range corresponding to the target voice feature according to the target voice feature of the target voice signal, and determine preset voice tone information corresponding to the preset voice feature range as the voice tone information in the target voice signal.

7. The terminal device of claim 6, wherein the at least one voice characteristic further comprises at least one of:

8. The terminal device according to claim 6 or 7, wherein the target display policy comprises at least one of:

9. The terminal device of claim 6,

the obtaining module is specifically configured to obtain a target voice feature of the target voice signal, where the target voice feature is used to indicate voice mood information in the target voice signal.

10. The terminal device according to claim 6, wherein the obtaining module is specifically configured to receive a first input from a user, and obtain the target voice signal input by the user in response to the first input, where the first input is used to trigger the terminal device to obtain a voice signal.

11. A terminal device, comprising a processor, a memory and a computer program stored on the memory and executable on the processor, the computer program, when executed by the processor, implementing the steps of the content display method according to any one of claims 1 to 5.