CN112614481A - Voice tone customization method and system for automobile prompt tone - Google Patents

Voice tone customization method and system for automobile prompt tone Download PDF

Info

Publication number
CN112614481A
CN112614481A CN202011443075.9A CN202011443075A CN112614481A CN 112614481 A CN112614481 A CN 112614481A CN 202011443075 A CN202011443075 A CN 202011443075A CN 112614481 A CN112614481 A CN 112614481A
Authority
CN
China
Prior art keywords
tone
voice
data
sound
prompt
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202011443075.9A
Other languages
Chinese (zh)
Inventor
李俊杰
辛慧玉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang Hozon New Energy Automobile Co Ltd
Original Assignee
Zhejiang Hozon New Energy Automobile Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang Hozon New Energy Automobile Co Ltd filed Critical Zhejiang Hozon New Energy Automobile Co Ltd
Priority to CN202011443075.9A priority Critical patent/CN112614481A/en
Publication of CN112614481A publication Critical patent/CN112614481A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
    • G10L25/30Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique using neural networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)

Abstract

The invention relates to the technical field of vehicle voice, in particular to a voice tone customization method and system of automobile prompt tones. The invention provides a method for customizing the voice tone of an automobile prompt tone, which comprises the following steps: step S1, inputting sound with appointed tone; step S2, storing input sound data; step S3, extracting the tone of the input voice data, synthesizing the extracted tone with the original voice prompt tone data, and generating the customized voice prompt tone data corresponding to the designated tone; and step S4, storing and outputting the customized voice prompt tone data. According to the method and the system for customizing the voice tone of the automobile prompt tone, the favorite sound is input by the user, the tone of the sound is simulated to perform subsequent TTS voice broadcast prompt, the structure is simple, the design is ingenious, the scientific and technological sense brought by voice interaction can be realized, the operation attribute of the traditional voice broadcast can be realized, and the affinity and the individuality of the automobile in driving are greatly improved.

Description

Voice tone customization method and system for automobile prompt tone
Technical Field
The invention relates to the technical field of vehicle voice, in particular to a voice tone customization method and system of automobile prompt tones.
Background
The vehicle-mounted voice control system is a novel product system which is popular in recent years and is used for replacing a traditional in-vehicle control system.
The vehicle-mounted voice control system can realize multiple functions which cannot be realized by interaction modes such as traditional entity keys and the like in a simpler interaction mode by means of a voice control mode of software, and improves the scientific and technological feeling and the luxury feeling of the vehicle.
However, in the existing typical vehicle-mounted voice control system, the corresponding voice interaction functions are generally classified into the following types:
1) and voice control cannot be carried out, and only one-way voice broadcasting can be carried out.
2) Simple voice control can be performed, such as turning on an air conditioner, and the like.
3) The broadcast sound, such as the pronouncing character of boys and girls, can be selected on the basis of voice control.
For the above-mentioned sub-tone broadcast voice function of the third vehicle-mounted voice control system, as shown in fig. 1, fig. 1 discloses a flow chart of a voice broadcast method in the prior art, and stores fixed voice files of several tones in a memory, and a User selects a favorite tone through related setting items on a User Interface (User Interface) and outputs voice broadcast of the selected tone.
The solution shown in fig. 1 has the following drawbacks:
generally, voice files with various timbres need to be stored in advance, and requirements on hardware storage equipment at a vehicle end are high.
Even if a voice file with a plurality of timbres is provided in advance, the timbres are difficult to be customized according to needs, and specific preferences of users are difficult to meet to a great extent.
Disclosure of Invention
The invention aims to provide a method and a system for customizing the voice tone of an automobile prompt tone, which solve the problem that the automobile prompt tone in the prior art is difficult to input and customize in a personalized way.
In order to achieve the aim, the invention provides a method for customizing the voice tone of an automobile prompt tone, which comprises the following steps:
step S1, inputting sound with appointed tone;
step S2, storing input sound data;
step S3, extracting the tone of the input voice data, synthesizing the extracted tone with the original voice prompt tone data, and generating the customized voice prompt tone data corresponding to the designated tone;
and step S4, storing and outputting the customized voice prompt tone data.
In an embodiment, the step S3, further includes:
step S31, analyzing the voice spectrum through Fourier change, and extracting the tone color characteristics of the input voice data;
step S32, extracting the content characteristic information of the original voice prompt tone data;
step S33 is to synthesize the tone color feature and the content feature information to generate voice guidance sound data corresponding to the specified tone color.
In an embodiment, the step S31, further includes:
step 311, decomposing the input voice data by frame;
step S312, calculating a periodic power spectrum for the audio of each frame;
step S313, applying the mel filter to the periodic power spectrum, and calculating the energy sum of each mel filter;
step S314, calculating the logarithm value of the energy sum;
step S315, discrete cosine transform is carried out on each logarithmic energy;
and step S316, reserving 2-13 coefficients of the discrete cosine transform result as timbre characteristics, and discarding the rest coefficients.
In an embodiment, the step S33, further includes:
step S331, classifying the extracted tone characteristic information according to a frequency spectrum;
s332, expanding the tone characteristic information by using the series, and taking the tone characteristic information of the main part;
step S333, sorting the content characteristic information, combining the tone characteristic information and generating voice spectrum data corresponding to the designated tone;
and step 334, performing inverse frequency domain transformation on the voice frequency spectrum data corresponding to the designated tone color, and outputting voice prompt tone data corresponding to the designated tone color.
In an embodiment, the step S33, further includes:
and (4) synthesizing the tone characteristic and the content characteristic information after training through a deep neural network algorithm.
In order to achieve the above object, the present invention provides a system for customizing the voice timbre of an automobile prompt tone, which comprises a user end, a vehicle end and a service end:
the user side is connected with the vehicle side, inputs the sound with the designated tone and outputs the customized voice prompt tone;
the vehicle end is connected with the server end, receives input sound data, stores the input sound data and sends the input sound data to the server end, sends original voice prompt sound data to the server end, receives customized voice prompt sound data, stores the customized voice prompt sound data and sends the customized voice prompt sound data to the user end;
and the server side extracts the tone of the input sound data, synthesizes the input sound data with the original voice prompt tone data and generates customized voice prompt tone data corresponding to the designated tone.
In an embodiment, the server analyzes the fourier transform into a spectrogram, extracts the tone features of the input voice data, extracts the content feature information of the original voice prompt tone data, and synthesizes the tone features and the content feature information to generate the voice prompt tone data corresponding to the specified tone.
In one embodiment, the server decomposes the input sound data by frames, calculates a periodic power spectrum for the audio frequency of each frame, applies mel filters to the periodic power spectrum, calculates the energy sum of each mel filter, calculates the logarithm value of the energy sum, performs discrete cosine transform on each logarithm energy, retains 2-13 coefficients of the discrete cosine transform result as the tone color feature, and discards the rest coefficients.
In an embodiment, the server classifies the extracted tone characteristic information according to a frequency spectrum, expands the tone characteristic information by using a series, takes tone characteristic information of a main part of the tone characteristic information, sorts the content characteristic information, combines the tone characteristic information, generates voice spectrum data corresponding to a specified tone, performs inverse frequency domain transformation on the voice spectrum data corresponding to the specified tone, and outputs voice prompt tone data corresponding to the specified tone.
In an embodiment, the server side synthesizes the tone characteristic and the content characteristic information after training through a deep neural network algorithm.
According to the method and the system for customizing the voice tone of the automobile prompt tone, the favorite sound is input by the user, the tone of the sound is simulated to perform subsequent TTS voice broadcast prompt, the structure is simple, the design is ingenious, the scientific and technological sense brought by voice interaction can be realized, the operation attribute of the traditional voice broadcast can be realized, and the affinity and the individuality of the automobile in driving are greatly improved.
Drawings
The above and other features, properties and advantages of the present invention will become more apparent from the following description of the embodiments with reference to the accompanying drawings in which like reference numerals denote like features throughout the several views, wherein:
fig. 1 discloses a flow chart of a voice broadcasting method in the prior art;
FIG. 2 is a flow chart of a method for customizing the voice tone of an automobile warning tone according to an embodiment of the present invention;
fig. 3 discloses a schematic diagram of a voice timbre customizing system for a car warning sound according to an embodiment of the invention.
The meanings of the reference symbols in the figures are as follows:
100 user terminals;
200 of a vehicle end;
300, a server side.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
Fig. 2 discloses a flow chart of a method for customizing a voice tone of an automobile warning sound according to an embodiment of the present invention, and the method for customizing the voice tone of the automobile warning sound shown in fig. 2 includes the following steps:
step S1, inputting sound with appointed tone;
and directly inputting the sound interested by the user through a microphone by an artificial intelligence technology, wherein the tone corresponding to the sound interested by the user is used as a designated tone.
Step S2, storing input sound data;
the vehicle end stores the input sound data file and transmits the input sound data file to the server end through TBOX.
The T-Box is called a vehicle-mounted intelligent terminal, serves as the only control unit capable of being networked for a vehicle body, carries a mission for monitoring and controlling the state of the vehicle body, and the TBox is mainly used for collecting vehicle-related information including position information, attitude information, vehicle state information (through connecting a CAN bus on the vehicle) and the like and then transmitting the information to the TSP platform through wireless communication.
Step S3, extracting the tone of the input voice data, synthesizing the extracted tone with the original voice prompt tone data, and generating the customized voice prompt tone data corresponding to the designated tone;
and the server side extracts the tone information through a frequency domain transformation algorithm and synthesizes the tone information and the content of the original voice prompt tone.
And step S4, storing and outputting the customized voice prompt tone data.
And the server transmits the synthesized voice prompt tone data to the vehicle terminal through the TBOX, and the tone corresponding to the synthesized voice prompt tone is the tone of the sound which is interested by the user.
When the user carries out man-machine interaction with the car machine end through the voice command, the car machine end can output voice prompt sound corresponding to the tone desired by the user.
Further, the step S3 further includes the following steps:
step S31, analyzing the voice spectrum through Fourier change, and extracting the tone color characteristics of the input voice data;
step S32, extracting the content characteristic information of the original voice prompt tone data;
step S33 is to synthesize the tone color feature and the content feature information to generate voice guidance sound data corresponding to the specified tone color.
The key points of the present invention are two steps of extracting tone color feature information and synthesizing sound in step S3.
In step S31, the algorithm for extracting the tone characteristic information further includes:
step 311, decomposing the input voice data by frame;
step S312, calculating a periodic power spectrum for the audio of each frame;
step S313, applying the mel filter to the periodic power spectrum, and calculating the energy sum of each mel filter;
human ears feel that the height of a voice signal is not in a linear relation with the frequency, so that a group of triangular filter sequences can be constructed, and sparse decomposition is carried out on the signal, namely a mel filter bank.
Step S314, calculating the logarithm value of the energy sum;
step S315, performing Discrete Cosine Transform (DCT) on each logarithmic energy;
and step S316, reserving 2-13 coefficients of the discrete cosine transform result as timbre characteristics, and discarding the rest coefficients.
The step S33, an algorithm of sound synthesis, further includes:
step S331, classifying the extracted tone characteristic information according to a frequency spectrum;
s332, expanding the tone characteristic information by using the series, and taking the tone characteristic information of the main part;
step S333, sorting the content characteristic information, combining the tone characteristic information and generating voice spectrum data corresponding to the designated tone;
and step 334, performing inverse frequency domain transformation on the voice frequency spectrum data corresponding to the designated tone color, and outputting voice prompt tone data corresponding to the designated tone color.
Furthermore, the accuracy and the precision of sound synthesis can be further improved by continuously training the samples based on the deep neural network algorithm, and synthesizing the tone characteristic information and the content characteristic information after training.
Fig. 3 discloses a schematic diagram of a voice timbre customizing system for an automobile warning sound according to an embodiment of the present invention, and the voice timbre customizing system for an automobile warning sound shown in fig. 3 includes a user terminal 100, an automobile terminal 200, and a service terminal 300:
the user end 100 is connected with the vehicle end 200, inputs the sound with the designated tone color and outputs the customized voice prompt tone;
the vehicle end 200 is connected with the server 300, receives input sound data, stores the input sound data, sends the input sound data to the server 300, sends original voice prompt sound data to the server 300, receives customized voice prompt sound data, stores the customized voice prompt sound data and sends the customized voice prompt sound data to the user end 100;
the server 300 extracts the tone of the input voice data, synthesizes the extracted tone with the original voice prompt tone data, and generates customized voice prompt tone data corresponding to the designated tone.
In the embodiment shown in fig. 3, the user terminal 100 inputs the sound of interest to the user directly through the microphone/microphone by using the artificial intelligence technique, and the tone corresponding to the sound of interest to the user is used as the designated tone.
When the user terminal 100 performs the man-machine interaction with the car terminal 200 through the voice command, the car terminal 200 outputs the voice prompt sound corresponding to the tone desired by the user to the user terminal 100.
In the embodiment shown in FIG. 3, the vehicle end 200 is an infotainment system (IHU, generally referred to as an infotainment head unit).
The car terminal 200 stores the inputted sound data file and transmits it to the server terminal 300 through the TBOX.
Optionally, the vehicle-mounted terminal 200 is an SoC terminal, and the SoC chip is an integrated circuit chip, so that the development cost of the electronic/information system product can be effectively reduced, the development period can be shortened, and the competitiveness of the product can be improved, which is the most important product development mode to be adopted in the future industry.
In the embodiment shown in fig. 3, the server 300 is a cloud processor, and performs sound synthesis and tone color replacement by analyzing the uploaded input sound file into a spectrogram through fourier transform.
Further, the server 300 analyzes the fourier transform into a spectrogram, extracts the tone features of the input voice data, extracts the content feature information of the original voice prompt tone data, and synthesizes the tone features and the content feature information to generate the voice prompt tone data corresponding to the designated tone.
Further, the server 300 extracts the tone characteristic information, and further includes:
decomposing input sound data by frames, calculating a periodic power spectrum for the audio frequency of each frame, applying mel filters to the periodic power spectrum, calculating the energy sum of each mel filter, calculating the logarithm value of the energy sum, performing discrete cosine transform on each logarithm energy, reserving 2-13 coefficients of the discrete cosine transform result as timbre characteristics, and discarding the rest coefficients.
Further, the server 300 performs sound synthesis, and further includes:
classifying the extracted tone characteristic information according to frequency spectrums, expanding the tone characteristic information by using series, taking tone characteristic information of a main part, sorting content characteristic information, combining the tone characteristic information, generating voice frequency spectrum data corresponding to the specified tone, performing frequency domain inverse transformation on the voice frequency spectrum data corresponding to the specified tone, and outputting voice prompt tone data corresponding to the specified tone.
Furthermore, the server 300 synthesizes the information of the tone features and the content features after training through a deep neural network algorithm.
According to the method and the system for customizing the voice tone of the automobile prompt tone, the favorite sound is input by the user, the tone of the sound is simulated to perform subsequent TTS voice broadcast prompt, the structure is simple, the design is ingenious, the scientific and technological sense brought by voice interaction can be realized, the operation attribute of the traditional voice broadcast can be realized, and the affinity and the individuality of the automobile in driving are greatly improved.
While, for purposes of simplicity of explanation, the methodologies are shown and described as a series of acts, it is to be understood and appreciated that the methodologies are not limited by the order of acts, as some acts may, in accordance with one or more embodiments, occur in different orders and/or concurrently with other acts from that shown and described herein or not shown and described herein, as would be understood by one skilled in the art.
Those of skill would further appreciate that the various illustrative logical blocks, modules, circuits, and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware, computer software, or combinations of both. To clearly illustrate this interchangeability of hardware and software, various illustrative components, blocks, modules, circuits, and steps have been described above generally in terms of their functionality. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the overall system. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present invention.
As used in this application and the appended claims, the terms "a," "an," "the," and/or "the" are not intended to be inclusive in the singular, but rather are intended to be inclusive in the plural unless the context clearly dictates otherwise. In general, the terms "comprises" and "comprising" merely indicate that steps and elements are included which are explicitly identified, that the steps and elements do not form an exclusive list, and that a method or apparatus may include other steps or elements.
The embodiments described above are provided to enable persons skilled in the art to make or use the invention and that modifications or variations can be made to the embodiments described above by persons skilled in the art without departing from the inventive concept of the present invention, so that the scope of protection of the present invention is not limited by the embodiments described above but should be accorded the widest scope consistent with the innovative features set forth in the claims.

Claims (10)

1. A voice timbre customizing method of an automobile prompt tone is characterized by comprising the following steps:
step S1, inputting sound with appointed tone;
step S2, storing input sound data;
step S3, extracting the tone of the input voice data, synthesizing the extracted tone with the original voice prompt tone data, and generating the customized voice prompt tone data corresponding to the designated tone;
and step S4, storing and outputting the customized voice prompt tone data.
2. The method for customizing the voice tone of an automobile warning sound according to claim 1, wherein the step S3 further comprises:
step S31, analyzing the voice spectrum through Fourier change, and extracting the tone color characteristics of the input voice data;
step S32, extracting the content characteristic information of the original voice prompt tone data;
step S33 is to synthesize the tone color feature and the content feature information to generate voice guidance sound data corresponding to the specified tone color.
3. The method for customizing the voice tone of an automobile warning sound according to claim 2, wherein the step S31 further comprises:
step 311, decomposing the input voice data by frame;
step S312, calculating a periodic power spectrum for the audio of each frame;
step S313, applying the mel filter to the periodic power spectrum, and calculating the energy sum of each mel filter;
step S314, calculating the logarithm value of the energy sum;
step S315, discrete cosine transform is carried out on each logarithmic energy;
and step S316, reserving 2-13 coefficients of the discrete cosine transform result as timbre characteristics, and discarding the rest coefficients.
4. The method for customizing the voice tone of an automobile warning sound according to claim 3, wherein the step S33 further comprises:
step S331, classifying the extracted tone characteristic information according to a frequency spectrum;
s332, expanding the tone characteristic information by using the series, and taking the tone characteristic information of the main part;
step S333, sorting the content characteristic information, combining the tone characteristic information and generating voice spectrum data corresponding to the designated tone;
and step 334, performing inverse frequency domain transformation on the voice frequency spectrum data corresponding to the designated tone color, and outputting voice prompt tone data corresponding to the designated tone color.
5. The method for customizing the voice tone of an automobile warning sound according to claim 2, wherein the step S33 further comprises:
and (4) synthesizing the tone characteristic and the content characteristic information after training through a deep neural network algorithm.
6. The utility model provides a pronunciation tone customization system of car prompt tone which characterized in that, includes user end, car end and server:
the user side is connected with the vehicle side, inputs the sound with the designated tone and outputs the customized voice prompt tone;
the vehicle end is connected with the server end, receives input sound data, stores the input sound data and sends the input sound data to the server end, sends original voice prompt sound data to the server end, receives customized voice prompt sound data, stores the customized voice prompt sound data and sends the customized voice prompt sound data to the user end;
and the server side extracts the tone of the input sound data, synthesizes the input sound data with the original voice prompt tone data and generates customized voice prompt tone data corresponding to the designated tone.
7. The system of claim 6, wherein the server analyzes the voice spectrum by fourier transform analysis, extracts tone color features of the input voice data, extracts content feature information of the original voice prompt tone data, and synthesizes the tone color features and the content feature information to generate the voice prompt tone data corresponding to the specified tone color.
8. The system of claim 7, wherein the server decomposes the input sound data by frames, calculates a periodic power spectrum for each frame of audio, applies mel-filters to the periodic power spectrum, calculates a sum of energy of each mel-filter, calculates a logarithmic value of the sum of energy, performs discrete cosine transform on each logarithmic energy, retains 2-13 coefficients of the discrete cosine transform result as timbre characteristics, and truncates the remaining coefficients.
9. The system according to claim 8, wherein the server classifies the extracted tone feature information according to a spectrum, expands the tone feature information using a series, extracts tone feature information of a main part of the tone feature information, sorts the content feature information, combines the tone feature information to generate voice spectrum data corresponding to a specified tone, performs inverse frequency domain conversion on the voice spectrum data corresponding to the specified tone, and outputs the voice prompt tone data corresponding to the specified tone.
10. The system of claim 7, wherein the server synthesizes the tone characteristic and the content characteristic information after training the tone characteristic and the content characteristic information through a deep neural network algorithm.
CN202011443075.9A 2020-12-08 2020-12-08 Voice tone customization method and system for automobile prompt tone Pending CN112614481A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011443075.9A CN112614481A (en) 2020-12-08 2020-12-08 Voice tone customization method and system for automobile prompt tone

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011443075.9A CN112614481A (en) 2020-12-08 2020-12-08 Voice tone customization method and system for automobile prompt tone

Publications (1)

Publication Number Publication Date
CN112614481A true CN112614481A (en) 2021-04-06

Family

ID=75232746

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011443075.9A Pending CN112614481A (en) 2020-12-08 2020-12-08 Voice tone customization method and system for automobile prompt tone

Country Status (1)

Country Link
CN (1) CN112614481A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113409615A (en) * 2021-06-18 2021-09-17 深圳市易流科技股份有限公司 Driver monitoring system and driver monitoring method

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105606117A (en) * 2014-11-18 2016-05-25 深圳市腾讯计算机***有限公司 Navigation prompting method and navigation prompting apparatus
CN106627570A (en) * 2016-10-21 2017-05-10 璧典寒 Vehicle driving safety prompt method and vehicle driving safety prompt system
CN107507619A (en) * 2017-09-11 2017-12-22 厦门美图之家科技有限公司 Phonetics transfer method, device, electronic equipment and readable storage medium storing program for executing
CN107770380A (en) * 2017-10-25 2018-03-06 百度在线网络技术(北京)有限公司 Information processing method and device
CN110534088A (en) * 2019-09-25 2019-12-03 招商局金融科技有限公司 Phoneme synthesizing method, electronic device and storage medium
CN111276120A (en) * 2020-01-21 2020-06-12 华为技术有限公司 Speech synthesis method, apparatus and computer-readable storage medium
CN111435591A (en) * 2020-01-17 2020-07-21 珠海市杰理科技股份有限公司 Sound synthesis method and system, audio processing chip and electronic equipment
CN111477210A (en) * 2020-04-02 2020-07-31 北京字节跳动网络技术有限公司 Speech synthesis method and device
CN111599342A (en) * 2019-02-21 2020-08-28 北京京东尚科信息技术有限公司 Tone selecting method and system

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105606117A (en) * 2014-11-18 2016-05-25 深圳市腾讯计算机***有限公司 Navigation prompting method and navigation prompting apparatus
CN106627570A (en) * 2016-10-21 2017-05-10 璧典寒 Vehicle driving safety prompt method and vehicle driving safety prompt system
CN107507619A (en) * 2017-09-11 2017-12-22 厦门美图之家科技有限公司 Phonetics transfer method, device, electronic equipment and readable storage medium storing program for executing
CN107770380A (en) * 2017-10-25 2018-03-06 百度在线网络技术(北京)有限公司 Information processing method and device
CN111599342A (en) * 2019-02-21 2020-08-28 北京京东尚科信息技术有限公司 Tone selecting method and system
CN110534088A (en) * 2019-09-25 2019-12-03 招商局金融科技有限公司 Phoneme synthesizing method, electronic device and storage medium
CN111435591A (en) * 2020-01-17 2020-07-21 珠海市杰理科技股份有限公司 Sound synthesis method and system, audio processing chip and electronic equipment
CN111276120A (en) * 2020-01-21 2020-06-12 华为技术有限公司 Speech synthesis method, apparatus and computer-readable storage medium
CN111477210A (en) * 2020-04-02 2020-07-31 北京字节跳动网络技术有限公司 Speech synthesis method and device

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113409615A (en) * 2021-06-18 2021-09-17 深圳市易流科技股份有限公司 Driver monitoring system and driver monitoring method

Similar Documents

Publication Publication Date Title
DE69031165T2 (en) SYSTEM AND METHOD FOR TEXT-LANGUAGE IMPLEMENTATION WITH THE CONTEXT-DEPENDENT VOCALALLOPHONE
CN109147804A (en) A kind of acoustic feature processing method and system based on deep learning
EP2306450A1 (en) Voice synthesis model generation device, voice synthesis model generation system, communication terminal device and method for generating voice synthesis model
CN106373580A (en) Singing synthesis method based on artificial intelligence and device
EP3879854A1 (en) Hearing device component, hearing device, computer-readable medium and method for processing an audio-signal for a hearing device
JP3660937B2 (en) Speech synthesis method and speech synthesis apparatus
CN110155075A (en) Atmosphere apparatus control method and relevant apparatus
DE112013007617T5 (en) Speech recognition device and speech recognition method
CN112562681B (en) Speech recognition method and apparatus, and storage medium
CN106601231A (en) Vehicle control method and apparatus
CN112614481A (en) Voice tone customization method and system for automobile prompt tone
CN112151071A (en) Speech emotion recognition method based on mixed wavelet packet feature deep learning
Yang et al. Kullback–Leibler divergence frequency warping scale for acoustic scene classification using convolutional neural network
CN114863905A (en) Voice category acquisition method and device, electronic equipment and storage medium
CN114927122A (en) Emotional voice synthesis method and synthesis device
CN110556092A (en) Speech synthesis method and device, storage medium and electronic device
CN102063897A (en) Sound library compression for embedded type voice synthesis system and use method thereof
JP6864322B2 (en) Voice processing device, voice processing program and voice processing method
CN113370923B (en) Vehicle configuration adjusting method and device, electronic equipment and storage medium
DE112020007096T5 (en) SYSTEMS AND METHODS FOR PROVIDING A PERSONALIZED VIRTUAL PERSONAL ASSISTANT
CN108172241B (en) Music recommendation method and music recommendation system based on intelligent terminal
CN115188363A (en) Voice processing method, system, device and storage medium
CN113012681B (en) Awakening voice synthesis method based on awakening voice model and application awakening method
CN116994553A (en) Training method of speech synthesis model, speech synthesis method, device and equipment
CN113823318A (en) Multiplying power determining method based on artificial intelligence, volume adjusting method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information
CB02 Change of applicant information

Address after: 314500 988 Tong Tong Road, Wu Tong Street, Tongxiang, Jiaxing, Zhejiang

Applicant after: United New Energy Automobile Co.,Ltd.

Address before: 314500 988 Tong Tong Road, Wu Tong Street, Tongxiang, Jiaxing, Zhejiang

Applicant before: Hozon New Energy Automobile Co., Ltd.

Address after: 314500 988 Tong Tong Road, Wu Tong Street, Tongxiang, Jiaxing, Zhejiang

Applicant after: Hozon New Energy Automobile Co., Ltd.

Address before: 314500 988 Tong Tong Road, Wu Tong Street, Tongxiang, Jiaxing, Zhejiang

Applicant before: Hozon New Energy Automobile Co., Ltd.

RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20210406